LOCUS BC052575 3331 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens collagen, type VI, alpha 1, mRNA (cDNA clone MGC:59702 IMAGE:6598940), complete cds. ACCESSION BC052575 VERSION BC052575.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3331) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3331) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (16-MAY-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 47 Row: o Column: 14 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 15011912. FEATURES Location/Qualifiers source 1..3331 /db_xref="H-InvDB:HIT000053797" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:59702 IMAGE:6598940" /tissue_type="Ovary, teratocarcinoma" /clone_lib="NIH_MGC_109" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..3331 /gene="COL6A1" /gene_synonym="OPLL" /db_xref="GeneID:1291" /db_xref="HGNC:HGNC:2211" /db_xref="MIM:120220" CDS 83..3169 /gene="COL6A1" /gene_synonym="OPLL" /codon_start=1 /product="collagen, type VI, alpha 1" /protein_id="AAH52575.1" /db_xref="GeneID:1291" /db_xref="HGNC:HGNC:2211" /db_xref="MIM:120220" /translation="MRAARALLPLLLQACWTAAQDEPETPRAVAFQDCPVDLFFVLDT SESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIQG LTRMPGGRDALKSSVDAVKYFGKGTYTDCAIKKGLEQLLVGGSHLKENKYLIVVTDGH PLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRLSIIATDHTYRRNFTAADW GQSRDAEEAISQTIDTIVDMIKNNVEQVCCSFECQPARGPPGLRGDPGFEGERGKPGL PGEKGEAGDPGRPGDLGPVGYQGMKGEKGSRGEKGSRGPKGYKGEKGKRGIDGVDGVK GEMGYPGLPGCKGSPGFDGIQGPPGPKGDPGAFGLKGEKGEPGADGEAGRPGSSGPSG DEGQPGEPGPPGEKGEAGDEGNPGPDGAPGERGGPGERGPRGTPGTRGPRGDPGEAGP QGDQGREGPVGVPGDPGEAGPIGPKGYRGDEGPPGSEGARGAPGPAGPPGDPGLMGER GEDGPAGNGTEGFPGFPGYPGNRGAPGINGTKGYPGLKGDEGEAGDPGDDNNDIAPRG VKGAKGYRGPEGPQGPPGHQGPPGPDECEILDIIMKMCSCCECKCGPIDLLFVLDSSE SIGLQNFEIAKDFVVKVIDRLSRDELVKFEPGQSYAGVVQYSHSQMQEHVSLRSPSIR NVQELKEAIKSLQWMAGGTFTGEALQYTRDQLLPPSPNNRIALVITDGRSDTQRDTTP LNVLCSPGIQVVSVGIKDVFDFIPGSDQLNVISCQGLAPSQGRPGLSLVKENYAELLE DAFLKNVTAQICIDKKCPDYTCPITFSSPADITILLDGSASVGSHNFDTTKRFAKRLA ERFLTAGRTDPAHDVRVAVVQYSGTGQQRPERASLQFLQNYTALASAVDAMDFINDAT DVNDALGYVTRFYREASSGAAKKRLLLFSDGNSQGATPAAIEKAVQEAQRAGIEIFVV VVGRQVNEPHIRVLVTGKTAEYDVAYGESHLFRVPSYQALLRGVFHQTVSRKVALG" BASE COUNT 754 a 1019 c 1100 g 458 t ORIGIN 1 cagaaggcag cctcggtctc tgggcggcgg cggcggccca ctctgccctg gccgcgctgt 61 gtggtgaccg caggcccgag acatgagggc ggcccgtgct ctgctgcccc tgctgctgca 121 ggcctgctgg acagccgcgc aggatgagcc ggagaccccg agggccgtgg ccttccagga 181 ctgccccgtg gacctgttct ttgtgctgga cacctctgag agcgtggccc tgaggctgaa 241 gccctacggg gccctcgtgg acaaagtcaa gtccttcacc aagcgcttca tcgacaacct 301 gagggacagg tactaccgct gtgaccgaaa cctggtgtgg aacgcaggcg cgctgcacta 361 cagtgacgag gtggagatca tccaaggcct cacgcgcatg cctggcggcc gcgacgcact 421 caaaagcagc gtggacgcgg tcaagtactt tgggaagggc acctacaccg actgcgctat 481 caagaagggg ctggagcagc tcctcgtggg gggctcccac ctgaaggaga ataagtacct 541 gattgtggtg accgacgggc accccctgga gggctacaag gaaccctgtg gggggctgga 601 ggatgctgtg aacgaggcca agcacctggg cgtcaaagtc ttctcggtgg ccatcacacc 661 cgaccacctg gagccgcgtc tgagcatcat cgccacggac cacacgtacc ggcgcaactt 721 cacggcggct gactggggcc agagccgcga cgcagaggag gccatcagcc agaccatcga 781 caccatcgtg gacatgatca aaaataacgt ggagcaagtg tgctgctcct tcgaatgcca 841 gcctgcaaga ggacctccgg ggctccgggg cgaccccggc tttgagggag aacgaggcaa 901 gccggggctc ccaggagaga agggagaagc cggagatcct ggaagacccg gggacctcgg 961 acctgttggg taccagggaa tgaagggaga aaaagggagc cgtggggaga agggctccag 1021 gggacccaag ggctacaagg gagagaaggg caagcgtggc atcgacgggg tggacggcgt 1081 gaagggggag atggggtacc caggcctgcc aggctgcaag ggctcgcccg ggtttgacgg 1141 cattcaagga ccccctggcc ccaagggaga ccccggcgcc tttggactga aaggagaaaa 1201 gggcgagcct ggagctgacg gggaggcggg gagaccaggg agctcgggac catctggaga 1261 cgagggccag ccgggagagc ctgggccccc cggagagaaa ggagaggcgg gcgacgaggg 1321 gaacccagga cctgacggtg cccccgggga gcggggtggc cctggagaga gaggaccacg 1381 ggggacccca ggcacacggg gaccaagagg agaccctggt gaagctggcc cgcagggtga 1441 tcagggaaga gaaggccccg ttggtgtccc tggagacccg ggcgaggctg gccctatcgg 1501 acctaaaggc taccgaggcg atgagggtcc cccagggtcc gagggtgcca gaggagcccc 1561 aggacctgcc ggaccccctg gagacccggg gctgatgggt gaaaggggag aagacggccc 1621 cgctggaaat ggcaccgagg gcttccccgg cttccccggg tatccgggca acaggggcgc 1681 tcccgggata aacggcacga agggctaccc cggcctcaag ggggacgagg gagaagccgg 1741 ggaccccgga gacgataaca acgacattgc accccgagga gtcaaaggag caaaggggta 1801 ccggggtccc gagggccccc agggaccccc aggacaccaa ggaccgcctg ggccggacga 1861 atgcgagatt ttggacatca tcatgaaaat gtgctcttgc tgtgaatgca agtgcggccc 1921 catcgacctc ctgttcgtgc tggacagctc agagagcatt ggcctgcaga acttcgagat 1981 tgccaaggac ttcgtcgtca aggtcatcga ccggctgagc cgggacgagc tggtcaagtt 2041 cgagccaggg cagtcgtacg cgggtgtggt gcagtacagc cacagccaga tgcaggagca 2101 cgtgagcctg cgcagcccca gcatccggaa cgtgcaggag ctcaaggaag ccatcaagag 2161 cctgcagtgg atggcgggcg gcaccttcac gggggaggcc ctgcagtaca cgcgggacca 2221 gctgctgccg cccagcccga acaaccgcat cgccctggtc atcactgacg ggcgctcaga 2281 cactcagagg gacaccacac cgctcaacgt gctctgcagc cccggcatcc aggtggtctc 2341 cgtgggcatc aaagacgtgt ttgacttcat cccaggctca gaccagctca atgtcatttc 2401 ttgccaaggc ctggcaccat cccagggccg gcccggcctc tcgctggtca aggagaacta 2461 tgcagagctg ctggaggatg ccttcctgaa gaatgtcacc gcccagatct gcatagacaa 2521 gaagtgtcca gattacacct gccccatcac gttctcctcc ccggctgaca tcaccatcct 2581 gctggacggc tccgccagcg tgggcagcca caactttgac accaccaagc gcttcgccaa 2641 gcgcctggcc gagcgcttcc tcacagcggg caggacggac cccgcccacg acgtgcgggt 2701 ggcggtggtg cagtacagcg gcacgggcca gcagcgccca gagcgggcgt cgctgcagtt 2761 cctgcagaac tacacggccc tggccagtgc cgtcgatgcc atggacttta tcaacgacgc 2821 caccgacgtc aacgatgccc tgggctatgt gacccgcttc taccgcgagg cctcgtccgg 2881 cgctgccaag aagaggctgc tgctcttctc agatggcaac tcgcagggcg ccacgcccgc 2941 tgccatcgag aaggccgtgc aggaagccca gcgggcaggc atcgagatct tcgtggtggt 3001 cgtgggccgc caggtgaatg agccccacat ccgcgtcctg gtcaccggca agacggccga 3061 gtacgacgtg gcctacggcg agagccacct gttccgtgtc cccagctacc aggccctgct 3121 ccgcggtgtc ttccaccaga cagtctccag gaaggtggcg ctgggctagc ccaccctgca 3181 cgccggcacc aaaccctgtc ctcccacccc tccccactca tcactaaaca gagtaaaatg 3241 tgatgcgaat tttcccgaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3301 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa a //