LOCUS BC052575 3331 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens collagen, type VI, alpha 1, mRNA (cDNA clone MGC:59702
IMAGE:6598940), complete cds.
ACCESSION BC052575
VERSION BC052575.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 3331)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 3331)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (16-MAY-2003) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 47 Row: o Column: 14
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 15011912.
FEATURES Location/Qualifiers
source 1..3331
/db_xref="H-InvDB:HIT000053797"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:59702 IMAGE:6598940"
/tissue_type="Ovary, teratocarcinoma"
/clone_lib="NIH_MGC_109"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..3331
/gene="COL6A1"
/gene_synonym="OPLL"
/db_xref="GeneID:1291"
/db_xref="HGNC:HGNC:2211"
/db_xref="MIM:120220"
CDS 83..3169
/gene="COL6A1"
/gene_synonym="OPLL"
/codon_start=1
/product="collagen, type VI, alpha 1"
/protein_id="AAH52575.1"
/db_xref="GeneID:1291"
/db_xref="HGNC:HGNC:2211"
/db_xref="MIM:120220"
/translation="MRAARALLPLLLQACWTAAQDEPETPRAVAFQDCPVDLFFVLDT
SESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIQG
LTRMPGGRDALKSSVDAVKYFGKGTYTDCAIKKGLEQLLVGGSHLKENKYLIVVTDGH
PLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRLSIIATDHTYRRNFTAADW
GQSRDAEEAISQTIDTIVDMIKNNVEQVCCSFECQPARGPPGLRGDPGFEGERGKPGL
PGEKGEAGDPGRPGDLGPVGYQGMKGEKGSRGEKGSRGPKGYKGEKGKRGIDGVDGVK
GEMGYPGLPGCKGSPGFDGIQGPPGPKGDPGAFGLKGEKGEPGADGEAGRPGSSGPSG
DEGQPGEPGPPGEKGEAGDEGNPGPDGAPGERGGPGERGPRGTPGTRGPRGDPGEAGP
QGDQGREGPVGVPGDPGEAGPIGPKGYRGDEGPPGSEGARGAPGPAGPPGDPGLMGER
GEDGPAGNGTEGFPGFPGYPGNRGAPGINGTKGYPGLKGDEGEAGDPGDDNNDIAPRG
VKGAKGYRGPEGPQGPPGHQGPPGPDECEILDIIMKMCSCCECKCGPIDLLFVLDSSE
SIGLQNFEIAKDFVVKVIDRLSRDELVKFEPGQSYAGVVQYSHSQMQEHVSLRSPSIR
NVQELKEAIKSLQWMAGGTFTGEALQYTRDQLLPPSPNNRIALVITDGRSDTQRDTTP
LNVLCSPGIQVVSVGIKDVFDFIPGSDQLNVISCQGLAPSQGRPGLSLVKENYAELLE
DAFLKNVTAQICIDKKCPDYTCPITFSSPADITILLDGSASVGSHNFDTTKRFAKRLA
ERFLTAGRTDPAHDVRVAVVQYSGTGQQRPERASLQFLQNYTALASAVDAMDFINDAT
DVNDALGYVTRFYREASSGAAKKRLLLFSDGNSQGATPAAIEKAVQEAQRAGIEIFVV
VVGRQVNEPHIRVLVTGKTAEYDVAYGESHLFRVPSYQALLRGVFHQTVSRKVALG"
BASE COUNT 754 a 1019 c 1100 g 458 t
ORIGIN
1 cagaaggcag cctcggtctc tgggcggcgg cggcggccca ctctgccctg gccgcgctgt
61 gtggtgaccg caggcccgag acatgagggc ggcccgtgct ctgctgcccc tgctgctgca
121 ggcctgctgg acagccgcgc aggatgagcc ggagaccccg agggccgtgg ccttccagga
181 ctgccccgtg gacctgttct ttgtgctgga cacctctgag agcgtggccc tgaggctgaa
241 gccctacggg gccctcgtgg acaaagtcaa gtccttcacc aagcgcttca tcgacaacct
301 gagggacagg tactaccgct gtgaccgaaa cctggtgtgg aacgcaggcg cgctgcacta
361 cagtgacgag gtggagatca tccaaggcct cacgcgcatg cctggcggcc gcgacgcact
421 caaaagcagc gtggacgcgg tcaagtactt tgggaagggc acctacaccg actgcgctat
481 caagaagggg ctggagcagc tcctcgtggg gggctcccac ctgaaggaga ataagtacct
541 gattgtggtg accgacgggc accccctgga gggctacaag gaaccctgtg gggggctgga
601 ggatgctgtg aacgaggcca agcacctggg cgtcaaagtc ttctcggtgg ccatcacacc
661 cgaccacctg gagccgcgtc tgagcatcat cgccacggac cacacgtacc ggcgcaactt
721 cacggcggct gactggggcc agagccgcga cgcagaggag gccatcagcc agaccatcga
781 caccatcgtg gacatgatca aaaataacgt ggagcaagtg tgctgctcct tcgaatgcca
841 gcctgcaaga ggacctccgg ggctccgggg cgaccccggc tttgagggag aacgaggcaa
901 gccggggctc ccaggagaga agggagaagc cggagatcct ggaagacccg gggacctcgg
961 acctgttggg taccagggaa tgaagggaga aaaagggagc cgtggggaga agggctccag
1021 gggacccaag ggctacaagg gagagaaggg caagcgtggc atcgacgggg tggacggcgt
1081 gaagggggag atggggtacc caggcctgcc aggctgcaag ggctcgcccg ggtttgacgg
1141 cattcaagga ccccctggcc ccaagggaga ccccggcgcc tttggactga aaggagaaaa
1201 gggcgagcct ggagctgacg gggaggcggg gagaccaggg agctcgggac catctggaga
1261 cgagggccag ccgggagagc ctgggccccc cggagagaaa ggagaggcgg gcgacgaggg
1321 gaacccagga cctgacggtg cccccgggga gcggggtggc cctggagaga gaggaccacg
1381 ggggacccca ggcacacggg gaccaagagg agaccctggt gaagctggcc cgcagggtga
1441 tcagggaaga gaaggccccg ttggtgtccc tggagacccg ggcgaggctg gccctatcgg
1501 acctaaaggc taccgaggcg atgagggtcc cccagggtcc gagggtgcca gaggagcccc
1561 aggacctgcc ggaccccctg gagacccggg gctgatgggt gaaaggggag aagacggccc
1621 cgctggaaat ggcaccgagg gcttccccgg cttccccggg tatccgggca acaggggcgc
1681 tcccgggata aacggcacga agggctaccc cggcctcaag ggggacgagg gagaagccgg
1741 ggaccccgga gacgataaca acgacattgc accccgagga gtcaaaggag caaaggggta
1801 ccggggtccc gagggccccc agggaccccc aggacaccaa ggaccgcctg ggccggacga
1861 atgcgagatt ttggacatca tcatgaaaat gtgctcttgc tgtgaatgca agtgcggccc
1921 catcgacctc ctgttcgtgc tggacagctc agagagcatt ggcctgcaga acttcgagat
1981 tgccaaggac ttcgtcgtca aggtcatcga ccggctgagc cgggacgagc tggtcaagtt
2041 cgagccaggg cagtcgtacg cgggtgtggt gcagtacagc cacagccaga tgcaggagca
2101 cgtgagcctg cgcagcccca gcatccggaa cgtgcaggag ctcaaggaag ccatcaagag
2161 cctgcagtgg atggcgggcg gcaccttcac gggggaggcc ctgcagtaca cgcgggacca
2221 gctgctgccg cccagcccga acaaccgcat cgccctggtc atcactgacg ggcgctcaga
2281 cactcagagg gacaccacac cgctcaacgt gctctgcagc cccggcatcc aggtggtctc
2341 cgtgggcatc aaagacgtgt ttgacttcat cccaggctca gaccagctca atgtcatttc
2401 ttgccaaggc ctggcaccat cccagggccg gcccggcctc tcgctggtca aggagaacta
2461 tgcagagctg ctggaggatg ccttcctgaa gaatgtcacc gcccagatct gcatagacaa
2521 gaagtgtcca gattacacct gccccatcac gttctcctcc ccggctgaca tcaccatcct
2581 gctggacggc tccgccagcg tgggcagcca caactttgac accaccaagc gcttcgccaa
2641 gcgcctggcc gagcgcttcc tcacagcggg caggacggac cccgcccacg acgtgcgggt
2701 ggcggtggtg cagtacagcg gcacgggcca gcagcgccca gagcgggcgt cgctgcagtt
2761 cctgcagaac tacacggccc tggccagtgc cgtcgatgcc atggacttta tcaacgacgc
2821 caccgacgtc aacgatgccc tgggctatgt gacccgcttc taccgcgagg cctcgtccgg
2881 cgctgccaag aagaggctgc tgctcttctc agatggcaac tcgcagggcg ccacgcccgc
2941 tgccatcgag aaggccgtgc aggaagccca gcgggcaggc atcgagatct tcgtggtggt
3001 cgtgggccgc caggtgaatg agccccacat ccgcgtcctg gtcaccggca agacggccga
3061 gtacgacgtg gcctacggcg agagccacct gttccgtgtc cccagctacc aggccctgct
3121 ccgcggtgtc ttccaccaga cagtctccag gaaggtggcg ctgggctagc ccaccctgca
3181 cgccggcacc aaaccctgtc ctcccacccc tccccactca tcactaaaca gagtaaaatg
3241 tgatgcgaat tttcccgaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
3301 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa a
//