LOCUS BC054498 5060 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens collagen, type I, alpha 2, mRNA (cDNA clone MGC:57503
IMAGE:6474549), complete cds.
ACCESSION BC054498
VERSION BC054498.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 5060)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 5060)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (01-JUL-2003) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Sequencing Group at the Stanford Human Genome
Center, Stanford University School of Medicine, Stanford, CA 94305
Web site: http://www-shgc.stanford.edu
Contact: (Dickson, Mark) mcd@paxil.stanford.edu
Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
R. M.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 107 Row: p Column: 3
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 48762933.
FEATURES Location/Qualifiers
source 1..5060
/db_xref="H-InvDB:HIT000054073"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:57503 IMAGE:6474549"
/tissue_type="Uterus, leiomyosarcoma"
/clone_lib="NIH_MGC_71"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..5060
/gene="COL1A2"
/db_xref="GeneID:1278"
/db_xref="HGNC:HGNC:2198"
/db_xref="MIM:120160"
CDS 101..4201
/gene="COL1A2"
/codon_start=1
/product="collagen, type I, alpha 2"
/protein_id="AAH54498.1"
/db_xref="GeneID:1278"
/db_xref="HGNC:HGNC:2198"
/db_xref="MIM:120160"
/translation="MLSFVDTRTLLLLAVTLCLATCQSLQEETVRKGPAGDRGPRGER
GPPGPPGRDGEDGPTGPPGPPGPPGPPGLGGNFAAQYDGKGVGLGPGPMGLMGPRGPP
GAAGAPGPQGFQGPAGEPGEPGQTGPAGARGPAGPPGKAGEDGHPGKPGRPGERGVVG
PQGARGFPGTPGLPGFKGIRGHNGLDGLKGQPGAPGVKGEPGAPGENGTPGQTGARGL
PGERGRVGAPGPAGARGSDGSVGPVGPAGPIGSAGPPGFPGAPGPKGEIGAVGNAGPA
GPAGPRGEVGLPGLSGPVGPPGNPGANGLTGAKGAAGLPGVAGAPGLPGPRGIPGPVG
AAGATGARGLVGEPGPAGSKGESGNKGEPGSAGPQGPPGPSGEEGKRGPNGEAGSAGP
PGPPGLRGSPGSRGLPGADGRAGVMGPPGSRGASGPAGVRGPNGDAGRPGEPGLMGPR
GLPGSPGNIGPAGKEGPVGLPGIDGRPGPIGPAGARGEPGNIGFPGPKGPTGDPGKNG
DKGHAGLAGARGAPGPDGNNGAQGPPGPQGVQGGKGEQGPAGPPGFQGLPGPSGPAGE
VGKPGERGLHGEFGLPGPAGPRGERGPPGESGAAGPTGPIGSRGPSGPPGPDGNKGEP
GVVGAVGTAGPSGPSGLPGERGAAGIPGGKGEKGEPGLRGEIGNPGRDGARGAPGAVG
APGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPAGPNGFAGPAGAAGQPGAKGE
RGAKGPKGENGVVGPTGPVGAAGPAGPNGPPGPAGSRGDGGPPGMTGFPGAAGRTGPP
GPSGISGPPGPPGPAGKEGLRGPRGDQGPVGRTGEVGAVGPPGFAGEKGPSGEAGTAG
PPGTPGPQGLLGAPGILGLPGSRGERGLPGVAGAVGEPGPLGIAGPPGARGPPGAVGS
PGVNGAPGEAGRDGNPGNDGPPGRDGQPGHKGERGYPGNIGPVGAAGAPGPHGPVGPA
GKHGNRGETGPSGPVGPAGAVGPRGPSGPQGIRGDKGEPGEKGPRGLPGLKGHNGLQG
LPGIAGHHGDQGAPGSVGPAGPRGPAGPSGPAGKDGRTGHPGTVGPAGIRGPQGHQGP
AGPPGPPGPPGPPGVSGGGYDFGYDGDFYRADQPRSAPSLRPKDYEVDATLKSLNNQI
ETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCDFSTGETCI
RAQPENIPAKNWYRSSKDKKHVWLGETINAGSQFEYNVEGVTSKEMATQLAFMRLLAN
YASQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELVAEGNSRFTYTVLVDGCSKK
TNEWGKTIIEYKTNKPSRLPFLDIAPLDIGGADQEFFVDIGPVCFK"
BASE COUNT 1097 a 1278 c 1465 g 1220 t
ORIGIN
1 gtactggcca cgactgcatg cccgcgcccg ccaggtgata cctccgccgg tgacccaggg
61 gctctgcgac acaaggagtc tgcatgtcta agtgctagac atgctcagct ttgtggatac
121 gcggactttg ttgctgcttg cagtaacctt atgcctagca acatgccaat ctttacaaga
181 ggaaactgta agaaagggcc cagccggaga tagaggacca cgtggagaaa ggggtccacc
241 aggcccccca ggcagagatg gtgaagatgg tcccacaggc cctcctggtc cacctggtcc
301 tcctggcccc cctggtctcg gtgggaactt tgctgctcag tatgatggaa aaggagttgg
361 acttggccct ggaccaatgg gcttaatggg acctagaggc ccacctggtg cagctggagc
421 cccaggccct caaggtttcc aaggacctgc tggtgagcct ggtgaacctg gtcaaactgg
481 tcctgcaggt gctcgtggtc cagctggccc tcctggcaag gctggtgaag atggtcaccc
541 tggaaaaccc ggacgacctg gtgagagagg agttgttgga ccacagggtg ctcgtggttt
601 ccctggaact cctggacttc ctggcttcaa aggcattagg ggacacaatg gtctggatgg
661 attgaaggga cagcccggtg ctcctggtgt gaagggtgaa cctggtgccc ctggtgaaaa
721 tggaactcca ggtcaaacag gagcccgtgg gcttcctggt gagagaggac gtgttggtgc
781 ccctggccca gctggtgccc gtggcagtga tggaagtgtg ggtcccgtgg gtcctgctgg
841 tcccattggg tctgctggcc ctccaggctt cccaggtgcc cctggcccca agggtgaaat
901 tggagctgtt ggtaacgctg gtcctgctgg tcccgccggt ccccgtggtg aagtgggtct
961 tccaggcctc tccggccccg ttggacctcc tggtaatcct ggagcaaacg gccttactgg
1021 tgccaagggt gctgctggcc ttcccggcgt tgctggggct cccggcctcc ctggaccccg
1081 cggtattcct ggccctgttg gtgctgccgg tgctactggt gccagaggac ttgttggtga
1141 gcctggtcca gctggctcca aaggagagag cggtaacaag ggtgagcccg gctctgctgg
1201 gccccaaggt cctcctggtc ccagtggtga agaaggaaag agaggcccta atggggaagc
1261 tggatctgcc ggccctccag gacctcctgg gctgagaggt agtcctggtt ctcgtggtct
1321 tcctggagct gatggcagag ctggcgtcat gggccctcct ggtagtcgtg gtgcaagtgg
1381 ccctgctgga gtccgaggac ctaatggaga tgctggtcgc cctggggagc ctggtctcat
1441 gggacccaga ggtcttcctg gttcccctgg aaatatcggc cccgctggaa aagaaggtcc
1501 tgtcggcctc cctggcatcg acggcaggcc tggcccaatt ggcccagctg gagcaagagg
1561 agagcctggc aacattggat tccctggacc caaaggcccc actggtgatc ctggcaaaaa
1621 cggtgataaa ggtcatgctg gtcttgctgg tgctcggggt gctccaggtc ctgatggaaa
1681 caatggtgct cagggacctc ctggaccaca gggtgttcaa ggtggaaaag gtgaacaggg
1741 tcccgctggt cctccaggct tccagggtct gcctggcccc tcaggtcccg ctggtgaagt
1801 tggcaaacca ggagaaaggg gtctccatgg tgagtttggt ctccctggtc ctgctggtcc
1861 aagaggggaa cgcggtcccc caggtgagag tggtgctgcc ggtcctactg gtcctattgg
1921 aagccgaggt ccttctggac ccccagggcc tgatggaaac aagggtgaac ctggtgtggt
1981 tggtgctgtg ggcactgctg gtccatctgg tcctagtgga ctcccaggag agaggggtgc
2041 tgctggcata cctggaggca agggagaaaa gggtgaacct ggtctcagag gtgaaattgg
2101 taaccctggc agagatggtg ctcgtggtgc tcctggtgct gtaggtgccc ctggtcctgc
2161 tggagccaca ggtgaccggg gcgaagctgg ggctgctggt cctgctggtc ctgctggtcc
2221 tcggggaagc cctggtgaac gtggtgaggt cggtcctgct ggccccaatg gatttgctgg
2281 tcctgctggt gctgctggtc aacctggtgc taaaggagaa agaggagcca aagggcctaa
2341 gggtgaaaac ggtgttgttg gtcccacagg ccccgttgga gctgctggcc cagctggtcc
2401 aaatggtccc cccggtcctg ctggaagtcg tggtgatgga ggcccccctg gtatgactgg
2461 tttccctggt gctgctggac ggactggtcc cccaggaccc tctggtattt ctggccctcc
2521 tggtccccct ggtcctgctg ggaaagaagg gcttcgtggt cctcgtggtg accaaggtcc
2581 agttggccga actggagaag taggtgcagt tggtccccct ggcttcgctg gtgagaaggg
2641 tccctctgga gaggctggta ctgctggacc tcctggcact ccaggtcctc agggtcttct
2701 tggtgctcct ggtattctgg gtctccctgg ctcgagaggt gaacgtggtc taccaggtgt
2761 tgctggtgct gtgggtgaac ctggtcctct tggcattgcc ggccctcctg gggcccgtgg
2821 tcctcctggt gctgtgggta gtcctggagt caacggtgct cctggtgaag ctggtcgtga
2881 tggcaaccct gggaacgatg gtcccccagg tcgcgatggt caacccggac acaagggaga
2941 gcgcggttac cctggcaata ttggtcccgt tggtgctgca ggtgcacctg gtcctcatgg
3001 ccccgtgggt cctgctggca aacatggaaa ccgtggtgaa actggtcctt ctggtcctgt
3061 tggtcctgct ggtgctgttg gcccaagagg tcctagtggc ccacaaggca ttcgtggcga
3121 taagggagag cccggtgaaa aggggcccag aggtcttcct ggcttaaagg gacacaatgg
3181 attgcaaggt ctgcctggta tcgctggtca ccatggtgat caaggtgctc ctggctccgt
3241 gggtcctgct ggtcctaggg gccctgctgg tccttctggc cctgctggaa aagatggtcg
3301 cactggacat cctggtacag ttggacctgc tggcattcga ggccctcagg gtcaccaagg
3361 ccctgctggc ccccctggtc cccctggccc tcctggacct ccaggtgtaa gcggtggtgg
3421 ttatgacttt ggttacgatg gagacttcta cagggctgac cagcctcgct cagcaccttc
3481 tctcagaccc aaggactatg aagttgatgc tactctgaag tctctcaaca accagattga
3541 gacccttctt actcctgaag gctctagaaa gaacccagct cgcacatgcc gtgacttgag
3601 actcagccac ccagagtgga gcagtggtta ctactggatt gaccctaacc aaggatgcac
3661 tatggatgct atcaaagtat actgtgattt ctctactggc gaaacctgta tccgggccca
3721 acctgaaaac atcccagcca agaactggta taggagctcc aaggacaaga aacacgtctg
3781 gctaggagaa actatcaatg ctggcagcca gtttgaatat aatgtagaag gagtgacttc
3841 caaggaaatg gctacccaac ttgccttcat gcgcctgctg gccaactatg cctctcagaa
3901 catcacctac cactgcaaga acagcattgc atacatggat gaggagactg gcaacctgaa
3961 aaaggctgtc attctacagg gctctaatga tgttgaactt gttgctgagg gcaacagcag
4021 gttcacttac actgttcttg tagatggctg ctctaaaaag acaaatgaat ggggaaagac
4081 aatcattgaa tacaaaacaa ataagccatc acgcctgccc ttccttgata ttgcaccttt
4141 ggacatcggt ggtgctgacc aggaattctt tgtggacatt ggcccagtct gtttcaaata
4201 aatgaactca atctaaatta aaaaagaaag aaatttgaaa aaactttctc tttgccattt
4261 cttcttcttc ttttttaact gaaagctgaa tccttccatt tcttctgcac atctacttgc
4321 ttaaattgtg ggcaaaagag aaaaagaagg attgatcaga gcattgtgca atacagtttc
4381 attaactcct tcccccgctc ccccaaaaat ttgaattttt ttttcaacac tcttacacct
4441 gttatggaaa atgtcaacct ttgtaagaaa accaaaataa aaattgaaaa ataaaaacca
4501 taaacatttg caccacttgt ggcttttgaa tatcttccac agagggaagt ttaaaaccca
4561 aacttccaaa ggtttaaact acctcaaaac actttcccat gagtgtgatc cacattgtta
4621 ggtgctgacc tagacagaga tgaactgagg tccttgtttt gttttgttca taatacaaag
4681 gtgctaatta atagtatttc agatacttga agaatgttga tggtgctaga agaatttgag
4741 aagaaatact cctgtattga gttgtatcgt gtggtgtatt ttttaaaaaa tttgatttag
4801 cattcatatt ttccatctta ttcccaatta aaagtatgca gattatttgc ccaaatcttc
4861 ttcagattca gcatttgttc tttgccagtc tcattttcat cttcttccat ggttccacag
4921 aagctttgtt tcttgggcaa gcagaaaaat taaattgtac ctattttgta tatgtgagat
4981 gtttaaataa attgtgaaaa aaatgaaata aagcatgttt ggttttccaa aaaaaaaaaa
5041 aaaaaaaaaa aaaaaaaaaa
//