LOCUS BC042586 4572 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens collagen, type I, alpha 2, mRNA (cDNA clone MGC:30044
IMAGE:4803351), complete cds.
ACCESSION BC042586
VERSION BC042586.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 4572)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 4572)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (02-JAN-2003) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: James Cleaver, M.D.
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Baylor College of Medicine Human Genome
Sequencing Center
Center code: BCM-HGSC
Web site: http://www.hgsc.bcm.tmc.edu/cdna/
Contact: amg@bcm.tmc.edu
Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
A.N., Gibbs, R.A.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 42 Row: f Column: 3
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 48762933.
FEATURES Location/Qualifiers
source 1..4572
/db_xref="H-InvDB:HIT000259092"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:30044 IMAGE:4803351"
/tissue_type="Skin, normal"
/clone_lib="NCI_CGAP_Skn3"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6.1"
gene 1..4572
/gene="COL1A2"
/db_xref="GeneID:1278"
/db_xref="HGNC:HGNC:2198"
/db_xref="MIM:120160"
CDS 119..4219
/gene="COL1A2"
/codon_start=1
/product="collagen, type I, alpha 2"
/protein_id="AAH42586.1"
/db_xref="GeneID:1278"
/db_xref="HGNC:HGNC:2198"
/db_xref="MIM:120160"
/translation="MLSFVDTRTLLLLAVTLCLATCQSLQEETVRKGPAGDRGPRGER
GPPGPPGRDGEDGPTGPPGPPGPPGPPGLGGNFAAQYDGKGVGLGPGPMGLMGPRGPP
GAAGAPGPQGFQGPAGEPGEPGQTGPAGARGPAGPPGKAGEDGHPGKPGRPGERGVVG
PQGARGFPGTPGLPGFKGIRGHNGLDGLKGQPGAPGVKGEPGAPGENGTPGQTGARGL
PGERGRVGAPGPAGARGSDGSVGPVGPAGPIGSAGPPGFPGAPGPKGEIGAVGNAGPA
GPAGPRGEVGLPGLSGPVGPPGNPGANGLTGAKGAAGLPGVAGAPGLPGPRGIPGPVG
AAGATGARGLVGEPGPAGSKGESGNKGEPGSAGPQGPPGPSGEEGKRGPNGEAGSAGP
PGPPGLRGSPGSRGLPGADGRAGVMGPPGSRGASGPAGVRGPNGDAGRPGEPGLMGPR
GLPGSPGNIGPAGKEGPVGLPGIDGRPGPIGPAGARGEPGNIGFPGPKGPTGDPGKNG
DKGHAGLAGARGAPGPDGNNGAQGPPGPQGVQGGKGEQGPAGPPGFQGLPGPSGPAGE
VGKPGERGLHGEFGLPGPAGPRGERGPPGESGAAGPTGPIGSRGPSGPPGPDGNKGEP
GVVGAVGTAGPSGPSGLPGERGAAGIPGGKGEKGEPGLRGEIGNPGRDGARGAPGAVG
APGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPAGPNGFAGPAGAAGQPGAKGE
RGAKGPKGENGVVGPTGPVGAAGPAGPNGPPGPAGSRGDGGPPGMTGFPGAAGRTGPP
GPSGISGPPGPPGPAGKEGLRGPRGDQGPVGRTGEVGAVGPPGFAGEKGPSGEAGTAG
PPGTPGPQGLLGAPGILGLPGSRGERGLPGVAGAVGEPGPLGIAGPPGARGPPGAVGS
PGVNGAPGEAGRDGNPGNDGPPGRDGQPGHKGERGYPGNIGPVGAAGAPGPHGPVGPA
GKHGNRGETGPSGPVGPAGAVGPRGPSGPQGIRGDKGEPGEKGPRGLPGLKGHNGLQG
LPGIAGHHGDQGAPGSVGPAGPRGPAGPSGPAGKDGRTGHPGTVGPAGIRGPQGHQGP
AGPPGPPGPPGPPGVSGGGYDFGYDGDFYRADQPRSAPSLRPKDYEVDATLKSLNNQI
ETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCDFSTGETCI
RAQPENIPAKNWYRSSKDKKHVWLGETINAGSQFEYNVEGVTSKEMATQLAFMRLLAN
YASQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELVAEGNSRFTYTVLVDGCSKK
TNEWGKTIIEYKTNKPSRLPFLDIAPLDIGGADQEFFVDIGPVCFK"
BASE COUNT 955 a 1197 c 1382 g 1038 t
ORIGIN
1 gtttcggcta agttggaggt actggccacg actgcatgcc cgcgcccgcc aggtgatacc
61 tccgccggtg acccaggggc tctgcgacac aaggagtctg catgtctaag tgctagacat
121 gctcagcttt gtggatacgc ggactttgtt gctgcttgca gtaaccttat gcctagcaac
181 atgccaatct ttacaagagg aaactgtaag aaagggccca gccggagata gaggaccacg
241 tggagaaagg ggtccaccag gccccccagg cagagatggt gaagatggtc ccacaggccc
301 tcctggtcca cctggtcctc ctggcccccc tggtctcggt gggaactttg ctgctcagta
361 tgatggaaaa ggagttggac ttggccctgg accaatgggc ttaatgggac ctagaggccc
421 acctggtgca gctggagccc caggccctca aggtttccaa ggacctgctg gtgagcctgg
481 tgaacctggt caaactggtc ctgcaggtgc tcgtggtcca gctggccctc ctggcaaggc
541 tggtgaagat ggtcaccctg gaaaacccgg acgacctggt gagagaggag ttgttggacc
601 acagggtgct cgtggtttcc ctggaactcc tggacttcct ggcttcaaag gcattagggg
661 acacaatggt ctggatggat tgaagggaca gcccggtgct cctggtgtga agggtgaacc
721 tggtgcccct ggtgaaaatg gaactccagg tcaaacagga gcccgtgggc ttcctggtga
781 gagaggacgt gttggtgccc ctggcccagc tggtgcccgt ggcagtgatg gaagtgtggg
841 tcccgtgggt cctgctggtc ccattgggtc tgctggccct ccaggcttcc caggtgcccc
901 tggccccaag ggtgaaattg gagctgttgg taacgctggt cctgctggtc ccgccggtcc
961 ccgtggtgaa gtgggtcttc caggcctctc cggccccgtt ggacctcctg gtaatcctgg
1021 agcaaacggc cttactggtg ccaagggtgc tgctggcctt cccggcgttg ctggggctcc
1081 cggcctccct ggaccccgcg gtattcctgg ccctgttggt gctgccggtg ctactggtgc
1141 cagaggactt gttggtgagc ctggtccagc tggctccaaa ggagagagcg gtaacaaggg
1201 tgagcccggc tctgctgggc cccaaggtcc tcctggtccc agtggtgaag aaggaaagag
1261 aggccctaat ggggaagctg gatctgccgg ccctccagga cctcctgggc tgagaggtag
1321 tcctggttct cgtggtcttc ctggagctga tggcagagct ggcgtcatgg gccctcctgg
1381 tagtcgtggt gcaagtggcc ctgctggagt ccgaggacct aatggagatg ctggtcgccc
1441 tggggagcct ggtctcatgg gacccagagg tcttcctggt tcccctggaa atatcggccc
1501 cgctggaaaa gaaggtcctg tcggcctccc tggcatcgac ggcaggcctg gcccaattgg
1561 cccagctgga gcaagaggag agcctggcaa cattggattc cctggaccca aaggccccac
1621 tggtgatcct ggcaaaaacg gtgataaagg tcatgctggt cttgctggtg ctcggggtgc
1681 tccaggtcct gatggaaaca atggtgctca gggacctcct ggaccacagg gtgttcaagg
1741 tggaaaaggt gaacagggtc ccgctggtcc tccaggcttc cagggtctgc ctggcccctc
1801 aggtcccgct ggtgaagttg gcaaaccagg agaaaggggt ctccatggtg agtttggtct
1861 ccctggtcct gctggtccaa gaggggaacg cggtccccca ggtgagagtg gtgctgccgg
1921 tcctactggt cctattggaa gccgaggtcc ttctggaccc ccagggcctg atggaaacaa
1981 gggtgaacct ggtgtggttg gtgctgtggg cactgctggt ccatctggtc ctagtggact
2041 cccaggagag aggggtgctg ctggcatacc tggaggcaag ggagaaaagg gtgaacctgg
2101 tctcagaggt gaaattggta accctggcag agatggtgct cgtggtgctc ctggtgctgt
2161 aggtgcccct ggtcctgctg gagccacagg tgaccggggc gaagctgggg ctgctggtcc
2221 tgctggtcct gctggtcctc ggggaagccc tggtgaacgt ggtgaggtcg gtcctgctgg
2281 ccccaatgga tttgctggtc ctgctggtgc tgctggtcaa cctggtgcta aaggagaaag
2341 aggagccaaa gggcctaagg gtgaaaacgg tgttgttggt cccacaggcc ccgttggagc
2401 tgctggccca gctggtccaa atggtccccc cggtcctgct ggaagtcgtg gtgatggagg
2461 cccccctggt atgactggtt tccctggtgc tgctggacgg actggtcccc caggaccctc
2521 tggtatttct ggccctcctg gtccccctgg tcctgctggg aaagaagggc ttcgtggtcc
2581 tcgtggtgac caaggtccag ttggccgaac tggagaagta ggtgcagttg gtccccctgg
2641 cttcgctggt gagaagggtc cctctggaga ggctggtact gctggacctc ctggcactcc
2701 aggtcctcag ggtcttcttg gtgctcctgg tattctgggt ctccctggct cgagaggtga
2761 acgtggtcta ccaggtgttg ctggtgctgt gggtgaacct ggtcctcttg gcattgccgg
2821 ccctcctggg gcccgtggtc ctcctggtgc tgtgggtagt cctggagtca acggtgctcc
2881 tggtgaagct ggtcgtgatg gcaaccctgg gaacgatggt cccccaggtc gcgatggtca
2941 acccggacac aagggagagc gcggttaccc tggcaatatt ggtcccgttg gtgctgcagg
3001 tgcacctggt cctcatggcc ccgtgggtcc tgctggcaaa catggaaacc gtggtgaaac
3061 tggtccttct ggtcctgttg gtcctgctgg tgctgttggc ccaagaggtc ctagtggccc
3121 acaaggcatt cgtggcgata agggagagcc cggtgaaaag gggcccagag gtcttcctgg
3181 cttaaaggga cacaatggat tgcaaggtct gcctggtatc gctggtcacc atggtgatca
3241 aggtgctcct ggctccgtgg gtcctgctgg tcctaggggc cctgctggtc cttctggccc
3301 tgctggaaaa gatggtcgca ctggacatcc tggtacagtt ggacctgctg gcattcgagg
3361 ccctcagggt caccaaggcc ctgctggccc ccctggtccc cctggccctc ctggacctcc
3421 aggtgtaagc ggtggtggtt atgactttgg ttacgatgga gacttctaca gggctgacca
3481 gcctcgctca gcaccttctc tcagacccaa ggactatgaa gttgatgcta ctctgaagtc
3541 tctcaacaac cagattgaga cccttcttac tcctgaaggc tctagaaaga acccagctcg
3601 cacatgccgt gacttgagac tcagccaccc agagtggagc agtggttact actggattga
3661 ccctaaccaa ggatgcacta tggatgctat caaagtatac tgtgatttct ctactggcga
3721 aacctgtatc cgggcccaac ctgaaaacat cccagccaag aactggtata ggagctccaa
3781 ggacaagaaa cacgtctggc taggagaaac tatcaatgct ggcagccagt ttgaatataa
3841 tgtagaagga gtgacttcca aggaaatggc tacccaactt gccttcatgc gcctgctggc
3901 caactatgcc tctcagaaca tcacctacca ctgcaagaac agcattgcat acatggatga
3961 ggagactggc aacctgaaaa aggctgtcat tctacagggc tctaatgatg ttgaacttgt
4021 tgctgagggc aacagcaggt tcacttacac tgttcttgta gatggctgct ctaaaaagac
4081 aaatgaatgg ggaaagacaa tcattgaata caaaacaaat aagccatcac gcctgccctt
4141 ccttgatatt gcacctttgg acatcggtgg tgctgaccag gaattctttg tggacattgg
4201 cccagtctgt ttcaaataaa tgaactcaat ctaaattaaa aaagaaagaa atttgaaaaa
4261 actttctctt tgccatttct tcttcttctt ttttaactga aagctgaatc cttccatttc
4321 ttctgcacat ctacttgctt aaattgtggg caaaagagaa aaagaaggat tgatcagagc
4381 attgtgcaat acagtttcat taactccttc cctcgctccc ccaaaaattt gaattttttt
4441 ttcaacactc ttacacctgt tatggaaaat gtcaaccttt gtaagaaaac caaaataaaa
4501 attgaaaaat aaaaaccata aacatttgca aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
4561 aaaaaaaaaa aa
//