LOCUS       BC042586                4572 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens collagen, type I, alpha 2, mRNA (cDNA clone MGC:30044
            IMAGE:4803351), complete cds.
ACCESSION   BC042586
VERSION     BC042586.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 4572)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 4572)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (02-JAN-2003) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: James Cleaver, M.D.
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Baylor College of Medicine Human Genome
            Sequencing Center
            Center code: BCM-HGSC
            Web site: http://www.hgsc.bcm.tmc.edu/cdna/
            Contact: amg@bcm.tmc.edu
            Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
            Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
            A.N., Gibbs, R.A.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 42 Row: f Column: 3
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 48762933.
FEATURES             Location/Qualifiers
     source          1..4572
                     /db_xref="H-InvDB:HIT000259092"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:30044 IMAGE:4803351"
                     /tissue_type="Skin, normal"
                     /clone_lib="NCI_CGAP_Skn3"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6.1"
     gene            1..4572
                     /gene="COL1A2"
                     /db_xref="GeneID:1278"
                     /db_xref="HGNC:HGNC:2198"
                     /db_xref="MIM:120160"
     CDS             119..4219
                     /gene="COL1A2"
                     /codon_start=1
                     /product="collagen, type I, alpha 2"
                     /protein_id="AAH42586.1"
                     /db_xref="GeneID:1278"
                     /db_xref="HGNC:HGNC:2198"
                     /db_xref="MIM:120160"
                     /translation="MLSFVDTRTLLLLAVTLCLATCQSLQEETVRKGPAGDRGPRGER
                     GPPGPPGRDGEDGPTGPPGPPGPPGPPGLGGNFAAQYDGKGVGLGPGPMGLMGPRGPP
                     GAAGAPGPQGFQGPAGEPGEPGQTGPAGARGPAGPPGKAGEDGHPGKPGRPGERGVVG
                     PQGARGFPGTPGLPGFKGIRGHNGLDGLKGQPGAPGVKGEPGAPGENGTPGQTGARGL
                     PGERGRVGAPGPAGARGSDGSVGPVGPAGPIGSAGPPGFPGAPGPKGEIGAVGNAGPA
                     GPAGPRGEVGLPGLSGPVGPPGNPGANGLTGAKGAAGLPGVAGAPGLPGPRGIPGPVG
                     AAGATGARGLVGEPGPAGSKGESGNKGEPGSAGPQGPPGPSGEEGKRGPNGEAGSAGP
                     PGPPGLRGSPGSRGLPGADGRAGVMGPPGSRGASGPAGVRGPNGDAGRPGEPGLMGPR
                     GLPGSPGNIGPAGKEGPVGLPGIDGRPGPIGPAGARGEPGNIGFPGPKGPTGDPGKNG
                     DKGHAGLAGARGAPGPDGNNGAQGPPGPQGVQGGKGEQGPAGPPGFQGLPGPSGPAGE
                     VGKPGERGLHGEFGLPGPAGPRGERGPPGESGAAGPTGPIGSRGPSGPPGPDGNKGEP
                     GVVGAVGTAGPSGPSGLPGERGAAGIPGGKGEKGEPGLRGEIGNPGRDGARGAPGAVG
                     APGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPAGPNGFAGPAGAAGQPGAKGE
                     RGAKGPKGENGVVGPTGPVGAAGPAGPNGPPGPAGSRGDGGPPGMTGFPGAAGRTGPP
                     GPSGISGPPGPPGPAGKEGLRGPRGDQGPVGRTGEVGAVGPPGFAGEKGPSGEAGTAG
                     PPGTPGPQGLLGAPGILGLPGSRGERGLPGVAGAVGEPGPLGIAGPPGARGPPGAVGS
                     PGVNGAPGEAGRDGNPGNDGPPGRDGQPGHKGERGYPGNIGPVGAAGAPGPHGPVGPA
                     GKHGNRGETGPSGPVGPAGAVGPRGPSGPQGIRGDKGEPGEKGPRGLPGLKGHNGLQG
                     LPGIAGHHGDQGAPGSVGPAGPRGPAGPSGPAGKDGRTGHPGTVGPAGIRGPQGHQGP
                     AGPPGPPGPPGPPGVSGGGYDFGYDGDFYRADQPRSAPSLRPKDYEVDATLKSLNNQI
                     ETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCDFSTGETCI
                     RAQPENIPAKNWYRSSKDKKHVWLGETINAGSQFEYNVEGVTSKEMATQLAFMRLLAN
                     YASQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELVAEGNSRFTYTVLVDGCSKK
                     TNEWGKTIIEYKTNKPSRLPFLDIAPLDIGGADQEFFVDIGPVCFK"
BASE COUNT          955 a         1197 c         1382 g         1038 t
ORIGIN      
        1 gtttcggcta agttggaggt actggccacg actgcatgcc cgcgcccgcc aggtgatacc
       61 tccgccggtg acccaggggc tctgcgacac aaggagtctg catgtctaag tgctagacat
      121 gctcagcttt gtggatacgc ggactttgtt gctgcttgca gtaaccttat gcctagcaac
      181 atgccaatct ttacaagagg aaactgtaag aaagggccca gccggagata gaggaccacg
      241 tggagaaagg ggtccaccag gccccccagg cagagatggt gaagatggtc ccacaggccc
      301 tcctggtcca cctggtcctc ctggcccccc tggtctcggt gggaactttg ctgctcagta
      361 tgatggaaaa ggagttggac ttggccctgg accaatgggc ttaatgggac ctagaggccc
      421 acctggtgca gctggagccc caggccctca aggtttccaa ggacctgctg gtgagcctgg
      481 tgaacctggt caaactggtc ctgcaggtgc tcgtggtcca gctggccctc ctggcaaggc
      541 tggtgaagat ggtcaccctg gaaaacccgg acgacctggt gagagaggag ttgttggacc
      601 acagggtgct cgtggtttcc ctggaactcc tggacttcct ggcttcaaag gcattagggg
      661 acacaatggt ctggatggat tgaagggaca gcccggtgct cctggtgtga agggtgaacc
      721 tggtgcccct ggtgaaaatg gaactccagg tcaaacagga gcccgtgggc ttcctggtga
      781 gagaggacgt gttggtgccc ctggcccagc tggtgcccgt ggcagtgatg gaagtgtggg
      841 tcccgtgggt cctgctggtc ccattgggtc tgctggccct ccaggcttcc caggtgcccc
      901 tggccccaag ggtgaaattg gagctgttgg taacgctggt cctgctggtc ccgccggtcc
      961 ccgtggtgaa gtgggtcttc caggcctctc cggccccgtt ggacctcctg gtaatcctgg
     1021 agcaaacggc cttactggtg ccaagggtgc tgctggcctt cccggcgttg ctggggctcc
     1081 cggcctccct ggaccccgcg gtattcctgg ccctgttggt gctgccggtg ctactggtgc
     1141 cagaggactt gttggtgagc ctggtccagc tggctccaaa ggagagagcg gtaacaaggg
     1201 tgagcccggc tctgctgggc cccaaggtcc tcctggtccc agtggtgaag aaggaaagag
     1261 aggccctaat ggggaagctg gatctgccgg ccctccagga cctcctgggc tgagaggtag
     1321 tcctggttct cgtggtcttc ctggagctga tggcagagct ggcgtcatgg gccctcctgg
     1381 tagtcgtggt gcaagtggcc ctgctggagt ccgaggacct aatggagatg ctggtcgccc
     1441 tggggagcct ggtctcatgg gacccagagg tcttcctggt tcccctggaa atatcggccc
     1501 cgctggaaaa gaaggtcctg tcggcctccc tggcatcgac ggcaggcctg gcccaattgg
     1561 cccagctgga gcaagaggag agcctggcaa cattggattc cctggaccca aaggccccac
     1621 tggtgatcct ggcaaaaacg gtgataaagg tcatgctggt cttgctggtg ctcggggtgc
     1681 tccaggtcct gatggaaaca atggtgctca gggacctcct ggaccacagg gtgttcaagg
     1741 tggaaaaggt gaacagggtc ccgctggtcc tccaggcttc cagggtctgc ctggcccctc
     1801 aggtcccgct ggtgaagttg gcaaaccagg agaaaggggt ctccatggtg agtttggtct
     1861 ccctggtcct gctggtccaa gaggggaacg cggtccccca ggtgagagtg gtgctgccgg
     1921 tcctactggt cctattggaa gccgaggtcc ttctggaccc ccagggcctg atggaaacaa
     1981 gggtgaacct ggtgtggttg gtgctgtggg cactgctggt ccatctggtc ctagtggact
     2041 cccaggagag aggggtgctg ctggcatacc tggaggcaag ggagaaaagg gtgaacctgg
     2101 tctcagaggt gaaattggta accctggcag agatggtgct cgtggtgctc ctggtgctgt
     2161 aggtgcccct ggtcctgctg gagccacagg tgaccggggc gaagctgggg ctgctggtcc
     2221 tgctggtcct gctggtcctc ggggaagccc tggtgaacgt ggtgaggtcg gtcctgctgg
     2281 ccccaatgga tttgctggtc ctgctggtgc tgctggtcaa cctggtgcta aaggagaaag
     2341 aggagccaaa gggcctaagg gtgaaaacgg tgttgttggt cccacaggcc ccgttggagc
     2401 tgctggccca gctggtccaa atggtccccc cggtcctgct ggaagtcgtg gtgatggagg
     2461 cccccctggt atgactggtt tccctggtgc tgctggacgg actggtcccc caggaccctc
     2521 tggtatttct ggccctcctg gtccccctgg tcctgctggg aaagaagggc ttcgtggtcc
     2581 tcgtggtgac caaggtccag ttggccgaac tggagaagta ggtgcagttg gtccccctgg
     2641 cttcgctggt gagaagggtc cctctggaga ggctggtact gctggacctc ctggcactcc
     2701 aggtcctcag ggtcttcttg gtgctcctgg tattctgggt ctccctggct cgagaggtga
     2761 acgtggtcta ccaggtgttg ctggtgctgt gggtgaacct ggtcctcttg gcattgccgg
     2821 ccctcctggg gcccgtggtc ctcctggtgc tgtgggtagt cctggagtca acggtgctcc
     2881 tggtgaagct ggtcgtgatg gcaaccctgg gaacgatggt cccccaggtc gcgatggtca
     2941 acccggacac aagggagagc gcggttaccc tggcaatatt ggtcccgttg gtgctgcagg
     3001 tgcacctggt cctcatggcc ccgtgggtcc tgctggcaaa catggaaacc gtggtgaaac
     3061 tggtccttct ggtcctgttg gtcctgctgg tgctgttggc ccaagaggtc ctagtggccc
     3121 acaaggcatt cgtggcgata agggagagcc cggtgaaaag gggcccagag gtcttcctgg
     3181 cttaaaggga cacaatggat tgcaaggtct gcctggtatc gctggtcacc atggtgatca
     3241 aggtgctcct ggctccgtgg gtcctgctgg tcctaggggc cctgctggtc cttctggccc
     3301 tgctggaaaa gatggtcgca ctggacatcc tggtacagtt ggacctgctg gcattcgagg
     3361 ccctcagggt caccaaggcc ctgctggccc ccctggtccc cctggccctc ctggacctcc
     3421 aggtgtaagc ggtggtggtt atgactttgg ttacgatgga gacttctaca gggctgacca
     3481 gcctcgctca gcaccttctc tcagacccaa ggactatgaa gttgatgcta ctctgaagtc
     3541 tctcaacaac cagattgaga cccttcttac tcctgaaggc tctagaaaga acccagctcg
     3601 cacatgccgt gacttgagac tcagccaccc agagtggagc agtggttact actggattga
     3661 ccctaaccaa ggatgcacta tggatgctat caaagtatac tgtgatttct ctactggcga
     3721 aacctgtatc cgggcccaac ctgaaaacat cccagccaag aactggtata ggagctccaa
     3781 ggacaagaaa cacgtctggc taggagaaac tatcaatgct ggcagccagt ttgaatataa
     3841 tgtagaagga gtgacttcca aggaaatggc tacccaactt gccttcatgc gcctgctggc
     3901 caactatgcc tctcagaaca tcacctacca ctgcaagaac agcattgcat acatggatga
     3961 ggagactggc aacctgaaaa aggctgtcat tctacagggc tctaatgatg ttgaacttgt
     4021 tgctgagggc aacagcaggt tcacttacac tgttcttgta gatggctgct ctaaaaagac
     4081 aaatgaatgg ggaaagacaa tcattgaata caaaacaaat aagccatcac gcctgccctt
     4141 ccttgatatt gcacctttgg acatcggtgg tgctgaccag gaattctttg tggacattgg
     4201 cccagtctgt ttcaaataaa tgaactcaat ctaaattaaa aaagaaagaa atttgaaaaa
     4261 actttctctt tgccatttct tcttcttctt ttttaactga aagctgaatc cttccatttc
     4321 ttctgcacat ctacttgctt aaattgtggg caaaagagaa aaagaaggat tgatcagagc
     4381 attgtgcaat acagtttcat taactccttc cctcgctccc ccaaaaattt gaattttttt
     4441 ttcaacactc ttacacctgt tatggaaaat gtcaaccttt gtaagaaaac caaaataaaa
     4501 attgaaaaat aaaaaccata aacatttgca aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
     4561 aaaaaaaaaa aa
//