LOCUS       BC054498                5060 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens collagen, type I, alpha 2, mRNA (cDNA clone MGC:57503
            IMAGE:6474549), complete cds.
ACCESSION   BC054498
VERSION     BC054498.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 5060)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 5060)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (01-JUL-2003) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 107 Row: p Column: 3
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 48762933.
FEATURES             Location/Qualifiers
     source          1..5060
                     /db_xref="H-InvDB:HIT000054073"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:57503 IMAGE:6474549"
                     /tissue_type="Uterus, leiomyosarcoma"
                     /clone_lib="NIH_MGC_71"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..5060
                     /gene="COL1A2"
                     /db_xref="GeneID:1278"
                     /db_xref="HGNC:HGNC:2198"
                     /db_xref="MIM:120160"
     CDS             101..4201
                     /gene="COL1A2"
                     /codon_start=1
                     /product="collagen, type I, alpha 2"
                     /protein_id="AAH54498.1"
                     /db_xref="GeneID:1278"
                     /db_xref="HGNC:HGNC:2198"
                     /db_xref="MIM:120160"
                     /translation="MLSFVDTRTLLLLAVTLCLATCQSLQEETVRKGPAGDRGPRGER
                     GPPGPPGRDGEDGPTGPPGPPGPPGPPGLGGNFAAQYDGKGVGLGPGPMGLMGPRGPP
                     GAAGAPGPQGFQGPAGEPGEPGQTGPAGARGPAGPPGKAGEDGHPGKPGRPGERGVVG
                     PQGARGFPGTPGLPGFKGIRGHNGLDGLKGQPGAPGVKGEPGAPGENGTPGQTGARGL
                     PGERGRVGAPGPAGARGSDGSVGPVGPAGPIGSAGPPGFPGAPGPKGEIGAVGNAGPA
                     GPAGPRGEVGLPGLSGPVGPPGNPGANGLTGAKGAAGLPGVAGAPGLPGPRGIPGPVG
                     AAGATGARGLVGEPGPAGSKGESGNKGEPGSAGPQGPPGPSGEEGKRGPNGEAGSAGP
                     PGPPGLRGSPGSRGLPGADGRAGVMGPPGSRGASGPAGVRGPNGDAGRPGEPGLMGPR
                     GLPGSPGNIGPAGKEGPVGLPGIDGRPGPIGPAGARGEPGNIGFPGPKGPTGDPGKNG
                     DKGHAGLAGARGAPGPDGNNGAQGPPGPQGVQGGKGEQGPAGPPGFQGLPGPSGPAGE
                     VGKPGERGLHGEFGLPGPAGPRGERGPPGESGAAGPTGPIGSRGPSGPPGPDGNKGEP
                     GVVGAVGTAGPSGPSGLPGERGAAGIPGGKGEKGEPGLRGEIGNPGRDGARGAPGAVG
                     APGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPAGPNGFAGPAGAAGQPGAKGE
                     RGAKGPKGENGVVGPTGPVGAAGPAGPNGPPGPAGSRGDGGPPGMTGFPGAAGRTGPP
                     GPSGISGPPGPPGPAGKEGLRGPRGDQGPVGRTGEVGAVGPPGFAGEKGPSGEAGTAG
                     PPGTPGPQGLLGAPGILGLPGSRGERGLPGVAGAVGEPGPLGIAGPPGARGPPGAVGS
                     PGVNGAPGEAGRDGNPGNDGPPGRDGQPGHKGERGYPGNIGPVGAAGAPGPHGPVGPA
                     GKHGNRGETGPSGPVGPAGAVGPRGPSGPQGIRGDKGEPGEKGPRGLPGLKGHNGLQG
                     LPGIAGHHGDQGAPGSVGPAGPRGPAGPSGPAGKDGRTGHPGTVGPAGIRGPQGHQGP
                     AGPPGPPGPPGPPGVSGGGYDFGYDGDFYRADQPRSAPSLRPKDYEVDATLKSLNNQI
                     ETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCDFSTGETCI
                     RAQPENIPAKNWYRSSKDKKHVWLGETINAGSQFEYNVEGVTSKEMATQLAFMRLLAN
                     YASQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELVAEGNSRFTYTVLVDGCSKK
                     TNEWGKTIIEYKTNKPSRLPFLDIAPLDIGGADQEFFVDIGPVCFK"
BASE COUNT         1097 a         1278 c         1465 g         1220 t
ORIGIN      
        1 gtactggcca cgactgcatg cccgcgcccg ccaggtgata cctccgccgg tgacccaggg
       61 gctctgcgac acaaggagtc tgcatgtcta agtgctagac atgctcagct ttgtggatac
      121 gcggactttg ttgctgcttg cagtaacctt atgcctagca acatgccaat ctttacaaga
      181 ggaaactgta agaaagggcc cagccggaga tagaggacca cgtggagaaa ggggtccacc
      241 aggcccccca ggcagagatg gtgaagatgg tcccacaggc cctcctggtc cacctggtcc
      301 tcctggcccc cctggtctcg gtgggaactt tgctgctcag tatgatggaa aaggagttgg
      361 acttggccct ggaccaatgg gcttaatggg acctagaggc ccacctggtg cagctggagc
      421 cccaggccct caaggtttcc aaggacctgc tggtgagcct ggtgaacctg gtcaaactgg
      481 tcctgcaggt gctcgtggtc cagctggccc tcctggcaag gctggtgaag atggtcaccc
      541 tggaaaaccc ggacgacctg gtgagagagg agttgttgga ccacagggtg ctcgtggttt
      601 ccctggaact cctggacttc ctggcttcaa aggcattagg ggacacaatg gtctggatgg
      661 attgaaggga cagcccggtg ctcctggtgt gaagggtgaa cctggtgccc ctggtgaaaa
      721 tggaactcca ggtcaaacag gagcccgtgg gcttcctggt gagagaggac gtgttggtgc
      781 ccctggccca gctggtgccc gtggcagtga tggaagtgtg ggtcccgtgg gtcctgctgg
      841 tcccattggg tctgctggcc ctccaggctt cccaggtgcc cctggcccca agggtgaaat
      901 tggagctgtt ggtaacgctg gtcctgctgg tcccgccggt ccccgtggtg aagtgggtct
      961 tccaggcctc tccggccccg ttggacctcc tggtaatcct ggagcaaacg gccttactgg
     1021 tgccaagggt gctgctggcc ttcccggcgt tgctggggct cccggcctcc ctggaccccg
     1081 cggtattcct ggccctgttg gtgctgccgg tgctactggt gccagaggac ttgttggtga
     1141 gcctggtcca gctggctcca aaggagagag cggtaacaag ggtgagcccg gctctgctgg
     1201 gccccaaggt cctcctggtc ccagtggtga agaaggaaag agaggcccta atggggaagc
     1261 tggatctgcc ggccctccag gacctcctgg gctgagaggt agtcctggtt ctcgtggtct
     1321 tcctggagct gatggcagag ctggcgtcat gggccctcct ggtagtcgtg gtgcaagtgg
     1381 ccctgctgga gtccgaggac ctaatggaga tgctggtcgc cctggggagc ctggtctcat
     1441 gggacccaga ggtcttcctg gttcccctgg aaatatcggc cccgctggaa aagaaggtcc
     1501 tgtcggcctc cctggcatcg acggcaggcc tggcccaatt ggcccagctg gagcaagagg
     1561 agagcctggc aacattggat tccctggacc caaaggcccc actggtgatc ctggcaaaaa
     1621 cggtgataaa ggtcatgctg gtcttgctgg tgctcggggt gctccaggtc ctgatggaaa
     1681 caatggtgct cagggacctc ctggaccaca gggtgttcaa ggtggaaaag gtgaacaggg
     1741 tcccgctggt cctccaggct tccagggtct gcctggcccc tcaggtcccg ctggtgaagt
     1801 tggcaaacca ggagaaaggg gtctccatgg tgagtttggt ctccctggtc ctgctggtcc
     1861 aagaggggaa cgcggtcccc caggtgagag tggtgctgcc ggtcctactg gtcctattgg
     1921 aagccgaggt ccttctggac ccccagggcc tgatggaaac aagggtgaac ctggtgtggt
     1981 tggtgctgtg ggcactgctg gtccatctgg tcctagtgga ctcccaggag agaggggtgc
     2041 tgctggcata cctggaggca agggagaaaa gggtgaacct ggtctcagag gtgaaattgg
     2101 taaccctggc agagatggtg ctcgtggtgc tcctggtgct gtaggtgccc ctggtcctgc
     2161 tggagccaca ggtgaccggg gcgaagctgg ggctgctggt cctgctggtc ctgctggtcc
     2221 tcggggaagc cctggtgaac gtggtgaggt cggtcctgct ggccccaatg gatttgctgg
     2281 tcctgctggt gctgctggtc aacctggtgc taaaggagaa agaggagcca aagggcctaa
     2341 gggtgaaaac ggtgttgttg gtcccacagg ccccgttgga gctgctggcc cagctggtcc
     2401 aaatggtccc cccggtcctg ctggaagtcg tggtgatgga ggcccccctg gtatgactgg
     2461 tttccctggt gctgctggac ggactggtcc cccaggaccc tctggtattt ctggccctcc
     2521 tggtccccct ggtcctgctg ggaaagaagg gcttcgtggt cctcgtggtg accaaggtcc
     2581 agttggccga actggagaag taggtgcagt tggtccccct ggcttcgctg gtgagaaggg
     2641 tccctctgga gaggctggta ctgctggacc tcctggcact ccaggtcctc agggtcttct
     2701 tggtgctcct ggtattctgg gtctccctgg ctcgagaggt gaacgtggtc taccaggtgt
     2761 tgctggtgct gtgggtgaac ctggtcctct tggcattgcc ggccctcctg gggcccgtgg
     2821 tcctcctggt gctgtgggta gtcctggagt caacggtgct cctggtgaag ctggtcgtga
     2881 tggcaaccct gggaacgatg gtcccccagg tcgcgatggt caacccggac acaagggaga
     2941 gcgcggttac cctggcaata ttggtcccgt tggtgctgca ggtgcacctg gtcctcatgg
     3001 ccccgtgggt cctgctggca aacatggaaa ccgtggtgaa actggtcctt ctggtcctgt
     3061 tggtcctgct ggtgctgttg gcccaagagg tcctagtggc ccacaaggca ttcgtggcga
     3121 taagggagag cccggtgaaa aggggcccag aggtcttcct ggcttaaagg gacacaatgg
     3181 attgcaaggt ctgcctggta tcgctggtca ccatggtgat caaggtgctc ctggctccgt
     3241 gggtcctgct ggtcctaggg gccctgctgg tccttctggc cctgctggaa aagatggtcg
     3301 cactggacat cctggtacag ttggacctgc tggcattcga ggccctcagg gtcaccaagg
     3361 ccctgctggc ccccctggtc cccctggccc tcctggacct ccaggtgtaa gcggtggtgg
     3421 ttatgacttt ggttacgatg gagacttcta cagggctgac cagcctcgct cagcaccttc
     3481 tctcagaccc aaggactatg aagttgatgc tactctgaag tctctcaaca accagattga
     3541 gacccttctt actcctgaag gctctagaaa gaacccagct cgcacatgcc gtgacttgag
     3601 actcagccac ccagagtgga gcagtggtta ctactggatt gaccctaacc aaggatgcac
     3661 tatggatgct atcaaagtat actgtgattt ctctactggc gaaacctgta tccgggccca
     3721 acctgaaaac atcccagcca agaactggta taggagctcc aaggacaaga aacacgtctg
     3781 gctaggagaa actatcaatg ctggcagcca gtttgaatat aatgtagaag gagtgacttc
     3841 caaggaaatg gctacccaac ttgccttcat gcgcctgctg gccaactatg cctctcagaa
     3901 catcacctac cactgcaaga acagcattgc atacatggat gaggagactg gcaacctgaa
     3961 aaaggctgtc attctacagg gctctaatga tgttgaactt gttgctgagg gcaacagcag
     4021 gttcacttac actgttcttg tagatggctg ctctaaaaag acaaatgaat ggggaaagac
     4081 aatcattgaa tacaaaacaa ataagccatc acgcctgccc ttccttgata ttgcaccttt
     4141 ggacatcggt ggtgctgacc aggaattctt tgtggacatt ggcccagtct gtttcaaata
     4201 aatgaactca atctaaatta aaaaagaaag aaatttgaaa aaactttctc tttgccattt
     4261 cttcttcttc ttttttaact gaaagctgaa tccttccatt tcttctgcac atctacttgc
     4321 ttaaattgtg ggcaaaagag aaaaagaagg attgatcaga gcattgtgca atacagtttc
     4381 attaactcct tcccccgctc ccccaaaaat ttgaattttt ttttcaacac tcttacacct
     4441 gttatggaaa atgtcaacct ttgtaagaaa accaaaataa aaattgaaaa ataaaaacca
     4501 taaacatttg caccacttgt ggcttttgaa tatcttccac agagggaagt ttaaaaccca
     4561 aacttccaaa ggtttaaact acctcaaaac actttcccat gagtgtgatc cacattgtta
     4621 ggtgctgacc tagacagaga tgaactgagg tccttgtttt gttttgttca taatacaaag
     4681 gtgctaatta atagtatttc agatacttga agaatgttga tggtgctaga agaatttgag
     4741 aagaaatact cctgtattga gttgtatcgt gtggtgtatt ttttaaaaaa tttgatttag
     4801 cattcatatt ttccatctta ttcccaatta aaagtatgca gattatttgc ccaaatcttc
     4861 ttcagattca gcatttgttc tttgccagtc tcattttcat cttcttccat ggttccacag
     4921 aagctttgtt tcttgggcaa gcagaaaaat taaattgtac ctattttgta tatgtgagat
     4981 gtttaaataa attgtgaaaa aaatgaaata aagcatgttt ggttttccaa aaaaaaaaaa
     5041 aaaaaaaaaa aaaaaaaaaa
//