LOCUS       BC036531                4752 bp    mRNA    linear   HUM 21-JUL-2006
DEFINITION  Homo sapiens collagen, type I, alpha 1, mRNA (cDNA clone MGC:33668
            IMAGE:5264710), complete cds.
ACCESSION   BC036531
VERSION     BC036531.2
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 4752)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 4752)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (16-AUG-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Aug 25, 2003 this sequence version replaced BC036531.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Miklos Palkovits, M.D., Ph.D.
            cDNA Library Preparation: Michael J. Brownstein (NHGRI) &  Shiraki
            Toshiyuki and Piero Carninci (RIKEN)
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Institute for Systems Biology
            http://www.systemsbiology.org
            contact: amadan@systemsbiology.org
            Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
            Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 47 Row: i Column: 5
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 14719826.
FEATURES             Location/Qualifiers
     source          1..4752
                     /db_xref="H-InvDB:HIT000051732"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:33668 IMAGE:5264710"
                     /tissue_type="Brain, hippocampus"
                     /clone_lib="NIH_MGC_95"
                     /lab_host="DH10B"
                     /note="Vector: pBluescriptR"
     gene            1..4752
                     /gene="COL1A1"
                     /gene_synonym="OI4"
                     /db_xref="GeneID:1277"
                     /db_xref="HGNC:HGNC:2197"
                     /db_xref="MIM:120150"
     CDS             117..4511
                     /gene="COL1A1"
                     /gene_synonym="OI4"
                     /codon_start=1
                     /product="collagen, type I, alpha 1"
                     /protein_id="AAH36531.1"
                     /db_xref="GeneID:1277"
                     /db_xref="HGNC:HGNC:2197"
                     /db_xref="MIM:120150"
                     /translation="MFSFVDLRLLLLLAATALLTHGQEEGQVEGQDEDIPPITCVQNG
                     LRYHDRDVWKPEPCRICVCDNGKVLCDDVICDETKNCPGAEVPEGECCPVCPDGSESP
                     TDQETTGVEGPKGDTGPRGPRGPAGPPGRDGIPGQPGLPGPPGPPGPPGPPGLGGNFA
                     PQLSYGYDEKSTGGISVPGPMGPSGPRGLPGPPGAPGPQGFQGPPGEPGEPGASGPMG
                     PRGPPGPPGKNGDDGEAGKPGRPGERGPPGPQGARGLPGTAGLPGMKGHRGFSGLDGA
                     KGDAGPAGPKGEPGSPGENGAPGQMGPRGLPGERGRPGAPGPAGARGNDGATGAAGPP
                     GPTGPAGPPGFPGAVGAKGEAGPQGPRGSEGPQGVRGEPGPPGPAGAAGPAGNPGADG
                     QPGAKGANGAPGIAGAPGFPGARGPSGPQGPGGPPGPKGNSGEPGAPGSKGDTGAKGE
                     PGPVGVQGPPGPAGEEGKRGARGEPGPTGLPGPPGERGGPGSRGFPGADGVAGPKGPA
                     GERGSPGPAGPKGSPGEAGRPGEAGLPGAKGLTGSPGSPGPDGKTGPPGPAGQDGRPG
                     PPGPPGARGQAGVMGFPGPKGAAGEPGKAGERGVPGPPGAVGPAGKDGEAGAQGPPGP
                     AGPAGERGEQGPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGDLGAPGPSGARGERGFP
                     GERGVQGPPGPAGPRGANGAPGNDGAKGDAGAPGAPGSQGAPGLQGMPGERGAAGLPG
                     PKGDRGDAGPKGADGSPGKDGVRGLTGPIGPPGPAGAPGDKGESGPSGPAGPTGARGA
                     PGDRGEPGPPGPAGFAGPPGADGQPGAKGEPGDAGAKGDAGPPGPAGPAGPPGPIGNV
                     GAPGAKGARGSAGPPGATGFPGAAGRVGPPGPSGNAGPPGPPGPAGKEGGKGPRGETG
                     PAGRPGEVGPPGPPGPAGEKGSPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGF
                     PGLPGPSGEPGKQGPSGASGERGPPGPMGPPGLAGPPGESGREGAPGAEGSPGRDGSP
                     GAKGDRGETGPAGPPGAPGAPGAPGPVGPAGKSGDRGETGPAGPAGPVGPVGARGPAG
                     PQGPRGDKGETGEQGDRGIKGHRGFSGLQGPPGPPGSPGEQGPSGASGPAGPRGPPGS
                     AGAPGKDGLNGLPGPIGPPGPRGRTGDAGPVGPPGPPGPPGPPGPPSAGFDFSFLPQP
                     PQEKAHDGGRYYRADDANVVRDRDLEVDTTLKSLSQQIENIRSPEGSRKNPARTCRDL
                     KMCHSDWKSGEYWIDPNQGCNLDAIKVFCNMETGETCVYPTQPSVAQKNWYISKNPKD
                     KRHVWFGESMTDGFQFEYGGQGSDPADVAIQLTFLRLMSTEASQNITYHCKNSVAYMD
                     QQTGNLKKALLLQGSNEIEIRAEGNSRFTYSVTVDGCTSHTGAWGKTVIEYKTTKTSR
                     LRIIDVAPLDVGAPDQEFGFDVGHVCFL"
BASE COUNT          821 a         1544 c         1485 g          902 t
ORIGIN      
        1 agacgggagt ttctcctcgg ggtcggagca ggaggcacgc ggagtgtgag gccacgcatg
       61 agcggacgct aaccccctcc ccagccacaa agagtctaca tgtctagggt ctagacatgt
      121 tcagctttgt ggacctccgg ctcctgctcc tcttagcggc caccgccctc ctgacgcacg
      181 gccaagagga aggccaagtc gagggccaag acgaagacat cccaccaatc acctgcgtac
      241 agaacggcct caggtaccat gaccgagacg tgtggaaacc cgagccctgc cggatctgcg
      301 tctgcgacaa cggcaaggtg ttgtgcgatg acgtgatctg tgacgagacc aagaactgcc
      361 ccggcgccga agtccccgag ggcgagtgct gtcccgtctg ccccgacggc tcagagtcac
      421 ccaccgacca agaaaccacc ggcgtcgagg gacccaaggg agacactggc ccccgaggcc
      481 caaggggacc cgcaggcccc cctggccgag atggcatccc tggacagcct ggacttcccg
      541 gaccccccgg accccccgga cctcccggac cccctggcct cggaggaaac tttgctcccc
      601 agctgtctta tggctatgat gagaaatcaa ccggaggaat ttccgtgcct ggccccatgg
      661 gtccctctgg tcctcgtggt ctccctggcc cccctggtgc acctggtccc caaggcttcc
      721 aaggtccccc tggtgagcct ggcgagcctg gagcttcagg tcccatgggt ccccgaggtc
      781 ccccaggtcc ccctggaaag aatggagatg atggggaagc tggaaaacct ggtcgtcctg
      841 gtgagcgtgg gcctcctggg cctcagggtg ctcgaggatt gcccggaaca gctggcctcc
      901 ctggaatgaa gggacacaga ggtttcagtg gtttggatgg tgccaaggga gatgctggtc
      961 ctgctggtcc taagggtgag cctggcagcc ctggtgaaaa tggagctcct ggtcagatgg
     1021 gcccccgtgg cctgcctggt gagagaggtc gccctggagc ccctggccct gctggtgctc
     1081 gtggaaatga tggtgctact ggtgctgccg ggccccctgg tcccaccggc cccgctggtc
     1141 ctcctggctt ccctggtgct gttggtgcta agggtgaagc tggtccccaa gggccccgag
     1201 gctctgaagg tccccagggt gtgcgtggtg agcctggccc ccctggccct gctggtgctg
     1261 ctggccctgc tggaaaccct ggtgctgatg gacagcctgg tgctaaaggt gccaatggtg
     1321 ctcctggtat tgctggtgct cctggcttcc ctggtgcccg aggcccctct ggaccccagg
     1381 gccccggcgg ccctcctggt cccaagggta acagcggtga acctggtgct cctggcagca
     1441 aaggagacac tggtgctaag ggagagcctg gccctgttgg tgttcaagga ccccctggcc
     1501 ctgctggaga ggaaggaaag cgaggagctc gaggtgaacc cggacccact ggcctgcccg
     1561 gaccccctgg cgagcgtggt ggacctggta gccgtggttt ccctggcgca gatggtgttg
     1621 ctggtcccaa gggtcccgct ggtgaacgtg gttctcctgg ccctgctggc cccaaaggat
     1681 ctcctggtga agctggtcgt cccggtgaag ctggtctgcc tggtgccaag ggtctgactg
     1741 gaagccctgg cagccctggt cctgatggca aaactggccc ccctggtccc gccggtcaag
     1801 atggtcgccc cggaccccca ggcccacctg gtgcccgtgg tcaggctggt gtgatgggat
     1861 tccctggacc taaaggtgct gctggagagc ccggcaaggc tggagagcga ggtgttcccg
     1921 gaccccctgg cgctgtcggt cctgctggca aagatggaga ggctggagct cagggacccc
     1981 ctggccctgc tggtcccgct ggcgagagag gtgaacaagg ccctgctggc tcccccggat
     2041 tccagggtct ccctggtcct gctggtcctc caggtgaagc aggcaaacct ggtgaacagg
     2101 gtgttcctgg agaccttggc gcccctggcc cctctggagc aagaggcgag agaggtttcc
     2161 ctggcgagcg tggtgtgcaa ggtccccctg gtcctgctgg tccccgaggg gccaacggtg
     2221 ctcccggcaa cgatggtgct aagggtgatg ctggtgcccc tggagctccc ggtagccagg
     2281 gcgcccctgg ccttcaggga atgcctggtg aacgtggtgc agctggtctt ccagggccta
     2341 agggtgacag aggtgatgct ggtcccaaag gtgctgatgg ctctcctggc aaagatggcg
     2401 tccgtggtct gaccggcccc attggtcctc ctggccctgc tggtgcccct ggtgacaagg
     2461 gtgaaagtgg tcccagcggc cctgctggtc ccactggagc tcgtggtgcc cccggagacc
     2521 gtggtgagcc tggtcccccc ggccctgctg gctttgctgg cccccctggt gctgacggcc
     2581 aacctggtgc taaaggcgaa cctggtgatg ctggtgctaa aggcgatgct ggtccccctg
     2641 gccctgccgg acccgctgga ccccctggcc ccattggtaa tgttggtgct cctggagcca
     2701 aaggtgctcg cggcagcgct ggtccccctg gtgctactgg tttccctggt gctgctggcc
     2761 gagtcggtcc tcctggcccc tctggaaatg ctggaccccc tggccctcct ggtcctgctg
     2821 gcaaagaagg cggcaaaggt ccccgtggtg agactggccc tgctggacgt cctggtgaag
     2881 ttggtccccc tggtccccct ggccctgctg gcgagaaagg atcccctggt gctgatggtc
     2941 ctgctggtgc tcctggtact cccgggcctc aaggtattgc tggacagcgt ggtgtggtcg
     3001 gcctgcctgg tcagagagga gagagaggct tccctggtct tcctggcccc tctggtgaac
     3061 ctggcaaaca aggtccctct ggagcaagtg gtgaacgtgg tccccctggt cccatgggcc
     3121 cccctggatt ggctggaccc cctggtgaat ctggacgtga gggggctcct ggtgccgaag
     3181 gttcccctgg acgagacggt tctcctggcg ccaagggtga ccgtggtgag accggccccg
     3241 ctggaccccc tggtgctcct ggtgctcctg gtgcccctgg ccccgttggc cctgctggca
     3301 agagtggtga tcgtggtgag actggtcctg ctggtcccgc cggtcctgtc ggccctgttg
     3361 gcgcccgtgg ccccgccgga ccccaaggcc cacgtggtga caagggtgag acaggcgaac
     3421 agggcgacag aggcataaag ggtcaccgtg gcttctctgg cctccagggt ccccctggcc
     3481 ctcctggctc tcctggtgaa caaggtccct ctggagcctc tggtcctgct ggtccccgag
     3541 gtccccctgg ctctgctggt gctcctggca aagatggact caacggtctc cctggcccca
     3601 ttgggccccc tggtcctcgc ggtcgcactg gtgatgctgg tcctgttggt ccccccggcc
     3661 ctcctggacc tcctggtccc cctggtcctc ccagcgctgg tttcgacttc agcttcctgc
     3721 cccagccacc tcaagagaag gctcacgatg gtggccgcta ctaccgggct gatgatgcca
     3781 atgtggttcg tgaccgtgac ctcgaggtgg acaccaccct caagagcctg agccagcaga
     3841 tcgagaacat ccggagccca gagggcagcc gcaagaaccc cgcccgcacc tgccgtgacc
     3901 tcaagatgtg ccactctgac tggaagagtg gagagtactg gattgacccc aaccaaggct
     3961 gcaacctgga tgccatcaaa gtcttctgca acatggagac tggtgagacc tgcgtgtacc
     4021 ccactcagcc cagtgtggcc cagaagaact ggtacatcag caagaacccc aaggacaaga
     4081 ggcatgtctg gttcggcgag agcatgaccg atggattcca gttcgagtat ggcggccagg
     4141 gctccgaccc tgccgatgtg gccatccagc tgaccttcct gcgcctgatg tccaccgagg
     4201 cctcccagaa catcacctac cactgcaaga acagcgtggc ctacatggac cagcagactg
     4261 gcaacctcaa gaaggccctg ctcctccagg gctccaacga gatcgagatc cgcgccgagg
     4321 gcaacagccg cttcacctac agcgtcactg tcgatggctg cacgagtcac accggagcct
     4381 ggggcaagac agtgattgaa tacaaaacca ccaagacctc ccgcctgcgc atcatcgatg
     4441 tggccccctt ggacgttggt gccccagacc aggaattcgg cttcgacgtt ggccatgtct
     4501 gcttcctgta aactccctcc atcccaacct ggctccctcc cacccaacca actttccccc
     4561 caacccggaa acagacaagc aacccaaact gaaccccctc aaaagccaaa aaatgggaga
     4621 caatttcaca tggactttgg aaaatatttt tttcctttgc attcatctct caaacttagt
     4681 ttttatcttt gaccaaccga acatgaccaa aaaccaaaag tgcattcaac cttaccaaaa
     4741 aaaaaaaaaa aa
//