LOCUS BC036531 4752 bp mRNA linear HUM 21-JUL-2006 DEFINITION Homo sapiens collagen, type I, alpha 1, mRNA (cDNA clone MGC:33668 IMAGE:5264710), complete cds. ACCESSION BC036531 VERSION BC036531.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4752) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 4752) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (16-AUG-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 25, 2003 this sequence version replaced BC036531.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Miklos Palkovits, M.D., Ph.D. cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki Toshiyuki and Piero Carninci (RIKEN) cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 47 Row: i Column: 5 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 14719826. FEATURES Location/Qualifiers source 1..4752 /db_xref="H-InvDB:HIT000051732" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:33668 IMAGE:5264710" /tissue_type="Brain, hippocampus" /clone_lib="NIH_MGC_95" /lab_host="DH10B" /note="Vector: pBluescriptR" gene 1..4752 /gene="COL1A1" /gene_synonym="OI4" /db_xref="GeneID:1277" /db_xref="HGNC:HGNC:2197" /db_xref="MIM:120150" CDS 117..4511 /gene="COL1A1" /gene_synonym="OI4" /codon_start=1 /product="collagen, type I, alpha 1" /protein_id="AAH36531.1" /db_xref="GeneID:1277" /db_xref="HGNC:HGNC:2197" /db_xref="MIM:120150" /translation="MFSFVDLRLLLLLAATALLTHGQEEGQVEGQDEDIPPITCVQNG LRYHDRDVWKPEPCRICVCDNGKVLCDDVICDETKNCPGAEVPEGECCPVCPDGSESP TDQETTGVEGPKGDTGPRGPRGPAGPPGRDGIPGQPGLPGPPGPPGPPGPPGLGGNFA PQLSYGYDEKSTGGISVPGPMGPSGPRGLPGPPGAPGPQGFQGPPGEPGEPGASGPMG PRGPPGPPGKNGDDGEAGKPGRPGERGPPGPQGARGLPGTAGLPGMKGHRGFSGLDGA KGDAGPAGPKGEPGSPGENGAPGQMGPRGLPGERGRPGAPGPAGARGNDGATGAAGPP GPTGPAGPPGFPGAVGAKGEAGPQGPRGSEGPQGVRGEPGPPGPAGAAGPAGNPGADG QPGAKGANGAPGIAGAPGFPGARGPSGPQGPGGPPGPKGNSGEPGAPGSKGDTGAKGE PGPVGVQGPPGPAGEEGKRGARGEPGPTGLPGPPGERGGPGSRGFPGADGVAGPKGPA GERGSPGPAGPKGSPGEAGRPGEAGLPGAKGLTGSPGSPGPDGKTGPPGPAGQDGRPG PPGPPGARGQAGVMGFPGPKGAAGEPGKAGERGVPGPPGAVGPAGKDGEAGAQGPPGP AGPAGERGEQGPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGDLGAPGPSGARGERGFP GERGVQGPPGPAGPRGANGAPGNDGAKGDAGAPGAPGSQGAPGLQGMPGERGAAGLPG PKGDRGDAGPKGADGSPGKDGVRGLTGPIGPPGPAGAPGDKGESGPSGPAGPTGARGA PGDRGEPGPPGPAGFAGPPGADGQPGAKGEPGDAGAKGDAGPPGPAGPAGPPGPIGNV GAPGAKGARGSAGPPGATGFPGAAGRVGPPGPSGNAGPPGPPGPAGKEGGKGPRGETG PAGRPGEVGPPGPPGPAGEKGSPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGF PGLPGPSGEPGKQGPSGASGERGPPGPMGPPGLAGPPGESGREGAPGAEGSPGRDGSP GAKGDRGETGPAGPPGAPGAPGAPGPVGPAGKSGDRGETGPAGPAGPVGPVGARGPAG PQGPRGDKGETGEQGDRGIKGHRGFSGLQGPPGPPGSPGEQGPSGASGPAGPRGPPGS AGAPGKDGLNGLPGPIGPPGPRGRTGDAGPVGPPGPPGPPGPPGPPSAGFDFSFLPQP PQEKAHDGGRYYRADDANVVRDRDLEVDTTLKSLSQQIENIRSPEGSRKNPARTCRDL KMCHSDWKSGEYWIDPNQGCNLDAIKVFCNMETGETCVYPTQPSVAQKNWYISKNPKD KRHVWFGESMTDGFQFEYGGQGSDPADVAIQLTFLRLMSTEASQNITYHCKNSVAYMD QQTGNLKKALLLQGSNEIEIRAEGNSRFTYSVTVDGCTSHTGAWGKTVIEYKTTKTSR LRIIDVAPLDVGAPDQEFGFDVGHVCFL" BASE COUNT 821 a 1544 c 1485 g 902 t ORIGIN 1 agacgggagt ttctcctcgg ggtcggagca ggaggcacgc ggagtgtgag gccacgcatg 61 agcggacgct aaccccctcc ccagccacaa agagtctaca tgtctagggt ctagacatgt 121 tcagctttgt ggacctccgg ctcctgctcc tcttagcggc caccgccctc ctgacgcacg 181 gccaagagga aggccaagtc gagggccaag acgaagacat cccaccaatc acctgcgtac 241 agaacggcct caggtaccat gaccgagacg tgtggaaacc cgagccctgc cggatctgcg 301 tctgcgacaa cggcaaggtg ttgtgcgatg acgtgatctg tgacgagacc aagaactgcc 361 ccggcgccga agtccccgag ggcgagtgct gtcccgtctg ccccgacggc tcagagtcac 421 ccaccgacca agaaaccacc ggcgtcgagg gacccaaggg agacactggc ccccgaggcc 481 caaggggacc cgcaggcccc cctggccgag atggcatccc tggacagcct ggacttcccg 541 gaccccccgg accccccgga cctcccggac cccctggcct cggaggaaac tttgctcccc 601 agctgtctta tggctatgat gagaaatcaa ccggaggaat ttccgtgcct ggccccatgg 661 gtccctctgg tcctcgtggt ctccctggcc cccctggtgc acctggtccc caaggcttcc 721 aaggtccccc tggtgagcct ggcgagcctg gagcttcagg tcccatgggt ccccgaggtc 781 ccccaggtcc ccctggaaag aatggagatg atggggaagc tggaaaacct ggtcgtcctg 841 gtgagcgtgg gcctcctggg cctcagggtg ctcgaggatt gcccggaaca gctggcctcc 901 ctggaatgaa gggacacaga ggtttcagtg gtttggatgg tgccaaggga gatgctggtc 961 ctgctggtcc taagggtgag cctggcagcc ctggtgaaaa tggagctcct ggtcagatgg 1021 gcccccgtgg cctgcctggt gagagaggtc gccctggagc ccctggccct gctggtgctc 1081 gtggaaatga tggtgctact ggtgctgccg ggccccctgg tcccaccggc cccgctggtc 1141 ctcctggctt ccctggtgct gttggtgcta agggtgaagc tggtccccaa gggccccgag 1201 gctctgaagg tccccagggt gtgcgtggtg agcctggccc ccctggccct gctggtgctg 1261 ctggccctgc tggaaaccct ggtgctgatg gacagcctgg tgctaaaggt gccaatggtg 1321 ctcctggtat tgctggtgct cctggcttcc ctggtgcccg aggcccctct ggaccccagg 1381 gccccggcgg ccctcctggt cccaagggta acagcggtga acctggtgct cctggcagca 1441 aaggagacac tggtgctaag ggagagcctg gccctgttgg tgttcaagga ccccctggcc 1501 ctgctggaga ggaaggaaag cgaggagctc gaggtgaacc cggacccact ggcctgcccg 1561 gaccccctgg cgagcgtggt ggacctggta gccgtggttt ccctggcgca gatggtgttg 1621 ctggtcccaa gggtcccgct ggtgaacgtg gttctcctgg ccctgctggc cccaaaggat 1681 ctcctggtga agctggtcgt cccggtgaag ctggtctgcc tggtgccaag ggtctgactg 1741 gaagccctgg cagccctggt cctgatggca aaactggccc ccctggtccc gccggtcaag 1801 atggtcgccc cggaccccca ggcccacctg gtgcccgtgg tcaggctggt gtgatgggat 1861 tccctggacc taaaggtgct gctggagagc ccggcaaggc tggagagcga ggtgttcccg 1921 gaccccctgg cgctgtcggt cctgctggca aagatggaga ggctggagct cagggacccc 1981 ctggccctgc tggtcccgct ggcgagagag gtgaacaagg ccctgctggc tcccccggat 2041 tccagggtct ccctggtcct gctggtcctc caggtgaagc aggcaaacct ggtgaacagg 2101 gtgttcctgg agaccttggc gcccctggcc cctctggagc aagaggcgag agaggtttcc 2161 ctggcgagcg tggtgtgcaa ggtccccctg gtcctgctgg tccccgaggg gccaacggtg 2221 ctcccggcaa cgatggtgct aagggtgatg ctggtgcccc tggagctccc ggtagccagg 2281 gcgcccctgg ccttcaggga atgcctggtg aacgtggtgc agctggtctt ccagggccta 2341 agggtgacag aggtgatgct ggtcccaaag gtgctgatgg ctctcctggc aaagatggcg 2401 tccgtggtct gaccggcccc attggtcctc ctggccctgc tggtgcccct ggtgacaagg 2461 gtgaaagtgg tcccagcggc cctgctggtc ccactggagc tcgtggtgcc cccggagacc 2521 gtggtgagcc tggtcccccc ggccctgctg gctttgctgg cccccctggt gctgacggcc 2581 aacctggtgc taaaggcgaa cctggtgatg ctggtgctaa aggcgatgct ggtccccctg 2641 gccctgccgg acccgctgga ccccctggcc ccattggtaa tgttggtgct cctggagcca 2701 aaggtgctcg cggcagcgct ggtccccctg gtgctactgg tttccctggt gctgctggcc 2761 gagtcggtcc tcctggcccc tctggaaatg ctggaccccc tggccctcct ggtcctgctg 2821 gcaaagaagg cggcaaaggt ccccgtggtg agactggccc tgctggacgt cctggtgaag 2881 ttggtccccc tggtccccct ggccctgctg gcgagaaagg atcccctggt gctgatggtc 2941 ctgctggtgc tcctggtact cccgggcctc aaggtattgc tggacagcgt ggtgtggtcg 3001 gcctgcctgg tcagagagga gagagaggct tccctggtct tcctggcccc tctggtgaac 3061 ctggcaaaca aggtccctct ggagcaagtg gtgaacgtgg tccccctggt cccatgggcc 3121 cccctggatt ggctggaccc cctggtgaat ctggacgtga gggggctcct ggtgccgaag 3181 gttcccctgg acgagacggt tctcctggcg ccaagggtga ccgtggtgag accggccccg 3241 ctggaccccc tggtgctcct ggtgctcctg gtgcccctgg ccccgttggc cctgctggca 3301 agagtggtga tcgtggtgag actggtcctg ctggtcccgc cggtcctgtc ggccctgttg 3361 gcgcccgtgg ccccgccgga ccccaaggcc cacgtggtga caagggtgag acaggcgaac 3421 agggcgacag aggcataaag ggtcaccgtg gcttctctgg cctccagggt ccccctggcc 3481 ctcctggctc tcctggtgaa caaggtccct ctggagcctc tggtcctgct ggtccccgag 3541 gtccccctgg ctctgctggt gctcctggca aagatggact caacggtctc cctggcccca 3601 ttgggccccc tggtcctcgc ggtcgcactg gtgatgctgg tcctgttggt ccccccggcc 3661 ctcctggacc tcctggtccc cctggtcctc ccagcgctgg tttcgacttc agcttcctgc 3721 cccagccacc tcaagagaag gctcacgatg gtggccgcta ctaccgggct gatgatgcca 3781 atgtggttcg tgaccgtgac ctcgaggtgg acaccaccct caagagcctg agccagcaga 3841 tcgagaacat ccggagccca gagggcagcc gcaagaaccc cgcccgcacc tgccgtgacc 3901 tcaagatgtg ccactctgac tggaagagtg gagagtactg gattgacccc aaccaaggct 3961 gcaacctgga tgccatcaaa gtcttctgca acatggagac tggtgagacc tgcgtgtacc 4021 ccactcagcc cagtgtggcc cagaagaact ggtacatcag caagaacccc aaggacaaga 4081 ggcatgtctg gttcggcgag agcatgaccg atggattcca gttcgagtat ggcggccagg 4141 gctccgaccc tgccgatgtg gccatccagc tgaccttcct gcgcctgatg tccaccgagg 4201 cctcccagaa catcacctac cactgcaaga acagcgtggc ctacatggac cagcagactg 4261 gcaacctcaa gaaggccctg ctcctccagg gctccaacga gatcgagatc cgcgccgagg 4321 gcaacagccg cttcacctac agcgtcactg tcgatggctg cacgagtcac accggagcct 4381 ggggcaagac agtgattgaa tacaaaacca ccaagacctc ccgcctgcgc atcatcgatg 4441 tggccccctt ggacgttggt gccccagacc aggaattcgg cttcgacgtt ggccatgtct 4501 gcttcctgta aactccctcc atcccaacct ggctccctcc cacccaacca actttccccc 4561 caacccggaa acagacaagc aacccaaact gaaccccctc aaaagccaaa aaatgggaga 4621 caatttcaca tggactttgg aaaatatttt tttcctttgc attcatctct caaacttagt 4681 ttttatcttt gaccaaccga acatgaccaa aaaccaaaag tgcattcaac cttaccaaaa 4741 aaaaaaaaaa aa //