LOCUS BC042586 4572 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens collagen, type I, alpha 2, mRNA (cDNA clone MGC:30044 IMAGE:4803351), complete cds. ACCESSION BC042586 VERSION BC042586.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4572) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 4572) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (02-JAN-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: James Cleaver, M.D. cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Baylor College of Medicine Human Genome Sequencing Center Center code: BCM-HGSC Web site: http://www.hgsc.bcm.tmc.edu/cdna/ Contact: amg@bcm.tmc.edu Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H., Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, A.N., Gibbs, R.A. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 42 Row: f Column: 3 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 48762933. FEATURES Location/Qualifiers source 1..4572 /db_xref="H-InvDB:HIT000259092" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:30044 IMAGE:4803351" /tissue_type="Skin, normal" /clone_lib="NCI_CGAP_Skn3" /lab_host="DH10B" /note="Vector: pCMV-SPORT6.1" gene 1..4572 /gene="COL1A2" /db_xref="GeneID:1278" /db_xref="HGNC:HGNC:2198" /db_xref="MIM:120160" CDS 119..4219 /gene="COL1A2" /codon_start=1 /product="collagen, type I, alpha 2" /protein_id="AAH42586.1" /db_xref="GeneID:1278" /db_xref="HGNC:HGNC:2198" /db_xref="MIM:120160" /translation="MLSFVDTRTLLLLAVTLCLATCQSLQEETVRKGPAGDRGPRGER GPPGPPGRDGEDGPTGPPGPPGPPGPPGLGGNFAAQYDGKGVGLGPGPMGLMGPRGPP GAAGAPGPQGFQGPAGEPGEPGQTGPAGARGPAGPPGKAGEDGHPGKPGRPGERGVVG PQGARGFPGTPGLPGFKGIRGHNGLDGLKGQPGAPGVKGEPGAPGENGTPGQTGARGL PGERGRVGAPGPAGARGSDGSVGPVGPAGPIGSAGPPGFPGAPGPKGEIGAVGNAGPA GPAGPRGEVGLPGLSGPVGPPGNPGANGLTGAKGAAGLPGVAGAPGLPGPRGIPGPVG AAGATGARGLVGEPGPAGSKGESGNKGEPGSAGPQGPPGPSGEEGKRGPNGEAGSAGP PGPPGLRGSPGSRGLPGADGRAGVMGPPGSRGASGPAGVRGPNGDAGRPGEPGLMGPR GLPGSPGNIGPAGKEGPVGLPGIDGRPGPIGPAGARGEPGNIGFPGPKGPTGDPGKNG DKGHAGLAGARGAPGPDGNNGAQGPPGPQGVQGGKGEQGPAGPPGFQGLPGPSGPAGE VGKPGERGLHGEFGLPGPAGPRGERGPPGESGAAGPTGPIGSRGPSGPPGPDGNKGEP GVVGAVGTAGPSGPSGLPGERGAAGIPGGKGEKGEPGLRGEIGNPGRDGARGAPGAVG APGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPAGPNGFAGPAGAAGQPGAKGE RGAKGPKGENGVVGPTGPVGAAGPAGPNGPPGPAGSRGDGGPPGMTGFPGAAGRTGPP GPSGISGPPGPPGPAGKEGLRGPRGDQGPVGRTGEVGAVGPPGFAGEKGPSGEAGTAG PPGTPGPQGLLGAPGILGLPGSRGERGLPGVAGAVGEPGPLGIAGPPGARGPPGAVGS PGVNGAPGEAGRDGNPGNDGPPGRDGQPGHKGERGYPGNIGPVGAAGAPGPHGPVGPA GKHGNRGETGPSGPVGPAGAVGPRGPSGPQGIRGDKGEPGEKGPRGLPGLKGHNGLQG LPGIAGHHGDQGAPGSVGPAGPRGPAGPSGPAGKDGRTGHPGTVGPAGIRGPQGHQGP AGPPGPPGPPGPPGVSGGGYDFGYDGDFYRADQPRSAPSLRPKDYEVDATLKSLNNQI ETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCDFSTGETCI RAQPENIPAKNWYRSSKDKKHVWLGETINAGSQFEYNVEGVTSKEMATQLAFMRLLAN YASQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELVAEGNSRFTYTVLVDGCSKK TNEWGKTIIEYKTNKPSRLPFLDIAPLDIGGADQEFFVDIGPVCFK" BASE COUNT 955 a 1197 c 1382 g 1038 t ORIGIN 1 gtttcggcta agttggaggt actggccacg actgcatgcc cgcgcccgcc aggtgatacc 61 tccgccggtg acccaggggc tctgcgacac aaggagtctg catgtctaag tgctagacat 121 gctcagcttt gtggatacgc ggactttgtt gctgcttgca gtaaccttat gcctagcaac 181 atgccaatct ttacaagagg aaactgtaag aaagggccca gccggagata gaggaccacg 241 tggagaaagg ggtccaccag gccccccagg cagagatggt gaagatggtc ccacaggccc 301 tcctggtcca cctggtcctc ctggcccccc tggtctcggt gggaactttg ctgctcagta 361 tgatggaaaa ggagttggac ttggccctgg accaatgggc ttaatgggac ctagaggccc 421 acctggtgca gctggagccc caggccctca aggtttccaa ggacctgctg gtgagcctgg 481 tgaacctggt caaactggtc ctgcaggtgc tcgtggtcca gctggccctc ctggcaaggc 541 tggtgaagat ggtcaccctg gaaaacccgg acgacctggt gagagaggag ttgttggacc 601 acagggtgct cgtggtttcc ctggaactcc tggacttcct ggcttcaaag gcattagggg 661 acacaatggt ctggatggat tgaagggaca gcccggtgct cctggtgtga agggtgaacc 721 tggtgcccct ggtgaaaatg gaactccagg tcaaacagga gcccgtgggc ttcctggtga 781 gagaggacgt gttggtgccc ctggcccagc tggtgcccgt ggcagtgatg gaagtgtggg 841 tcccgtgggt cctgctggtc ccattgggtc tgctggccct ccaggcttcc caggtgcccc 901 tggccccaag ggtgaaattg gagctgttgg taacgctggt cctgctggtc ccgccggtcc 961 ccgtggtgaa gtgggtcttc caggcctctc cggccccgtt ggacctcctg gtaatcctgg 1021 agcaaacggc cttactggtg ccaagggtgc tgctggcctt cccggcgttg ctggggctcc 1081 cggcctccct ggaccccgcg gtattcctgg ccctgttggt gctgccggtg ctactggtgc 1141 cagaggactt gttggtgagc ctggtccagc tggctccaaa ggagagagcg gtaacaaggg 1201 tgagcccggc tctgctgggc cccaaggtcc tcctggtccc agtggtgaag aaggaaagag 1261 aggccctaat ggggaagctg gatctgccgg ccctccagga cctcctgggc tgagaggtag 1321 tcctggttct cgtggtcttc ctggagctga tggcagagct ggcgtcatgg gccctcctgg 1381 tagtcgtggt gcaagtggcc ctgctggagt ccgaggacct aatggagatg ctggtcgccc 1441 tggggagcct ggtctcatgg gacccagagg tcttcctggt tcccctggaa atatcggccc 1501 cgctggaaaa gaaggtcctg tcggcctccc tggcatcgac ggcaggcctg gcccaattgg 1561 cccagctgga gcaagaggag agcctggcaa cattggattc cctggaccca aaggccccac 1621 tggtgatcct ggcaaaaacg gtgataaagg tcatgctggt cttgctggtg ctcggggtgc 1681 tccaggtcct gatggaaaca atggtgctca gggacctcct ggaccacagg gtgttcaagg 1741 tggaaaaggt gaacagggtc ccgctggtcc tccaggcttc cagggtctgc ctggcccctc 1801 aggtcccgct ggtgaagttg gcaaaccagg agaaaggggt ctccatggtg agtttggtct 1861 ccctggtcct gctggtccaa gaggggaacg cggtccccca ggtgagagtg gtgctgccgg 1921 tcctactggt cctattggaa gccgaggtcc ttctggaccc ccagggcctg atggaaacaa 1981 gggtgaacct ggtgtggttg gtgctgtggg cactgctggt ccatctggtc ctagtggact 2041 cccaggagag aggggtgctg ctggcatacc tggaggcaag ggagaaaagg gtgaacctgg 2101 tctcagaggt gaaattggta accctggcag agatggtgct cgtggtgctc ctggtgctgt 2161 aggtgcccct ggtcctgctg gagccacagg tgaccggggc gaagctgggg ctgctggtcc 2221 tgctggtcct gctggtcctc ggggaagccc tggtgaacgt ggtgaggtcg gtcctgctgg 2281 ccccaatgga tttgctggtc ctgctggtgc tgctggtcaa cctggtgcta aaggagaaag 2341 aggagccaaa gggcctaagg gtgaaaacgg tgttgttggt cccacaggcc ccgttggagc 2401 tgctggccca gctggtccaa atggtccccc cggtcctgct ggaagtcgtg gtgatggagg 2461 cccccctggt atgactggtt tccctggtgc tgctggacgg actggtcccc caggaccctc 2521 tggtatttct ggccctcctg gtccccctgg tcctgctggg aaagaagggc ttcgtggtcc 2581 tcgtggtgac caaggtccag ttggccgaac tggagaagta ggtgcagttg gtccccctgg 2641 cttcgctggt gagaagggtc cctctggaga ggctggtact gctggacctc ctggcactcc 2701 aggtcctcag ggtcttcttg gtgctcctgg tattctgggt ctccctggct cgagaggtga 2761 acgtggtcta ccaggtgttg ctggtgctgt gggtgaacct ggtcctcttg gcattgccgg 2821 ccctcctggg gcccgtggtc ctcctggtgc tgtgggtagt cctggagtca acggtgctcc 2881 tggtgaagct ggtcgtgatg gcaaccctgg gaacgatggt cccccaggtc gcgatggtca 2941 acccggacac aagggagagc gcggttaccc tggcaatatt ggtcccgttg gtgctgcagg 3001 tgcacctggt cctcatggcc ccgtgggtcc tgctggcaaa catggaaacc gtggtgaaac 3061 tggtccttct ggtcctgttg gtcctgctgg tgctgttggc ccaagaggtc ctagtggccc 3121 acaaggcatt cgtggcgata agggagagcc cggtgaaaag gggcccagag gtcttcctgg 3181 cttaaaggga cacaatggat tgcaaggtct gcctggtatc gctggtcacc atggtgatca 3241 aggtgctcct ggctccgtgg gtcctgctgg tcctaggggc cctgctggtc cttctggccc 3301 tgctggaaaa gatggtcgca ctggacatcc tggtacagtt ggacctgctg gcattcgagg 3361 ccctcagggt caccaaggcc ctgctggccc ccctggtccc cctggccctc ctggacctcc 3421 aggtgtaagc ggtggtggtt atgactttgg ttacgatgga gacttctaca gggctgacca 3481 gcctcgctca gcaccttctc tcagacccaa ggactatgaa gttgatgcta ctctgaagtc 3541 tctcaacaac cagattgaga cccttcttac tcctgaaggc tctagaaaga acccagctcg 3601 cacatgccgt gacttgagac tcagccaccc agagtggagc agtggttact actggattga 3661 ccctaaccaa ggatgcacta tggatgctat caaagtatac tgtgatttct ctactggcga 3721 aacctgtatc cgggcccaac ctgaaaacat cccagccaag aactggtata ggagctccaa 3781 ggacaagaaa cacgtctggc taggagaaac tatcaatgct ggcagccagt ttgaatataa 3841 tgtagaagga gtgacttcca aggaaatggc tacccaactt gccttcatgc gcctgctggc 3901 caactatgcc tctcagaaca tcacctacca ctgcaagaac agcattgcat acatggatga 3961 ggagactggc aacctgaaaa aggctgtcat tctacagggc tctaatgatg ttgaacttgt 4021 tgctgagggc aacagcaggt tcacttacac tgttcttgta gatggctgct ctaaaaagac 4081 aaatgaatgg ggaaagacaa tcattgaata caaaacaaat aagccatcac gcctgccctt 4141 ccttgatatt gcacctttgg acatcggtgg tgctgaccag gaattctttg tggacattgg 4201 cccagtctgt ttcaaataaa tgaactcaat ctaaattaaa aaagaaagaa atttgaaaaa 4261 actttctctt tgccatttct tcttcttctt ttttaactga aagctgaatc cttccatttc 4321 ttctgcacat ctacttgctt aaattgtggg caaaagagaa aaagaaggat tgatcagagc 4381 attgtgcaat acagtttcat taactccttc cctcgctccc ccaaaaattt gaattttttt 4441 ttcaacactc ttacacctgt tatggaaaat gtcaaccttt gtaagaaaac caaaataaaa 4501 attgaaaaat aaaaaccata aacatttgca aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 4561 aaaaaaaaaa aa //