LOCUS BC054498 5060 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens collagen, type I, alpha 2, mRNA (cDNA clone MGC:57503 IMAGE:6474549), complete cds. ACCESSION BC054498 VERSION BC054498.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5060) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 5060) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (01-JUL-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 107 Row: p Column: 3 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 48762933. FEATURES Location/Qualifiers source 1..5060 /db_xref="H-InvDB:HIT000054073" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:57503 IMAGE:6474549" /tissue_type="Uterus, leiomyosarcoma" /clone_lib="NIH_MGC_71" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..5060 /gene="COL1A2" /db_xref="GeneID:1278" /db_xref="HGNC:HGNC:2198" /db_xref="MIM:120160" CDS 101..4201 /gene="COL1A2" /codon_start=1 /product="collagen, type I, alpha 2" /protein_id="AAH54498.1" /db_xref="GeneID:1278" /db_xref="HGNC:HGNC:2198" /db_xref="MIM:120160" /translation="MLSFVDTRTLLLLAVTLCLATCQSLQEETVRKGPAGDRGPRGER GPPGPPGRDGEDGPTGPPGPPGPPGPPGLGGNFAAQYDGKGVGLGPGPMGLMGPRGPP GAAGAPGPQGFQGPAGEPGEPGQTGPAGARGPAGPPGKAGEDGHPGKPGRPGERGVVG PQGARGFPGTPGLPGFKGIRGHNGLDGLKGQPGAPGVKGEPGAPGENGTPGQTGARGL PGERGRVGAPGPAGARGSDGSVGPVGPAGPIGSAGPPGFPGAPGPKGEIGAVGNAGPA GPAGPRGEVGLPGLSGPVGPPGNPGANGLTGAKGAAGLPGVAGAPGLPGPRGIPGPVG AAGATGARGLVGEPGPAGSKGESGNKGEPGSAGPQGPPGPSGEEGKRGPNGEAGSAGP PGPPGLRGSPGSRGLPGADGRAGVMGPPGSRGASGPAGVRGPNGDAGRPGEPGLMGPR GLPGSPGNIGPAGKEGPVGLPGIDGRPGPIGPAGARGEPGNIGFPGPKGPTGDPGKNG DKGHAGLAGARGAPGPDGNNGAQGPPGPQGVQGGKGEQGPAGPPGFQGLPGPSGPAGE VGKPGERGLHGEFGLPGPAGPRGERGPPGESGAAGPTGPIGSRGPSGPPGPDGNKGEP GVVGAVGTAGPSGPSGLPGERGAAGIPGGKGEKGEPGLRGEIGNPGRDGARGAPGAVG APGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPAGPNGFAGPAGAAGQPGAKGE RGAKGPKGENGVVGPTGPVGAAGPAGPNGPPGPAGSRGDGGPPGMTGFPGAAGRTGPP GPSGISGPPGPPGPAGKEGLRGPRGDQGPVGRTGEVGAVGPPGFAGEKGPSGEAGTAG PPGTPGPQGLLGAPGILGLPGSRGERGLPGVAGAVGEPGPLGIAGPPGARGPPGAVGS PGVNGAPGEAGRDGNPGNDGPPGRDGQPGHKGERGYPGNIGPVGAAGAPGPHGPVGPA GKHGNRGETGPSGPVGPAGAVGPRGPSGPQGIRGDKGEPGEKGPRGLPGLKGHNGLQG LPGIAGHHGDQGAPGSVGPAGPRGPAGPSGPAGKDGRTGHPGTVGPAGIRGPQGHQGP AGPPGPPGPPGPPGVSGGGYDFGYDGDFYRADQPRSAPSLRPKDYEVDATLKSLNNQI ETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCDFSTGETCI RAQPENIPAKNWYRSSKDKKHVWLGETINAGSQFEYNVEGVTSKEMATQLAFMRLLAN YASQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELVAEGNSRFTYTVLVDGCSKK TNEWGKTIIEYKTNKPSRLPFLDIAPLDIGGADQEFFVDIGPVCFK" BASE COUNT 1097 a 1278 c 1465 g 1220 t ORIGIN 1 gtactggcca cgactgcatg cccgcgcccg ccaggtgata cctccgccgg tgacccaggg 61 gctctgcgac acaaggagtc tgcatgtcta agtgctagac atgctcagct ttgtggatac 121 gcggactttg ttgctgcttg cagtaacctt atgcctagca acatgccaat ctttacaaga 181 ggaaactgta agaaagggcc cagccggaga tagaggacca cgtggagaaa ggggtccacc 241 aggcccccca ggcagagatg gtgaagatgg tcccacaggc cctcctggtc cacctggtcc 301 tcctggcccc cctggtctcg gtgggaactt tgctgctcag tatgatggaa aaggagttgg 361 acttggccct ggaccaatgg gcttaatggg acctagaggc ccacctggtg cagctggagc 421 cccaggccct caaggtttcc aaggacctgc tggtgagcct ggtgaacctg gtcaaactgg 481 tcctgcaggt gctcgtggtc cagctggccc tcctggcaag gctggtgaag atggtcaccc 541 tggaaaaccc ggacgacctg gtgagagagg agttgttgga ccacagggtg ctcgtggttt 601 ccctggaact cctggacttc ctggcttcaa aggcattagg ggacacaatg gtctggatgg 661 attgaaggga cagcccggtg ctcctggtgt gaagggtgaa cctggtgccc ctggtgaaaa 721 tggaactcca ggtcaaacag gagcccgtgg gcttcctggt gagagaggac gtgttggtgc 781 ccctggccca gctggtgccc gtggcagtga tggaagtgtg ggtcccgtgg gtcctgctgg 841 tcccattggg tctgctggcc ctccaggctt cccaggtgcc cctggcccca agggtgaaat 901 tggagctgtt ggtaacgctg gtcctgctgg tcccgccggt ccccgtggtg aagtgggtct 961 tccaggcctc tccggccccg ttggacctcc tggtaatcct ggagcaaacg gccttactgg 1021 tgccaagggt gctgctggcc ttcccggcgt tgctggggct cccggcctcc ctggaccccg 1081 cggtattcct ggccctgttg gtgctgccgg tgctactggt gccagaggac ttgttggtga 1141 gcctggtcca gctggctcca aaggagagag cggtaacaag ggtgagcccg gctctgctgg 1201 gccccaaggt cctcctggtc ccagtggtga agaaggaaag agaggcccta atggggaagc 1261 tggatctgcc ggccctccag gacctcctgg gctgagaggt agtcctggtt ctcgtggtct 1321 tcctggagct gatggcagag ctggcgtcat gggccctcct ggtagtcgtg gtgcaagtgg 1381 ccctgctgga gtccgaggac ctaatggaga tgctggtcgc cctggggagc ctggtctcat 1441 gggacccaga ggtcttcctg gttcccctgg aaatatcggc cccgctggaa aagaaggtcc 1501 tgtcggcctc cctggcatcg acggcaggcc tggcccaatt ggcccagctg gagcaagagg 1561 agagcctggc aacattggat tccctggacc caaaggcccc actggtgatc ctggcaaaaa 1621 cggtgataaa ggtcatgctg gtcttgctgg tgctcggggt gctccaggtc ctgatggaaa 1681 caatggtgct cagggacctc ctggaccaca gggtgttcaa ggtggaaaag gtgaacaggg 1741 tcccgctggt cctccaggct tccagggtct gcctggcccc tcaggtcccg ctggtgaagt 1801 tggcaaacca ggagaaaggg gtctccatgg tgagtttggt ctccctggtc ctgctggtcc 1861 aagaggggaa cgcggtcccc caggtgagag tggtgctgcc ggtcctactg gtcctattgg 1921 aagccgaggt ccttctggac ccccagggcc tgatggaaac aagggtgaac ctggtgtggt 1981 tggtgctgtg ggcactgctg gtccatctgg tcctagtgga ctcccaggag agaggggtgc 2041 tgctggcata cctggaggca agggagaaaa gggtgaacct ggtctcagag gtgaaattgg 2101 taaccctggc agagatggtg ctcgtggtgc tcctggtgct gtaggtgccc ctggtcctgc 2161 tggagccaca ggtgaccggg gcgaagctgg ggctgctggt cctgctggtc ctgctggtcc 2221 tcggggaagc cctggtgaac gtggtgaggt cggtcctgct ggccccaatg gatttgctgg 2281 tcctgctggt gctgctggtc aacctggtgc taaaggagaa agaggagcca aagggcctaa 2341 gggtgaaaac ggtgttgttg gtcccacagg ccccgttgga gctgctggcc cagctggtcc 2401 aaatggtccc cccggtcctg ctggaagtcg tggtgatgga ggcccccctg gtatgactgg 2461 tttccctggt gctgctggac ggactggtcc cccaggaccc tctggtattt ctggccctcc 2521 tggtccccct ggtcctgctg ggaaagaagg gcttcgtggt cctcgtggtg accaaggtcc 2581 agttggccga actggagaag taggtgcagt tggtccccct ggcttcgctg gtgagaaggg 2641 tccctctgga gaggctggta ctgctggacc tcctggcact ccaggtcctc agggtcttct 2701 tggtgctcct ggtattctgg gtctccctgg ctcgagaggt gaacgtggtc taccaggtgt 2761 tgctggtgct gtgggtgaac ctggtcctct tggcattgcc ggccctcctg gggcccgtgg 2821 tcctcctggt gctgtgggta gtcctggagt caacggtgct cctggtgaag ctggtcgtga 2881 tggcaaccct gggaacgatg gtcccccagg tcgcgatggt caacccggac acaagggaga 2941 gcgcggttac cctggcaata ttggtcccgt tggtgctgca ggtgcacctg gtcctcatgg 3001 ccccgtgggt cctgctggca aacatggaaa ccgtggtgaa actggtcctt ctggtcctgt 3061 tggtcctgct ggtgctgttg gcccaagagg tcctagtggc ccacaaggca ttcgtggcga 3121 taagggagag cccggtgaaa aggggcccag aggtcttcct ggcttaaagg gacacaatgg 3181 attgcaaggt ctgcctggta tcgctggtca ccatggtgat caaggtgctc ctggctccgt 3241 gggtcctgct ggtcctaggg gccctgctgg tccttctggc cctgctggaa aagatggtcg 3301 cactggacat cctggtacag ttggacctgc tggcattcga ggccctcagg gtcaccaagg 3361 ccctgctggc ccccctggtc cccctggccc tcctggacct ccaggtgtaa gcggtggtgg 3421 ttatgacttt ggttacgatg gagacttcta cagggctgac cagcctcgct cagcaccttc 3481 tctcagaccc aaggactatg aagttgatgc tactctgaag tctctcaaca accagattga 3541 gacccttctt actcctgaag gctctagaaa gaacccagct cgcacatgcc gtgacttgag 3601 actcagccac ccagagtgga gcagtggtta ctactggatt gaccctaacc aaggatgcac 3661 tatggatgct atcaaagtat actgtgattt ctctactggc gaaacctgta tccgggccca 3721 acctgaaaac atcccagcca agaactggta taggagctcc aaggacaaga aacacgtctg 3781 gctaggagaa actatcaatg ctggcagcca gtttgaatat aatgtagaag gagtgacttc 3841 caaggaaatg gctacccaac ttgccttcat gcgcctgctg gccaactatg cctctcagaa 3901 catcacctac cactgcaaga acagcattgc atacatggat gaggagactg gcaacctgaa 3961 aaaggctgtc attctacagg gctctaatga tgttgaactt gttgctgagg gcaacagcag 4021 gttcacttac actgttcttg tagatggctg ctctaaaaag acaaatgaat ggggaaagac 4081 aatcattgaa tacaaaacaa ataagccatc acgcctgccc ttccttgata ttgcaccttt 4141 ggacatcggt ggtgctgacc aggaattctt tgtggacatt ggcccagtct gtttcaaata 4201 aatgaactca atctaaatta aaaaagaaag aaatttgaaa aaactttctc tttgccattt 4261 cttcttcttc ttttttaact gaaagctgaa tccttccatt tcttctgcac atctacttgc 4321 ttaaattgtg ggcaaaagag aaaaagaagg attgatcaga gcattgtgca atacagtttc 4381 attaactcct tcccccgctc ccccaaaaat ttgaattttt ttttcaacac tcttacacct 4441 gttatggaaa atgtcaacct ttgtaagaaa accaaaataa aaattgaaaa ataaaaacca 4501 taaacatttg caccacttgt ggcttttgaa tatcttccac agagggaagt ttaaaaccca 4561 aacttccaaa ggtttaaact acctcaaaac actttcccat gagtgtgatc cacattgtta 4621 ggtgctgacc tagacagaga tgaactgagg tccttgtttt gttttgttca taatacaaag 4681 gtgctaatta atagtatttc agatacttga agaatgttga tggtgctaga agaatttgag 4741 aagaaatact cctgtattga gttgtatcgt gtggtgtatt ttttaaaaaa tttgatttag 4801 cattcatatt ttccatctta ttcccaatta aaagtatgca gattatttgc ccaaatcttc 4861 ttcagattca gcatttgttc tttgccagtc tcattttcat cttcttccat ggttccacag 4921 aagctttgtt tcttgggcaa gcagaaaaat taaattgtac ctattttgta tatgtgagat 4981 gtttaaataa attgtgaaaa aaatgaaata aagcatgttt ggttttccaa aaaaaaaaaa 5041 aaaaaaaaaa aaaaaaaaaa //