LOCUS AAH42586.1 1366 aa PRT HUM 15-JUL-2006 DEFINITION Homo sapiens collagen, type I, alpha 2 protein. ACCESSION BC042586-1 PROTEIN_ID AAH42586.1 SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4572) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 4572) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (02-JAN-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: James Cleaver, M.D. cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Baylor College of Medicine Human Genome Sequencing Center Center code: BCM-HGSC Web site: http://www.hgsc.bcm.tmc.edu/cdna/ Contact: amg@bcm.tmc.edu Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H., Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, A.N., Gibbs, R.A. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 42 Row: f Column: 3 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 48762933. FEATURES Qualifiers source /db_xref="H-InvDB:HIT000259092" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:30044 IMAGE:4803351" /tissue_type="Skin, normal" /clone_lib="NCI_CGAP_Skn3" /lab_host="DH10B" /note="Vector: pCMV-SPORT6.1" protein /gene="COL1A2" /db_xref="GeneID:1278" /db_xref="HGNC:HGNC:2198" /db_xref="MIM:120160" BEGIN 1 MLSFVDTRTL LLLAVTLCLA TCQSLQEETV RKGPAGDRGP RGERGPPGPP GRDGEDGPTG 61 PPGPPGPPGP PGLGGNFAAQ YDGKGVGLGP GPMGLMGPRG PPGAAGAPGP QGFQGPAGEP 121 GEPGQTGPAG ARGPAGPPGK AGEDGHPGKP GRPGERGVVG PQGARGFPGT PGLPGFKGIR 181 GHNGLDGLKG QPGAPGVKGE PGAPGENGTP GQTGARGLPG ERGRVGAPGP AGARGSDGSV 241 GPVGPAGPIG SAGPPGFPGA PGPKGEIGAV GNAGPAGPAG PRGEVGLPGL SGPVGPPGNP 301 GANGLTGAKG AAGLPGVAGA PGLPGPRGIP GPVGAAGATG ARGLVGEPGP AGSKGESGNK 361 GEPGSAGPQG PPGPSGEEGK RGPNGEAGSA GPPGPPGLRG SPGSRGLPGA DGRAGVMGPP 421 GSRGASGPAG VRGPNGDAGR PGEPGLMGPR GLPGSPGNIG PAGKEGPVGL PGIDGRPGPI 481 GPAGARGEPG NIGFPGPKGP TGDPGKNGDK GHAGLAGARG APGPDGNNGA QGPPGPQGVQ 541 GGKGEQGPAG PPGFQGLPGP SGPAGEVGKP GERGLHGEFG LPGPAGPRGE RGPPGESGAA 601 GPTGPIGSRG PSGPPGPDGN KGEPGVVGAV GTAGPSGPSG LPGERGAAGI PGGKGEKGEP 661 GLRGEIGNPG RDGARGAPGA VGAPGPAGAT GDRGEAGAAG PAGPAGPRGS PGERGEVGPA 721 GPNGFAGPAG AAGQPGAKGE RGAKGPKGEN GVVGPTGPVG AAGPAGPNGP PGPAGSRGDG 781 GPPGMTGFPG AAGRTGPPGP SGISGPPGPP GPAGKEGLRG PRGDQGPVGR TGEVGAVGPP 841 GFAGEKGPSG EAGTAGPPGT PGPQGLLGAP GILGLPGSRG ERGLPGVAGA VGEPGPLGIA 901 GPPGARGPPG AVGSPGVNGA PGEAGRDGNP GNDGPPGRDG QPGHKGERGY PGNIGPVGAA 961 GAPGPHGPVG PAGKHGNRGE TGPSGPVGPA GAVGPRGPSG PQGIRGDKGE PGEKGPRGLP 1021 GLKGHNGLQG LPGIAGHHGD QGAPGSVGPA GPRGPAGPSG PAGKDGRTGH PGTVGPAGIR 1081 GPQGHQGPAG PPGPPGPPGP PGVSGGGYDF GYDGDFYRAD QPRSAPSLRP KDYEVDATLK 1141 SLNNQIETLL TPEGSRKNPA RTCRDLRLSH PEWSSGYYWI DPNQGCTMDA IKVYCDFSTG 1201 ETCIRAQPEN IPAKNWYRSS KDKKHVWLGE TINAGSQFEY NVEGVTSKEM ATQLAFMRLL 1261 ANYASQNITY HCKNSIAYMD EETGNLKKAV ILQGSNDVEL VAEGNSRFTY TVLVDGCSKK 1321 TNEWGKTIIE YKTNKPSRLP FLDIAPLDIG GADQEFFVDI GPVCFK //