LOCUS       AAH42586.1              1366 aa    PRT              HUM 15-JUL-2006
DEFINITION  Homo sapiens collagen, type I, alpha 2 protein.
ACCESSION   BC042586-1
PROTEIN_ID  AAH42586.1
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 4572)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 4572)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (02-JAN-2003) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: James Cleaver, M.D.
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Baylor College of Medicine Human Genome
            Sequencing Center
            Center code: BCM-HGSC
            Web site: http://www.hgsc.bcm.tmc.edu/cdna/
            Contact: amg@bcm.tmc.edu
            Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
            Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
            A.N., Gibbs, R.A.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 42 Row: f Column: 3
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 48762933.
FEATURES             Qualifiers
     source          /db_xref="H-InvDB:HIT000259092"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:30044 IMAGE:4803351"
                     /tissue_type="Skin, normal"
                     /clone_lib="NCI_CGAP_Skn3"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6.1"
     protein         /gene="COL1A2"
                     /db_xref="GeneID:1278"
                     /db_xref="HGNC:HGNC:2198"
                     /db_xref="MIM:120160"
BEGIN
        1 MLSFVDTRTL LLLAVTLCLA TCQSLQEETV RKGPAGDRGP RGERGPPGPP GRDGEDGPTG
       61 PPGPPGPPGP PGLGGNFAAQ YDGKGVGLGP GPMGLMGPRG PPGAAGAPGP QGFQGPAGEP
      121 GEPGQTGPAG ARGPAGPPGK AGEDGHPGKP GRPGERGVVG PQGARGFPGT PGLPGFKGIR
      181 GHNGLDGLKG QPGAPGVKGE PGAPGENGTP GQTGARGLPG ERGRVGAPGP AGARGSDGSV
      241 GPVGPAGPIG SAGPPGFPGA PGPKGEIGAV GNAGPAGPAG PRGEVGLPGL SGPVGPPGNP
      301 GANGLTGAKG AAGLPGVAGA PGLPGPRGIP GPVGAAGATG ARGLVGEPGP AGSKGESGNK
      361 GEPGSAGPQG PPGPSGEEGK RGPNGEAGSA GPPGPPGLRG SPGSRGLPGA DGRAGVMGPP
      421 GSRGASGPAG VRGPNGDAGR PGEPGLMGPR GLPGSPGNIG PAGKEGPVGL PGIDGRPGPI
      481 GPAGARGEPG NIGFPGPKGP TGDPGKNGDK GHAGLAGARG APGPDGNNGA QGPPGPQGVQ
      541 GGKGEQGPAG PPGFQGLPGP SGPAGEVGKP GERGLHGEFG LPGPAGPRGE RGPPGESGAA
      601 GPTGPIGSRG PSGPPGPDGN KGEPGVVGAV GTAGPSGPSG LPGERGAAGI PGGKGEKGEP
      661 GLRGEIGNPG RDGARGAPGA VGAPGPAGAT GDRGEAGAAG PAGPAGPRGS PGERGEVGPA
      721 GPNGFAGPAG AAGQPGAKGE RGAKGPKGEN GVVGPTGPVG AAGPAGPNGP PGPAGSRGDG
      781 GPPGMTGFPG AAGRTGPPGP SGISGPPGPP GPAGKEGLRG PRGDQGPVGR TGEVGAVGPP
      841 GFAGEKGPSG EAGTAGPPGT PGPQGLLGAP GILGLPGSRG ERGLPGVAGA VGEPGPLGIA
      901 GPPGARGPPG AVGSPGVNGA PGEAGRDGNP GNDGPPGRDG QPGHKGERGY PGNIGPVGAA
      961 GAPGPHGPVG PAGKHGNRGE TGPSGPVGPA GAVGPRGPSG PQGIRGDKGE PGEKGPRGLP
     1021 GLKGHNGLQG LPGIAGHHGD QGAPGSVGPA GPRGPAGPSG PAGKDGRTGH PGTVGPAGIR
     1081 GPQGHQGPAG PPGPPGPPGP PGVSGGGYDF GYDGDFYRAD QPRSAPSLRP KDYEVDATLK
     1141 SLNNQIETLL TPEGSRKNPA RTCRDLRLSH PEWSSGYYWI DPNQGCTMDA IKVYCDFSTG
     1201 ETCIRAQPEN IPAKNWYRSS KDKKHVWLGE TINAGSQFEY NVEGVTSKEM ATQLAFMRLL
     1261 ANYASQNITY HCKNSIAYMD EETGNLKKAV ILQGSNDVEL VAEGNSRFTY TVLVDGCSKK
     1321 TNEWGKTIIE YKTNKPSRLP FLDIAPLDIG GADQEFFVDI GPVCFK
//