LOCUS AAH42586.1 1366 aa PRT HUM 15-JUL-2006
DEFINITION Homo sapiens collagen, type I, alpha 2 protein.
ACCESSION BC042586-1
PROTEIN_ID AAH42586.1
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 4572)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 4572)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (02-JAN-2003) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: James Cleaver, M.D.
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Baylor College of Medicine Human Genome
Sequencing Center
Center code: BCM-HGSC
Web site: http://www.hgsc.bcm.tmc.edu/cdna/
Contact: amg@bcm.tmc.edu
Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
A.N., Gibbs, R.A.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 42 Row: f Column: 3
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 48762933.
FEATURES Qualifiers
source /db_xref="H-InvDB:HIT000259092"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:30044 IMAGE:4803351"
/tissue_type="Skin, normal"
/clone_lib="NCI_CGAP_Skn3"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6.1"
protein /gene="COL1A2"
/db_xref="GeneID:1278"
/db_xref="HGNC:HGNC:2198"
/db_xref="MIM:120160"
BEGIN
1 MLSFVDTRTL LLLAVTLCLA TCQSLQEETV RKGPAGDRGP RGERGPPGPP GRDGEDGPTG
61 PPGPPGPPGP PGLGGNFAAQ YDGKGVGLGP GPMGLMGPRG PPGAAGAPGP QGFQGPAGEP
121 GEPGQTGPAG ARGPAGPPGK AGEDGHPGKP GRPGERGVVG PQGARGFPGT PGLPGFKGIR
181 GHNGLDGLKG QPGAPGVKGE PGAPGENGTP GQTGARGLPG ERGRVGAPGP AGARGSDGSV
241 GPVGPAGPIG SAGPPGFPGA PGPKGEIGAV GNAGPAGPAG PRGEVGLPGL SGPVGPPGNP
301 GANGLTGAKG AAGLPGVAGA PGLPGPRGIP GPVGAAGATG ARGLVGEPGP AGSKGESGNK
361 GEPGSAGPQG PPGPSGEEGK RGPNGEAGSA GPPGPPGLRG SPGSRGLPGA DGRAGVMGPP
421 GSRGASGPAG VRGPNGDAGR PGEPGLMGPR GLPGSPGNIG PAGKEGPVGL PGIDGRPGPI
481 GPAGARGEPG NIGFPGPKGP TGDPGKNGDK GHAGLAGARG APGPDGNNGA QGPPGPQGVQ
541 GGKGEQGPAG PPGFQGLPGP SGPAGEVGKP GERGLHGEFG LPGPAGPRGE RGPPGESGAA
601 GPTGPIGSRG PSGPPGPDGN KGEPGVVGAV GTAGPSGPSG LPGERGAAGI PGGKGEKGEP
661 GLRGEIGNPG RDGARGAPGA VGAPGPAGAT GDRGEAGAAG PAGPAGPRGS PGERGEVGPA
721 GPNGFAGPAG AAGQPGAKGE RGAKGPKGEN GVVGPTGPVG AAGPAGPNGP PGPAGSRGDG
781 GPPGMTGFPG AAGRTGPPGP SGISGPPGPP GPAGKEGLRG PRGDQGPVGR TGEVGAVGPP
841 GFAGEKGPSG EAGTAGPPGT PGPQGLLGAP GILGLPGSRG ERGLPGVAGA VGEPGPLGIA
901 GPPGARGPPG AVGSPGVNGA PGEAGRDGNP GNDGPPGRDG QPGHKGERGY PGNIGPVGAA
961 GAPGPHGPVG PAGKHGNRGE TGPSGPVGPA GAVGPRGPSG PQGIRGDKGE PGEKGPRGLP
1021 GLKGHNGLQG LPGIAGHHGD QGAPGSVGPA GPRGPAGPSG PAGKDGRTGH PGTVGPAGIR
1081 GPQGHQGPAG PPGPPGPPGP PGVSGGGYDF GYDGDFYRAD QPRSAPSLRP KDYEVDATLK
1141 SLNNQIETLL TPEGSRKNPA RTCRDLRLSH PEWSSGYYWI DPNQGCTMDA IKVYCDFSTG
1201 ETCIRAQPEN IPAKNWYRSS KDKKHVWLGE TINAGSQFEY NVEGVTSKEM ATQLAFMRLL
1261 ANYASQNITY HCKNSIAYMD EETGNLKKAV ILQGSNDVEL VAEGNSRFTY TVLVDGCSKK
1321 TNEWGKTIIE YKTNKPSRLP FLDIAPLDIG GADQEFFVDI GPVCFK
//