LOCUS AAH47305.1 1075 aa PRT HUM 04-NOV-2003 DEFINITION Homo sapiens COL4A1 protein protein. ACCESSION BC047305-1 PROTEIN_ID AAH47305.1 SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3504) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3504) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (28-FEB-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: David N. Louis, M.D. cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 93 Row: e Column: 14. FEATURES Qualifiers source /db_xref="H-InvDB:HIT000098508" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="IMAGE:4941939" /tissue_type="Brain, anaplastic oligodendroglioma with 1p/19q loss" /clone_lib="NCI_CGAP_Brn67" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" protein /gene="COL4A1" /db_xref="GeneID:1282" /db_xref="MIM:120130" BEGIN 1 GSRGDTGPPG PPGYGPAGPI GDKGQAGFPG GPGSPGLPGP KGEPGKIVPL PGPPGAEGLP 61 GSPGFPGPQG DRGFPGTPGR PGLPGEKGAV GQPGIGFPGP PGPKGVDGLP GDMGPPGTPG 121 RPGFNGLPGN PGVQGQKGEP GVGLPGLKGL PGLPGIPGTP GEKGSIGVPG VPGEHGAIGP 181 PGLQGIRGEP GPPGLPGSVG SPGVPGIGPP GARGPPGGQG PPGLSGPPGI KGEKGFPGFP 241 GLDMPGPKGD KGAQGLPGIT GQSGLPGLPG QQGAPGIPGF PGSKGEMGVM GTPGQPGSPG 301 PVGAPGLPGE KGDHGFPGSS GPRGDPGLKG DKGDVGLPGK PGSMDKVDMG SMKGQKGDQG 361 EKGQIGPIGE KGSRGDPGTP GVPGKDGQAG QPGQPGPKGD PGISGTPGAP GLPGPKGSVG 421 GMGLPGTPGE KGVPGIPGPQ GSPGLPGDKG AKGEKGQAGP PGIGIPGLRG EKGDQGIAGF 481 PGSPGEKGEK GSIGIPGMPG SPGLKGSPGS VGYPGSPGLP GEKGDKGLPG LDGIPGVKGE 541 AGLPGTPGPT GPAGQKGEPG SDGIPGSAGE KGEPGLPGRG FPGFPGAKGD KGSKGEVGFP 601 GLAGSPGIPG SKGEQGFMGP PGPQGQPGLP GSPGHATEGP KGDRGPQGQP GLPGLPGPMG 661 PPGLPGIDGV KGDKGNPGWP GAPGVPGPKG DPGFQGMPGI GGSPGITGSK GDMGPPGVPG 721 FQGPKGLPGL QGIKGDQGDH GVPGAKGLPG PPGPPGPYDI IKGEPGLPGP EGPPGLKGLQ 781 GLPGPKGQQG VTGLVGIPGP PGIPGFDGAP GQKGEMGPAG PTGPRGFPGP PGPDGLPGSM 841 GPPGTPSVDH GFLVTRHSQT IDDPQCPSGT KILYHGYSLL YVQGNERAHG QDLGTAGSCL 901 RKFSTMPFLF CNINNVCNFA SRNDYSYWLS TPEPMPMSMA PITGENIRPF ISRCAVCEAP 961 AMVMAVHSQT IQIPPCPSGW SSLWIGYSFV MHTSAGAEGS GQALASPGSC LEEFRSAPFI 1021 ECHGRGTCNY YANAYSFWLA TIERSEMFKK PTPSTLKAGE LRTHVSRCQV CMRRT //