LOCUS AAI51221.1 1669 aa PRT HUM 24-JUL-2007 DEFINITION Homo sapiens COL4A1 protein protein. ACCESSION BC151220-1 PROTEIN_ID AAI51221.1 SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6517) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 6517) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (23-JUL-2007) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Novartis Institute for Biomedical Research cDNA Library Preparation: Novartis Institute for Biomedical Research cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 306 Row: l Column: 7. FEATURES Qualifiers source /db_xref="H-InvDB:HIT000435929" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:165004 IMAGE:40148649" /tissue_type="Donated clones,Novartis FGA collection" /clone_lib="NIH_MGC_417" /lab_host="DH5a" /note="Vector: pCMV-SPORT6" protein /gene="COL4A1" /gene_synonym="arresten" /db_xref="GeneID:1282" /db_xref="HGNC:HGNC:2202" /db_xref="MIM:120130" BEGIN 1 MGPRLSVWLL LLPAALLLHE EHSRAAAKGG CAGSGCGKCD CHGVKGQKGE RGLPGLQGVI 61 GFPGMQGPEG PQGPPGQKGD TGEPGLPGTK GTRGPPGASG YPGNPGLPGI PGQDGPPGPP 121 GIPGCNGTKG ERGPLGPPGL PGFAGNPGPP GLPGMKGDPG EILGHVPGML LKGERGFPGI 181 PGTPGPPGLP GLQGPVGPPG FTGPPGPPGP PGPPGEKGQM GLSFQGPKGD KGDQGVSGPP 241 GVPGQAQVQE KGDFATKGEK GQKGEPGFQG MPGVGEKGEP GKPGPRGKPG KDGDKGEKGS 301 PGFPGEPGYP GLIGRQGPQG EKGEAGPPGP PGIVIGTGPL GEKGERGYPG TPGPRGEPGP 361 KGFPGLPGQP GPPGLPVPGQ AGAPGFPGER GEKGDRGFPG TSLPGPSGRD GLPGPPGSPG 421 PPGQPGYTNG IVECQPGPPG DQGPPGIPGQ PGFIGEIGEK GQKGESCLIC DIDGYRGPPG 481 PQGPPGEIGF PGQPGAKGDR GLPGRDGVAG VPGPQGTPGL IGQPGAKGEP GEFYFDLRLK 541 GDKGDPGFPG QPGMPGRAGS PGRDGHPGLP GPKGSPGSVG LKGERGPPGG VGFPGSRGDT 601 GPPGPPGYGP AGPIGDKGQA GFPGGPGSPG LPGPKGEPGK IVPLPGPPGA EGLPGSPGFP 661 GPQGDRGFPG TPGRPGLPGE KGAVGQPGIG FPGPPGPKGV DGLPGDMGPP GTPGRPGFNG 721 LPGNPGVQGQ KGEPGVGLPG LKGLPGLPGI PGTPGEKGSI GVPGVPGEHG AIGPPGLQGI 781 RGEPGPPGLP GSVGSPGVPG IGPPGARGPP GGQGPPGLSG PPGIKGEKGF PGFPGLDMPG 841 PKGDKGAQGL PGITGQSGLP GLPGQQGAPG IPGFPGSKGE MGVMGTPGQP GSPGPVGAPG 901 LPGEKGDHGF PGSSGPRGDP GLKGDKGDVG LPGKPGSMDK VDMGSMKGQK GDQGEKGQIG 961 PIGEKGSRGD PGTPGVPGKD GQAGQPGQPG PKGDPGISGT PGAPGLPGPK GSVGGMGLPG 1021 TPGEKGVPGI PGPQGSPGLP GDKGAKGEKG QAGPPGIGIP GLRGEKGDQG IAGFPGSPGE 1081 KGEKGSIGIP GMPGSPGLKG SPGSVGYPGS PGLPGEKGDK GLPGLDGIPG VKGEAGLPGT 1141 PGPTGPAGQK GEPGSDGIPG SAGEKGEPGL PGRGFPGFPG AKGDKGSKGE VGFPGLAGSP 1201 GIPGSKGEQG FMGPPGPQGQ PGLPGSPGHA TEGPKGDRGP QGQPGLPGLP GPMGPPGLPG 1261 IDGVKGDKGN PGWPGAPGVP GPKGDPGFQG MPGIGGSPGI TGSKGDMGPP GVPGFQGPKG 1321 LPGLQGIKGD QGDHGVPGAK GLPGPPGPPG PYDIIKGEPG LPGPEGPPGL KGLQGLPGPK 1381 GQQGVTGLVG IPGPPGIPGF DGAPGQKGEM GPAGPTGPRG FPGPPGPDGL PGSMGPPGTP 1441 SVDHGFLVTR HSQTIDDPQC PSGTKILYHG YSLLYVQGNE RAHGQDLGTA GSCLRKFSTM 1501 PFLFCNINNV CNFASRNDYS YWLSTPEPMP MSMAPITGEN IRPFISRCAV CEAPAMVMAV 1561 HSQTIQIPPC PSGWSSLWIG YSFVMHTSAG AEGSGQALAS PGSCLEEFRS APFIECHGRG 1621 TCNYYANAYS FWLATIERSE MFKKPTPSTL KAGELRTHVS RCQVCMRRT //