LOCUS AAH42612.1 628 aa PRT HUM 03-NOV-2005 DEFINITION Homo sapiens SORBS1 protein protein. ACCESSION BC042612-1 PROTEIN_ID AAH42612.1 SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3215) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3215) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (02-JAN-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Miklos Palkovits, M.D., Ph.D. cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki Toshiyuki and Piero Carninci (RIKEN) cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Baylor College of Medicine Human Genome Sequencing Center Center code: BCM-HGSC Web site: http://www.hgsc.bcm.tmc.edu/cdna/ Contact: amg@bcm.tmc.edu Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H., Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, A.N., Gibbs, R.A. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 71 Row: e Column: 6 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 5454055 This clone has the following problem: The cds is short compared to the longest cds in the locus. FEATURES Qualifiers source /db_xref="H-InvDB:HIT000052725" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="IMAGE:4821975" /tissue_type="Testis" /clone_lib="NIH_MGC_97" /lab_host="DH10B" /note="Vector: pBluescriptR" protein /gene="SORBS1" /gene_synonym="CAP" /gene_synonym="DKFZp586P1422" /gene_synonym="FLAF2" /gene_synonym="FLJ12406" /gene_synonym="KIAA1296" /gene_synonym="R85FL" /gene_synonym="SH3P12" /gene_synonym="SORB1" /db_xref="GeneID:10580" /db_xref="HGNC:HGNC:14565" /db_xref="IMGT/GENE-DB:14565" /db_xref="MIM:605264" BEGIN 1 MSSECDGGSK AVMNGLAPGS NGQDKDMDPT KICTGKGAVT LRASSSYRET PSSSPASPQE 61 TRQHESKPDE WRLSSSADAN GNAQPSSLAA KGYRSVHPNL PSDKSQDATS SSAAQPEVIV 121 VPLYLVNTDR GQEGTARPPT PLGPLGCVPT IPATASAASP LTFPTLDDFI PPHLQRWPHH 181 SQPARASGSF APISQTPPSF SPPPPLVPPA PEDLRRVSEP DLTGAVSSTD SSPLLNEVSS 241 SLIGTDSQAF PSVSKPSSAY PSTTIVNPTI VLLQHNREQQ KRLSSLSDPV SERRVGEQDS 301 APTQEKPTSP GKAIEKRAKD DSRRVVKSTQ DLSDVSMDEV GIPLRNTERS KDWYKTMFKQ 361 IHKLNRDDDS DLYSPRYSFS EDTKSPLSVP RSKSEMSYID GEKVVKRSAT LPLPARSSSL 421 KSSSERNDWE PPDKKVDTRK YRAEPKSIYE YQPGKSSVLT NEKMSSAISP TPEISSETPG 481 YIYSSNFHAV KRESDGAPGD LTSLENERQI YKSVLEGGDI PLQGLSGLKR PSSSASTKDS 541 ESPRHFIPAD YLESTEEFIR RRHDDKEKLL ADQRRLKREQ EEADIAARRH TGVIPTHHQF 601 ITNERFGDLL NIDDTAKRKS GSEKYDWA //