LOCUS BC042612 3215 bp mRNA linear HUM 03-NOV-2005 DEFINITION Homo sapiens sorbin and SH3 domain containing 1, mRNA (cDNA clone IMAGE:4821975), complete cds. ACCESSION BC042612 VERSION BC042612.1 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3215) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3215) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (02-JAN-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Miklos Palkovits, M.D., Ph.D. cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki Toshiyuki and Piero Carninci (RIKEN) cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Baylor College of Medicine Human Genome Sequencing Center Center code: BCM-HGSC Web site: http://www.hgsc.bcm.tmc.edu/cdna/ Contact: amg@bcm.tmc.edu Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H., Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, A.N., Gibbs, R.A. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 71 Row: e Column: 6 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 5454055 This clone has the following problem: The cds is short compared to the longest cds in the locus. FEATURES Location/Qualifiers source 1..3215 /db_xref="H-InvDB:HIT000052725" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="IMAGE:4821975" /tissue_type="Testis" /clone_lib="NIH_MGC_97" /lab_host="DH10B" /note="Vector: pBluescriptR" gene 1..3215 /gene="SORBS1" /gene_synonym="CAP" /gene_synonym="DKFZp586P1422" /gene_synonym="FLAF2" /gene_synonym="FLJ12406" /gene_synonym="KIAA1296" /gene_synonym="R85FL" /gene_synonym="SH3P12" /gene_synonym="SORB1" /db_xref="GeneID:10580" /db_xref="HGNC:HGNC:14565" /db_xref="IMGT/GENE-DB:14565" /db_xref="MIM:605264" CDS 284..2170 /gene="SORBS1" /gene_synonym="CAP" /gene_synonym="DKFZp586P1422" /gene_synonym="FLAF2" /gene_synonym="FLJ12406" /gene_synonym="KIAA1296" /gene_synonym="R85FL" /gene_synonym="SH3P12" /gene_synonym="SORB1" /codon_start=1 /product="SORBS1 protein" /protein_id="AAH42612.1" /db_xref="GeneID:10580" /db_xref="HGNC:HGNC:14565" /db_xref="IMGT/GENE-DB:14565" /db_xref="MIM:605264" /translation="MSSECDGGSKAVMNGLAPGSNGQDKDMDPTKICTGKGAVTLRAS SSYRETPSSSPASPQETRQHESKPDEWRLSSSADANGNAQPSSLAAKGYRSVHPNLPS DKSQDATSSSAAQPEVIVVPLYLVNTDRGQEGTARPPTPLGPLGCVPTIPATASAASP LTFPTLDDFIPPHLQRWPHHSQPARASGSFAPISQTPPSFSPPPPLVPPAPEDLRRVS EPDLTGAVSSTDSSPLLNEVSSSLIGTDSQAFPSVSKPSSAYPSTTIVNPTIVLLQHN REQQKRLSSLSDPVSERRVGEQDSAPTQEKPTSPGKAIEKRAKDDSRRVVKSTQDLSD VSMDEVGIPLRNTERSKDWYKTMFKQIHKLNRDDDSDLYSPRYSFSEDTKSPLSVPRS KSEMSYIDGEKVVKRSATLPLPARSSSLKSSSERNDWEPPDKKVDTRKYRAEPKSIYE YQPGKSSVLTNEKMSSAISPTPEISSETPGYIYSSNFHAVKRESDGAPGDLTSLENER QIYKSVLEGGDIPLQGLSGLKRPSSSASTKDSESPRHFIPADYLESTEEFIRRRHDDK EKLLADQRRLKREQEEADIAARRHTGVIPTHHQFITNERFGDLLNIDDTAKRKSGSEK YDWA" BASE COUNT 942 a 808 c 738 g 727 t ORIGIN 1 agtagagatg ggttcctaat gtcatcgttt atttctcatg gctgccagta ataaacacga 61 aagaagaact aaatatagtg agttatgtac tcaccagctc atggtattat cctgaacttg 121 ccgcactgag aggaaattct gtagctctgt ttagtagatg gtacagagta aaccccaggc 181 agagagtttt catcttcttg tcatcccgca gaatatttct gatgactaga gctctcagag 241 cagttcagag ccccagttgc agacgacttg tcctgccacc accatgagtt ctgaatgtga 301 tggtggttcc aaagctgtga tgaatggctt ggcacctggc agcaatgggc aagacaaaga 361 catggatcct acaaaaatct gcactgggaa gggagcggtg actctccggg cctcgtcttc 421 ctacagggaa accccaagca gtagccctgc gagccctcag gaaacccggc aacacgaaag 481 caaaccagat gagtggaggc tttcttccag tgctgatgcc aatggaaatg cccagccctc 541 ttcactcgct gccaagggct acagaagtgt gcatcccaac cttccttctg acaagtccca 601 ggatgccact tcctccagtg cagcccagcc ggaggtaata gttgtccctc tctacctggt 661 taatactgac agagggcaag aaggcactgc cagacctcca acacctctgg ggcctcttgg 721 ctgcgtcccc acaatcccag cgactgcctc tgccgcctca cctctgacct tcccgactct 781 agatgatttc attccccctc atctgcagag gtggccccac cacagccagc cagcccgcgc 841 ctctggctcc tttgccccca ttagccagac gccaccatcc ttctcaccac cacctccgct 901 ggtccctcct gccccggagg acctccgcag agtctcggag cctgacctca cgggagctgt 961 ttcgagtacc gattccagtc ctctactaaa tgaagtttct tcttccctta ttggaactga 1021 ttcccaagcc tttccatcag ttagcaagcc ttcatccgcc tatccctcca caacgattgt 1081 caatcctact attgtgctct tgcaacacaa tcgagaacag caaaaacgac tcagtagcct 1141 ttcagatcct gtctcagaaa gaagagtggg agagcaggac tcagcaccaa cccaggaaaa 1201 acccacctca cctggcaagg ctattgaaaa aagagcaaag gatgacagta ggcgggtggt 1261 gaagagcact caggacttaa gcgatgtttc catggatgaa gtgggcatcc cactccggaa 1321 cactgagaga tcaaaagact ggtacaagac tatgtttaaa cagatccaca aactgaacag 1381 agatgatgat tcagatctgt actctcccag atactcattt tctgaagaca caaaatctcc 1441 cctttctgtg cctcgctcaa aaagtgagat gagctacatt gatggtgaga aggtagtcaa 1501 gaggtcggcc acactacccc tcccagcccg ctcttcctca ctgaagtcaa gctcagaaag 1561 aaatgactgg gaacccccag ataagaaagt agacacaaga aaatatcgtg cagagcccaa 1621 gagcatttac gaatatcagc ctggcaagtc ttccgttctg accaacgaaa agatgagctc 1681 agccatcagc cctactccgg aaatttcttc agagactcct ggatatatat attcttccaa 1741 cttccatgca gtgaagaggg aatcagacgg ggctcctggg gatctcacta gcttggagaa 1801 tgagagacaa atttataaaa gtgtcttgga aggtggtgac atccctcttc agggcctgag 1861 tgggctcaag cgaccatcca gctctgcttc cactaaagat tcagaatcgc caagacattt 1921 tataccagct gattacttgg aatccacgga agaatttatt cgaagacgtc atgatgataa 1981 agagaaactt ttagcggacc agagacgact taaacgcgag caagaagagg ctgatattgc 2041 agctcgacgc cacacaggcg tcattccgac gcaccatcag tttatcacta atgagcgctt 2101 tggggacctc ctcaatatag acgatactgc aaaaaggaaa tctgggtcag agaaatatga 2161 ctgggcatag tggcttacac ctgtaatccc agtactttgg gaagtcaagg tgggagaatt 2221 gcttgagccc aggagttcga caccagcttg ggcaacataa tgagacctgc cagagccaaa 2281 tttgacttta aagctcagac actggaggag cttcctctgc agaagggaga tattgtttac 2341 atttataagc aaattgatca gaactggtat gaaggagaac accacggccg ggtgggaatc 2401 ttcccacgca cctacatcga gcttcttcct cctgctgaga aggcacagcc caaaaagttg 2461 acaccagtgc aggttttgga atatggagaa gctattgcta agtttaactt taatggtgat 2521 acacaagtag aaatgtcctt cagaaagggt gagaggatca cactgctccg gcaggtagat 2581 gagaactggt acgaagggag gatcccgggg acatcccgac aaggcatctt ccccatcacc 2641 tacgtggatg tgatcaagcg accactggtg aaaaaccctg tggattacat ggacctgcct 2701 ttctcctcct ccccaagtcg cagtgccact gcaagcccac agcaacctca agcccagcag 2761 cgaagagtca cccccgacag gagtcaaacc tcacaagatt tatttagcta tcaagcatta 2821 tatagctata taccacagaa tgatgatgag ttggaactcc gcgatggaga tatcgttgat 2881 gtcatggaaa aatgtgacga tggatggttt gttggtactt caagaaggac aaagcagttt 2941 ggtacttttc caggcaacta tgtaaaacct ttgtatctat aagaagactg aaaaccatgg 3001 agattatttt tattggagga ggaagcatca ttcatgaacc gatcttttta gttgagtcag 3061 taggaaaatt aatacagtgg ataaagtaag aagcaaaaga cagggacaga gaagtgttgt 3121 gtttaaaacc caagcctgtc taaggttact gtgtattaga cagggccgaa ctagtgtgct 3181 gagcaaaaaa aaaaaaaata aaaaaaaaaa aaaaa //