LOCUS       BC042612                3215 bp    mRNA    linear   HUM 03-NOV-2005
DEFINITION  Homo sapiens sorbin and SH3 domain containing 1, mRNA (cDNA clone
            IMAGE:4821975), complete cds.
ACCESSION   BC042612
VERSION     BC042612.1
KEYWORDS    .
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3215)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 3215)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (02-JAN-2003) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Miklos Palkovits, M.D., Ph.D.
            cDNA Library Preparation: Michael J. Brownstein (NHGRI) &  Shiraki
            Toshiyuki and Piero Carninci (RIKEN)
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Baylor College of Medicine Human Genome
            Sequencing Center
            Center code: BCM-HGSC
            Web site: http://www.hgsc.bcm.tmc.edu/cdna/
            Contact: amg@bcm.tmc.edu
            Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
            Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
            A.N., Gibbs, R.A.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 71 Row: e Column: 6
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 5454055
            This clone has the following problem: The cds is short compared to
            the longest cds in the locus.
FEATURES             Location/Qualifiers
     source          1..3215
                     /db_xref="H-InvDB:HIT000052725"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="IMAGE:4821975"
                     /tissue_type="Testis"
                     /clone_lib="NIH_MGC_97"
                     /lab_host="DH10B"
                     /note="Vector: pBluescriptR"
     gene            1..3215
                     /gene="SORBS1"
                     /gene_synonym="CAP"
                     /gene_synonym="DKFZp586P1422"
                     /gene_synonym="FLAF2"
                     /gene_synonym="FLJ12406"
                     /gene_synonym="KIAA1296"
                     /gene_synonym="R85FL"
                     /gene_synonym="SH3P12"
                     /gene_synonym="SORB1"
                     /db_xref="GeneID:10580"
                     /db_xref="HGNC:HGNC:14565"
                     /db_xref="IMGT/GENE-DB:14565"
                     /db_xref="MIM:605264"
     CDS             284..2170
                     /gene="SORBS1"
                     /gene_synonym="CAP"
                     /gene_synonym="DKFZp586P1422"
                     /gene_synonym="FLAF2"
                     /gene_synonym="FLJ12406"
                     /gene_synonym="KIAA1296"
                     /gene_synonym="R85FL"
                     /gene_synonym="SH3P12"
                     /gene_synonym="SORB1"
                     /codon_start=1
                     /product="SORBS1 protein"
                     /protein_id="AAH42612.1"
                     /db_xref="GeneID:10580"
                     /db_xref="HGNC:HGNC:14565"
                     /db_xref="IMGT/GENE-DB:14565"
                     /db_xref="MIM:605264"
                     /translation="MSSECDGGSKAVMNGLAPGSNGQDKDMDPTKICTGKGAVTLRAS
                     SSYRETPSSSPASPQETRQHESKPDEWRLSSSADANGNAQPSSLAAKGYRSVHPNLPS
                     DKSQDATSSSAAQPEVIVVPLYLVNTDRGQEGTARPPTPLGPLGCVPTIPATASAASP
                     LTFPTLDDFIPPHLQRWPHHSQPARASGSFAPISQTPPSFSPPPPLVPPAPEDLRRVS
                     EPDLTGAVSSTDSSPLLNEVSSSLIGTDSQAFPSVSKPSSAYPSTTIVNPTIVLLQHN
                     REQQKRLSSLSDPVSERRVGEQDSAPTQEKPTSPGKAIEKRAKDDSRRVVKSTQDLSD
                     VSMDEVGIPLRNTERSKDWYKTMFKQIHKLNRDDDSDLYSPRYSFSEDTKSPLSVPRS
                     KSEMSYIDGEKVVKRSATLPLPARSSSLKSSSERNDWEPPDKKVDTRKYRAEPKSIYE
                     YQPGKSSVLTNEKMSSAISPTPEISSETPGYIYSSNFHAVKRESDGAPGDLTSLENER
                     QIYKSVLEGGDIPLQGLSGLKRPSSSASTKDSESPRHFIPADYLESTEEFIRRRHDDK
                     EKLLADQRRLKREQEEADIAARRHTGVIPTHHQFITNERFGDLLNIDDTAKRKSGSEK
                     YDWA"
BASE COUNT          942 a          808 c          738 g          727 t
ORIGIN      
        1 agtagagatg ggttcctaat gtcatcgttt atttctcatg gctgccagta ataaacacga
       61 aagaagaact aaatatagtg agttatgtac tcaccagctc atggtattat cctgaacttg
      121 ccgcactgag aggaaattct gtagctctgt ttagtagatg gtacagagta aaccccaggc
      181 agagagtttt catcttcttg tcatcccgca gaatatttct gatgactaga gctctcagag
      241 cagttcagag ccccagttgc agacgacttg tcctgccacc accatgagtt ctgaatgtga
      301 tggtggttcc aaagctgtga tgaatggctt ggcacctggc agcaatgggc aagacaaaga
      361 catggatcct acaaaaatct gcactgggaa gggagcggtg actctccggg cctcgtcttc
      421 ctacagggaa accccaagca gtagccctgc gagccctcag gaaacccggc aacacgaaag
      481 caaaccagat gagtggaggc tttcttccag tgctgatgcc aatggaaatg cccagccctc
      541 ttcactcgct gccaagggct acagaagtgt gcatcccaac cttccttctg acaagtccca
      601 ggatgccact tcctccagtg cagcccagcc ggaggtaata gttgtccctc tctacctggt
      661 taatactgac agagggcaag aaggcactgc cagacctcca acacctctgg ggcctcttgg
      721 ctgcgtcccc acaatcccag cgactgcctc tgccgcctca cctctgacct tcccgactct
      781 agatgatttc attccccctc atctgcagag gtggccccac cacagccagc cagcccgcgc
      841 ctctggctcc tttgccccca ttagccagac gccaccatcc ttctcaccac cacctccgct
      901 ggtccctcct gccccggagg acctccgcag agtctcggag cctgacctca cgggagctgt
      961 ttcgagtacc gattccagtc ctctactaaa tgaagtttct tcttccctta ttggaactga
     1021 ttcccaagcc tttccatcag ttagcaagcc ttcatccgcc tatccctcca caacgattgt
     1081 caatcctact attgtgctct tgcaacacaa tcgagaacag caaaaacgac tcagtagcct
     1141 ttcagatcct gtctcagaaa gaagagtggg agagcaggac tcagcaccaa cccaggaaaa
     1201 acccacctca cctggcaagg ctattgaaaa aagagcaaag gatgacagta ggcgggtggt
     1261 gaagagcact caggacttaa gcgatgtttc catggatgaa gtgggcatcc cactccggaa
     1321 cactgagaga tcaaaagact ggtacaagac tatgtttaaa cagatccaca aactgaacag
     1381 agatgatgat tcagatctgt actctcccag atactcattt tctgaagaca caaaatctcc
     1441 cctttctgtg cctcgctcaa aaagtgagat gagctacatt gatggtgaga aggtagtcaa
     1501 gaggtcggcc acactacccc tcccagcccg ctcttcctca ctgaagtcaa gctcagaaag
     1561 aaatgactgg gaacccccag ataagaaagt agacacaaga aaatatcgtg cagagcccaa
     1621 gagcatttac gaatatcagc ctggcaagtc ttccgttctg accaacgaaa agatgagctc
     1681 agccatcagc cctactccgg aaatttcttc agagactcct ggatatatat attcttccaa
     1741 cttccatgca gtgaagaggg aatcagacgg ggctcctggg gatctcacta gcttggagaa
     1801 tgagagacaa atttataaaa gtgtcttgga aggtggtgac atccctcttc agggcctgag
     1861 tgggctcaag cgaccatcca gctctgcttc cactaaagat tcagaatcgc caagacattt
     1921 tataccagct gattacttgg aatccacgga agaatttatt cgaagacgtc atgatgataa
     1981 agagaaactt ttagcggacc agagacgact taaacgcgag caagaagagg ctgatattgc
     2041 agctcgacgc cacacaggcg tcattccgac gcaccatcag tttatcacta atgagcgctt
     2101 tggggacctc ctcaatatag acgatactgc aaaaaggaaa tctgggtcag agaaatatga
     2161 ctgggcatag tggcttacac ctgtaatccc agtactttgg gaagtcaagg tgggagaatt
     2221 gcttgagccc aggagttcga caccagcttg ggcaacataa tgagacctgc cagagccaaa
     2281 tttgacttta aagctcagac actggaggag cttcctctgc agaagggaga tattgtttac
     2341 atttataagc aaattgatca gaactggtat gaaggagaac accacggccg ggtgggaatc
     2401 ttcccacgca cctacatcga gcttcttcct cctgctgaga aggcacagcc caaaaagttg
     2461 acaccagtgc aggttttgga atatggagaa gctattgcta agtttaactt taatggtgat
     2521 acacaagtag aaatgtcctt cagaaagggt gagaggatca cactgctccg gcaggtagat
     2581 gagaactggt acgaagggag gatcccgggg acatcccgac aaggcatctt ccccatcacc
     2641 tacgtggatg tgatcaagcg accactggtg aaaaaccctg tggattacat ggacctgcct
     2701 ttctcctcct ccccaagtcg cagtgccact gcaagcccac agcaacctca agcccagcag
     2761 cgaagagtca cccccgacag gagtcaaacc tcacaagatt tatttagcta tcaagcatta
     2821 tatagctata taccacagaa tgatgatgag ttggaactcc gcgatggaga tatcgttgat
     2881 gtcatggaaa aatgtgacga tggatggttt gttggtactt caagaaggac aaagcagttt
     2941 ggtacttttc caggcaacta tgtaaaacct ttgtatctat aagaagactg aaaaccatgg
     3001 agattatttt tattggagga ggaagcatca ttcatgaacc gatcttttta gttgagtcag
     3061 taggaaaatt aatacagtgg ataaagtaag aagcaaaaga cagggacaga gaagtgttgt
     3121 gtttaaaacc caagcctgtc taaggttact gtgtattaga cagggccgaa ctagtgtgct
     3181 gagcaaaaaa aaaaaaaata aaaaaaaaaa aaaaa
//