LOCUS BC042612 3215 bp mRNA linear HUM 03-NOV-2005
DEFINITION Homo sapiens sorbin and SH3 domain containing 1, mRNA (cDNA clone
IMAGE:4821975), complete cds.
ACCESSION BC042612
VERSION BC042612.1
KEYWORDS .
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 3215)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 3215)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (02-JAN-2003) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: Miklos Palkovits, M.D., Ph.D.
cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki
Toshiyuki and Piero Carninci (RIKEN)
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Baylor College of Medicine Human Genome
Sequencing Center
Center code: BCM-HGSC
Web site: http://www.hgsc.bcm.tmc.edu/cdna/
Contact: amg@bcm.tmc.edu
Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
A.N., Gibbs, R.A.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 71 Row: e Column: 6
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 5454055
This clone has the following problem: The cds is short compared to
the longest cds in the locus.
FEATURES Location/Qualifiers
source 1..3215
/db_xref="H-InvDB:HIT000052725"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="IMAGE:4821975"
/tissue_type="Testis"
/clone_lib="NIH_MGC_97"
/lab_host="DH10B"
/note="Vector: pBluescriptR"
gene 1..3215
/gene="SORBS1"
/gene_synonym="CAP"
/gene_synonym="DKFZp586P1422"
/gene_synonym="FLAF2"
/gene_synonym="FLJ12406"
/gene_synonym="KIAA1296"
/gene_synonym="R85FL"
/gene_synonym="SH3P12"
/gene_synonym="SORB1"
/db_xref="GeneID:10580"
/db_xref="HGNC:HGNC:14565"
/db_xref="IMGT/GENE-DB:14565"
/db_xref="MIM:605264"
CDS 284..2170
/gene="SORBS1"
/gene_synonym="CAP"
/gene_synonym="DKFZp586P1422"
/gene_synonym="FLAF2"
/gene_synonym="FLJ12406"
/gene_synonym="KIAA1296"
/gene_synonym="R85FL"
/gene_synonym="SH3P12"
/gene_synonym="SORB1"
/codon_start=1
/product="SORBS1 protein"
/protein_id="AAH42612.1"
/db_xref="GeneID:10580"
/db_xref="HGNC:HGNC:14565"
/db_xref="IMGT/GENE-DB:14565"
/db_xref="MIM:605264"
/translation="MSSECDGGSKAVMNGLAPGSNGQDKDMDPTKICTGKGAVTLRAS
SSYRETPSSSPASPQETRQHESKPDEWRLSSSADANGNAQPSSLAAKGYRSVHPNLPS
DKSQDATSSSAAQPEVIVVPLYLVNTDRGQEGTARPPTPLGPLGCVPTIPATASAASP
LTFPTLDDFIPPHLQRWPHHSQPARASGSFAPISQTPPSFSPPPPLVPPAPEDLRRVS
EPDLTGAVSSTDSSPLLNEVSSSLIGTDSQAFPSVSKPSSAYPSTTIVNPTIVLLQHN
REQQKRLSSLSDPVSERRVGEQDSAPTQEKPTSPGKAIEKRAKDDSRRVVKSTQDLSD
VSMDEVGIPLRNTERSKDWYKTMFKQIHKLNRDDDSDLYSPRYSFSEDTKSPLSVPRS
KSEMSYIDGEKVVKRSATLPLPARSSSLKSSSERNDWEPPDKKVDTRKYRAEPKSIYE
YQPGKSSVLTNEKMSSAISPTPEISSETPGYIYSSNFHAVKRESDGAPGDLTSLENER
QIYKSVLEGGDIPLQGLSGLKRPSSSASTKDSESPRHFIPADYLESTEEFIRRRHDDK
EKLLADQRRLKREQEEADIAARRHTGVIPTHHQFITNERFGDLLNIDDTAKRKSGSEK
YDWA"
BASE COUNT 942 a 808 c 738 g 727 t
ORIGIN
1 agtagagatg ggttcctaat gtcatcgttt atttctcatg gctgccagta ataaacacga
61 aagaagaact aaatatagtg agttatgtac tcaccagctc atggtattat cctgaacttg
121 ccgcactgag aggaaattct gtagctctgt ttagtagatg gtacagagta aaccccaggc
181 agagagtttt catcttcttg tcatcccgca gaatatttct gatgactaga gctctcagag
241 cagttcagag ccccagttgc agacgacttg tcctgccacc accatgagtt ctgaatgtga
301 tggtggttcc aaagctgtga tgaatggctt ggcacctggc agcaatgggc aagacaaaga
361 catggatcct acaaaaatct gcactgggaa gggagcggtg actctccggg cctcgtcttc
421 ctacagggaa accccaagca gtagccctgc gagccctcag gaaacccggc aacacgaaag
481 caaaccagat gagtggaggc tttcttccag tgctgatgcc aatggaaatg cccagccctc
541 ttcactcgct gccaagggct acagaagtgt gcatcccaac cttccttctg acaagtccca
601 ggatgccact tcctccagtg cagcccagcc ggaggtaata gttgtccctc tctacctggt
661 taatactgac agagggcaag aaggcactgc cagacctcca acacctctgg ggcctcttgg
721 ctgcgtcccc acaatcccag cgactgcctc tgccgcctca cctctgacct tcccgactct
781 agatgatttc attccccctc atctgcagag gtggccccac cacagccagc cagcccgcgc
841 ctctggctcc tttgccccca ttagccagac gccaccatcc ttctcaccac cacctccgct
901 ggtccctcct gccccggagg acctccgcag agtctcggag cctgacctca cgggagctgt
961 ttcgagtacc gattccagtc ctctactaaa tgaagtttct tcttccctta ttggaactga
1021 ttcccaagcc tttccatcag ttagcaagcc ttcatccgcc tatccctcca caacgattgt
1081 caatcctact attgtgctct tgcaacacaa tcgagaacag caaaaacgac tcagtagcct
1141 ttcagatcct gtctcagaaa gaagagtggg agagcaggac tcagcaccaa cccaggaaaa
1201 acccacctca cctggcaagg ctattgaaaa aagagcaaag gatgacagta ggcgggtggt
1261 gaagagcact caggacttaa gcgatgtttc catggatgaa gtgggcatcc cactccggaa
1321 cactgagaga tcaaaagact ggtacaagac tatgtttaaa cagatccaca aactgaacag
1381 agatgatgat tcagatctgt actctcccag atactcattt tctgaagaca caaaatctcc
1441 cctttctgtg cctcgctcaa aaagtgagat gagctacatt gatggtgaga aggtagtcaa
1501 gaggtcggcc acactacccc tcccagcccg ctcttcctca ctgaagtcaa gctcagaaag
1561 aaatgactgg gaacccccag ataagaaagt agacacaaga aaatatcgtg cagagcccaa
1621 gagcatttac gaatatcagc ctggcaagtc ttccgttctg accaacgaaa agatgagctc
1681 agccatcagc cctactccgg aaatttcttc agagactcct ggatatatat attcttccaa
1741 cttccatgca gtgaagaggg aatcagacgg ggctcctggg gatctcacta gcttggagaa
1801 tgagagacaa atttataaaa gtgtcttgga aggtggtgac atccctcttc agggcctgag
1861 tgggctcaag cgaccatcca gctctgcttc cactaaagat tcagaatcgc caagacattt
1921 tataccagct gattacttgg aatccacgga agaatttatt cgaagacgtc atgatgataa
1981 agagaaactt ttagcggacc agagacgact taaacgcgag caagaagagg ctgatattgc
2041 agctcgacgc cacacaggcg tcattccgac gcaccatcag tttatcacta atgagcgctt
2101 tggggacctc ctcaatatag acgatactgc aaaaaggaaa tctgggtcag agaaatatga
2161 ctgggcatag tggcttacac ctgtaatccc agtactttgg gaagtcaagg tgggagaatt
2221 gcttgagccc aggagttcga caccagcttg ggcaacataa tgagacctgc cagagccaaa
2281 tttgacttta aagctcagac actggaggag cttcctctgc agaagggaga tattgtttac
2341 atttataagc aaattgatca gaactggtat gaaggagaac accacggccg ggtgggaatc
2401 ttcccacgca cctacatcga gcttcttcct cctgctgaga aggcacagcc caaaaagttg
2461 acaccagtgc aggttttgga atatggagaa gctattgcta agtttaactt taatggtgat
2521 acacaagtag aaatgtcctt cagaaagggt gagaggatca cactgctccg gcaggtagat
2581 gagaactggt acgaagggag gatcccgggg acatcccgac aaggcatctt ccccatcacc
2641 tacgtggatg tgatcaagcg accactggtg aaaaaccctg tggattacat ggacctgcct
2701 ttctcctcct ccccaagtcg cagtgccact gcaagcccac agcaacctca agcccagcag
2761 cgaagagtca cccccgacag gagtcaaacc tcacaagatt tatttagcta tcaagcatta
2821 tatagctata taccacagaa tgatgatgag ttggaactcc gcgatggaga tatcgttgat
2881 gtcatggaaa aatgtgacga tggatggttt gttggtactt caagaaggac aaagcagttt
2941 ggtacttttc caggcaacta tgtaaaacct ttgtatctat aagaagactg aaaaccatgg
3001 agattatttt tattggagga ggaagcatca ttcatgaacc gatcttttta gttgagtcag
3061 taggaaaatt aatacagtgg ataaagtaag aagcaaaaga cagggacaga gaagtgttgt
3121 gtttaaaacc caagcctgtc taaggttact gtgtattaga cagggccgaa ctagtgtgct
3181 gagcaaaaaa aaaaaaaata aaaaaaaaaa aaaaa
//