LOCUS BC001744 2907 bp mRNA linear HUM 22-FEB-2007
DEFINITION Homo sapiens SATB homeobox 1, mRNA (cDNA clone MGC:1624
IMAGE:3533915), complete cds.
ACCESSION BC001744
VERSION BC001744.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2907)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2907)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (16-JAN-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: DCTD/DTP
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Genome Sequence Centre,
BC Cancer Agency, Vancouver, BC, Canada
info@bcgsc.bc.ca
Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson
Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen
Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel
Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave
Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth
Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao,
Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR
Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang,
Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 8 Row: e Column: 15
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 33356175.
FEATURES Location/Qualifiers
source 1..2907
/db_xref="H-InvDB:HIT000030618"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:1624 IMAGE:3533915"
/tissue_type="Lung, small cell carcinoma"
/clone_lib="NIH_MGC_7"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..2907
/gene="SATB1"
/db_xref="GeneID:6304"
/db_xref="HGNC:HGNC:10541"
/db_xref="MIM:602075"
CDS 173..2464
/gene="SATB1"
/codon_start=1
/product="SATB1 protein"
/protein_id="AAH01744.1"
/db_xref="GeneID:6304"
/db_xref="HGNC:HGNC:10541"
/db_xref="MIM:602075"
/translation="MDHLNEATQGKEHSEMSNNVSDPKGPPAKIARLEQNGSPLGRGR
LGSTGAKMQGVPLKHSGHLMKTNLRKGTMLPVFCVVEHYENAIEYDCKEEHAEFVLVR
KDMLFNQLIEMALLSLGYSHSSAAQAKGLIQVGKWNPVPLSYVTDAPDATVADMLQDV
YHVVTLKIQLHSCPKLEDLPPEQWSHTTVRNALKDLLKDMNQSSLAKECPLSQSMISS
IVNSTYYANVSAAKCQEFGRWYKHFKKTKDMMVEMDSLSELSQQGANHVNFGQQPVPG
NTAEQPPSPAQLSHGSQPSVRTPLPNLHPGLVSTPISPQLVNQQLVMAQLLNQQYAVN
RLLAQQSLNQQYLNHPPPVSRSMNKPLEQQVSTNTEVSSEIYQWVRDELKRAGISQAV
FARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFLQLPEAERDRIYQDERER
SLNAASAMGPAPLISTPPSRPPQVKTATIATERNGKPENNTMNINASIYDEIQQEMKR
AKVSQALFAKVAATKSQGWLCELLRWKEDPSPENRTLWENLSMIRRFLSLPQPERDAI
YEQESNAVHHHGDRPPHIIHVPAEQIQQQQQQQQQQQQQQQAPPPPQPQQQPQTGPRL
PPRQPTVASPAESDEENRQKTRPRTKISVEALGILQSFIQDVGLYPDEEAIQTLSAQL
DLPKYTIIKFFQNQRYYLKHHGKLKDNSGLEVDVAEYKEEELLKDLEESVQDKNTNTL
FSVKLEEELSVEGNTDINTDLKD"
BASE COUNT 866 a 705 c 661 g 675 t
ORIGIN
1 aggcaactgg taaccacctc atttggggat gtttctgcct tgctagcagt gccagagaga
61 acttcatcat tgtcacctca tcaaagacta ctttttcaga catctcctgt agggctagat
121 tcagagagca gcttctgata tttggagggt gatctttaga cagtgactga gtatggatca
181 tttgaacgag gcaactcagg ggaaagaaca ttcagaaatg tctaacaatg tgagtgatcc
241 gaagggtcca ccagccaaga ttgcccgcct ggagcagaac gggagcccgc taggaagagg
301 aaggcttggg agtacaggtg caaaaatgca gggagtgcct ttaaaacact cgggccatct
361 gatgaaaacc aaccttagga aaggaaccat gctgccagtt ttctgtgtgg tggaacatta
421 tgaaaacgcc attgaatatg attgcaagga ggagcatgca gaatttgtgc tggtgagaaa
481 ggatatgctt ttcaaccagc tgatcgaaat ggcattgctg tctctaggtt attcacatag
541 ctctgctgcc caggccaaag ggctaatcca ggttggaaag tggaatccag ttccactgtc
601 ttacgtgaca gatgcccctg atgctacagt agcagatatg cttcaagatg tgtatcatgt
661 ggtcacattg aaaattcagt tacacagttg ccccaaacta gaagacttgc ctcccgaaca
721 atggtcgcac accacagtga ggaatgctct gaaggactta ctgaaagata tgaatcagag
781 ttcattggcc aaggagtgcc ccctttcaca gagtatgatt tcttccattg tgaacagtac
841 ttactatgca aatgtctcag cagcaaaatg tcaagaattt ggaaggtggt acaaacattt
901 caagaagaca aaagatatga tggttgaaat ggatagtctt tctgagctat cccagcaagg
961 cgccaatcat gtcaattttg gccagcaacc agttccaggg aacacagccg agcagcctcc
1021 atcccctgcg cagctctccc atggcagcca gccctctgtc cggacacctc ttccaaacct
1081 gcaccctggg ctcgtatcaa cacctatcag tcctcaattg gtcaaccagc agctggtgat
1141 ggctcagctg ctgaaccagc agtatgcagt gaatagactt ttagcccagc agtccttaaa
1201 ccaacaatac ttgaaccacc ctccccctgt cagtagatct atgaataagc ctttggagca
1261 acaggtttcg accaacacag aggtgtcttc cgaaatctac cagtgggtac gcgatgaact
1321 gaaacgagca ggaatctccc aggcggtatt tgcacgtgtg gcttttaaca gaactcaggg
1381 cttgctttca gaaatcctcc gaaaggaaga ggaccccaag actgcatccc agtctttgct
1441 ggtaaacctt cgggctatgc agaatttctt gcagttaccg gaagctgaaa gagaccgaat
1501 ataccaggac gaaagggaaa ggagcttgaa tgctgcctcg gccatgggtc ctgcccccct
1561 catcagcaca ccacccagcc gtcctcccca ggtgaaaaca gctactattg ccactgaaag
1621 gaatgggaaa ccagagaaca ataccatgaa cattaatgct tccatttatg atgagattca
1681 gcaggaaatg aagcgtgcta aagtgtctca agcactgttt gcaaaggttg cagcaaccaa
1741 aagccaggga tggttgtgcg agctgttacg ctggaaagaa gatccttctc cagaaaacag
1801 aaccctgtgg gagaacctct ccatgatccg aaggttcctc agtcttcctc agccagaacg
1861 tgatgccatt tatgaacagg agagcaacgc ggtgcatcac catggcgaca ggccgcccca
1921 cattatccat gttccagcag agcagattca gcaacagcag cagcaacagc aacagcagca
1981 gcagcagcag caggcaccgc cgcctccaca gccacagcag cagccacaga caggccctcg
2041 gctcccccca cggcaaccca cggtggcctc tccagcagag tcagatgagg aaaaccgaca
2101 gaagacccgg ccacgaacaa aaatttcagt ggaagccttg ggaatcctcc agagtttcat
2161 acaagacgtg ggcctgtacc ctgacgaaga ggccatccag actctgtctg cccagctcga
2221 ccttcccaag tacaccatca tcaagttctt tcagaaccag cggtactatc tcaagcacca
2281 cggcaaactg aaggacaatt ccggtttaga ggtcgatgtg gcagaatata aagaagagga
2341 gctgctgaag gatttggaag agagtgtcca agataaaaat actaacaccc ttttttcagt
2401 gaaactagaa gaagagctgt cagtggaagg aaacacagac attaatactg atttgaaaga
2461 ctgagataaa agtatttgtt tcgttcaaca gtgccactgg tatttactaa caaaatgaaa
2521 agtccacctt gtcttctctc agaaaacctt tgttgttcat tgtttggcca atgaatcttc
2581 aaaaacttgc acaaacagaa aagttggaaa aggataatac agactgcact aaatgttttc
2641 ctctgtttta caaactgctt ggcagcccca ggtgaagcat caaggattgt ttggtattaa
2701 aatttgtgtt cacgggatgc accaaagtgt gtaccccgta agcatgaaac cagtgttttt
2761 tgtttttttt ttagttctta ttccggagcc tcaaacaagc attatacctt ctgtgattat
2821 gatttcctct cctataatta tttctgtagc actccacact gatctttgga aacttgcccc
2881 ttatttaaaa aaaaaaaaaa aaaaaaa
//