LOCUS       BC001744                2907 bp    mRNA    linear   HUM 22-FEB-2007
DEFINITION  Homo sapiens SATB homeobox 1, mRNA (cDNA clone MGC:1624
            IMAGE:3533915), complete cds.
ACCESSION   BC001744
VERSION     BC001744.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2907)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2907)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (16-JAN-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: DCTD/DTP
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Genome Sequence Centre,
            BC Cancer Agency, Vancouver, BC, Canada
            info@bcgsc.bc.ca
            Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson
            Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen
            Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel
            Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave
            Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth
            Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao,
            Kim MacDonald,  Mike R. Mayo, Josh Moran, Diana Palmquist, JR
            Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang,
            Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 8 Row: e Column: 15
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 33356175.
FEATURES             Location/Qualifiers
     source          1..2907
                     /db_xref="H-InvDB:HIT000030618"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:1624 IMAGE:3533915"
                     /tissue_type="Lung, small cell carcinoma"
                     /clone_lib="NIH_MGC_7"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            1..2907
                     /gene="SATB1"
                     /db_xref="GeneID:6304"
                     /db_xref="HGNC:HGNC:10541"
                     /db_xref="MIM:602075"
     CDS             173..2464
                     /gene="SATB1"
                     /codon_start=1
                     /product="SATB1 protein"
                     /protein_id="AAH01744.1"
                     /db_xref="GeneID:6304"
                     /db_xref="HGNC:HGNC:10541"
                     /db_xref="MIM:602075"
                     /translation="MDHLNEATQGKEHSEMSNNVSDPKGPPAKIARLEQNGSPLGRGR
                     LGSTGAKMQGVPLKHSGHLMKTNLRKGTMLPVFCVVEHYENAIEYDCKEEHAEFVLVR
                     KDMLFNQLIEMALLSLGYSHSSAAQAKGLIQVGKWNPVPLSYVTDAPDATVADMLQDV
                     YHVVTLKIQLHSCPKLEDLPPEQWSHTTVRNALKDLLKDMNQSSLAKECPLSQSMISS
                     IVNSTYYANVSAAKCQEFGRWYKHFKKTKDMMVEMDSLSELSQQGANHVNFGQQPVPG
                     NTAEQPPSPAQLSHGSQPSVRTPLPNLHPGLVSTPISPQLVNQQLVMAQLLNQQYAVN
                     RLLAQQSLNQQYLNHPPPVSRSMNKPLEQQVSTNTEVSSEIYQWVRDELKRAGISQAV
                     FARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFLQLPEAERDRIYQDERER
                     SLNAASAMGPAPLISTPPSRPPQVKTATIATERNGKPENNTMNINASIYDEIQQEMKR
                     AKVSQALFAKVAATKSQGWLCELLRWKEDPSPENRTLWENLSMIRRFLSLPQPERDAI
                     YEQESNAVHHHGDRPPHIIHVPAEQIQQQQQQQQQQQQQQQAPPPPQPQQQPQTGPRL
                     PPRQPTVASPAESDEENRQKTRPRTKISVEALGILQSFIQDVGLYPDEEAIQTLSAQL
                     DLPKYTIIKFFQNQRYYLKHHGKLKDNSGLEVDVAEYKEEELLKDLEESVQDKNTNTL
                     FSVKLEEELSVEGNTDINTDLKD"
BASE COUNT          866 a          705 c          661 g          675 t
ORIGIN      
        1 aggcaactgg taaccacctc atttggggat gtttctgcct tgctagcagt gccagagaga
       61 acttcatcat tgtcacctca tcaaagacta ctttttcaga catctcctgt agggctagat
      121 tcagagagca gcttctgata tttggagggt gatctttaga cagtgactga gtatggatca
      181 tttgaacgag gcaactcagg ggaaagaaca ttcagaaatg tctaacaatg tgagtgatcc
      241 gaagggtcca ccagccaaga ttgcccgcct ggagcagaac gggagcccgc taggaagagg
      301 aaggcttggg agtacaggtg caaaaatgca gggagtgcct ttaaaacact cgggccatct
      361 gatgaaaacc aaccttagga aaggaaccat gctgccagtt ttctgtgtgg tggaacatta
      421 tgaaaacgcc attgaatatg attgcaagga ggagcatgca gaatttgtgc tggtgagaaa
      481 ggatatgctt ttcaaccagc tgatcgaaat ggcattgctg tctctaggtt attcacatag
      541 ctctgctgcc caggccaaag ggctaatcca ggttggaaag tggaatccag ttccactgtc
      601 ttacgtgaca gatgcccctg atgctacagt agcagatatg cttcaagatg tgtatcatgt
      661 ggtcacattg aaaattcagt tacacagttg ccccaaacta gaagacttgc ctcccgaaca
      721 atggtcgcac accacagtga ggaatgctct gaaggactta ctgaaagata tgaatcagag
      781 ttcattggcc aaggagtgcc ccctttcaca gagtatgatt tcttccattg tgaacagtac
      841 ttactatgca aatgtctcag cagcaaaatg tcaagaattt ggaaggtggt acaaacattt
      901 caagaagaca aaagatatga tggttgaaat ggatagtctt tctgagctat cccagcaagg
      961 cgccaatcat gtcaattttg gccagcaacc agttccaggg aacacagccg agcagcctcc
     1021 atcccctgcg cagctctccc atggcagcca gccctctgtc cggacacctc ttccaaacct
     1081 gcaccctggg ctcgtatcaa cacctatcag tcctcaattg gtcaaccagc agctggtgat
     1141 ggctcagctg ctgaaccagc agtatgcagt gaatagactt ttagcccagc agtccttaaa
     1201 ccaacaatac ttgaaccacc ctccccctgt cagtagatct atgaataagc ctttggagca
     1261 acaggtttcg accaacacag aggtgtcttc cgaaatctac cagtgggtac gcgatgaact
     1321 gaaacgagca ggaatctccc aggcggtatt tgcacgtgtg gcttttaaca gaactcaggg
     1381 cttgctttca gaaatcctcc gaaaggaaga ggaccccaag actgcatccc agtctttgct
     1441 ggtaaacctt cgggctatgc agaatttctt gcagttaccg gaagctgaaa gagaccgaat
     1501 ataccaggac gaaagggaaa ggagcttgaa tgctgcctcg gccatgggtc ctgcccccct
     1561 catcagcaca ccacccagcc gtcctcccca ggtgaaaaca gctactattg ccactgaaag
     1621 gaatgggaaa ccagagaaca ataccatgaa cattaatgct tccatttatg atgagattca
     1681 gcaggaaatg aagcgtgcta aagtgtctca agcactgttt gcaaaggttg cagcaaccaa
     1741 aagccaggga tggttgtgcg agctgttacg ctggaaagaa gatccttctc cagaaaacag
     1801 aaccctgtgg gagaacctct ccatgatccg aaggttcctc agtcttcctc agccagaacg
     1861 tgatgccatt tatgaacagg agagcaacgc ggtgcatcac catggcgaca ggccgcccca
     1921 cattatccat gttccagcag agcagattca gcaacagcag cagcaacagc aacagcagca
     1981 gcagcagcag caggcaccgc cgcctccaca gccacagcag cagccacaga caggccctcg
     2041 gctcccccca cggcaaccca cggtggcctc tccagcagag tcagatgagg aaaaccgaca
     2101 gaagacccgg ccacgaacaa aaatttcagt ggaagccttg ggaatcctcc agagtttcat
     2161 acaagacgtg ggcctgtacc ctgacgaaga ggccatccag actctgtctg cccagctcga
     2221 ccttcccaag tacaccatca tcaagttctt tcagaaccag cggtactatc tcaagcacca
     2281 cggcaaactg aaggacaatt ccggtttaga ggtcgatgtg gcagaatata aagaagagga
     2341 gctgctgaag gatttggaag agagtgtcca agataaaaat actaacaccc ttttttcagt
     2401 gaaactagaa gaagagctgt cagtggaagg aaacacagac attaatactg atttgaaaga
     2461 ctgagataaa agtatttgtt tcgttcaaca gtgccactgg tatttactaa caaaatgaaa
     2521 agtccacctt gtcttctctc agaaaacctt tgttgttcat tgtttggcca atgaatcttc
     2581 aaaaacttgc acaaacagaa aagttggaaa aggataatac agactgcact aaatgttttc
     2641 ctctgtttta caaactgctt ggcagcccca ggtgaagcat caaggattgt ttggtattaa
     2701 aatttgtgtt cacgggatgc accaaagtgt gtaccccgta agcatgaaac cagtgttttt
     2761 tgtttttttt ttagttctta ttccggagcc tcaaacaagc attatacctt ctgtgattat
     2821 gatttcctct cctataatta tttctgtagc actccacact gatctttgga aacttgcccc
     2881 ttatttaaaa aaaaaaaaaa aaaaaaa
//