LOCUS BC001744 2907 bp mRNA linear HUM 22-FEB-2007 DEFINITION Homo sapiens SATB homeobox 1, mRNA (cDNA clone MGC:1624 IMAGE:3533915), complete cds. ACCESSION BC001744 VERSION BC001744.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2907) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2907) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (16-JAN-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: DCTD/DTP cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 8 Row: e Column: 15 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 33356175. FEATURES Location/Qualifiers source 1..2907 /db_xref="H-InvDB:HIT000030618" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:1624 IMAGE:3533915" /tissue_type="Lung, small cell carcinoma" /clone_lib="NIH_MGC_7" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2907 /gene="SATB1" /db_xref="GeneID:6304" /db_xref="HGNC:HGNC:10541" /db_xref="MIM:602075" CDS 173..2464 /gene="SATB1" /codon_start=1 /product="SATB1 protein" /protein_id="AAH01744.1" /db_xref="GeneID:6304" /db_xref="HGNC:HGNC:10541" /db_xref="MIM:602075" /translation="MDHLNEATQGKEHSEMSNNVSDPKGPPAKIARLEQNGSPLGRGR LGSTGAKMQGVPLKHSGHLMKTNLRKGTMLPVFCVVEHYENAIEYDCKEEHAEFVLVR KDMLFNQLIEMALLSLGYSHSSAAQAKGLIQVGKWNPVPLSYVTDAPDATVADMLQDV YHVVTLKIQLHSCPKLEDLPPEQWSHTTVRNALKDLLKDMNQSSLAKECPLSQSMISS IVNSTYYANVSAAKCQEFGRWYKHFKKTKDMMVEMDSLSELSQQGANHVNFGQQPVPG NTAEQPPSPAQLSHGSQPSVRTPLPNLHPGLVSTPISPQLVNQQLVMAQLLNQQYAVN RLLAQQSLNQQYLNHPPPVSRSMNKPLEQQVSTNTEVSSEIYQWVRDELKRAGISQAV FARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFLQLPEAERDRIYQDERER SLNAASAMGPAPLISTPPSRPPQVKTATIATERNGKPENNTMNINASIYDEIQQEMKR AKVSQALFAKVAATKSQGWLCELLRWKEDPSPENRTLWENLSMIRRFLSLPQPERDAI YEQESNAVHHHGDRPPHIIHVPAEQIQQQQQQQQQQQQQQQAPPPPQPQQQPQTGPRL PPRQPTVASPAESDEENRQKTRPRTKISVEALGILQSFIQDVGLYPDEEAIQTLSAQL DLPKYTIIKFFQNQRYYLKHHGKLKDNSGLEVDVAEYKEEELLKDLEESVQDKNTNTL FSVKLEEELSVEGNTDINTDLKD" BASE COUNT 866 a 705 c 661 g 675 t ORIGIN 1 aggcaactgg taaccacctc atttggggat gtttctgcct tgctagcagt gccagagaga 61 acttcatcat tgtcacctca tcaaagacta ctttttcaga catctcctgt agggctagat 121 tcagagagca gcttctgata tttggagggt gatctttaga cagtgactga gtatggatca 181 tttgaacgag gcaactcagg ggaaagaaca ttcagaaatg tctaacaatg tgagtgatcc 241 gaagggtcca ccagccaaga ttgcccgcct ggagcagaac gggagcccgc taggaagagg 301 aaggcttggg agtacaggtg caaaaatgca gggagtgcct ttaaaacact cgggccatct 361 gatgaaaacc aaccttagga aaggaaccat gctgccagtt ttctgtgtgg tggaacatta 421 tgaaaacgcc attgaatatg attgcaagga ggagcatgca gaatttgtgc tggtgagaaa 481 ggatatgctt ttcaaccagc tgatcgaaat ggcattgctg tctctaggtt attcacatag 541 ctctgctgcc caggccaaag ggctaatcca ggttggaaag tggaatccag ttccactgtc 601 ttacgtgaca gatgcccctg atgctacagt agcagatatg cttcaagatg tgtatcatgt 661 ggtcacattg aaaattcagt tacacagttg ccccaaacta gaagacttgc ctcccgaaca 721 atggtcgcac accacagtga ggaatgctct gaaggactta ctgaaagata tgaatcagag 781 ttcattggcc aaggagtgcc ccctttcaca gagtatgatt tcttccattg tgaacagtac 841 ttactatgca aatgtctcag cagcaaaatg tcaagaattt ggaaggtggt acaaacattt 901 caagaagaca aaagatatga tggttgaaat ggatagtctt tctgagctat cccagcaagg 961 cgccaatcat gtcaattttg gccagcaacc agttccaggg aacacagccg agcagcctcc 1021 atcccctgcg cagctctccc atggcagcca gccctctgtc cggacacctc ttccaaacct 1081 gcaccctggg ctcgtatcaa cacctatcag tcctcaattg gtcaaccagc agctggtgat 1141 ggctcagctg ctgaaccagc agtatgcagt gaatagactt ttagcccagc agtccttaaa 1201 ccaacaatac ttgaaccacc ctccccctgt cagtagatct atgaataagc ctttggagca 1261 acaggtttcg accaacacag aggtgtcttc cgaaatctac cagtgggtac gcgatgaact 1321 gaaacgagca ggaatctccc aggcggtatt tgcacgtgtg gcttttaaca gaactcaggg 1381 cttgctttca gaaatcctcc gaaaggaaga ggaccccaag actgcatccc agtctttgct 1441 ggtaaacctt cgggctatgc agaatttctt gcagttaccg gaagctgaaa gagaccgaat 1501 ataccaggac gaaagggaaa ggagcttgaa tgctgcctcg gccatgggtc ctgcccccct 1561 catcagcaca ccacccagcc gtcctcccca ggtgaaaaca gctactattg ccactgaaag 1621 gaatgggaaa ccagagaaca ataccatgaa cattaatgct tccatttatg atgagattca 1681 gcaggaaatg aagcgtgcta aagtgtctca agcactgttt gcaaaggttg cagcaaccaa 1741 aagccaggga tggttgtgcg agctgttacg ctggaaagaa gatccttctc cagaaaacag 1801 aaccctgtgg gagaacctct ccatgatccg aaggttcctc agtcttcctc agccagaacg 1861 tgatgccatt tatgaacagg agagcaacgc ggtgcatcac catggcgaca ggccgcccca 1921 cattatccat gttccagcag agcagattca gcaacagcag cagcaacagc aacagcagca 1981 gcagcagcag caggcaccgc cgcctccaca gccacagcag cagccacaga caggccctcg 2041 gctcccccca cggcaaccca cggtggcctc tccagcagag tcagatgagg aaaaccgaca 2101 gaagacccgg ccacgaacaa aaatttcagt ggaagccttg ggaatcctcc agagtttcat 2161 acaagacgtg ggcctgtacc ctgacgaaga ggccatccag actctgtctg cccagctcga 2221 ccttcccaag tacaccatca tcaagttctt tcagaaccag cggtactatc tcaagcacca 2281 cggcaaactg aaggacaatt ccggtttaga ggtcgatgtg gcagaatata aagaagagga 2341 gctgctgaag gatttggaag agagtgtcca agataaaaat actaacaccc ttttttcagt 2401 gaaactagaa gaagagctgt cagtggaagg aaacacagac attaatactg atttgaaaga 2461 ctgagataaa agtatttgtt tcgttcaaca gtgccactgg tatttactaa caaaatgaaa 2521 agtccacctt gtcttctctc agaaaacctt tgttgttcat tgtttggcca atgaatcttc 2581 aaaaacttgc acaaacagaa aagttggaaa aggataatac agactgcact aaatgttttc 2641 ctctgtttta caaactgctt ggcagcccca ggtgaagcat caaggattgt ttggtattaa 2701 aatttgtgtt cacgggatgc accaaagtgt gtaccccgta agcatgaaac cagtgttttt 2761 tgtttttttt ttagttctta ttccggagcc tcaaacaagc attatacctt ctgtgattat 2821 gatttcctct cctataatta tttctgtagc actccacact gatctttgga aacttgcccc 2881 ttatttaaaa aaaaaaaaaa aaaaaaa //