LOCUS       HSU70212                4007 bp    mRNA    linear   HUM 06-JUL-1997
DEFINITION  Human SIM1 mRNA, complete cds.
ACCESSION   U70212
VERSION     U70212.1
KEYWORDS    .
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 4007)
  AUTHORS   Chrast,R., Scott,H.S., Chen,H., Kudoh,J., Rossier,C., Minoshima,S.,
            Wang,Y., Shimizu,N. and Antonarakis,S.E.
  TITLE     Cloning of two human homologs of the Drosophila single-minded gene
            SIM1 on chromosome 6q and SIM2 on 21q within the Down syndrome
            chromosomal region
  JOURNAL   Genome Res. 7 (6), 615-624 (1997)
   PUBMED   9199934
REFERENCE   2  (bases 1 to 4007)
  AUTHORS   Chrast,R., Rossier,C. and Antonarakis,S.E.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-SEP-1996) Medical Genetics, Geneva University Medical
            School, 1 rue Michel Servet, Geneva 1211, Switzerland
FEATURES             Location/Qualifiers
     source          1..4007
                     /db_xref="H-InvDB:HIT000221095"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /chromosome="6"
                     /map="6q"
     gene            1..4007
                     /gene="SIM1"
     CDS             217..2517
                     /gene="SIM1"
                     /note="similar to the single-minded protein in Drosophila
                     melanogaster, SwissProt Accession Number P05709"
                     /codon_start=1
                     /product="hSIM1"
                     /protein_id="AAB62395.1"
                     /translation="MKEKSKNAARTRREKENSEFYELAKLLPLASAITSQVDKASIIR
                     LTTSYLKMRVVFPEGLGEAWGHSSRTSPLDNVGRELGSHLLQTLDGFIFVVAPDGKIM
                     YISETASVHLGLSQVELTGNSIYEYIHPADHDEMTAVLTAHQPYHSHFVQEYEIERSF
                     FLRMKCVLAKRNAGLTCGGYKVIHCSGYLKIRQYSLDMSPFDGCYQNVGLVAVGHSLP
                     PSAVTEIKLHSNMFMFRASLDMKLIFLDSRVAELTGYEPQDLIEKTLYHHVHGCDTFH
                     LRCAHHLLLVKGQVTTKYYRFLAKHGGWVWVQSYATIVHNSRSSRPHCIVSVNYVLTD
                     TEYKGLQLSLDQISASKPAFSYTSSSTPTMTDNRKGAKSRLSSSKSKSRTSPYPQYSG
                     FHTERSESDHDSQWGGSPLTDTASPQLLDPADRPGSQHDASCAYRQFSDRSSLCYGFA
                     LDHSRLVEERHFHTQACEGGRCEAGRYFLGTPQAGREPWWGSRAALPLTKASPESREA
                     YENSMPHIASVHRIHGRGHWDEDSVVSSPDPGSASESGDRYRTEQYQSSPHEPSKIET
                     LIRATQQMIKEEENRLQLRKAPSDQLASINGAGKKHSLCFANYQQPPPTGEVCHGSAL
                     ANTSPCDHIQQREGKMLSPHENDYDNSPTALSRISSPNSDRISKSSLILAKDYLHSDI
                     SPHQTAGDHPTVSPNCFGSHRQYFDKHAYTLTGYALEHLYDSETIRNYSLGCNGSHFD
                     VTSHLRMQPDPAQGHKGTSVIITNGS"
BASE COUNT         1159 a          967 c          899 g          981 t
ORIGIN      
        1 ggatccgcgc gaattttcaa agaacatatt ttccgttcac ccccgctggt cttttactgc
       61 catcaataca ctgttcttgg tgcaaatacc tcagcctctt tattcaaagt atgttttatg
      121 ttttngccaa atatgatctc taattgaaag tttatttttg gttttggatg aatctgcgga
      181 gcttaagttg tgagaagaaa gggggaacaa gacacaatga aagaaaagtc caaaaatgct
      241 gcgcggacta ggagggagaa ggaaaacagc gaattttatg aactggctaa attactgcct
      301 ttggcctcgg ctatcacctc gcaggtggac aaagcatcca taatcagact cacgaccagc
      361 tatctcaaaa tgagagtggt gttcccagaa gggctcggcg aggcgtgggg ccactcaagt
      421 cggaccagcc ccctggacaa cgttggccga gaactgggct cccatctgct ccagaccctg
      481 gatggcttca tcttcgtggt agccccagat gggaagatca tgtacatctc agagacagcc
      541 tcagtccact tgggtctttc tcaggtagag ctgaccggaa acagcattta tgaatacatt
      601 cacccggcag accacgacga gatgacggcg gtgctcaccg cccatcaacc ctaccactct
      661 cacttcgtgc aggagtatga gatcgagcgc tccttcttcc tgaggatgaa gtgcgtcttg
      721 gccaagcgta acgccggcct cacctgtggc ggctacaagg tcatccactg cagcggctac
      781 ttgaagatcc gccagtacag cctggacatg tcccccttcg acggctgcta ccaaaacgtg
      841 ggcctggtgg ccgtgggcca ctcgctgcct cccagcgccg tcacggagat caagctacac
      901 agcaatatgt ttatgttccg cgccagcctg gacatgaagc tcatctttct ggactccagg
      961 gtggcggagc tgacggggta cgaacctcag gacctgattg agaagactct gtaccaccat
     1021 gtgcacggct gcgacacctt ccacctgcgc tgcgcgcacc atttgctgct ggtgaaggga
     1081 caggtgacca ccaagtacta caggttcctg gcgaaacacg gcggctgggt atgggtgcaa
     1141 agctacgcga ccatcgtgca caacagtcgc tcctccaggc cacactgtat cgtcagcgtc
     1201 aactatgtcc tcacagacac agaatacaaa gggctgcagc tctccctgga tcagatctca
     1261 gcctccaaac cagccttctc ctataccagc agctccaccc ccaccatgac tgacaacaga
     1321 aagggggcca aatcccggct ctccagctca aagtcaaaat ccaggacttc cccataccct
     1381 cagtattcgg gatttcacac agaaagatcg gaatctgatc atgacagcca gtggggcgga
     1441 agtcccttga ccgacacggc ctctccgcag cttctggacc ccgccgatag gcctggctcc
     1501 cagcacgacg catcgtgcgc ctacagacag ttttcggacc gcagctctct ctgctatggc
     1561 tttgcgcttg accactcgag gctggtggaa gagaggcatt tccataccca ggcctgtgaa
     1621 ggaggccgat gtgaggcagg caggtacttc ctgggaacgc cgcaggccgg gagggagccc
     1681 tggtggggct ctcgcgcagc cttgcccctg acaaaggcct ccccagaaag cagagaagcc
     1741 tatgaaaaca gcatgcctca catcgcttca gtccacagga tccatgggcg aggtcattgg
     1801 gatgaagata gtgtggtcag ttctccagac cctgggtcgg ccagtgaatc aggtgaccga
     1861 tatcgtactg agcagtatca aagtagccca catgaaccca gcaaaattga aactcttata
     1921 agagccactc agcaaatgat taaagaagaa gagaacagat tacagctaag gaaagccccc
     1981 tcagaccaac tggcttccat taatggggct gggaaaaaac actccctgtg ttttgcaaac
     2041 taccaacagc ccccaccaac aggtgaagtc tgccatggct ctgctcttgc caacacttca
     2101 ccatgtgacc atatccagca gagagaggga aaaatgttga gcccccatga aaatgactat
     2161 gacaacagtc ccaccgcact atctcggata agtagtccca attcggatcg catttcaaaa
     2221 tccagtttga tcctagctaa agactatctg cattcggata tatctcctca tcagacagca
     2281 ggagaccacc ctactgtctc tccaaactgc tttggctctc accggcagta ttttgacaag
     2341 catgcttaca cattaactgg atatgccctg gagcacttat atgacagcga aaccattaga
     2401 aactattcct tgggctgtaa tggctcacac tttgatgtaa cttcccatct gaggatgcaa
     2461 ccagacccag cacaaggaca caagggaaca tctgttataa taaccaacgg aagctgatgt
     2521 tttgctgaaa tattttgttc tttaaggatc tctgaaacat atttatagtt taatacccca
     2581 ttaccagcat ttactatgcc acagattgtt agagagtata acttaagtta ctgggtattt
     2641 gatacgtgtt cctataaaat caaagaaaac atagcactag cattcagggt tatacacaga
     2701 aaagggagct aaattgaata cacaaatttc ccctctaatt atatgggaac cagaatagat
     2761 aaattttgac ttgaaaaata ttcatgtaga tcaagtgtgc atatatacta catgagagga
     2821 ctgatgaatg acaacattgc attgtgacta tccagtgatc ctcaaacaca caaactatta
     2881 cttacaaact gcggtataca ttttacatat ggaaatatag gctatgtaat gtaaatacat
     2941 caaaaatggg taattttctt tgactctgtc acactaaact tcttaacgaa atttccattc
     3001 ccaaaataac tgagaaagag agagatacat cttataaact gacttctttg tggtttcaaa
     3061 tcagccagct catttggttc aggcataaat tagagaaatg gttctggata tggtgcaaaa
     3121 atgagttttc acctggtatc cattataaac aatcaggaag aggtaatttt tcaccttgct
     3181 tttcagttag acaaggacca ggattgcact gacatggcgc tgagggtttt tctaagtaag
     3241 aacactgaga tattgggaca cacatcaaaa acctggagtg ctcaattgga agtagttcta
     3301 tgaatatgga aaggccagag gcagagtgaa ataaaatgct atctcaaagt ttaacacaat
     3361 ttaagggctc agcataagta aacaacatat ttggggtttg cttgtaaaac caactaaata
     3421 aaaaattcaa accaattcac ccagaaaaaa gaccaatagg tgcaaaaata aaaggaaaac
     3481 cagtgaagtg ccacatgaca gcagtgttaa gtgtttgaaa acgtttcaaa gcacatatgt
     3541 gccaatgtga caacatgtgg aaagcctcag gagagagtct aagataaaag cttaggctga
     3601 tagacaagta gttaagagct aagagcagta ctctgaagga ataggcaaaa tgtttatttt
     3661 ccttattgtt tgtaaacaac aaacttggtc ttacatctgt gtggtatagt agaaaggcca
     3721 gctgactaga tctctggatt ctaattttgg ccctacctgt aacttaattt tgtgaccaca
     3781 gttgtaccat tcaccgtgcc tgggctctag tttcctggtt tgtaaggcag ccccagcgtt
     3841 catgttctgt gatagagcag aactgaactt attacctaat taactctctg ctatgagttg
     3901 tcaagactga tcattctgtt ttttctgtac acagaagttt agatgctttg tgacttaagc
     3961 aggtgtgtgg gctcctttag gcaggttaca gttaatttct agagtcg
//