LOCUS HUMGS1 2058 bp mRNA linear HUM 31-DEC-1994 DEFINITION Human GS1 (protein of unknown function) mRNA, complete cds. ACCESSION M86934 VERSION M86934.1 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2058) AUTHORS Salido,E.C., Yen,P.H., Koprivnikar,K., Yu,L.C. and Shapiro,L.J. TITLE The human enamel protein gene amelogenin is expressed from both the X and the Y chromosomes JOURNAL Am. J. Hum. Genet. 50 (2), 303-316 (1992) PUBMED 1734713 REFERENCE 2 (sites) AUTHORS Yen,P.H., Ellison,J., Salido,E.C., Mohandas,T. and Shapiro,L. TITLE Isolation of a new gene from the distal short arm of the human X chromosome that escapes X-inactivation JOURNAL Hum. Mol. Genet. 1 (1), 47-52 (1992) PUBMED 1284467 COMMENT Original source text: Homo sapiens (tissue library: Clontech) cDNA to mRNA. FEATURES Location/Qualifiers source 1..2058 /db_xref="H-InvDB:HIT000196656" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /map="Xp22.3" /tissue_lib="Clontech" gene 1..2058 /gene="GS1" CDS 36..680 /gene="GS1" /note="Gene from Xp22.3 which escapes X-inactivation. Function unknown." /codon_start=1 /protein_id="AAA58622.1" /translation="MDGLLLDTERLYSVVFQEICNRYDKKYSWDVKSLVMGKKALEAA QIIIDVLQLPMSKEELVEESQTKLKEVFPMAALMPGAEKLIIHLRKHGIPFALATSSG SASFDMKTSRHKEFFSLFSHIVLGDDPEVQHGKPDPDIFLACAKRFSPPPAMEKCLVF EDAPNGVEAALAAGMQAVMVPDGNLSRDLTTKATLVLNSLQDFQPELFGLPSYE" regulatory 2030..2035 /regulatory_class="polyA_signal_sequence" /gene="GS1" polyA_site 2058 /gene="GS1" BASE COUNT 542 a 455 c 467 g 594 t ORIGIN Xp22.3. 1 cgcccccgca gcccgtcacc cacctcatct ttgacatgga cggacttctt ctggatactg 61 aacggctgta ttcagtggtg tttcaagaaa tatgtaatcg ctatgacaag aaatacagct 121 gggatgtaaa gtccctggtt atgggtaaga aggcattaga ggcggcacag attataatag 181 acgtcttgca gctcccgatg tccaaagagg agctggtgga agaaagccaa acgaagttaa 241 aggaagtgtt ccccatggct gcgctcatgc caggggcgga gaaactcatc atccacctgc 301 ggaaacatgg catccccttt gcactagcca ccagctcggg gtccgcgtcg ttcgatatga 361 agacaagccg ccacaaggag ttcttcagct tgttttccca cattgtgctg ggagatgacc 421 ccgaagtgca gcatggcaag ccagacccag acatcttcct agcttgtgcc aagaggttct 481 ctccccctcc tgctatggag aagtgccttg tctttgaaga tgctcccaat ggggtggagg 541 cggccctggc agctgggatg caggcggtca tggttcctga cggaaacttg agccgagatc 601 tgacaacaaa ggccaccctg gtgctgaatt ccctgcagga cttccagccc gagctgtttg 661 gtttgccctc ctatgagtga gagggagggc ctcagtcttc cgcccccagc ccactctcat 721 ggtccacact gctgggggaa agggaaagga aatcagcaac tcttcaatcc caacctgcgc 781 tgtgatttta gcctcctgag attggagttt ccatcccatg ttggtttgtc ccagtctaac 841 gtgttgataa aatgtgactt gacggttgag acaaaaaata cagtagagac agaaacgaag 901 cccagaacaa agatgaaact tgaattacca tctcagaagt caagctgatg gagtatgtga 961 taaagtgaat gtacatgtat atacacacac acctccatat atacacgtgt gtatcagttt 1021 ggtaatatgc aggtaggcat tacatgcata tgtatgtaga catatgcatg catgtatatg 1081 taaaatatat acttttccaa gacaaaatgg aacatcactt ctcctagttt ttctgaacac 1141 tggctgggaa atgtaaactg tgtatgcata taagtatatg ctttatgtat gcatatgtat 1201 gtagatatgt ttatatctat cgtctgcatc actctcctca gtgttgatgt caacatgcaa 1261 tgacaactga taaagcgaga tggtagttct gcctggtttg cagtctgagt gggaaagtcc 1321 tgtttttgat gagcactcct tgttagctaa catttaaatt ctttttgtga cctcagaatg 1381 tctctggatc tttcctcatt gactgactct gtgccacgtc atccatagtt tattgttagt 1441 atgaacacaa ctgtaacatt tacctggtat ctacatcctt acctgcattg gaaaatgttt 1501 gctacctcac aacaaccatt tgcctccttt aagaacactg atgggctgca ctttttggat 1561 agaaatagaa tttgatttca gaatgtatgc ttggtgagtc tcagtgccca ggaacacttt 1621 tggaataatt tatcagacat tgaacttctg tgattaatcg cttttataga tttactcagt 1681 ctttaaaatt cgtctctgat ttgccagaga aaaacggtgg tagccatgga aatcgggagt 1741 gaaggagcac tgcttcattg tggctcagcc cttcctaggg gcctctgccc tttgatgtcc 1801 ttgagctact cttcagctct ggaagttgtg gacaaaccgt aggaatgtat gtgtgcgtgt 1861 ggtggagtga ttgtctgtga atgacaggcc ctggctattg attgatgttg catcaattta 1921 gcaaattcat ttcctcattc ttgatggcct gaatatatgt ctgcactttt aatgctcctc 1981 ttaaccagtt gtaacatctt accatttccc taccaaattg aattagttta ataaaatctt 2041 ttgacacatg ttaaaaac //