LOCUS HUMELI 2677 bp mRNA linear HUM 27-APR-1993 DEFINITION Human erythroid isoform protein 4.1 mRNA, complete cds. ACCESSION J03796 VERSION J03796.1 KEYWORDS erythroid protein 4.1. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2677) AUTHORS Tang,T.K., Leto,T.L., Correas,I., Alonso,M.A., Marchesi,V.T. and Benz,E.J. Jr. TITLE Selective expression of an erythroid-specific isoform of protein 4.1 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85 (11), 3713-3717 (1988) PUBMED 3375238 COMMENT Original source text: Human T-cell leukemia line MOLT-4, cDNA to mRNA, clones pTM-[1,2]. Draft entry and printed copy of sequence for [1] kindly provided by T.K.Tang, 26-APR-1988. FEATURES Location/Qualifiers source 1..2677 /db_xref="H-InvDB:HIT000191200" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" CDS 47..2374 /note="erythroid protein 4.1 isoform A" /codon_start=1 /protein_id="AAA35793.1" /translation="MTTEKSLVTEAENSQHQQKEEGEEAINSGQQEPQQEESCQTAAE GDNWCEQKLKASNGDTPTHEDLTKNKERTSESRGLSRLFSSFLKRPKSQVSEEEGKEV ESDKEKGEGGQKEIEFGTSLDEEIILKAPIAAPEPELKTDPSLDLHSLSSAETQPAQE ELREDPDFEIKEGEGLEECSKIEVKEESPQSKAETELKASQKPIRKHRNMHCKVSLLD DTVYECVVETWLDSAKEIKKQVRGVPWNFTFNVKFYPPDPAQLTEDITRYYLCLQLRQ DIVAGRLPCSFATLALLGSYTIQSELGDYDPELHGVDYVSDFKLAPNQTKELEEKVME LHKSYRSMTPAQADLEFLENAKKLSMYGVDLHKAKDLEGVDIILGVCSSGLLVYKDKL RINRFPWPKVLKISYKRSSFFIKIRPGEQEQYESTIGFKLPSYRAAKKLWKVCVEHHT FFRLTSTDTIPKSKFLALGSKFRYSGRTQAQTRQASALIDRPAPHFERTASKRASRSL DGAAAVDSADRSPRPTSAPAITQGQVAEGGVLDASAKKTVVPKAQKETVKAEVKKEDE PPEQAEPEPTEAWKDLDKSQEEIKKHHASISELKKNFMESVPEPRPSEWDKRLSTHSP FRTLNINGQIPTGEGPPLVKTQTVTISDNANAVKSEIPTKDVPIVHTETKTITYEAAQ TDDNSGDLDPGVLLTAQTITSETPSSTTTTKITKTVKGGISETRIEKRIVITGDADID HDQVLVQAIKEAKEQHPDMSVTKVVVHQETEIADE" CDS 674..2374 /note="erythroid protein 4.1 isoform B" /codon_start=1 /protein_id="AAA35794.1" /translation="MHCKVSLLDDTVYECVVETWLDSAKEIKKQVRGVPWNFTFNVKF YPPDPAQLTEDITRYYLCLQLRQDIVAGRLPCSFATLALLGSYTIQSELGDYDPELHG VDYVSDFKLAPNQTKELEEKVMELHKSYRSMTPAQADLEFLENAKKLSMYGVDLHKAK DLEGVDIILGVCSSGLLVYKDKLRINRFPWPKVLKISYKRSSFFIKIRPGEQEQYEST IGFKLPSYRAAKKLWKVCVEHHTFFRLTSTDTIPKSKFLALGSKFRYSGRTQAQTRQA SALIDRPAPHFERTASKRASRSLDGAAAVDSADRSPRPTSAPAITQGQVAEGGVLDAS AKKTVVPKAQKETVKAEVKKEDEPPEQAEPEPTEAWKDLDKSQEEIKKHHASISELKK NFMESVPEPRPSEWDKRLSTHSPFRTLNINGQIPTGEGPPLVKTQTVTISDNANAVKS EIPTKDVPIVHTETKTITYEAAQTDDNSGDLDPGVLLTAQTITSETPSSTTTTKITKT VKGGISETRIEKRIVITGDADIDHDQVLVQAIKEAKEQHPDMSVTKVVVHQETEIADE " BASE COUNT 873 a 604 c 603 g 597 t ORIGIN 828 bp upstream of HincII site; chromosome 1p32-1pter. 1 agaacgcggt cggcccggtc cccgccgcac ccagcccagc aacatcatga caacagagaa 61 gagtttagtg actgaggccg aaaattcaca gcaccaacag aaggaagagg gtgaggaagc 121 cataaactca ggccaacaag aacctcagca ggaggaatct tgtcaaacag cagctgaagg 181 agataattgg tgtgaacaga agctgaaagc ttctaatgga gacactccta cacatgaaga 241 cttgaccaag aacaaggagc ggacatcaga aagcagagga ctttcacgac tattctcctc 301 gtttctcaaa aggcccaaat ctcaggtgtc cgaggaagaa ggcaaagaag tagagtcaga 361 taaagaaaaa ggtgaaggag gtcagaaaga gatagaattt ggaaccagtc ttgatgaaga 421 gatcatttta aaggccccaa ttgcagctcc tgaaccggaa ctcaaaacag acccatcttt 481 ggatcttcat tcattaagca gtgcagaaac acagcctgct caggaagaac tcagagaaga 541 tccagatttt gaaattaagg aaggagaagg acttgaagag tgctccaaaa tagaagtaaa 601 agaagaaagc cctcaatcaa aagcagaaac agaattaaaa gcttcccaaa aaccaatcag 661 aaaacacagg aacatgcact gcaaggtttc tttgttggat gacacagttt atgaatgtgt 721 tgtggagaca tggctggatt ccgccaaaga aataaaaaag caggttcgtg gtgtcccttg 781 gaattttaca tttaatgtaa agttttatcc acctgaccca gcacagttaa cagaagacat 841 aacaagatat tatttatgtc ttcagcttcg gcaggacata gttgcaggac gtctgccctg 901 ttcctttgca accttagcat tattaggttc ttacaccatc cagtctgaac tgggagacta 961 cgacccagaa ctccatggcg tggattatgt tagtgatttt aaactggccc cgaatcagac 1021 caaggaactt gaagagaagg tcatggaact gcataagtca tacaggtcca tgactccagc 1081 tcaggctgac ttggagtttc ttgagaatgc caaaaagttg tctatgtatg gagttgatct 1141 tcataaagca aaggacttgg aaggagtaga tatcatccta ggtgtctgct ctagtggcct 1201 tctggtttac aaagataagc tgagaattaa ccgcttccct tggcccaaag tgctgaagat 1261 ttcttataaa cgtagtagct ttttcatcaa gattcggcct ggagagcaag agcagtatga 1321 aagtaccatc ggattcaaac ttcccagtta ccgagcagct aagaaattat ggaaagtctg 1381 tgtagaacat cacacgtttt tcagattgac atctacagac accattccca aaagcaaatt 1441 tcttgcgcta ggatccaaat ttcgatacag tggccggact caagctcaga ccaggcaagc 1501 tagtgctcta attgacaggc ctgccccaca cttcgagcgt acagcaagta aacgggcgtc 1561 ccggagcctc gatggagcag cagctgtcga ttcggcagac cgaagtcctc ggcccacttc 1621 tgcacctgcc attactcagg gtcaggttgc agaaggtggc gtcctagatg cctctgctaa 1681 aaaaacagtg gtccctaaag cacagaagga aacagtgaag gctgaagtga aaaaggaaga 1741 cgagccacct gagcaagctg agccagagcc cacagaagca tggaaggatt tagacaagag 1801 tcaagaggag atcaaaaaac atcatgccag catcagtgag ctgaaaaaga acttcatgga 1861 gtctgtacca gaaccacggc ctagtgaatg ggataaacgc ttatccactc actcaccctt 1921 ccgaactctt aacatcaatg ggcaaatccc cacaggagaa ggacctcccc tggtgaagac 1981 acaaactgtc accatctcag ataatgccaa tgctgtgaaa agtgaaatcc caaccaaaga 2041 cgtccctatt gtccacactg agaccaagac catcacttat gaggctgccc agactgacga 2101 caacagtgga gacttggacc caggagtctt gctgacagct caaactatca catctgagac 2161 cccaagcagc accaccacaa ctaaaattac caagactgta aaaggtggga tttcagagac 2221 acgtattgaa aagagaattg tgatcacagg agatgctgat attgaccatg atcaggtcct 2281 tgtacaagcc atcaaggagg caaaggagca gcacccagac atgtcagtga ccaaggtggt 2341 cgtccaccag gagaccgaga ttgctgatga gtgagctcag gaactaacct accccaactc 2401 tgcccttctc ccatccaaga gaaaccagca aaatgataaa gaagctaacc tgccatagtc 2461 agacttcaga ctttcaagat tattctaaat caccagaaaa ttaatttcag tttctattgg 2521 gagtttatac caagagattc ttctagatct cattgatcct tttgaagagc tttttctata 2581 ttaggatatc agaattgttc aacttttcac tctatagact gttttaagag ttttggggtt 2641 ttttttaatt gggtggtttg taaccccttc agcctag //