LOCUS AH002766 3043 bp DNA linear HUM 10-JUN-2016 DEFINITION Homo sapiens engrailed protein (EN1) gene, complete cds. ACCESSION AH002766 L12698 L12699 VERSION AH002766.2 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3043) AUTHORS Logan,C., Hanks,M.C., Noble-Topham,S., Nallainathan,D., Provart,N.J. and Joyner,A.L. TITLE Cloning and sequence comparison of the mouse, human, and chicken engrailed genes reveal potential functional domains and regulatory regions JOURNAL Dev. Genet. 13 (5), 345-358 (1992) PUBMED 1363401 COMMENT On or before Jun 10, 2016 this sequence version replaced L12698.1, L12699.1, AH002766.1. FEATURES Location/Qualifiers source 1..3043 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" /clone_lib="EMBL3 genomic from Dr. N. Landau" /dev_stage="adult" gene <1..3043 /gene="EN1" mRNA join(1..1877,1978..3043) /gene="EN1" /product="engrailed protein" exon 1..1877 /gene="EN1" /number=1 5'UTR 1..1015 /gene="EN1" CDS join(1016..1877,1978..2294) /gene="EN1" /function="transcription factor" /standard_name="engrailed" /note="putative" /codon_start=1 /product="engrailed protein" /protein_id="AAA53502.2" /translation="MEEQQPEPKSQRDSALGGAAAATPGGLSLSLSPGASGSSGSGSD GDSVPVSPQPAPPSPPAAPCLPPLAHHPHLPPHPPPPPPQHLAAPAHQPQPAAQLHRT TNFFIDNILRPDFGCKKEQPPPQLLVAAAARGGAGGGGRVERDRGQTAAGRDPVHPLG TRAPGAASLLCAPDANCGPPDGSQPAAAGAGASKAGNPAAAAAAAAAAVAAAAAAAAA KPSDTGGGGSGGGAGSPGAQGTKYPEHGNPAILLMGSANGGPVVKTDSQQPLVWPAWV YCTRYSDRPSSGPRTRKLKKKKNEKEDKRPRTAFTAEQLQRLKAEFQANRYITEQRRQ TLAQELSLNESQIKIWFQNKRAKIKKATGIKNGLALHLMAQGLYNHSTTTVQDKDESE " misc_feature 2019..2198 /gene="EN1" /note="Region: homeobox" gap 1878..1977 /estimated_length=unknown exon 1978..3043 /gene="EN1" /number=2 3'UTR 2295..3043 /gene="EN1" /note="G00-119-869; putative" BASE COUNT 623 a 929 c 830 g 561 t ORIGIN 1 gagctcacag acccataatc ctgcatttct ctaacaagtt gtttatggag ttgcttctcc 61 atttgcctac atcccaaaat tcacccctcc cggtttcttc tgccccctcc tgagtcccgg 121 cctgaaggag ggggagggac gcgggtgcgg cgcgggtggg ggagggcgga cccgacgcac 181 agggccagcg ccgaggcgcc ccctctccgc cagcggttga cgcccccgga ttatttatcc 241 gcaaagtccc gcgcgcgccc attgggccga ggcccgagtg tcagcgcgag tcccggctcg 301 ccattggctc cgcacacgtg cggccctgac tcacgtgctt ccggtttgaa ggcaaaaagt 361 gtgcctgggt gatttttttt ttaagcgaga gagtttgtgc aaagatccga gctgtcagag 421 atttgaaaaa aaaaaaaaaa acaaaaaaaa aaaaaccagc ccggcgctgg cggagacgcg 481 ctctccctgc aaaaaaagca aaggcgatta aaggcgctgc cagcctcacg ctctgggcac 541 agctgagcgt gacactcggg gaagtcaaac ccctcactac tgcctaggaa gatggctaga 601 ctttaaatac tatttttttc cctttaagaa aaaaattatt ggagcttttt ttcttgcttt 661 ctttttcctt ttctttttct ttttttcctt catttttttg gccgtggctt actccccatt 721 taaatcaaat cattgaatct ggttgcagaa agaaaaaaga aatagccaag tgtctccata 781 tctggatgtc tacaaattag agagggagag acagcgagat ctatctgcta gataagaacg 841 agcgatccag gccagacgcc tgagcttttt tcctgcaccc gccccgtgcc ttcgctgagg 901 cttcgcctgc ctccttcctc cgcgcacccc cacgggccgc tggcaaagtg gggtggggag 961 cgaggcggtg ggggcggggg ccggcgcggc ggccggggcg gcggggcggc cgagcatgga 1021 agaacagcag ccggaaccta aaagtcagcg cgactcggcc ctcggcggcg cggcggcggc 1081 gactccgggc ggcctcagcc tgagcctcag tccgggcgcc agcggcagca gcggcagcgg 1141 cagcgatgga gacagcgtgc cggtgtcccc gcagcctgcg cccccctcgc cgcccgcggc 1201 gccttgcctg ccgcccctgg cccaccaccc gcacctcccc ccacaccccc cgcccccgcc 1261 gcctcagcat ctcgcggcgc ctgctcacca gccgcagcca gcggcccagc tgcaccgcac 1321 caccaacttt ttcatcgaca acatcctgag gccggacttc ggctgcaaaa aggagcagcc 1381 gccaccgcag cttctggtgg ctgcggcggc cagaggaggc gcaggaggag gaggccgggt 1441 cgagcgtgac agaggccaga ctgccgcagg tagagaccct gtccacccgt tgggcacccg 1501 ggcgccaggc gctgcctcgc tcctgtgcgc cccggacgcg aactgtggcc cacccgacgg 1561 ctcccagcca gccgccgccg gcgcgggcgc gtctaaagct gggaacccgg ctgcggcggc 1621 ggcggcggcc gcggcggcag tggcggcggc ggcggcggcc gcagcagcca agccctcgga 1681 caccggtggc ggcggcagtg gaggcggcgc ggggagcccc ggagcgcagg gcaccaaata 1741 cccggagcac ggcaacccgg ctatcctact tatgggctca gccaacggcg ggcccgtggt 1801 caaaactgac tcgcagcagc ctctcgtatg gcccgcctgg gtgtactgca cacgttattc 1861 ggatcgtcca tcctccgnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1921 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnngtc 1981 cgcgcaccag gaagctgaag aagaagaaga acgagaagga ggacaagcgg ccgcggaccg 2041 cgttcacggc cgagcagctg cagagactca aggcggagtt ccaggcaaac cgctacatca 2101 cggagcagcg gcggcagacc ctggcccagg aactcagcct caacgagtcc cagatcaaga 2161 tctggttcca gaacaagcgc gccaagatca agaaagccac aggcatcaag aacggcctgg 2221 cgctgcacct catggcccag ggactgtaca accactccac caccacggtc caggacaaag 2281 acgagagcga gtagccgcca caggccgggg ccgcgcccgc gccccctccc ggcaccgccg 2341 ccgtcgtctc ccggcccctc gctgggggag aaagcatctg ctccaaggag ggagggagcg 2401 cagggaaaag agcgagagag acagaaagag agcctcagaa tggacaatga cgttgaaacg 2461 cagcattttt gaaaagggag aaagactcgg acaggtgcta tcgaaaaata agatccattc 2521 tctattccca gtataaggga cgaaactgcg aactccttaa agctctatct agccaaaccg 2581 cttacgacct tgtatatatt taatttcagg taaggaaaac acatacgtgt agcgatctct 2641 atttgctgga catttttatt aatctccttt attattattg ttataattat tataattatt 2701 ataattattt tatggccctc ccccaccgcc tcgctgcccc cgcccagttt cgttttcgtt 2761 gcctttttca tttgaatgtc attgcttctc cggtgcctcc cgacccgcat cgccggccct 2821 ggtttctctg ggacttttct ttgtgtgcga gagtgtgttt cctttcgtgt ctgcccacct 2881 cttctccccc acctcccggg tcccttctgt cggtctgtct gttctgcccc cctttcgttt 2941 tccggagact tgttgagaaa tacgacccca cagactgcga gactgaaccg ccgctacaag 3001 ccaaagattt tattatgttc agaaacctgt agtctgaaat aaa //