LOCUS BC010623 3791 bp mRNA linear HUM 04-OCT-2003 DEFINITION Homo sapiens nuclear factor (erythroid-derived 2)-like 1, mRNA (cDNA clone MGC:9037 IMAGE:3871147), complete cds. ACCESSION BC010623 VERSION BC010623.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3791) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3791) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (10-JUL-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Baylor College of Medicine Human Genome Sequencing Center Center code: BCM-HGSC Web site: http://www.hgsc.bcm.tmc.edu/cdna/ Contact: amg@bcm.tmc.edu Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H., Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, A.N., Gibbs, R.A. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 13 Row: k Column: 7 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 4505378. FEATURES Location/Qualifiers source 1..3791 /db_xref="H-InvDB:HIT000035017" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:9037 IMAGE:3871147" /tissue_type="Eye, retinoblastoma" /clone_lib="NIH_MGC_67" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..3791 /gene="NFE2L1" /gene_synonym="LCR-F1" /gene_synonym="NRF1" /gene_synonym="TCF11" /db_xref="GeneID:4779" /db_xref="MIM:163260" CDS 634..2862 /gene="NFE2L1" /gene_synonym="LCR-F1" /gene_synonym="NRF1" /gene_synonym="TCF11" /codon_start=1 /product="NFE2L1 protein" /protein_id="AAH10623.1" /db_xref="GeneID:4779" /db_xref="MIM:163260" /translation="MLSLKKYLTEGLLQFTILLSLIGVRVDVDTYLTSQLPPLREIIL GPSSAYTQTQFHNLRNTLDGYGIHPKSIDLDNYFTARRLLSQVRALDRFQVPTTEVNA WLVHRDPEGSVSGSQPNSGLALESSSGLQDVTGPDNGVRESETEQGFGEDLEDLGAVA PPVSGDLTKEDIDLIDILWRQDIDLGAGREVFDYSHRQKEQDVEKELRDGGEQDTWAG EGAEALARNLLVDGETGESFPAQFPADISSITEAVPSESEPPALQNNLLSPLLTGTES PFDLEQQWQDLMSIMEMQAMEVNTSASEILYSAPPGDPLSTNYSLAPNTPINQNVSLH QASLGGCSQDFLLFSPEVESLPVASSSTLLPLAPSNSTSLNSTFGSTNLTGLFFPPQL NGTANDTAGPELPDPLGGLLDEAMLDEISLMDLAIEEGFNPVQASQLEEEFDSDSGLS LDSSHSPSSLSSSEGSSSSSSSSSSSSSSASSSASSSFSEEGAVGYSSDSETLDLEEA EGAVGYQPEYSKFCRMSYQDPAQLSCLPYLEHVGHNHTYNMAPSALDSADLPPPSALK KGSKEKQADFLDKQMSRDEHRARAMKIPFTNDKIINLPVEEFNELLSKYQLSEAQLSL IRDIRRRGKNKMAAQNCRKRKLDTILNLERDVEDLQRDKARLLREKVEFLRSLRQMKQ KVQSLYQEVFGRLRDENGRPYSPSQYALQYAGDGSVLLIPRTMADQQARRQERKPKDR RK" misc_feature 2506..2691 /gene="NFE2L1" /gene_synonym="LCR-F1" /gene_synonym="NRF1" /gene_synonym="TCF11" /note="BRLZ; Region: basic region leucin zipper" /db_xref="CDD:smart00338" BASE COUNT 878 a 1023 c 1115 g 775 t ORIGIN 1 ggagggggcg gggaggtaag cggaggctcc gagctctagg ccggccggcg gtggcggcgg 61 cgaggccggg actcgggctt agggcctgct gtggaggcag cggcggacgc cgagctaagc 121 agtttctctg gaaacccccc tggtaagtgt ggaggaggcg ggacactctg acccaagacg 181 aaaggcctgt agctccagcc aaagaaaata aaccttagga gggagaagga aaaaaaaaat 241 ccatcagctg ttcctgagaa cagcctgcat tggaatctac agagaggaca actaatgtga 301 gtgaggaagt gactgtatgt ggactgtgga gaaagtaagt cacgtgggcc cttgaggacc 361 tggactgggt taggaacagt tgtactttca gaggtgaggt gtcgagaagg gaaagtgaat 421 gtggtctgga gtgtgtcctt ggccttggct ccacagggtg tgctttcctc tggggccgtc 481 agggagctca tcccttgtgt tctgccaggg tggggtacgg ggtttgacac tgaggagggt 541 aacctgctgg ctggagcggc agagcagtgg ccttgatttg tcttttggaa gattttaaaa 601 accaaaaagc ataaacattc tggtccttca gcaatgcttt ctctgaagaa atacttaacg 661 gaaggacttc tccagttcac cattctgctg agtttgattg gggtacgggt ggacgtggat 721 acttacctga cctcacagct tcccccactc cgggagatca tcctggggcc cagttctgcc 781 tatactcaga cccagttcca caacctgagg aataccttgg atggctatgg tatccacccc 841 aagagcatag acctggacaa ttacttcact gcccggcggc tcctcagtca ggtgagggcc 901 ctggacaggt tccaggtgcc aaccactgag gtaaatgcct ggctggttca ccgagaccca 961 gaggggtctg tctctggcag tcagcccaac tcaggcctcg ccctcgagag ttccagtggc 1021 ctccaagatg tgacaggccc agacaacggg gtgcgagaaa gcgaaacgga gcagggattc 1081 ggtgaagatt tggaggattt gggggctgta gcccccccag tcagtggaga cttaaccaaa 1141 gaggacatag atctgattga catcctttgg cgacaggata ttgatctggg ggctgggcgt 1201 gaggtttttg actatagtca ccgccagaag gagcaggatg tggagaagga gctgcgagat 1261 ggaggcgagc aggacacctg ggcaggcgag ggcgcggaag ctctggcacg gaacctgcta 1321 gtggatggag agactgggga gagcttccct gcacagtttc cagcagacat ttccagcata 1381 acagaagcag tgcctagtga gagtgagccc cctgctcttc aaaacaacct cttgtctcct 1441 cttctgaccg ggacagagtc accatttgat ttggaacagc agtggcaaga tctcatgtcc 1501 atcatggaaa tgcaggccat ggaagtgaac acatcagcaa gtgaaatcct gtacagtgcc 1561 cctcctggag acccactgag caccaactac agccttgccc ccaacactcc catcaatcag 1621 aatgtcagcc tgcatcaggc gtccctgggg ggctgcagcc aggacttctt actcttcagc 1681 cccgaggtgg aaagcctgcc tgtggccagt agctccacgc tgctcccgtt ggcccccagc 1741 aattctacca gcctcaactc caccttcggc tccaccaacc tgacagggct cttctttcca 1801 ccccagctca atggcacagc caatgacaca gcaggcccag agctgcctga ccctttgggg 1861 ggtctgttag atgaagctat gttggatgag atcagcctta tggacctggc cattgaagaa 1921 ggctttaacc ctgtgcaggc ctcccagctg gaggaggaat ttgactctga ctcaggcctt 1981 tccttagact cgagccatag cccttcttcc ctaagcagct ctgaaggcag ttcttcctct 2041 tcttcctcct cctcttcctc ttcttcctct gcttcttcct ctgcctcttc ctccttttct 2101 gaggaaggtg cggttggcta cagctctgac tctgagaccc tggatctgga agaggccgag 2161 ggtgctgtgg gctaccagcc tgagtattcc aagttctgcc gcatgagcta ccaggatcca 2221 gctcagctct catgcctgcc ctacctggag cacgtgggcc acaaccacac atacaacatg 2281 gcacccagtg ccctggactc agccgacctg ccaccaccca gtgccctcaa gaaaggcagc 2341 aaggagaagc aggctgactt cctggacaag cagatgagcc gggatgagca ccgagcccga 2401 gccatgaaga tccctttcac caatgacaaa atcatcaacc tgcctgtgga ggagttcaat 2461 gaactgctgt ccaaatacca gttgagtgaa gcccagctga gcctcatccg agacatccgg 2521 cgccggggca agaacaagat ggcggcgcag aactgccgca agcgcaagct ggacaccatc 2581 ctgaatctgg agcgtgatgt ggaggacctg cagcgtgaca aagcccggct gctgcgggag 2641 aaagtggagt tcctgcgctc cctgcgacag atgaagcaga aggtccagag cctgtaccag 2701 gaggtgtttg ggcggctgcg agatgagaac ggacgaccct actcgcccag tcagtatgcg 2761 ctccagtacg ccggggacgg cagtgtcctc ctcatccccc gcacgatggc cgaccagcag 2821 gcccggcggc aggagaggaa gccaaaggac cggagaaagt gagcctgggg aagaaggggg 2881 tttgaagccc accaagaccg aaactggaga agggctggac ctggacctgg acctggacct 2941 acagcgggga cttaaatgcc ttcttatcca atatatcttc tcagatggga tgactgcggg 3001 tcagtgtaca ggaagaggca ggcactggct ggctcagctc cactcgggtg gagtggaagt 3061 ggccagacca tttagacgga cagggtcctc accctacccc tttcctgtga ggcaggggtg 3121 gtggtggagt tgctggaggt agaggagcta tgtggagcaa aggccgacag aggggaagga 3181 atggacctgt gagaggaagg gaaggtggca gaaagtctca tttcaggaag gaggggcggt 3241 gttaactctt tctgctcctt gcattttgac atccctgaag gggagctctt ggatatcatt 3301 ggccatgttt caatcgaatg gagccactgg gccccaacac tggctttgag atttagagtc 3361 aaagggtaga gtgaacagga aagggtcacg tggtcccatg ttgcaacagc cccaacatca 3421 cgcatgtcat tcactgcctt gccactccat ctccctccgt gctccagcca cccctgagct 3481 gaggctccca ttgtctccat cagagcctgc atgtgtatgc cgtcctcccc tggtccggtg 3541 tttgtgttcc ccacccctca cagactgcct gagctcttct gtaagctggg gtagggtgat 3601 ggcagtgctc cgggaactgg gcctgcagcc ttcctcttct gggactgctg tgaggcagag 3661 gaatgatgga gaatctagtg tagcagcctc caggcaggat tcagcacaac actggggagt 3721 cacccttccc tcgggcctct gcctaccaac aactgggctt atcactggga aaacacaaaa 3781 aaaaaaaaaa a //