LOCUS       BC005044                1663 bp    mRNA    linear   HUM 03-OCT-2003
DEFINITION  Homo sapiens nuclear factor (erythroid-derived 2), 45kDa, mRNA
            (cDNA clone MGC:12809 IMAGE:4040433), complete cds.
ACCESSION   BC005044
VERSION     BC005044.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1663)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 1663)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (26-MAR-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: DCTD/DTP/Gazdar
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Genome Sequence Centre,
            BC Cancer Agency, Vancouver, BC, Canada
            info@bcgsc.bc.ca
            Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterfield,
            Susanna Chan, Readman Chiu, Chris Fjell, Erin Garland, Ran Guin,
            Letticia Hsiao, Martin Krzywinski, Reta Kutsche, Oliver Lee, Soo
            Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven
            Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi, Jacqueline
            Schein, Duane Smailus, Michael Smith, Lorraine Spence, Jeff Stott,
            Michael Thorne, Miranada Tsai, Natasja van den Bosch, Jill Vardy,
            George Yang, Scott Zuyderduyn, Marco Marra.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 18 Row: o Column: 19
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 5453773.
FEATURES             Location/Qualifiers
     source          1..1663
                     /db_xref="H-InvDB:HIT000032178"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:12809 IMAGE:4040433"
                     /tissue_type="Lung, large cell carcinoma"
                     /clone_lib="NIH_MGC_18"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            1..1663
                     /gene="NFE2"
                     /gene_synonym="NF-E2"
                     /gene_synonym="p45"
                     /db_xref="GeneID:4778"
                     /db_xref="MIM:601490"
     CDS             264..1385
                     /gene="NFE2"
                     /gene_synonym="NF-E2"
                     /gene_synonym="p45"
                     /codon_start=1
                     /product="NFE2 protein"
                     /protein_id="AAH05044.1"
                     /db_xref="GeneID:4778"
                     /db_xref="MIM:601490"
                     /translation="MSPCPPQQSRNRVIQLSTSELGEMELTWQEIMSITELQGLNAPS
                     EPSFEPQAPAPYLGPPPPTTYCPCSIHPDSGFPLPPPPYELPASTSHVPDPPYSYGNM
                     AIPVSKPLSLSGLLSEPLQDPLALLDIGLPAGPPKPQEDPESDSGLSLNYSDAESLEL
                     EGTEAGRRRSEYVEMYPVEYPYSLMPNSLAHSNYTLPAAETPLALEPSSGPVRAKPTA
                     RGEAGSRDERRALAMKIPFPTDKIVNLPVDDFNELLARYPLTESQLALVRDIRRRGKN
                     KVAAQNCRKRKLETIVQLERELERLTNERERLLRARGEADRTLEVMRQQLTELYRDIF
                     QHLRDESGNSYSPEEYALQQAADGTIFLVPRGTKMEATD"
BASE COUNT          381 a          513 c          443 g          326 t
ORIGIN      
        1 gtgcgcctgc ttggggctcc tgtgctcagc tcagcctgag cttccacact cagcgctcag
       61 caatggcccg ggggcggggc gcggtcctcg cagattctca aaggtagccg ggatcctcgt
      121 ccagcagtgt cagctcaggc tcagcctccc cagagacaac accgggagcc tcatctctct
      181 cctcaccctg ctgtgactcc accacaggtt tctagagcca tctgggcttt ccgggaacct
      241 ggaccagact ctggcccagt aggatgtccc cgtgtcctcc ccagcagagc aggaacaggg
      301 tgatacagct gtccacttca gagctaggag agatggaact gacttggcag gagatcatgt
      361 ccatcaccga gctgcagggt ctgaatgctc caagtgagcc atcatttgag ccccaagccc
      421 cagctccata ccttggacct ccaccaccca caacttactg cccctgctca atccacccag
      481 attctggctt cccacttcct ccaccacctt atgagctccc agcatccaca tcccatgtcc
      541 cagatccccc atactcctat ggcaacatgg ccataccagt ctccaagcca ctgagcctct
      601 caggcctgct cagtgagccg ctccaagacc ccttagccct cctggacatt gggctgccag
      661 cagggccacc taagccccaa gaagacccag aatccgactc aggattatcc ctcaactata
      721 gcgatgctga atctcttgag ctggagggga cagaggctgg tcggcggcgc agcgaatatg
      781 tagagatgta cccagtggag tacccctact cactcatgcc caactccttg gcccactcca
      841 actatacctt gccagctgct gagaccccct tggccttaga gccctcctca ggccctgtgc
      901 gggctaagcc cactgcacgg ggggaggcag ggagtcggga tgaacgtcgg gccttggcca
      961 tgaagattcc ttttcctacg gacaagattg tcaacttgcc ggtagatgac tttaatgagc
     1021 tattggcaag gtacccgctg acagagagcc agctagcgct agtccgggac atccgacgac
     1081 ggggcaaaaa caaggtggca gcccagaact gccgcaagag gaagctggaa accattgtgc
     1141 agctggagcg ggagctggag cggctgacca atgaacggga gcggcttctc agggcccgcg
     1201 gggaggcaga ccggaccctg gaggtcatgc gccaacagct gacagagctg taccgtgaca
     1261 ttttccagca ccttcgggat gaatcaggca acagctactc tcctgaagag tacgcgctgc
     1321 aacaggctgc cgatgggacc atcttccttg tgccccgggg gaccaagatg gaggccacag
     1381 actgagctgg cccagagggg tggaactgct gatgggattt ccttcattcc cttctgataa
     1441 aggtactccc caaccctgag tcccagaagg agctgagttc tctagaccag aagaggatga
     1501 caatggcaac aagtgtttgg aagttccaag gtgtgttcaa agaggcttgc cttgagggag
     1561 ggctggaatc tgtcttccct gactcggctc ctcaggtctt tagcctccac cttgtctaag
     1621 ctttggtcta taaagtgcgc tacagaaaaa aaaaaaaaaa aaa
//