LOCUS       BC012151                4594 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens nuclear transcription factor, X-box binding 1, mRNA
            (cDNA clone MGC:20369 IMAGE:4558442), complete cds.
ACCESSION   BC012151
VERSION     BC012151.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 4594)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 4594)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (02-AUG-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Genome Sequence Centre,
            BC Cancer Agency, Vancouver, BC, Canada
            info@bcgsc.bc.ca
            Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson
            Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen
            Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel
            Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave
            Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth
            Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao,
            Kim MacDonald,  Mike R. Mayo, Josh Moran, Diana Palmquist, JR
            Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang,
            Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 29 Row: d Column: 4
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 22212923.
FEATURES             Location/Qualifiers
     source          1..4594
                     /db_xref="H-InvDB:HIT000035694"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:20369 IMAGE:4558442"
                     /tissue_type="Eye, retinoblastoma"
                     /clone_lib="NIH_MGC_16"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            1..4594
                     /gene="NFX1"
                     /gene_synonym="MGC20369"
                     /gene_synonym="NFX2"
                     /db_xref="GeneID:4799"
                     /db_xref="HGNC:HGNC:7803"
                     /db_xref="MIM:603255"
     CDS             27..3389
                     /gene="NFX1"
                     /gene_synonym="MGC20369"
                     /gene_synonym="NFX2"
                     /codon_start=1
                     /product="nuclear transcription factor, X-box binding 1"
                     /protein_id="AAH12151.1"
                     /db_xref="GeneID:4799"
                     /db_xref="HGNC:HGNC:7803"
                     /db_xref="MIM:603255"
                     /translation="MAEAPPVSGTFKFNTDAAEFIPQEKKNSGLNCGTQRRLDSNRIG
                     RRNYSSPPPCHLSRQVPYDEISAVHQHSYHPSGSKPKSQQTSFQSSPCNKSPKSHGLQ
                     NQPWQKLRNEKHHIRVKKAQSLAEQTSDTAGLESSTRSESGTDLREHSPSESEKEVVG
                     ADPRGAKPKKATQFVYSYGRGPKVKGKLKCEWSNRTTPKPEDAGPESTKPVGVFHPDS
                     SEASSRKGVLDGYGARRNEQRRYPQKRPPWEVEGARPRPGRNPPKQEGHRHTNAGHRN
                     NMGPIPKDDLNERPAKSTCDSENLAVINKSSRRVDQEKCTVRRQDPQVVSPFSRGKQN
                     HVLKNVETHTGSLIEQLTTEKYECMVCCELVRVTAPVWSCQSCYHVFHLNCIKKWARS
                     PASQADGQSGWRCPACQNVSAHVPNTYTCFCGKVKNPEWSRNEIPHSCGEVCRKKQPG
                     QDCPHSCNLLCHPGPCPPCPAFMTKTCECGRTRHTVRCGQAVSVHCSNPCENILNCGQ
                     HQCAELCHGGQCQPCQIILNQVCYCGSTSRDVLCGTDVGKSDGFGDFSCLKICGKDLK
                     CGNHTCSQVCHPQPCQQCPRLPQLVRCCPCGQTPLSQLLELGSSSRKTCMDPVPSCGK
                     VCGKPLPCGSLDFIHTCEKLCHEGDCGPCSRTSVISCRCSFRTKELPCTSLKSEDATF
                     MCDKRCNKKRLCGRHKCNEICCVDKEHKCPLICGRKLRCGLHRCEEPCHRGNCQTCWQ
                     ASFDELTCHCGASVIYPPVPCGTRPPECTQTCARVHECDHPVYHSCHSEEKCPPCTFL
                     TQKWCMGKHEFRSNIPCHLVDISCGLPCSATLPCGMHKCQRLCHKGECLVDEPCKQPC
                     TTPRADCGHPCMAPCHTSSPCPVTACKAKVELQCECGRRKEMVICSEASSTYQRIAAI
                     SMASKITDMQLGGSVEISKLITKKEVHQARLECDEECSALERKKRLAEAFHISEDSDP
                     FNIRSSGSKFSDSLKEDARKDLKFVSDVEKEMETLVEAVNKGKNSKKSHSFPPMNRDH
                     RRIIHDLAQVYGLESVSYDSEPKRNVVVTAIRGKSVCPPTTLTGVLEREMQARPPPPI
                     PHHRHQSDKNPGSSNLQKITKEPIIDYFDVQD"
BASE COUNT         1315 a         1064 c         1092 g         1123 t
ORIGIN      
        1 gctcgatcta ggttctgcgg cacgggatgg cggaggcgcc tcctgtctca ggtactttta
       61 aattcaatac agatgctgct gaattcattc ctcaggagaa aaaaaattct ggtctaaatt
      121 gtgggactca aaggagacta gactctaata ggattggtag aagaaattac agttcaccac
      181 ctccctgtca cctttccagg caggtccctt atgatgaaat ctctgctgtt catcagcata
      241 gttatcatcc gtcaggaagc aaacctaaga gtcagcagac gtctttccag tcctctcctt
      301 gtaataaatc gcccaagagc catggccttc agaatcaacc ttggcagaaa ttgaggaatg
      361 agaagcacca tatcagagtc aagaaagcac agagtcttgc tgagcagacc tcagatacag
      421 ctggattaga gagctcgacc agatcagaga gtgggacaga cctcagagag catagtcctt
      481 ctgagagtga gaaggaagtt gtgggtgcag atcccagggg agcaaaaccc aaaaaagcaa
      541 cacagtttgt atacagctat ggtagaggac caaaagtcaa ggggaaactc aaatgtgaat
      601 ggagtaaccg aacaactcca aaaccggagg atgctggacc cgaaagtacc aaacctgtgg
      661 gggttttcca ccctgactct tcagaggcat cctctagaaa aggagtattg gatgggtatg
      721 gagccagacg aaatgagcag agaagatacc cacagaaaag gcctccctgg gaagtggagg
      781 gggccaggcc acgaccaggc agaaatccac caaaacagga gggccaccga catacaaacg
      841 caggacacag aaacaacatg ggccccattc caaaggatga cctcaatgaa agaccagcaa
      901 aatctacctg tgacagtgag aacttggcag tcatcaacaa gtcttccagg agggttgacc
      961 aagagaaatg cactgtacgg aggcaggatc ctcaagtagt atctcctttc tcccgaggca
     1021 aacagaacca tgtgctaaag aatgtggaaa cgcacacagg ttctctaatt gaacaactaa
     1081 caacagaaaa atacgagtgc atggtgtgct gtgaattggt tcgtgtcacg gccccagtgt
     1141 ggagttgtca gagctgttac catgtgtttc atttgaactg cataaagaaa tgggcaaggt
     1201 ctccagcatc tcaagcagat ggccagagtg gttggaggtg ccctgcctgt cagaatgttt
     1261 ctgcacatgt tcctaatacc tacacttgtt tctgtggcaa ggtaaagaat cctgagtgga
     1321 gcagaaatga aattccacat agctgtggtg aggtttgtag aaagaaacag cctggccagg
     1381 actgcccaca ttcctgtaac cttctctgcc atccaggacc ctgcccaccc tgccctgcct
     1441 ttatgacaaa aacatgtgaa tgtggacgaa ccaggcacac agttcgctgt ggtcaggctg
     1501 tctcagtcca ctgttctaac ccatgtgaga atattttgaa ctgtggtcag caccagtgtg
     1561 ctgagctgtg ccatgggggt cagtgccagc cttgccagat cattttgaac caggtatgct
     1621 attgcggcag cacctcccga gatgtgttat gtggaaccga tgtaggaaag tctgatggat
     1681 ttggggattt cagctgttta aagatatgtg gcaaggactt gaaatgcggt aaccatacat
     1741 gttcgcaagt gtgccaccct cagccctgcc agcaatgccc acggctcccc cagctggtgc
     1801 gctgttgccc ctgtggccaa actcctctca gccaattgct agaacttgga agtagtagtc
     1861 ggaaaacatg catggaccct gtgccttcat gtggaaaagt gtgcggcaag cctctgcctt
     1921 gtggttcctt agatttcatt catacctgtg aaaagctctg ccatgaagga gactgtggac
     1981 catgctctcg cacatcagtt atttcctgca gatgctcttt cagaacaaag gagcttccat
     2041 gtaccagtct caaaagtgaa gatgctacat ttatgtgtga caagcggtgt aacaagaaac
     2101 ggttgtgtgg acggcataaa tgtaatgaga tatgctgtgt ggataaggag cacaagtgtc
     2161 ctttgatttg tgggaggaaa ctccgttgtg gccttcatag gtgtgaagaa ccttgtcatc
     2221 gtggaaactg ccagacatgc tggcaagcca gttttgatga attaacctgc cattgtggtg
     2281 catcagtgat ttaccctcca gttccctgtg gtactaggcc ccctgaatgt acccaaacct
     2341 gcgctagagt ccatgagtgt gaccatccag tatatcattc ttgtcatagt gaggagaagt
     2401 gtcccccttg cactttccta actcagaagt ggtgcatggg caagcatgag tttcggagca
     2461 acatcccctg tcacctggtt gatatctctt gcggattacc ctgcagtgcc acgctaccat
     2521 gtgggatgca caaatgtcag agactctgtc acaaagggga gtgtcttgtg gatgagccct
     2581 gcaagcagcc ctgcaccacc cccagagctg actgtggtca cccgtgtatg gcaccctgcc
     2641 ataccagctc accctgccct gtgactgctt gtaaagctaa ggtagagcta cagtgtgaat
     2701 gtggacgaag aaaagagatg gtgatttgct ctgaagcatc tagtacttat caaagaatag
     2761 ctgcaatctc catggcctct aagataacag acatgcagct tggaggttca gtggagatca
     2821 gcaagttaat taccaaaaag gaagttcatc aagccaggct ggagtgtgat gaggagtgtt
     2881 cagccttgga aaggaaaaag agattagcag aggcatttca tatcagtgag gattctgatc
     2941 ctttcaatat acgttcttca gggtcaaaat tcagtgatag tttgaaagaa gatgccagga
     3001 aggacttaaa gtttgtcagt gacgttgaga aggaaatgga aaccctcgtg gaggccgtga
     3061 ataagggaaa gaatagtaag aaaagccaca gcttccctcc catgaacaga gaccaccgcc
     3121 ggatcatcca tgacttggcc caagtttatg gcctggagag cgtgagctat gacagtgaac
     3181 cgaagcgcaa tgtggtggtc actgccatca gggggaagtc cgtttgtcct cctaccacgc
     3241 tgacaggtgt gcttgaaagg gaaatgcagg cacggcctcc accaccgatt cctcatcaca
     3301 gacatcagtc agacaagaat cctgggagca gtaatttaca gaaaataacc aaggagccaa
     3361 taattgacta ttttgacgtc caggactaag aagatcatga tgcacttaga taaaagaatg
     3421 attaggtata gtggagactt atttgccagc agataaatca tgcccgttcc cctctgcctg
     3481 gcagaatcac agtctcacat actgtcttgt actgacacat ccaaagcatg agtgtgtcag
     3541 aaatcccttg tctattcctg tctgtataaa gtgtttcatt atgaccagat ctctgattgt
     3601 atggtcacta ggtatgcaat cacgcattca aagaggctct ttacaccatc actgtgattg
     3661 ctctgagagt tgagggacta ttgggcttta tttggacaaa ccaaactttt agcctgaaac
     3721 caactttatg ccactaagtc atagcctcag ttgtcccagt tatttgtcct cctgaaaatg
     3781 cctgaaacat cagacagaca ttgcttgctt tacccaaact gatcaaaatc tttaggagca
     3841 caaatgaatt ttttagtctg aaataccaaa taatgaattg gtataccata tccggaatca
     3901 cacatgttat cttaaaccca gccatcatac ctaagtcttt tgccaaaacc tctcataggt
     3961 atatctagct gaacttattt tggcattttc aatgtgatca gttctagacc tagaaggggg
     4021 tcaggctgct ttacagaatt ctatttcctt aagtccctgg cacttctcat accacatcac
     4081 tgaacctgtt cagtaacaat cagtttggcc gtcccccatg atggtaggaa atatagagag
     4141 caagttcttc tgccagggtc acactgtggt ctctgaactg accagtatat ccctaactcc
     4201 tctttgatag agaaagagtc tcaaatggac aactgtcctg tgttgctttc cctaggcctt
     4261 cagcagccta ttggctctcc ctgcctctga gctctggact ctgtttgaat attccaagta
     4321 gtatatggac agtccagggc ttatgcccag cagcccactg gaggcattct tcaggctcct
     4381 ttaaggcagg tgcattgata gttccattag tgtgaccctt gcattggcac ccctccagcc
     4441 tggaggccag gcttccagca acttccttct gccctagagc aagccatgag ccccagagca
     4501 gtagcaggag acttgagaag tagagtgaca aaaacaagca cttaattaaa ttataaaatt
     4561 taactttaaa aaaaaaaaaa aaaaaaaaaa aaaa
//