LOCUS BC012151 4594 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens nuclear transcription factor, X-box binding 1, mRNA (cDNA clone MGC:20369 IMAGE:4558442), complete cds. ACCESSION BC012151 VERSION BC012151.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4594) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 4594) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (02-AUG-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 29 Row: d Column: 4 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 22212923. FEATURES Location/Qualifiers source 1..4594 /db_xref="H-InvDB:HIT000035694" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:20369 IMAGE:4558442" /tissue_type="Eye, retinoblastoma" /clone_lib="NIH_MGC_16" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..4594 /gene="NFX1" /gene_synonym="MGC20369" /gene_synonym="NFX2" /db_xref="GeneID:4799" /db_xref="HGNC:HGNC:7803" /db_xref="MIM:603255" CDS 27..3389 /gene="NFX1" /gene_synonym="MGC20369" /gene_synonym="NFX2" /codon_start=1 /product="nuclear transcription factor, X-box binding 1" /protein_id="AAH12151.1" /db_xref="GeneID:4799" /db_xref="HGNC:HGNC:7803" /db_xref="MIM:603255" /translation="MAEAPPVSGTFKFNTDAAEFIPQEKKNSGLNCGTQRRLDSNRIG RRNYSSPPPCHLSRQVPYDEISAVHQHSYHPSGSKPKSQQTSFQSSPCNKSPKSHGLQ NQPWQKLRNEKHHIRVKKAQSLAEQTSDTAGLESSTRSESGTDLREHSPSESEKEVVG ADPRGAKPKKATQFVYSYGRGPKVKGKLKCEWSNRTTPKPEDAGPESTKPVGVFHPDS SEASSRKGVLDGYGARRNEQRRYPQKRPPWEVEGARPRPGRNPPKQEGHRHTNAGHRN NMGPIPKDDLNERPAKSTCDSENLAVINKSSRRVDQEKCTVRRQDPQVVSPFSRGKQN HVLKNVETHTGSLIEQLTTEKYECMVCCELVRVTAPVWSCQSCYHVFHLNCIKKWARS PASQADGQSGWRCPACQNVSAHVPNTYTCFCGKVKNPEWSRNEIPHSCGEVCRKKQPG QDCPHSCNLLCHPGPCPPCPAFMTKTCECGRTRHTVRCGQAVSVHCSNPCENILNCGQ HQCAELCHGGQCQPCQIILNQVCYCGSTSRDVLCGTDVGKSDGFGDFSCLKICGKDLK CGNHTCSQVCHPQPCQQCPRLPQLVRCCPCGQTPLSQLLELGSSSRKTCMDPVPSCGK VCGKPLPCGSLDFIHTCEKLCHEGDCGPCSRTSVISCRCSFRTKELPCTSLKSEDATF MCDKRCNKKRLCGRHKCNEICCVDKEHKCPLICGRKLRCGLHRCEEPCHRGNCQTCWQ ASFDELTCHCGASVIYPPVPCGTRPPECTQTCARVHECDHPVYHSCHSEEKCPPCTFL TQKWCMGKHEFRSNIPCHLVDISCGLPCSATLPCGMHKCQRLCHKGECLVDEPCKQPC TTPRADCGHPCMAPCHTSSPCPVTACKAKVELQCECGRRKEMVICSEASSTYQRIAAI SMASKITDMQLGGSVEISKLITKKEVHQARLECDEECSALERKKRLAEAFHISEDSDP FNIRSSGSKFSDSLKEDARKDLKFVSDVEKEMETLVEAVNKGKNSKKSHSFPPMNRDH RRIIHDLAQVYGLESVSYDSEPKRNVVVTAIRGKSVCPPTTLTGVLEREMQARPPPPI PHHRHQSDKNPGSSNLQKITKEPIIDYFDVQD" BASE COUNT 1315 a 1064 c 1092 g 1123 t ORIGIN 1 gctcgatcta ggttctgcgg cacgggatgg cggaggcgcc tcctgtctca ggtactttta 61 aattcaatac agatgctgct gaattcattc ctcaggagaa aaaaaattct ggtctaaatt 121 gtgggactca aaggagacta gactctaata ggattggtag aagaaattac agttcaccac 181 ctccctgtca cctttccagg caggtccctt atgatgaaat ctctgctgtt catcagcata 241 gttatcatcc gtcaggaagc aaacctaaga gtcagcagac gtctttccag tcctctcctt 301 gtaataaatc gcccaagagc catggccttc agaatcaacc ttggcagaaa ttgaggaatg 361 agaagcacca tatcagagtc aagaaagcac agagtcttgc tgagcagacc tcagatacag 421 ctggattaga gagctcgacc agatcagaga gtgggacaga cctcagagag catagtcctt 481 ctgagagtga gaaggaagtt gtgggtgcag atcccagggg agcaaaaccc aaaaaagcaa 541 cacagtttgt atacagctat ggtagaggac caaaagtcaa ggggaaactc aaatgtgaat 601 ggagtaaccg aacaactcca aaaccggagg atgctggacc cgaaagtacc aaacctgtgg 661 gggttttcca ccctgactct tcagaggcat cctctagaaa aggagtattg gatgggtatg 721 gagccagacg aaatgagcag agaagatacc cacagaaaag gcctccctgg gaagtggagg 781 gggccaggcc acgaccaggc agaaatccac caaaacagga gggccaccga catacaaacg 841 caggacacag aaacaacatg ggccccattc caaaggatga cctcaatgaa agaccagcaa 901 aatctacctg tgacagtgag aacttggcag tcatcaacaa gtcttccagg agggttgacc 961 aagagaaatg cactgtacgg aggcaggatc ctcaagtagt atctcctttc tcccgaggca 1021 aacagaacca tgtgctaaag aatgtggaaa cgcacacagg ttctctaatt gaacaactaa 1081 caacagaaaa atacgagtgc atggtgtgct gtgaattggt tcgtgtcacg gccccagtgt 1141 ggagttgtca gagctgttac catgtgtttc atttgaactg cataaagaaa tgggcaaggt 1201 ctccagcatc tcaagcagat ggccagagtg gttggaggtg ccctgcctgt cagaatgttt 1261 ctgcacatgt tcctaatacc tacacttgtt tctgtggcaa ggtaaagaat cctgagtgga 1321 gcagaaatga aattccacat agctgtggtg aggtttgtag aaagaaacag cctggccagg 1381 actgcccaca ttcctgtaac cttctctgcc atccaggacc ctgcccaccc tgccctgcct 1441 ttatgacaaa aacatgtgaa tgtggacgaa ccaggcacac agttcgctgt ggtcaggctg 1501 tctcagtcca ctgttctaac ccatgtgaga atattttgaa ctgtggtcag caccagtgtg 1561 ctgagctgtg ccatgggggt cagtgccagc cttgccagat cattttgaac caggtatgct 1621 attgcggcag cacctcccga gatgtgttat gtggaaccga tgtaggaaag tctgatggat 1681 ttggggattt cagctgttta aagatatgtg gcaaggactt gaaatgcggt aaccatacat 1741 gttcgcaagt gtgccaccct cagccctgcc agcaatgccc acggctcccc cagctggtgc 1801 gctgttgccc ctgtggccaa actcctctca gccaattgct agaacttgga agtagtagtc 1861 ggaaaacatg catggaccct gtgccttcat gtggaaaagt gtgcggcaag cctctgcctt 1921 gtggttcctt agatttcatt catacctgtg aaaagctctg ccatgaagga gactgtggac 1981 catgctctcg cacatcagtt atttcctgca gatgctcttt cagaacaaag gagcttccat 2041 gtaccagtct caaaagtgaa gatgctacat ttatgtgtga caagcggtgt aacaagaaac 2101 ggttgtgtgg acggcataaa tgtaatgaga tatgctgtgt ggataaggag cacaagtgtc 2161 ctttgatttg tgggaggaaa ctccgttgtg gccttcatag gtgtgaagaa ccttgtcatc 2221 gtggaaactg ccagacatgc tggcaagcca gttttgatga attaacctgc cattgtggtg 2281 catcagtgat ttaccctcca gttccctgtg gtactaggcc ccctgaatgt acccaaacct 2341 gcgctagagt ccatgagtgt gaccatccag tatatcattc ttgtcatagt gaggagaagt 2401 gtcccccttg cactttccta actcagaagt ggtgcatggg caagcatgag tttcggagca 2461 acatcccctg tcacctggtt gatatctctt gcggattacc ctgcagtgcc acgctaccat 2521 gtgggatgca caaatgtcag agactctgtc acaaagggga gtgtcttgtg gatgagccct 2581 gcaagcagcc ctgcaccacc cccagagctg actgtggtca cccgtgtatg gcaccctgcc 2641 ataccagctc accctgccct gtgactgctt gtaaagctaa ggtagagcta cagtgtgaat 2701 gtggacgaag aaaagagatg gtgatttgct ctgaagcatc tagtacttat caaagaatag 2761 ctgcaatctc catggcctct aagataacag acatgcagct tggaggttca gtggagatca 2821 gcaagttaat taccaaaaag gaagttcatc aagccaggct ggagtgtgat gaggagtgtt 2881 cagccttgga aaggaaaaag agattagcag aggcatttca tatcagtgag gattctgatc 2941 ctttcaatat acgttcttca gggtcaaaat tcagtgatag tttgaaagaa gatgccagga 3001 aggacttaaa gtttgtcagt gacgttgaga aggaaatgga aaccctcgtg gaggccgtga 3061 ataagggaaa gaatagtaag aaaagccaca gcttccctcc catgaacaga gaccaccgcc 3121 ggatcatcca tgacttggcc caagtttatg gcctggagag cgtgagctat gacagtgaac 3181 cgaagcgcaa tgtggtggtc actgccatca gggggaagtc cgtttgtcct cctaccacgc 3241 tgacaggtgt gcttgaaagg gaaatgcagg cacggcctcc accaccgatt cctcatcaca 3301 gacatcagtc agacaagaat cctgggagca gtaatttaca gaaaataacc aaggagccaa 3361 taattgacta ttttgacgtc caggactaag aagatcatga tgcacttaga taaaagaatg 3421 attaggtata gtggagactt atttgccagc agataaatca tgcccgttcc cctctgcctg 3481 gcagaatcac agtctcacat actgtcttgt actgacacat ccaaagcatg agtgtgtcag 3541 aaatcccttg tctattcctg tctgtataaa gtgtttcatt atgaccagat ctctgattgt 3601 atggtcacta ggtatgcaat cacgcattca aagaggctct ttacaccatc actgtgattg 3661 ctctgagagt tgagggacta ttgggcttta tttggacaaa ccaaactttt agcctgaaac 3721 caactttatg ccactaagtc atagcctcag ttgtcccagt tatttgtcct cctgaaaatg 3781 cctgaaacat cagacagaca ttgcttgctt tacccaaact gatcaaaatc tttaggagca 3841 caaatgaatt ttttagtctg aaataccaaa taatgaattg gtataccata tccggaatca 3901 cacatgttat cttaaaccca gccatcatac ctaagtcttt tgccaaaacc tctcataggt 3961 atatctagct gaacttattt tggcattttc aatgtgatca gttctagacc tagaaggggg 4021 tcaggctgct ttacagaatt ctatttcctt aagtccctgg cacttctcat accacatcac 4081 tgaacctgtt cagtaacaat cagtttggcc gtcccccatg atggtaggaa atatagagag 4141 caagttcttc tgccagggtc acactgtggt ctctgaactg accagtatat ccctaactcc 4201 tctttgatag agaaagagtc tcaaatggac aactgtcctg tgttgctttc cctaggcctt 4261 cagcagccta ttggctctcc ctgcctctga gctctggact ctgtttgaat attccaagta 4321 gtatatggac agtccagggc ttatgcccag cagcccactg gaggcattct tcaggctcct 4381 ttaaggcagg tgcattgata gttccattag tgtgaccctt gcattggcac ccctccagcc 4441 tggaggccag gcttccagca acttccttct gccctagagc aagccatgag ccccagagca 4501 gtagcaggag acttgagaag tagagtgaca aaaacaagca cttaattaaa ttataaaatt 4561 taactttaaa aaaaaaaaaa aaaaaaaaaa aaaa //