LOCUS BC051193 3878 bp mRNA linear HUM 19-MAR-2009 DEFINITION Homo sapiens nuclear transcription factor, X-box binding-like 1, mRNA (cDNA clone MGC:57228 IMAGE:5271401), complete cds. ACCESSION BC051193 VERSION BC051193.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3878) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3878) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (14-APR-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Miklos Palkovits, M.D., Ph.D. cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki Toshiyuki and Piero Carninci (RIKEN) cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 106 Row: k Column: 20 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 37674231 The stop codon of the CDS annotated on this record is located > 55 bases upstream of a splice junction, and therefore the mRNA is predicted to be subject to nonsense-mediated mRNA decay (NMD). FEATURES Location/Qualifiers source 1..3878 /db_xref="H-InvDB:HIT000053591" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:57228 IMAGE:5271401" /tissue_type="Testis" /clone_lib="NIH_MGC_97" /lab_host="DH10B" /note="Vector: pBluescriptR" gene 1..3878 /gene="NFXL1" /gene_synonym="HOZFP" /gene_synonym="URCC5" /db_xref="GeneID:152518" /db_xref="HGNC:HGNC:18726" CDS 29..2230 /gene="NFXL1" /gene_synonym="HOZFP" /gene_synonym="URCC5" /codon_start=1 /product="NFXL1 protein" /protein_id="AAH51193.1" /db_xref="GeneID:152518" /db_xref="HGNC:HGNC:18726" /translation="MEASWRQVAGGRGRSRGRATAAPSGNGVHLRGAGGGREKGSVGA VPSGTSPGGVATTAAAGSRHSPAGSQALQTTAASELMSQKKFEEIKKANQAAARKLVE EQFSSSSEEGDEDFEGKQGKILANTFITYTTQTDGDTRELERTKQYVNEAFQAGAMTC LICIASVKRNQAVWSCSGCFCIFHMPCIQKWAKDSQFLVSSVTDDDFGKKDCPWPCPK CRFEYKRSETPSRYYCYCGKVEDPPLDPWLVPHSCGQVCEREFKPPCGHKCLLLCHPG PCPPCPKMVTTTCYCKKAKPIPRRCSAKEWSCQLPCGQKLLCGQHKCENPCHAGSCQP CPRVSRQKCVCGKKVAERSCASPLWHCDQVCGKTLPCGNHTCEQVCHVGACGECPRSG KRFCPCQKSKFSLPCTEDVPTCGDSCDKVLECGIHRCSQRCHRGPCETCRQEVEKHCR CGKHTKRMPCHKPYLCETKCVKMRDCQKHQCRRKCCPGNCPPCDQNCGRTLGCRNHKC PSVCHRGSCYPCPETVDVKCNCGNTKVTVPCGRERTTRPPKCKEQCSRPPTCHHTSQE KHRCHFGSCPPCHQPCQKVLEKCGHLCPAPCHDQALIKQTGRHQPTGPWEQPSEPAFI QTALPCPPCQVPIPMECLGKHEVSPLPCHAVGPYSCKRVCGRILDCQNHTCMKECHKV TKTDGCTGKNKIGKLKLCDLSMILQQESSKSESTVPDLLTPSPMCSSTHVI" BASE COUNT 1212 a 714 c 875 g 1077 t ORIGIN 1 ttgtctgttg ggggcgtgcg cagtcgggat ggaagcttcc tggcgccagg tggccggtgg 61 ccgaggccga tcccggggac gggccactgc cgccccctca ggaaatggag tccatctccg 121 cggcgccgga ggagggcgag agaaggggtc ggtgggcgca gttccttctg gcaccagtcc 181 cggaggagtc gcgaccacgg cggctgcagg gagcaggcac agccccgcag gatcccaagc 241 cctgcagact accgcagcca gcgagctaat gtctcagaaa aaatttgaag aaatcaagaa 301 agctaaccaa gctgcagcca gaaaacttgt tgaagaacag tttagctctt catctgaaga 361 aggagatgaa gattttgaag gaaaacaggg aaaaatactt gcaaatacgt ttataacata 421 cactactcag acagatggag atacacgtga attagagcga acaaaacaat atgtaaatga 481 agcttttcaa gcaggggcta tgacatgcct aatttgtatt gcttcggtga agagaaacca 541 agcagtttgg agctgttcgg gatgtttctg tatatttcac atgccctgta tccagaagtg 601 ggctaaagac agccagtttc ttgtatcttc tgtgactgat gatgattttg gaaagaaaga 661 ttgtccctgg ccttgtccaa aatgtaggtt tgaatacaaa cgatctgaaa cacctagtag 721 gtactattgc tattgtggaa aagtagaaga tccaccttta gatccgtggc ttgtgcctca 781 ttcatgtggc caagtatgtg agcgtgaatt taaacctcct tgtggccata aatgtttact 841 cctctgtcat ccaggtccct gccctccttg tccaaagatg gtcacaacta cttgttactg 901 taagaaagca aaacctatcc ctcgtaggtg cagtgccaag gaatggtctt gtcagctgcc 961 atgtggacag aagttgcttt gtgggcaaca taagtgtgaa aatccttgtc atgcaggaag 1021 ctgtcagcct tgtccaagag ttagtagaca aaagtgtgtc tgtggcaaaa aagtagctga 1081 aagaagttgt gcaagtccac tatggcactg tgatcaagta tgtggaaaaa cactgccatg 1141 tggtaatcac acatgtgagc aagtttgcca tgttggtgct tgtggagaat gtcctcgatc 1201 tgggaaaagg ttctgtccat gtcagaaatc aaagttttct ttgccttgta cagaagatgt 1261 accaacttgt ggagacagtt gtgacaaagt acttgaatgc ggaatccata gatgttcaca 1321 gcgttgtcac cgaggtccct gtgaaacatg tagacaagaa gtggaaaagc attgtcgctg 1381 tggaaagcat acaaaacgaa tgccttgtca taaaccttat ctgtgtgaaa ctaagtgtgt 1441 taagatgcgt gactgtcaga agcatcaatg tagaagaaag tgttgccctg gaaactgtcc 1501 accttgtgat caaaactgtg gacggacttt aggatgtaga aaccataagt gtccatctgt 1561 ctgtcacaga ggcagttgct atccctgccc agaaactgta gatgtgaagt gtaattgtgg 1621 caatacaaag gtgacagtgc cctgtggccg agaacgtacc acaagaccac ccaagtgcaa 1681 ggagcaatgc agtcgaccac caacttgtca tcatacaagt caagaaaaac atcgctgtca 1741 ctttggttct tgtccaccat gtcatcaacc ttgccaaaaa gttttggaga aatgtggtca 1801 cttgtgtcct gctccgtgtc atgatcaagc gttaataaag cagactggca ggcaccagcc 1861 tacaggccct tgggaacagc cttctgagcc agcatttatt cagactgcat taccgtgtcc 1921 tccatgtcaa gttcctattc ctatggaatg tcttgggaaa catgaggtga gtccactacc 1981 atgccatgct gtaggaccct actcttgtaa aagagtttgt ggaagaatct tggattgtca 2041 gaatcacaca tgtatgaaag aatgccacaa agtaaccaaa actgatggct gcactggaaa 2101 aaacaagata gggaaactga agctctgtga tttgtccatg attctgcaac aggagagtag 2161 caaatcagaa agtactgtgc ctgatctttt aactcccagc ccaatgtgct cttctactca 2221 tgttatttaa atagaagatt tttgtaggtg taccacatag gaagagcaaa tgaaataaac 2281 tctttaggct ggcccagaat gccttcattg tgaggaaggg tgctccaagt cacggccact 2341 aggttgtctt cacccatgta ttttgcgatg tcaccctgga gaatgtccac cttgtgttca 2401 gatgcttaga ataaaatgtc actgtaagat cacaagcctg tatgtggaat gtagaaaaat 2461 aaccacagct gatgtaaatg aaaagaacct cctcagttgt tgcaaaaatc agtgccctaa 2521 agagcttcct tgtggtcata gatgcaaaga gatgtgtcat cctggtgaat gtccctttaa 2581 ctgcaaccag aaggtaaaac ttagatgtcc ttgtaaaaga ataaaaaagg aattgcagtg 2641 caacaaagta cgtgaaaatc aggtttcaat agaatgtgac acaacgtgca aggaaatgaa 2701 gcggaaagca tctgagataa aagaagcaga agccaaagct gctcttgaag aagaaaaacg 2761 aagacaacag gctgaactag aagcttttga aaacagactg aagggtcgtc ggaagaagaa 2821 caggaaaaga gatgaagtgg cagttgagct atcactatgg caaaaacata aatattatct 2881 catttcagtg tgtggagttg tggttgtagt gtttgcctgg tacatcaccc atgatgtcaa 2941 ttaaaaaaag ttttgatctt ttaatgtaac tcagattgga tttagataag ttgttaaatt 3001 tgaaatatta gaaaatgtat attatagaac atgatatata tttacattca tctctgtatt 3061 ctctcagctg ttgttagaag gacagaatgt taaactttat cttaattagt atactagaaa 3121 gggcagtata ctactgttta aagtgaaggc atgactgaaa ctaaaatatt tcataaggct 3181 tagctagagg cagagtaacg tgtttttgtt cattgggctt ccttgtactt agttttttca 3241 tttaataatt caaaccaaca cttttaaaaa aataattcag atgagactga gccatatctg 3301 cagtaagaga aatatttctt aatgttttgg ttacttatga tagagtactt ttcttgttac 3361 tgttaacttt gtgcttttta aaaaaagtga ttctctaaca gacctcttaa attgtgacat 3421 gaaggtatgt aattagattt cagaaattgg tttattagtg aggaattttt atcaataaat 3481 gtcatggggc gtgttcttca gaatatatag ttattttcaa caaatgccag gctagattcc 3541 tcacatgtgg ctatttctta tgtaagaagc ttttaactga agttggcatg tttcgtaaaa 3601 cttgcgtgtc ttttaaaaat aataaaagga agatgagtat ttatgaagaa tatgtgctga 3661 caacagggct tatgaggtct atgtacctta atctcgtttc tccttaccac aatcttaaat 3721 agatttcagc tgaaaataat cagttcttat gaaaacaaat agagaaatat cagtaagtca 3781 aatctgtttg aattataatt cctttcaaat agttttgcta tttaatttat atgattaatg 3841 ttttcattaa aatttttgat accaaaaaaa aaaaaaaa //