LOCUS BC045620 6591 bp mRNA linear HUM 17-JUL-2006 DEFINITION Homo sapiens nucleoporin 214kDa, mRNA (cDNA clone MGC:39489 IMAGE:5268515), complete cds. ACCESSION BC045620 VERSION BC045620.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6591) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 6591) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (31-JAN-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Miklos Palkovits, M.D., Ph.D. cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki Toshiyuki and Piero Carninci (RIKEN) cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 63 Row: g Column: 10 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 24497442. FEATURES Location/Qualifiers source 1..6591 /db_xref="H-InvDB:HIT000098081" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:39489 IMAGE:5268515" /tissue_type="Testis" /clone_lib="NIH_MGC_97" /lab_host="DH10B" /note="Vector: pBluescriptR" gene 1..6591 /gene="NUP214" /gene_synonym="CAIN" /gene_synonym="CAN" /gene_synonym="D9S46E" /gene_synonym="N214" /db_xref="GeneID:8021" /db_xref="HGNC:HGNC:8064" /db_xref="MIM:114350" CDS 89..6361 /gene="NUP214" /gene_synonym="CAIN" /gene_synonym="CAN" /gene_synonym="D9S46E" /gene_synonym="N214" /codon_start=1 /product="nucleoporin 214kDa" /protein_id="AAH45620.2" /db_xref="GeneID:8021" /db_xref="HGNC:HGNC:8064" /db_xref="MIM:114350" /translation="MGDEMDAMIPEREMKDFQFRALKKVRIFDSPEELPKERSSLLAV SNKYGLVFAGGASGLQIFPTKNLLIQNKPGDDPNKIVDKVQGLLVPMKFPIHHLALSC DNLTLSACMMSSEYGSIIAFFDVRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPT VPSMVAVCLADGSIAVLQVTETVKVCATLPSTVAVTSVCWSPKGKQLAVGKQNGTVVQ YLPTLQEKKVIPCPPFYESDHPVRVLDVLWIGTYVFAIVYAAADGTLETSPDVVMALL PKKEEKHPEIFVNFMEPCYGSCTERQHHYYLSYIEEWDLVLAASAASTEVSILARQSD QINWESWLLEDSSRAELPVTDKSDDSLPMGVVVDYTNQVEITISDEKTLPPAPVLMLL STDGVLCPFYMINQNPGVKSLIKTPERLSLEGERQPKSPGSTPTTPTSSQAPQKLDAS AAAAPASLPPSSPAAPIATFSLLPAGGAPTVFSFGSSSLKSSATVTGEPPSYSSGSDS SKAAPGPGPSTFSFVPPSKASLAPTPAASPVAPSAASFSFGSSGFKPTLESTPVPSVS APNIAMKSSFPPSTSAVKVNLSEKFTAAATSTPVSSSQSAPPMSPFSSASKPAASGPL SHPTPLSAPPSSVPLKSSVLPSPSGRSAQGSSSPVPSMVQKSPRITPPAAKPGSPQAK SLQPAVAEKQGHQWKDSDPVMAGIGEEIAHFQKELEELKARTSKACFQVGTSEEMKML RTESDDLHTFLLEIKETTESLHGDISSLKTTLLEGFAGVEEAREQNERNRDSGYLHLL YKRPLDPKSEAQLQEIRRLHQYVKFAVQDVNDVLDLEWDQHLEQKKKQRHLLVPERET LFNTLANNREIINQQRKRLNHLVDSLQQLRLYKQTSLWSLSSAVPSQSSIHSFDSDLE SLCNALLKTTIESHTKSLPKVPAKLSPMKQAQLRNFLAKRKTPPVRSTAPASLSRSAF LSQRYYEDLDEVSSTSSVSQSLESEDARTSCKDDEAVVQAPRHAPVVRTPSIQPSLLP HAAPFAKSHLVHGSSPGVMGTSVATSASKIIPQGADSTMLATKTVKHGAPSPSHPISA PQAAAAAALRRQMASQAPAVNTLTESTLKNVPQVVNVQELKNNPATPSTAMGSSVPYS TAKTPHPVLTPVAANQAKQGSLINSLKPSGPTPASGQLSSGDKASGTAKIETAVTSTP SASGQFSKPFSFSPSGTGFNFGIITPTPSSNFTAAQGATPSTKESSQPDAFSSGGGSK PSYEAIPESSPPSGITSASNTTPGEPAASSSRPVAPSGTALSTTSSKLETPPSKLGEL LFPSSLAGETLGSFSGLRVGQADDSTKPTNKASSTSLTSTQPTKTSGVPSGFNFTAPP VLGKHTEPPVTSSATTTSVAPPAATSTSSTAVFGSLPVTSAGSSGVISFGGTSLSAGK TSFSFGSQQTNSTVPPSAPPPTTAATPLPTSFPTLSFGSLLSSATTPSLPMSAGRSTE EATSSALPEKPGDSEVSASAASLLEEQQSAQLPQAPPQTSDSVKKEPVLAQPAVSNSG TAASSTSLVALSAEATPATTGVPDARTEAVPPASSFSVPGQTAVTAAAISSAGPVAVE TSSTPIASSTTSIVAPGPSAEAAAFGTVTSGSSVFAQPPAASSSSAFNQLTNNTATAP SATPVFGQVAASTAPSLFGQQTGSTASTAAATPQVSSSGFSSPAFGTTAPGVFGQTTF GQASVFGQSASSAASVFSFSQPGFSSVPAFGQPASSTPTSTSGSVFGAASSTSSSSSF SFGQSSPNTGGGLFGQSNAPAFGQSPGFGQGGSVFGGTSAATTTAATSGFSFCQASGF GSSNTGSVFGQAASTGGIVFGQQSSSSSGNVFGSGNTGRGGGFFSGLGGKPSQDAANK NPFSSASGGFGSTATSNTSNLFGNSGAKTFGGFASSSFGEQKPTGTFSSGGGSVASQG FGFSSPNKTGGFGAAPVFGSPPTFGGSPGFGGVPAFGSAPAFTSPLGSTGGKVFGEGT AAASAGGFGFGSSSNTTSFGTLASQNAPTFGSLSQQTSGFGTQSSGFSGFGSGTGGFS FGSNNSSVQGFGGWRS" BASE COUNT 1591 a 1869 c 1545 g 1586 t ORIGIN 1 gaggaagttt gctgtcgagc ggcctgggtt ccgtgggcaa ggccgtggga ggcagcgttg 61 gctgcttcga cacactgagg gcggcgcgat gggagacgag atggatgcca tgattcccga 121 gcgggagatg aaggattttc agtttagagc gctaaagaag gtgagaatct ttgactcccc 181 tgaggaattg cccaaggaac gctcgagtct gcttgctgtg tccaacaaat atggtctggt 241 cttcgctggt ggagccagtg gcttgcagat ttttcctact aaaaatcttc ttattcaaaa 301 taaacccgga gatgatccca acaaaatagt tgataaagtc caaggcttgc tagttcctat 361 gaaattccca atccatcacc tggccttgag ctgtgataac ctcacactct ctgcgtgcat 421 gatgtccagt gaatatggtt ccattattgc tttttttgat gttcgcacat tctcaaatga 481 ggctaaacag caaaaacgcc catttgccta tcataagctt ttgaaagatg caggaggcat 541 ggtgattgat atgaagtgga accccactgt cccctccatg gtggcagttt gtctggctga 601 tggtagtatt gctgtcctgc aagtcacgga aacagtgaaa gtatgtgcaa ctcttccttc 661 cacggtagca gtaacctctg tgtgctggag ccccaaagga aagcagctgg cagtgggaaa 721 acagaatgga actgtggtcc agtatcttcc tactttgcag gaaaaaaaag tcattccttg 781 tcctccgttt tatgagtcag atcatcctgt cagagttctg gatgtgctgt ggattggtac 841 ctacgtcttc gccatagtgt atgctgctgc agatgggacc ctggaaacgt ctccagatgt 901 ggtgatggct ctactaccga aaaaagaaga aaagcaccca gagatatttg tgaactttat 961 ggagccctgt tatggcagct gcacggagag acagcatcat tactacctca gttacattga 1021 ggaatgggat ttagtgctgg cagcatctgc ggcttcaaca gaagttagta tccttgctcg 1081 acaaagtgat cagattaatt gggaatcttg gctactggag gattctagtc gagctgaatt 1141 gcctgtgaca gacaagagtg atgactcctt gcccatggga gttgtcgtag actatacaaa 1201 ccaagtggaa atcaccatca gtgatgaaaa gactcttcct cctgctccag ttctcatgtt 1261 actttcaaca gatggtgtgc tttgtccatt ttatatgatt aatcaaaatc ctggggttaa 1321 gtctctcatc aaaacaccag agcgactttc attagaagga gagcgacagc ccaagtcacc 1381 aggaagtact cccactaccc caacctcctc tcaagcccca cagaaactgg atgcttctgc 1441 agctgcagcc cctgcctctc tgccaccttc atcacctgct gctcccattg ccactttttc 1501 tttgcttcct gctggtggag cccccactgt gttctccttt ggttcttcat ctttgaagtc 1561 atcggctacg gtcactgggg agcccccttc atattccagt ggctccgaca gctccaaagc 1621 agccccaggc cctggcccat caaccttctc ttttgttccc ccttctaaag cctccctagc 1681 ccccacccct gcagcgtctc ctgtggctcc atcagctgct tcattctcct ttggatcatc 1741 tggttttaag cctaccctgg aaagcacacc agtgccaagt gtgtctgctc caaatatagc 1801 aatgaagtcc tccttcccac cctcaacctc tgctgtcaaa gtcaacctta gtgaaaagtt 1861 tactgctgca gctacctcta ctcctgttag tagctcccag agcgcacccc cgatgtcgcc 1921 attctcttct gcctccaagc cagctgcttc tggaccactc agccacccca cacctctctc 1981 agcaccacct agttccgtgc cattgaagtc ctcagtcttg ccctcaccat caggacgatc 2041 tgctcagggc agttcaagcc cagtgccctc aatggtacag aaatcaccca ggataacccc 2101 tccagcggca aagccaggct ctccccaggc aaagtcactt cagcctgctg ttgcagaaaa 2161 gcagggacat cagtggaaag attcagatcc tgtaatggct ggaattgggg aggagattgc 2221 acactttcag aaggagttgg aagagttaaa agcccgaact tccaaagcct gtttccaagt 2281 gggcacttct gaggagatga agatgctgcg aacagaatca gatgacttgc atacctttct 2341 tttggagatt aaagagacca cagagtcgct tcatggagat ataagtagcc tgaaaacaac 2401 tttacttgag ggctttgctg gtgttgagga agccagagaa caaaatgaaa gaaatcgtga 2461 ctctggttat ctgcatttgc tttataaaag accactggat cccaagagtg aagctcagct 2521 tcaggaaatt cggcgccttc atcagtatgt gaaatttgct gtccaagatg tgaatgatgt 2581 tctagacttg gagtgggatc agcatctgga acaaaagaaa aaacaaaggc acctgcttgt 2641 gccagagcga gagacactgt ttaacaccct agccaacaat cgggaaatca tcaaccaaca 2701 gaggaagagg ctgaatcacc tggtggatag tcttcagcag ctccgccttt acaaacagac 2761 ttccctgtgg agcctgtcct cggctgttcc ttcccagagc agcattcaca gttttgacag 2821 tgacctggaa agcctgtgca atgctttgtt gaaaaccacc atagaatctc acaccaaatc 2881 cttgcccaaa gtaccagcca aactgtcccc catgaaacag gcacaactga gaaacttctt 2941 ggccaagagg aagaccccac cagtgagatc cactgctcca gccagcctgt ctcgatcagc 3001 ctttctgtct cagagatatt atgaagactt ggatgaagtc agctcaacgt catctgtctc 3061 ccagtctctg gagagtgaag atgcacggac gtcctgtaaa gatgacgagg cagtggttca 3121 ggcccctcgg cacgcccccg tggttcgcac tccttccatc cagcccagtc tcttgcccca 3181 tgcagcacct tttgctaaat ctcacctggt tcatggttct tcacctggtg tgatgggaac 3241 ttcagtggct acatctgcta gcaaaattat tcctcaaggg gccgatagca caatgcttgc 3301 cacgaaaacc gtgaaacatg gtgcacctag tccttcccac cccatctcag ccccgcaggc 3361 agctgccgca gcagcactca ggcggcagat ggccagtcag gcaccagctg taaacacttt 3421 gactgaatca acgttgaaga atgtccctca agtggtaaat gtgcaggaat tgaagaataa 3481 ccctgcaacc ccttctacag ccatgggttc ttcagtgccc tactccacag ccaaaacacc 3541 tcacccagtg ttgaccccag tggctgctaa ccaagccaag caggggtctc taataaattc 3601 ccttaagcca tctgggccta caccagcatc cggtcagtta tcatctggtg acaaagcttc 3661 agggacagcc aagatagaaa cagctgtgac ttcaacccca tctgcttctg ggcagttcag 3721 caagcctttc tcattttctc catcagggac tggctttaat tttgggataa tcacaccaac 3781 accgtcttct aatttcactg ctgcacaagg ggcaacaccc tccactaaag agtcaagcca 3841 gccggacgca ttctcatctg gtgggggaag caaaccttct tatgaggcca ttcctgaaag 3901 ctcacctccc tcaggaatca catccgcatc aaacaccacc ccaggagaac ctgccgcatc 3961 tagcagcaga cctgtggcac cttctggaac tgctctttcc accacctcta gtaagctgga 4021 aaccccaccg tccaagctgg gagagcttct gtttccaagt tctttggctg gagagactct 4081 gggaagtttt tcaggactgc gggttggcca agcagatgat tctacaaaac caaccaataa 4141 ggcttcatcc acaagcctaa ctagtaccca gccaaccaag acgtcaggcg tgccctcagg 4201 gtttaatttt actgcccccc cggtgttagg gaagcacacg gagccccctg tgacatcctc 4261 tgcaaccacc acctcagtag caccaccagc agccaccagc acttcctcaa ctgccgtttt 4321 tggcagtctg ccagtcacca gtgcaggatc ctctggggtc atcagttttg gtgggacatc 4381 tctaagtgct ggcaagacta gtttttcatt tggaagccaa cagaccaata gcacagtgcc 4441 cccatctgcc ccaccaccaa ctacagctgc cactcccctt ccaacatcat tccccacatt 4501 gtcatttggt agcctcctga gttcagcaac taccccctcc ctgcctatgt ccgctggcag 4561 aagcacagaa gaggccactt catcagcttt gcctgagaag ccaggtgaca gtgaggtctc 4621 agcatcagca gcctcacttc tagaggagca acagtcagcc cagcttcccc aggctcctcc 4681 gcaaacttct gactctgtta aaaaagaacc tgttcttgcc cagcctgcag tcagcaactc 4741 tggcactgca gcatctagta ctagtcttgt agcactttct gcagaggcta ccccagccac 4801 cacgggggtc cctgatgcca ggacggaggc agtaccacct gcttcctcct tttctgtgcc 4861 tgggcagact gctgtcacag cagctgctat ctcaagtgca ggccctgtgg ccgtcgaaac 4921 atcaagtacc cccatagcct ccagcaccac gtccattgtt gctcccggcc catctgcaga 4981 ggcagcagca tttggtaccg tcacttctgg ctcatccgtc tttgctcagc ctcctgctgc 5041 cagttctagc tcagctttca accagctcac caacaacaca gccactgccc cctctgccac 5101 gcccgtgttt gggcaagtgg cagccagcac cgcaccaagt ctgtttgggc agcagactgg 5161 tagcacagcc agcacagcag ctgccacacc acaggtcagc agctcagggt ttagcagccc 5221 agcttttggt accacagccc caggggtctt tggacagaca accttcgggc aggcctcagt 5281 ctttgggcag tcggcgagca gtgctgcaag tgtcttttcc ttcagtcagc ctgggttcag 5341 ttccgtgcct gccttcggtc agcctgcttc ctccactccc acatccacca gtggaagtgt 5401 ctttggtgcc gcctcaagta ccagtagctc cagttccttc tcatttggac agtcttctcc 5461 caacacagga ggggggctgt ttggccaaag caacgctcct gcttttgggc agagtcctgg 5521 ctttggacag ggaggctctg tctttggtgg tacctcagct gccaccacaa cagcagcaac 5581 ctctgggttc agcttttgcc aagcttcagg ttttgggtct agtaatactg gttctgtgtt 5641 tggtcaagca gccagtactg gtggaatagt ctttggccag caatcatcct cttccagtgg 5701 taacgtgttt gggtctggaa acactggaag agggggaggt ttcttcagtg gccttggagg 5761 aaaacccagt caggatgcag ccaacaaaaa cccattcagc tcggccagtg ggggctttgg 5821 atccacagct acctcaaata cctctaacct atttggaaac agtggggcca agacatttgg 5881 tggatttgcc agctcgtcgt ttggagagca gaaacccact ggcactttca gctctggagg 5941 aggaagtgtg gcatcccaag gctttgggtt ttcctctcca aacaaaacag gtggcttcgg 6001 tgctgctcca gtgtttggca gccctcctac ttttggggga tcccctgggt ttggaggggt 6061 gccagcattc ggttcagccc cagcctttac aagccctctg ggctcgacgg gaggcaaagt 6121 gttcggagag ggcactgcag ctgccagcgc aggaggattc gggtttggga gcagcagcaa 6181 caccacatcc ttcggcacgc tcgcgagtca gaatgccccc actttcggat cactgtccca 6241 acagacttct ggttttggga cccagagtag cggattctct ggttttggat caggcacagg 6301 agggttcagc tttgggtcaa ataactcgtc tgtccagggt tttggtggct ggcgaagctg 6361 agggcgtgtc agcaggcctt tcgatccctg ggaccaaccg catcctcagc ttcttccccg 6421 agaaatgctg gagcaggctg ttcagaccga cgttgccatc aaaacacata cacccagaaa 6481 gaaacaacag aaaccaaaac tcacaaggcg catgattact tgttttatat ttcatgttgg 6541 gttttccctc ccactattaa acagtctgtt tccgtaaaaa aaaaaaaaaa a //