LOCUS BC008972 1118 bp mRNA linear HUM 23-JUN-2006 DEFINITION Homo sapiens centromere protein N, mRNA (cDNA clone MGC:9498 IMAGE:3048151), complete cds. ACCESSION BC008972 VERSION BC008972.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1118) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1118) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (30-MAY-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Baylor College of Medicine Human Genome Sequencing Center Center code: BCM-HGSC Web site: http://www.hgsc.bcm.tmc.edu/cdna/ Contact: amg@bcm.tmc.edu Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H., Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, A.N., Gibbs, R.A. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 12 Row: a Column: 2 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 39725678. FEATURES Location/Qualifiers source 1..1118 /db_xref="H-InvDB:HIT000034117" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:9498 IMAGE:3048151" /tissue_type="Cervix, carcinoma" /clone_lib="NIH_MGC_12" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..1118 /gene="CENPN" /gene_synonym="BM039" /gene_synonym="CENP-N" /gene_synonym="FLJ13607" /gene_synonym="FLJ22660" /db_xref="GeneID:55839" /db_xref="HGNC:HGNC:30873" CDS 84..1103 /gene="CENPN" /gene_synonym="BM039" /gene_synonym="CENP-N" /gene_synonym="FLJ13607" /gene_synonym="FLJ22660" /codon_start=1 /product="CENPN protein" /protein_id="AAH08972.1" /db_xref="GeneID:55839" /db_xref="HGNC:HGNC:30873" /translation="MDETVAEFIKRTILKIPMNELTTILKAWDFLSENQLQTVNFRQR KESVVQHLIHLCEEKRASISDAALLDIIYMQFHQHQKVWDVFQMSKGPGEDVDLFDMK QFKNSFKKILQRALKNVTVSFRETEENAVWIRIAWGTQYTKPNQYKPTYVVYYSQTPY AFTSSSMLRRNTPLLGQALTIASKHHQIVKMDLRSRYLDSLKAIVFKQYNQTFETHNS TTPLQERSLGLDINMDSRIIHENIVEKERVQRITQETFGDYPQPQLEFAQYKLETKFK SGLNGSILAEREEPLRCLIKFSSPHLLEALKSLAPAGIADAPLSPLLTCIPNKRMNYF KIRDK" BASE COUNT 376 a 226 c 242 g 274 t ORIGIN 1 gcaattggtg gctttgaagg cgcggcgagc gggaacagct cttgaggagt gagactgcag 61 gagatgtggg ccgtgccaaa gagatggatg agactgttgc tgagttcatc aagaggacca 121 tcttgaaaat ccccatgaat gaactgacaa caatcctgaa ggcctgggat tttttgtctg 181 aaaatcaact gcagactgta aatttccgac agagaaagga atctgtagtt cagcacttga 241 tccatctgtg tgaggaaaag cgtgcaagta tcagtgatgc tgccctgtta gacatcattt 301 atatgcaatt tcatcagcac cagaaagttt gggatgtttt tcagatgagt aaaggaccag 361 gtgaagatgt tgaccttttt gatatgaaac aatttaaaaa ttcgttcaag aaaattcttc 421 agagagcatt aaaaaatgtg acagtcagct tcagagaaac tgaggagaat gcagtctgga 481 ttcgaattgc ctggggaaca cagtacacaa agccaaacca gtacaaacct acctacgtgg 541 tgtactactc ccagactccg tacgccttca cgtcctcctc catgctgagg cgcaatacac 601 cgcttctggg tcaggcgctg acaattgcta gcaaacacca tcagattgtg aaaatggacc 661 tgagaagtcg gtatctggac tctcttaagg ctattgtttt taaacagtat aatcagacct 721 ttgaaactca caactctacg acacctctac aggaaagaag ccttggacta gatataaata 781 tggattcaag gatcattcat gaaaacatag tagaaaaaga gagagtccaa cgaataactc 841 aagaaacatt tggagattat cctcaaccac aactagaatt tgcacaatat aagcttgaaa 901 cgaaattcaa aagtggttta aatgggagca tcttggctga gagggaagaa cccctccgat 961 gcctaataaa gttctctagc ccacatcttc tggaagcatt gaaatcctta gcaccagcgg 1021 gtattgcaga tgctccactt tctccactgc tcacttgcat acccaacaag agaatgaatt 1081 attttaaaat tagagataaa taaaaaaaaa aaaaaaaa //