LOCUS BC143951 3499 bp mRNA linear HUM 08-JAN-2009 DEFINITION Homo sapiens ecotropic viral integration site 1, mRNA (cDNA clone MGC:177486 IMAGE:9052469), complete cds. ACCESSION BC143951 VERSION BC143951.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3499) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3499) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (12-JUN-2007) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Mike Brownstein, NIMH cDNA Library Preparation: British Columbia Cancer Research Center cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRCB Plate: 16 Row: B Column: 18. FEATURES Location/Qualifiers source 1..3499 /db_xref="H-InvDB:HIT000502390" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:177486 IMAGE:9052469" /tissue_type="Lung, PCR rescued clones" /clone_lib="NIH_MGC_298" /note="Vector: pCR4-TOPO; Clone identification sequence tag: GCTCACGC" gene 1..3499 /gene="EVI1" /gene_synonym="AML1-EVI-1" /gene_synonym="EVI-1" /gene_synonym="MDS1-EVI1" /gene_synonym="PRDM3" /db_xref="GeneID:2122" /db_xref="HGNC:HGNC:3498" /db_xref="MIM:165215" CDS 114..3464 /gene="EVI1" /gene_synonym="AML1-EVI-1" /gene_synonym="EVI-1" /gene_synonym="MDS1-EVI1" /gene_synonym="PRDM3" /codon_start=1 /product="EVI1 protein" /protein_id="AAI43952.1" /db_xref="GeneID:2122" /db_xref="HGNC:HGNC:3498" /db_xref="MIM:165215" /translation="MILDEFYNVKFCIDASQPDVGSWLKYIRFAGCYDQHNLVACQIN DQIFYRVVADIAPGEELLLFMKSEDYPHETMAPDIHEERQYRCEDCDQLFESKAELAD HQKFPCSTPHSAFSMVEEDFQQKLESENDLQEIHTIQECKECDQVFPDLQSLEKHMLS HTEEREYKCDQCPKAFNWKSNLIRHQMSHDSGKHYECENCAKQVFTDPSNLQRHIRSQ HVGARAHACPECGKTFATSSGLKQHKHIHSSVKPFICEVCHKSYTQFSNLCRHKRMHA DCRTQIKCKDCGQMFSTTSSLNKHRRFCEGKNHFAAGGFFGQGISLPGTPAMDKTSMV NMSHANPGLADYFGANRHPAGLTFPTAPGFSFSFPGLFPSGLYHRPPLIPASSPVKGL SSTEQTNKSQSPLMTHPQILPATQDILKALSKHPSVGDNKPVELQPERSSEERPFEKI SDQSESSDLDDVSTPSGSDLETTSGSDLESDIESDKEKFKENGKMFKDKVSPLQNLAS INNKKEYSNHSIFSPSLEEQTAVSGAVNDSIKAIASIAEKYFGSTGLVGLQDKKVGAL PYPSMFPLPFFPAFSQSMYPFPDRDLRSLPLKMEPQSPGEVKKLQKGSSESPFDLTTK RKDEKPLTPVPSKPPVTPATSQDQPLDLSMGSRSRASGTKLTEPRKNHVFGGKKGSNV ESRPASDGSLQHARPTPFFMDPIYRVEKRKLTDPLEALKEKYLRPSPGFLFHPQFQLP DQRTWMSAIENMAEKLESFSALKPEASELLQSVPSMFNFRAPPNALPENLLRKGKERY TCRYCGKIFPRSANLTRHLRTHTGEQPYRCKYCDRSFSISSNLQRHVRNIHNKEKPFK CHLCDRCFGQQTNLDRHLKKHENGNMSGTATSSPHSELESTGAILDDKEDAYFTEIRN FIGNSNHGSQSPRNVEERMNGSHFKDEKALVTSQNSDLLDDEEVEDEVLLDEEDEDND ITGKTGKEPVTSNLHEGNPEDDYEETSALEMSCKTSPVRYKEEEYKSGLSALDHIRHF TDSLKMRKMEDNQYSEAELSSFSTSHVPEELKQPLHRKSKSQAYAMMLSLSDKESLHS TSHSSSNVWHSMARAAAESSAIQSISHV" BASE COUNT 1061 a 799 c 791 g 848 t ORIGIN 1 caacatcgtg tgctgcttcg cgagaaagtc acattcggac cctttggcta gattgcttat 61 tcatagggct tcttgactaa agcccttgga gcactgggtt tttcttgaag tatatgatct 121 tagacgaatt ttacaatgtg aagttctgca tagatgccag tcaaccagat gttggaagct 181 ggctcaagta cattagattc gctggctgtt atgatcagca caaccttgtt gcatgccaga 241 taaatgatca gatattctat agagtagttg cagacattgc gccgggagag gagcttctgc 301 tgttcatgaa gagcgaagac tatccccatg aaactatggc gccggatatc cacgaagaac 361 ggcaatatcg ctgcgaagac tgtgaccagc tctttgaatc taaggctgaa ctagcagatc 421 accaaaagtt tccatgcagt actcctcact cagcattttc aatggttgaa gaggactttc 481 agcaaaaact cgaaagcgag aatgatctcc aagagataca cacgatccag gagtgtaagg 541 aatgtgacca agtttttcct gatttgcaaa gcctggagaa acacatgctg tcacatactg 601 aagagaggga atacaagtgt gatcagtgtc ccaaggcatt taactggaag tccaatttaa 661 ttcgccacca gatgtcacat gacagtggaa agcactatga atgtgaaaac tgtgccaagc 721 aggttttcac ggaccctagc aaccttcagc ggcacattcg ctctcagcat gtcggtgccc 781 gggcccatgc atgcccggag tgtggcaaaa cgtttgccac ttcgtcgggc ctcaaacaac 841 acaagcacat ccacagcagt gtgaagccct ttatctgtga ggtctgccat aaatcctata 901 ctcagttttc aaacctttgc cgtcataagc gcatgcatgc tgattgcaga acccaaatca 961 agtgcaaaga ctgtggacaa atgttcagca ctacgtcttc cttaaataaa cacaggaggt 1021 tttgtgaggg caagaaccat tttgcggcag gtggattttt tggccaaggc atttcacttc 1081 ctggaacccc agctatggat aaaacgtcca tggttaatat gagtcatgcc aacccgggcc 1141 ttgctgacta ttttggcgcc aataggcatc ctgctggtct tacctttcca acagctcctg 1201 gattttcttt tagcttccct ggtctgtttc cttccggctt gtaccacagg cctcctttga 1261 tacctgctag ttctcctgtt aaaggactat caagtactga acagacaaac aaaagtcaaa 1321 gtcccctcat gacacatcct cagatactgc cagctacaca ggatattttg aaggcactat 1381 ctaaacaccc atctgtaggg gacaataagc cagtggagct ccagcccgag aggtcctctg 1441 aagagaggcc ctttgagaaa atcagtgacc agtcagagag tagtgacctt gatgatgtca 1501 gtacaccaag tggcagtgac ctggaaacaa cctcgggctc tgatctggaa agtgacattg 1561 aaagtgataa agagaaattt aaagaaaatg gtaaaatgtt caaagacaaa gtaagccctc 1621 ttcagaatct ggcttcaata aataataaga aagaatacag caatcattcc attttctcac 1681 catctttaga ggagcagact gcggtgtcag gagctgtgaa tgattctata aaggctattg 1741 cttctattgc tgaaaaatac tttggttcaa caggactggt ggggctgcaa gacaaaaaag 1801 ttggagcttt accttaccct tccatgtttc ccctcccatt ttttccagca ttctctcaat 1861 caatgtaccc atttcctgat agagacttga gatcgttacc tttgaaaatg gaaccccaat 1921 caccaggtga agtaaagaaa ctgcagaagg gcagctctga gtcccccttt gatctcacca 1981 ctaagcgaaa ggatgagaag cccttgactc cagtcccctc caagcctcca gtgacacctg 2041 ccacaagcca agaccagccc ctggatctaa gtatgggcag taggagtaga gccagtggga 2101 caaagctgac tgagcctcga aaaaaccacg tgtttggggg aaaaaaagga agcaacgtcg 2161 aatcaagacc tgcttcagat ggttccttgc agcatgcaag acccactcct ttctttatgg 2221 accctattta cagagtagag aaaagaaaac taactgaccc acttgaagct ttaaaagaga 2281 aatacttgag gccttctcca ggattcttgt ttcacccaca attccaactg cctgatcaga 2341 gaacttggat gtcagctatt gaaaacatgg cagaaaagct agagagcttc agtgccctga 2401 aacctgaggc cagtgagctc ttacagtcag tgccctctat gttcaacttc agggcgcctc 2461 ccaatgccct gccagagaac cttctgcgga agggaaagga gcgctatacc tgcagatact 2521 gtggcaagat ttttccaagg tctgcaaacc taacacggca cttgagaacc cacacaggag 2581 agcagcctta cagatgcaaa tactgtgaca gatcatttag catatcttct aacttgcaaa 2641 ggcatgttcg caacatccac aataaagaga agccatttaa gtgtcactta tgtgataggt 2701 gttttggtca acaaaccaat ttagacagac acctaaagaa acatgagaat gggaacatgt 2761 ccggtacagc aacatcgtcg cctcattctg aactggaaag tacaggtgcg attctggatg 2821 acaaagaaga tgcttacttc acagaaattc gaaatttcat tgggaacagc aaccatggca 2881 gccaatctcc caggaatgtg gaggagagaa tgaatggcag tcattttaaa gatgaaaagg 2941 ctttggtgac cagtcaaaat tcagacttgc tggatgatga agaagttgaa gatgaggtgt 3001 tgttagatga ggaggatgaa gacaatgata ttactggaaa aacaggaaag gaaccagtga 3061 caagtaattt acatgaagga aaccctgagg atgactatga agaaaccagt gccctggaga 3121 tgagttgcaa gacatcccca gtgaggtata aagaggaaga atataaaagt ggactttctg 3181 ctctagatca tataaggcac ttcacagata gcctcaaaat gaggaaaatg gaagataatc 3241 aatattctga agctgagctg tcttctttta gtacttccca tgtgccagag gaacttaagc 3301 agccgttaca cagaaagtcc aaatcgcagg catatgctat gatgctgtca ctgtctgaca 3361 aggagtccct ccattctaca tcccacagtt cttccaacgt gtggcacagt atggccaggg 3421 ctgcggcgga atccagtgct atccagtcca taagccacgt atgacgttat caaggttgac 3481 cagagtggga ccaagtcca //