LOCUS BC004178 2126 bp mRNA linear HUM 06-JUN-2006 DEFINITION Homo sapiens tRNA splicing endonuclease 2 homolog (S. cerevisiae), mRNA (cDNA clone MGC:2776 IMAGE:2959536), complete cds. ACCESSION BC004178 VERSION BC004178.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2126) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2126) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (01-MAR-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 19, 2003 this sequence version replaced BC004178.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 2 Row: a Column: 14 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 21314730. FEATURES Location/Qualifiers source 1..2126 /db_xref="H-InvDB:HIT000031757" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:2776 IMAGE:2959536" /tissue_type="Colon, adenocarcinoma" /clone_lib="NIH_MGC_15" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2126 /gene="TSEN2" /gene_synonym="MGC2776" /gene_synonym="MGC4440" /gene_synonym="SEN2" /gene_synonym="SEN2L" /db_xref="GeneID:80746" /db_xref="HGNC:HGNC:28422" /db_xref="MIM:608753" CDS 121..1518 /gene="TSEN2" /gene_synonym="MGC2776" /gene_synonym="MGC4440" /gene_synonym="SEN2" /gene_synonym="SEN2L" /codon_start=1 /product="TSEN2 protein" /protein_id="AAH04178.1" /db_xref="GeneID:80746" /db_xref="HGNC:HGNC:28422" /db_xref="MIM:608753" /translation="MAEAVFHAPKRKRRVYETYESPLPIPFGQDHGPLKEFKIFRAEM INNNVIVRNAEDIEQLYGKGYFGKGILSRSRPSFTISDPKLVAKWKDMKTNMPIITSK RYQHSVEWAAELMRRQGQDESTVRRILKDYTKPLEHPPVKRNEEAQVHDKLNSGMVSN MEGTAGGERPSVVNGDSGKSGGVGDPREPLGCLQEGSGCHPTTESFEKSVREDASPLP HVCCCKQDALILQRGLHHEDGSQHIGLLHPGDRGPDHEYVLVEEAECAMSEREAAPNE ELVQRNRLICRRNPYRIFEYLQLSLEEAFFLVYALGCLSIYYEKEPLTIVKLWKAFTV VQPTFRTTYMAYHYFRSKGWVPKVGLKYGTDLLLYRKGPPFYHASYSVIIELVDDHFE GSLRRPLSWKSLAALSRVSVNVSKELMLCYLIKPSTMTDKEMESPECMKRIKVQEVIL SRWVSSRERSDQDDL" BASE COUNT 625 a 440 c 517 g 544 t ORIGIN 1 cctgggcgag gaaagcgcgg ccctttccga gtttggtgtt ttgcagcgaa aggaaatctc 61 gctcttccga aagtcctcca gggcgagaga ggaaagggcc tagaatacct cctctgaaaa 121 atggcagaag cagttttcca tgccccaaag aggaaaagaa gagtgtatga gacttacgag 181 tctccattgc caatcccttt tggtcaggac catggtcctc tgaaagaatt caagatattc 241 cgtgctgaaa tgattaacaa caatgtgatt gtgaggaatg cggaggacat tgagcagctc 301 tatgggaaag gttattttgg aaaaggtatt ctttcaagaa gccgtccaag cttcacaatt 361 tcagatccta aactggttgc taaatggaaa gatatgaaga caaacatgcc tatcatcaca 421 tcaaagaggt atcagcatag tgttgagtgg gcagcagagc tgatgcgtag acaggggcag 481 gatgagagta cagtgcgcag aatcctcaag gattacacga aaccgcttga gcatcctcct 541 gtgaaaagga atgaagaggc tcaagtgcat gacaagctta actctggaat ggtttccaac 601 atggaaggca cagcaggggg agagagacct tctgtggtaa acggggactc tggaaagtca 661 ggtggtgtgg gtgatccccg tgagccatta ggctgcctgc aggagggctc tggctgccac 721 ccaacaacag agagctttga gaaaagcgtg cgagaggatg cctcacctct gccccatgtc 781 tgttgctgca aacaagatgc tctcatcctc cagcgtggcc ttcatcatga agacggcagc 841 cagcacatcg gcctcctgca tcctggggac agagggcctg accatgagta cgtgctggtc 901 gaggaagcgg agtgtgccat gagcgagagg gaggctgccc caaatgagga attggtgcaa 961 agaaacaggt taatatgcag aagaaatcca tataggatct ttgagtattt gcaactcagc 1021 ctagaagagg cctttttctt ggtctatgct ctgggatgtt taagtattta ctatgagaag 1081 gagcctttaa cgatagtgaa gctctggaaa gctttcactg tagttcagcc cacgttcaga 1141 accacctaca tggcctacca ttactttcga agcaagggct gggtgcccaa agtgggactc 1201 aagtacggga cagatttact gctatatcgg aaaggccctc cattttacca tgcaagttat 1261 tctgtcatta tcgagctagt tgatgaccat tttgaaggct ctctccgcag gcctctcagt 1321 tggaagtccc tggctgcctt gagcagagtt tccgttaatg tctctaagga acttatgctg 1381 tgctatttga ttaaaccctc tactatgact gacaaggaaa tggagtcacc agaatgtatg 1441 aaaaggatta aagttcagga ggtgattctg agtcgatggg tttcttcacg agagaggagt 1501 gaccaagacg atctttaaca attcaacctc aaatttctaa tttcaccaac aactatttat 1561 tgagggctag gtaaaaagtt ctttttgttg taatcgtcca ttaattcata agttttaaag 1621 ggcatggtgc tcccagcacc agaaaactat cagtgttttt aaagataaat tacacaaggg 1681 aggagaaaga tccctgtgct aggactgcag attctatact tgcgttggcc tctaactctc 1741 caatccagag cctcctgcct ctggcgtcag tcttttccct catccactca ctggggagat 1801 tggactagag gagtcctgag aggacacttc caacaagaga catttattct ctgattttac 1861 ctgaaaatgg tagtagttta catttataca gtacagttta tgaagcactt tcatacgcag 1921 gcatctcttg ttacctacat ctaagctgtt cccgaaagag tgttacagaa cacaacagta 1981 ttgtacaata ttcgataagc atatcttcac tgcacttgtt ataaaaatga gtggtgaaat 2041 aatgtttgga gacataatga aagcgattaa catttggcaa aatataataa agcctttttg 2101 taattggaaa aaaaaaaaaa aaaaaa //