LOCUS BC004178 2126 bp mRNA linear HUM 06-JUN-2006
DEFINITION Homo sapiens tRNA splicing endonuclease 2 homolog (S. cerevisiae),
mRNA (cDNA clone MGC:2776 IMAGE:2959536), complete cds.
ACCESSION BC004178
VERSION BC004178.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2126)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2126)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (01-MAR-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Aug 19, 2003 this sequence version replaced BC004178.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Institute for Systems Biology
http://www.systemsbiology.org
contact: amadan@systemsbiology.org
Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 2 Row: a Column: 14
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 21314730.
FEATURES Location/Qualifiers
source 1..2126
/db_xref="H-InvDB:HIT000031757"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:2776 IMAGE:2959536"
/tissue_type="Colon, adenocarcinoma"
/clone_lib="NIH_MGC_15"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..2126
/gene="TSEN2"
/gene_synonym="MGC2776"
/gene_synonym="MGC4440"
/gene_synonym="SEN2"
/gene_synonym="SEN2L"
/db_xref="GeneID:80746"
/db_xref="HGNC:HGNC:28422"
/db_xref="MIM:608753"
CDS 121..1518
/gene="TSEN2"
/gene_synonym="MGC2776"
/gene_synonym="MGC4440"
/gene_synonym="SEN2"
/gene_synonym="SEN2L"
/codon_start=1
/product="TSEN2 protein"
/protein_id="AAH04178.1"
/db_xref="GeneID:80746"
/db_xref="HGNC:HGNC:28422"
/db_xref="MIM:608753"
/translation="MAEAVFHAPKRKRRVYETYESPLPIPFGQDHGPLKEFKIFRAEM
INNNVIVRNAEDIEQLYGKGYFGKGILSRSRPSFTISDPKLVAKWKDMKTNMPIITSK
RYQHSVEWAAELMRRQGQDESTVRRILKDYTKPLEHPPVKRNEEAQVHDKLNSGMVSN
MEGTAGGERPSVVNGDSGKSGGVGDPREPLGCLQEGSGCHPTTESFEKSVREDASPLP
HVCCCKQDALILQRGLHHEDGSQHIGLLHPGDRGPDHEYVLVEEAECAMSEREAAPNE
ELVQRNRLICRRNPYRIFEYLQLSLEEAFFLVYALGCLSIYYEKEPLTIVKLWKAFTV
VQPTFRTTYMAYHYFRSKGWVPKVGLKYGTDLLLYRKGPPFYHASYSVIIELVDDHFE
GSLRRPLSWKSLAALSRVSVNVSKELMLCYLIKPSTMTDKEMESPECMKRIKVQEVIL
SRWVSSRERSDQDDL"
BASE COUNT 625 a 440 c 517 g 544 t
ORIGIN
1 cctgggcgag gaaagcgcgg ccctttccga gtttggtgtt ttgcagcgaa aggaaatctc
61 gctcttccga aagtcctcca gggcgagaga ggaaagggcc tagaatacct cctctgaaaa
121 atggcagaag cagttttcca tgccccaaag aggaaaagaa gagtgtatga gacttacgag
181 tctccattgc caatcccttt tggtcaggac catggtcctc tgaaagaatt caagatattc
241 cgtgctgaaa tgattaacaa caatgtgatt gtgaggaatg cggaggacat tgagcagctc
301 tatgggaaag gttattttgg aaaaggtatt ctttcaagaa gccgtccaag cttcacaatt
361 tcagatccta aactggttgc taaatggaaa gatatgaaga caaacatgcc tatcatcaca
421 tcaaagaggt atcagcatag tgttgagtgg gcagcagagc tgatgcgtag acaggggcag
481 gatgagagta cagtgcgcag aatcctcaag gattacacga aaccgcttga gcatcctcct
541 gtgaaaagga atgaagaggc tcaagtgcat gacaagctta actctggaat ggtttccaac
601 atggaaggca cagcaggggg agagagacct tctgtggtaa acggggactc tggaaagtca
661 ggtggtgtgg gtgatccccg tgagccatta ggctgcctgc aggagggctc tggctgccac
721 ccaacaacag agagctttga gaaaagcgtg cgagaggatg cctcacctct gccccatgtc
781 tgttgctgca aacaagatgc tctcatcctc cagcgtggcc ttcatcatga agacggcagc
841 cagcacatcg gcctcctgca tcctggggac agagggcctg accatgagta cgtgctggtc
901 gaggaagcgg agtgtgccat gagcgagagg gaggctgccc caaatgagga attggtgcaa
961 agaaacaggt taatatgcag aagaaatcca tataggatct ttgagtattt gcaactcagc
1021 ctagaagagg cctttttctt ggtctatgct ctgggatgtt taagtattta ctatgagaag
1081 gagcctttaa cgatagtgaa gctctggaaa gctttcactg tagttcagcc cacgttcaga
1141 accacctaca tggcctacca ttactttcga agcaagggct gggtgcccaa agtgggactc
1201 aagtacggga cagatttact gctatatcgg aaaggccctc cattttacca tgcaagttat
1261 tctgtcatta tcgagctagt tgatgaccat tttgaaggct ctctccgcag gcctctcagt
1321 tggaagtccc tggctgcctt gagcagagtt tccgttaatg tctctaagga acttatgctg
1381 tgctatttga ttaaaccctc tactatgact gacaaggaaa tggagtcacc agaatgtatg
1441 aaaaggatta aagttcagga ggtgattctg agtcgatggg tttcttcacg agagaggagt
1501 gaccaagacg atctttaaca attcaacctc aaatttctaa tttcaccaac aactatttat
1561 tgagggctag gtaaaaagtt ctttttgttg taatcgtcca ttaattcata agttttaaag
1621 ggcatggtgc tcccagcacc agaaaactat cagtgttttt aaagataaat tacacaaggg
1681 aggagaaaga tccctgtgct aggactgcag attctatact tgcgttggcc tctaactctc
1741 caatccagag cctcctgcct ctggcgtcag tcttttccct catccactca ctggggagat
1801 tggactagag gagtcctgag aggacacttc caacaagaga catttattct ctgattttac
1861 ctgaaaatgg tagtagttta catttataca gtacagttta tgaagcactt tcatacgcag
1921 gcatctcttg ttacctacat ctaagctgtt cccgaaagag tgttacagaa cacaacagta
1981 ttgtacaata ttcgataagc atatcttcac tgcacttgtt ataaaaatga gtggtgaaat
2041 aatgtttgga gacataatga aagcgattaa catttggcaa aatataataa agcctttttg
2101 taattggaaa aaaaaaaaaa aaaaaa
//