LOCUS CCP46627.1 444 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv Probable transposase protein. ACCESSION AL123456-3905 PROTEIN_ID CCP46627.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /locus_tag="Rv3798" /note="Rv3798, (MTV026.03), len: 444 aa. Probable transposase for insertion sequence element IS1557, highly similar to Q60255 similar to transposase of ISAE1 from alcaligenes eutrophus H1-4 (fragment) from dibenzofuran-degrading bacterium DPO360 (163 aa) FASTA scores: opt: 767, E(): 3.2e-42, (67.25% identity in 168 aa overlap); and similar to P74920 transposase from Thiobacillus ferrooxidans (404 aa), FASTA scores: opt: 375,E(): 1.1e-16, (27.55% identity in 439 aa overlap); Q48349 transposase from Alcaligenes eutrophus (Ralstonia eutropha) (408 aa), FASTA scores: opt: 324, E(): 2e-13, (3.9% identity in 369 aa overlap); Q9FDC1|TNP transposase from Burkholderia mallei (Pseudomonas mallei) (386 aa) FASTA scores: opt: 282, E(): 9.8e-11, (25.85% identity in 391 aa overlap); etc. C-terminal end identical to O53804|Rv0741|MTV041.15 transposase from Mycobacterium tuberculosis (104 aa), FASTA scores: opt: 582, E(): 1.8e-30, (85.6% identity in 104 aa overlap). Belongs to the transposase family 12." /db_xref="EnsemblGenomes-Gn:Rv3798" /db_xref="EnsemblGenomes-Tr:CCP46627" /db_xref="GOA:P9WKH7" /db_xref="InterPro:IPR002560" /db_xref="InterPro:IPR029261" /db_xref="InterPro:IPR032877" /db_xref="UniProtKB/Swiss-Prot:P9WKH7" BEGIN 1 MRNVRLFRAL LGVDKRTVIE DIEFEEDDAG DGARVIARVR PRSAVLRRCG RCGRKASWYD 61 RGAGLRQWRS LDWGTVEVFL EAEAPRVNCP THGPTVVAVP WARHHAGHTY AFDDTVAWLA 121 VACSKTAVCE LMRIAWRTVG AIVARVWADT EKRIDRFANL RRIGIDEISY KRHHRYLTVV 181 VDHDSGRLVW AAPGHDKATL GLFFDALGAE RAAQITHVSA DAADWIADVV TERCPDAIQC 241 ADPFHVVAWA TEALDVERRR AWNDARAIAR TEPKWGRGRP GKNAAPRPGR ERARRLKGAR 301 YALWKNPEDL TERQSAKLAW IAKTDPRLYR AYLLKESLRH VFSVKGEEGK QALDRWISWA 361 QRCRIPVFVE LAARIKRHRV AIDAALDHGL SQGLIESTNT KIRLLTRIAF GFRSPQALIA 421 LAMLTLAGHR PTLPGRHNHP QISQ //