LOCUS CCP46461.1 248 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv Possible transposase protein. ACCESSION AL123456-3739 PROTEIN_ID CCP46461.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /locus_tag="Rv3638" /note="Rv3638, (MTCY15C10.14c), len: 248 aa. Possible transposase, highly similar to Q9RLQ8|ISTB ISTB protein from Mycobacterium bovis (266 aa), FASTA scores: opt: 784,E(): 4e-46, (78.0% identity in 259 aa overlap); and similar to others e.g. P15026|ISTB_PSEAE insertion sequence IS21 putative ATP-binding protein from Pseudomonas aeruginosa (265 aa), FASTA scores: opt: 420, E(): 2.2e-21, (38.8% identity in 255 aa overlap); Q45619|ISTB_BACST insertion sequence IS5376 putative ATP-binding protein from Bacillus stearothermophilus (251 aa), FASTA scores: opt: 402, E(): 3.6e-20, (34.5% identity in 232 aa overlap); P15026|ISTB_ECOLI ISTB protein from Escherichia coli (265 aa), FASTA scores: opt: 419, E(): 8e-23, (38.8% identity in 255 aa overlap); etc. C-terminus highly similar to C-terminus of P96287|Rv2944|MTCY24G1.05 hypothetical 25.5 KDA protein from Mycobacterium tuberculosis strain H37Rv (alias AAK47343|MT3016 IS1533, ORFB from Mycobacterium tuberculosis strain CDC1551) (238 aa), FASTA scores: opt: 784, E(): 3.6e-46, (87.4% identity in 135 aa overlap)." /db_xref="EnsemblGenomes-Gn:Rv3638" /db_xref="EnsemblGenomes-Tr:CCP46461" /db_xref="GOA:I6XHU7" /db_xref="InterPro:IPR002611" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR028350" /db_xref="UniProtKB/TrEMBL:I6XHU7" BEGIN 1 MAAKTATNSR DVAAELAYLT RALKAPTLRG AIEQLADRAR TKTWSYEEFL AACLQREVSA 61 RESHGGEGRI RAARFPSRKS LEEFDFDHAR GLKRDTIAHL GTLDFVTLAI GIAIRACQAG 121 HRVLFATASQ WVDRLAAAHH SGTLQSELIR LARYPLLVVD EVGYIPFEPE AANLFFQLVS 181 SRYERASLIV TSNKPFGRWG EVFGDDVVAA AMIDRLVHHA EVIALKGDSY RIKDRDLGRV 241 PTVTADDQ //