LOCUS AJF01447.1 2512 aa PRT BCT 20-JUN-2017 DEFINITION Mycobacterium tuberculosis H37RvSiena putative peptide synthetase Nrp (peptide synthase) protein. ACCESSION CP007027-103 PROTEIN_ID AJF01447.1 SOURCE Mycobacterium tuberculosis H37RvSiena ORGANISM Mycobacterium tuberculosis H37RvSiena Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 (bases 1 to 4410911) AUTHORS Santoro,F., Guerrini,V., Lazzeri,E., Iannelli,F. and Pozzi,G. TITLE Genomic polymorphisms in a Laboratory Isolate of Mycobacterium tuberculosis Reference Strain H37Rv (ATCC27294) JOURNAL New Microbiol. 40 (1), 62-69 (2017) PUBMED 27819398 REFERENCE 2 (bases 1 to 4410911) AUTHORS Guerrini,V., Santoro,F. and Pozzi,G. TITLE Direct Submission JOURNAL Submitted (30-DEC-2013) Medical Biotechnology, University of Siena, viale Bracci, Policlinico Le Scotte, Siena 53100, Italy REFERENCE 3 (bases 1 to 4410911) AUTHORS Guerrini,V., Santoro,F. and Pozzi,G. TITLE Direct Submission JOURNAL Submitted (20-FEB-2015) Medical Biotechnology, University of Siena, viale Bracci, Policlinico Le Scotte, Siena 53100, Italy REMARK Protein update by submitter COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: http://www.ncbi.nlm.nih.gov/genome/annotation_prok/ Annotation modified by submitter. ##Assembly-Data-START## Assembly Method :: Ray v. 2.3.1 Coverage :: 300 Sequencing Technology :: Illumina ##Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 12/30/2013 09:28:23 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 2.3 (rev. 422554) Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes :: 4,146 CDS :: 4,050 Pseudo Genes :: 42 CRISPR Arrays :: 2 rRNAs :: 3 (5S, 16S, 23S) tRNAs :: 45 ncRNA :: 6 Frameshifted Genes :: 34 ##Genome-Annotation-Data-END## FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37RvSiena" /mol_type="genomic DNA" /strain="H37RvSiena" /db_xref="taxon:1437856" protein /gene="nrp" /locus_tag="Y980_0101" /experiment="EXISTENCE: identified in proteomics study" /inference="protein motif:PROSITE:PS00012" /inference="protein motif:PROSITE:PS00017" /inference="protein motif:PROSITE:PS00455" /note="Rv0101, (MTCY251.20) Probable nrp,peptide synthetase, similar to others e.g. AAD44234.1|AF143772_40|PstB peptide synthetase from Mycobacterium avium (2552 aa); 7476034|S77657 cyclic peptide synthetase from Mycobacterium leprae (1401 aa); part of CAB55600.1|AJ238027 peptide synthetase from Mycobacterium smegmatis (5990). Also similar to e.g. AAD56240.1|AF184977_1|AF184977 DhbF protein from Bacillus subtilis (2378 aa); SRF1_BACSU|P27206 surfactin synthetase subunit 1 (3587 aa): etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop), 2 x PS00455 Putative AMP-binding domain signature, and PS00012 Phosphopantetheine attachment site. Belongs to the ATP-dependent AMP-binding enzyme family. Thought to be not involved in mycobactin biosynthesis (see citation below)." /transl_table=11 /db_xref="GOA:Q10896" /db_xref="HSSP:O30409" /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR001242" /db_xref="InterPro:IPR006162" /db_xref="InterPro:IPR009081" /db_xref="InterPro:IPR010071" /db_xref="InterPro:IPR010080" /db_xref="InterPro:IPR013120" /db_xref="InterPro:IPR016040" /db_xref="InterPro:IPR020845" /db_xref="PDB:4DQV" /db_xref="UniProtKB/TrEMBL:Q10896" BEGIN 1 MHRVRLSRSQ RNLYNGVRQD NNPALYLIGK SYRFRRLELA RFLAALHATV LDNPVQLCVL 61 ENSGADYPDL VPRLRFGDIV RVGSADEHLQ STWCSGILGK PLVRHTVHTD PNGYVTGLDV 121 HTHHILLDGG ATGTIEADLA RYLTTDPAGE TPSVGAGLAK LREAHRRETA KVEESRGRLS 181 AVVQRELADE AYHGGHGHSV SDAPGTAAKG VLHESATICG NAFDAILTLS EAQRVPLNVL 241 VAAAAVAVDA SLRQNTETLL VHTVDNRFGD SDLNVATCLV NSVAQTVRFP PFASVSDVVR 301 TLDRGYVKAV RRRWLREEHY RRMYLAINRT SHVEALTLNF IREPCAPGLR PFLSEVPIAT 361 DIGPVEGMTV ASVLDEEQRT LNLAIWNRAD LPACKTHPKV AERIAAALES MAAMWDRPIA 421 MIVNDWFGIG PDGTRCQGDW PARQPSTPAW FLDSARGVHQ FLGRRRFVYP WVAWLVQRGA 481 APGDVLVFTD DDTDKTIDLL IACHLAGCGY SVCDTADEIS VRTNAITEHG DGILVTVVDV 541 AATQLAVVGH DELRKVVDER VTQVTHDALL ATKTAYIMPT SGTTGQPKLV RISHGSLAVF 601 CDAISRAYGW GAHDTVLQCA PLTSDISVEE IFGGAACGAR LVRSAAMKTG DLAALVDDLV 661 ARETTIVDLP TAVWQLLCAD GDAIDAIGRS RLRQIVIGGE AIRCSAVDKW LESAASQGIS 721 LLSSYGPTEA TVVATFLPIV CDQTTMDGAL LRLGRPILPN TVFLAFGEVV IVGDLVADGY 781 LGIDGDGFGT VTAADGSRRR AFATGDRVTV DAEGFPVFSG RKDAVVKISG KRVDIAEVTR 841 RIAEDPAVSD VAVELHSGSL GVWFKSQRTR EGEQDAAAAT RIRLVLVSLG VSSFFVVGVP 901 NIPRKPNGKI DSDNLPRLPQ WSAAGLNTAE TGQRAAGLSQ IWSRQLGRAI GPDSSLLGEG 961 IGSLDLIRIL PETRRYLGWR LSLLDLIGAD TAANLADYAP TPDAPTGEDR FRPLVAAQRP 1021 AAIPLSFAQR RLWFLDQLQR PAPVYNMAVA LRLRGYLDTE ALGAAVADVV GRHESLRTVF 1081 PAVDGVPRQL VIEARRADLG CDIVDATAWP ADRLQRAIEE AARHSFDLAT EIPLRTWLFR 1141 IADDEHVLVA VAHHIAADGW SVAPLTADLS AAYASRCAGR APDWAPLPVQ YVDYTLWQRE 1201 ILGDLDDSDS PIAAQLAYWE NALAGMPERL RLPTARPYPP VADQRGASLV VDWPASVQQQ 1261 VRRIARQHNA TSFMVVAAGL AVLLSKLSGS PDVAVGFPIA GRSDPALDNL VGFFVNTLVL 1321 RVNLAGDPSF AELLGQVRAR SLAAYENQDV PFEVLVDRLK PTRALTHHPL IQVMLAWQDN 1381 PVGQLNLGDL QATPMPIDTR TARMDLVFSL AERFSEGSEP AGIGGAVEYR TDVFEAQAID 1441 VLIERLRKVL VAVAAAPERT VSSIDALDGT ERARLDEWGN RAVLTAPAPT PVSIPQMLAA 1501 QVARIPEAEA VCCGDASMTY RELDEASNRL AHRLAGCGAG PGECVALLFE RCAPAVVAMV 1561 AVLKTGAAYL PIDPANPPPR VAFMLGDAVP VAAVTTAGLR SRLAGHDLPI IDVVDALAAY 1621 PGTPPPMPAA VNLAYILYTS GTTGEPKGVG ITHRNVTRLF ASLPARLSAA QVWSQCHSYG 1681 FDASAWEIWG ALLGGGRLVI VPESVAASPN DFHGLLVAEH VSVLTQTPAA VAMLPTQGLE 1741 SVALVVAGEA CPAALVDRWA PGRVMLNAYG PTETTICAAI SAPLRPGSGM PPIGVPVSGA 1801 ALFVLDSWLR PVPAGVAGEL YIAGAGVGVG YWRRAGLTAS RFVACPFGGS GARMYRTGDL 1861 VCWRADGQLE FLGRTDDQVK IRGYRIELGE VATALAELAG VGQAVVIARE DRPGDKRLVG 1921 YATEIAPGAV DPAGLRAQLA QRLPGYLVPA AVVVIDALPL TVNGKLDHRA LPAPEYGDTN 1981 GYRAPAGPVE KTVAGIFARV LGLERVGVDD SFFELGGDSL AAMRVIAAIN TTLNADLPVR 2041 ALLHASSTRG LSQLLGRDAR PTSDPRLVSV HGDNPTEVHA SDLTLDRFID ADTLATAVNL 2101 PGPSPELRTV LLTGATGFLG RYLVLELLRR LDVDGRLICL VRAESDEDAR RRLEKTFDSG 2161 DPELLRHFKE LAADRLEVVA GDKSEPDLGL DQPMWRRLAE TVDLIVDSAA MVNAFPYHEL 2221 FGPNVAGTAE LIRIALTTKL KPFTYVSTAD VGAAIEPSAF TEDADIRVIS PTRTVDGGWA 2281 GGYGTSKWAG EVLLREANDL CALPVAVFRC GMILADTSYA GQLNMSDWVT RMVLSLMATG 2341 IAPRSFYEPD SEGNRQRAHF DGLPVTFVAE AIAVLGARVA GSSLAGFATY HVMNPHDDGI 2401 GLDEYVDWLI EAGYPIRRID DFAEWLQRFE ASLGALPDRQ RRHSVLPMLL ASNSQRLQPL 2461 KPTRGCSAPT DRFRAAVRAA KVGSDKDNPD IPHVSAPTII NYVTNLQLLG LL //