LOCUS AZN92517.1 2272 aa PRT BCT 18-DEC-2018 DEFINITION Pseudomonas aeruginosa hypothetical protein protein. ACCESSION CP032541-2588 PROTEIN_ID AZN92517.1 SOURCE Pseudomonas aeruginosa ORGANISM Pseudomonas aeruginosa Bacteria; Proteobacteria; Gammaproteobacteria; Pseudomonadales; Pseudomonadaceae; Pseudomonas. REFERENCE 1 (bases 1 to 6264451) AUTHORS Valentine,M., Withers,R.W., Johnson,S.L., Niles,R. and Yu,H. TITLE Attenuated P. aeruginosa strain PAO1 with in-frame deletion of the following genes: toxA, plcH, phzM, wapR, and aroA JOURNAL Unpublished REFERENCE 2 (bases 1 to 6264451) AUTHORS Valentine,M., Withers,R.W., Johnson,S.L., Niles,R. and Yu,H. TITLE Direct Submission JOURNAL Submitted (25-SEP-2018) BioScience, Los Alamos National Laboratory, PO Box 1663 M888, Los Alamos, NM 87545, USA COMMENT Bacteria and source DNA available from Dr. Hongwei Yu. Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Date :: 20-SEP-2018 Assembly Method :: Bowtie2 v. 2018-09-20 Assembly Name :: PGN5 Genome Representation :: Full Expected Final Version :: Yes Reference-guided Assembly :: NC_002516.2 Genome Coverage :: 217.64x Sequencing Technology :: Illumina NextSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 09/26/2018 16:28:44 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.6 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,903 CDS (total) :: 5,840 Genes (coding) :: 5,779 CDS (coding) :: 5,779 Genes (RNA) :: 63 rRNAs :: 4 (5S) complete rRNAs :: 4 (5S) tRNAs :: 55 ncRNAs :: 4 Pseudo Genes (total) :: 61 Pseudo Genes (ambiguous residues) :: 12 of 61 Pseudo Genes (frameshifted) :: 22 of 61 Pseudo Genes (incomplete) :: 18 of 61 Pseudo Genes (internal stop) :: 13 of 61 Pseudo Genes (multiple problems) :: 4 of 61 ##Genome-Annotation-Data-END## FEATURES Qualifiers source /organism="Pseudomonas aeruginosa" /mol_type="genomic DNA" /strain="PGN5" /db_xref="taxon:287" /country="USA: WV, Huntington" /collection_date="21-Nov-2017" /collected_by="Progenesis Technologies, LLC" /identified_by="Progenesis Technologies, LLC" /note="genotype: PAO1 toxA-, plcH-, phzM-, wapR-, aroA-" protein /locus_tag="PGN5_13025" /inference="COORDINATES: similar to AA sequence:RefSeq:NP_251152.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /transl_table=11 BEGIN 1 GSLDNSASGT LASQADMSLR LGGGALRNQQ DGLIFSQAGA LEVQAGSLDN RQGTLQAQGD 61 NRLRIGGALD NQAGRLDSRA GNLDLQSGSL XNGAGGVLNX AXGWLXLVTG LFDNSAGVTQ 121 AQSLEIRAGQ GVRNQQGHLS ALGGDNRIVT ADFDNQGGGL YASGLLSLDG QRFLNQGAAA 181 GQGGKVGAGR IDFSLAGALA NRFGQLESES ELHLRAAAID NSGGSLRALG RSGSTRLVAG 241 DLNNAYGVLE SANQDLDLQL GSLANAGGRI LHTGNGTFGL DSGQVIRAGG ELTTNGLLDI 301 RASEWTNSSV LQAGRLNLDI GTFRQTAEGK LLAVQSFTGR GGDWSNDGLL ASNGSLRLEL 361 SGGYRGNGRA TSLGDFALNA ASLDLGNAAS LAGGANVTLG AGNLLVNRGR ITAAGDLVAS 421 AASLNNYGTL GGGGNLRLNA PALLNERGLL FSGADMTLRA GDITNLYGDV YSLGRLDIAR 481 DDAGGWANRL ENISGNLEST GDMRFSVSSL LNRRETLEIE GDLQNSAIGV RCTGCQLSER 541 WGKTRSSSEL VWIREYKSTL GDSSAAASIT AGRDLLVVGA SLQNIASNIS AVRDATLSLS 601 NFENKGYALG EYAVRGVYSP PSKFGEELLM RILAYNAVND PSYGEGYAST GGRLPNIHYF 661 DKNFNEKVSP LEVIHGNGKN GGPGWHLYFG TLDVEYPDTD RWNKAIGRIP APNYSSKKTD 721 AIPDLLKGLA PLDELTINKG ANSTVGAVVQ AGGRVTVNAA ESFNNSVLQG FQAVQETQLP 781 HQDIAVSSTT SAVVTLKSQL PADLARQQIN PLTLPGFSLP QGQNGLFRLA SQGAQVNQAS 841 GALKSASDLT QSGHGVSVSA QTGSGASGWS TQARRVGDDR VTSLAGSAYQ GRVAEAIDAL 901 RASAPISGDG GNTGRFQAGE HQATTGLGGL VEGNASGHSG NGVILADLRG GLPSFSSLPA 961 SDHVQGTVPG HDGNGTILAN WQGAQATVQA SPSTVRVEGV VSSPGGNGSI LADLPAEQSS 1021 VQALPSAVRA QGSLPRLEER SALLAEPPVG QPALQTLPSV ARVEGVPSNA TPSNSHKYLI 1081 ETNPALTELK QFLNSDYLLG GLGINPDDSK KRLGDGLYEQ RLVREAIVQR TGQRFIAGLN 1141 SDEAMFRYLM DNAIASKDVL GLTPGVTLSA AQVAALTHDI VWLEEVEVNG EKVLAPVVYL 1201 AQAEGRLGPN GALIQGRDVN LITGGDLRNA GTLRAQNDLS ATAGNIDNSG LIEAGNRLDL 1261 LASGSIRNDQ GGIIAGREVS LSALTGDVIN ERTVTQHQSS YRGTGTTEAF ADSAARIEAA 1321 QKLTVSAGRD VANIGGVIDS KGDLALQGGR DVLVSAAVAE RGWTAGSQAY QTQTTQMGAE 1381 VVAGRDISVS AGRDISVVGS RIDARRDVTF EAGRDVGLVA AANEEHAYGK TKKVTFQDDK 1441 ITQQATRVDA GGDLAINAGQ DLRLVASQAS AGDEAYLVAG DKLELLAAND SSYYLYDKKS 1501 KGSFGSKKTR RDEITDVTAV GSQISSGGDL TLLSGGDQTY QGAKLESGND LAIVSGGAVT 1561 FEAVKDLHQE SHEKSKGDLA WQSSKGKGQT DETVRQSQIV AQGNLAIKAV EGLKIDLKHI 1621 DQKTVSQTID AMVQADPQLA WLKQMEQRGD VDWRRVQELH DSWKYSNSGL GVGAQLAIAI 1681 VVAYFTAGAA SAALGSMAGV GAGSGSMMAA AGSTAMVQAG TAVGTAAAGW ANAAGTAVAM 1741 GMASNGAIST INNRGNLGDV VKDVTSSDAL RGYVVAGTTA GLTAGVYDKW TSTQTGTSTA 1801 LPNTGAVAPA AGLGTWQGVG QFTSNQLLQN GTSVLLDRAL GGKGSLGDAL QNSLANAFAA 1861 YGFKLIGDTT HGVLDDGSLG KIGLHALMGG LAAEAVGGDF RTGALAAGVN EALVDSLAKQ 1921 YASLPIDDKK GLLIMSSQLI GVLAASTQGD ADAKSLQTGA WVAGNATQHN YLSHWQEEKK 1981 RQEVDGCKDK QLCKTGIEAK WAIISAQQDV GIVVGVGGGI GLSTAETAVG VYELVKNWRE 2041 TYAALEQLAT SPEFRQQFGD NYLKGLEERA AFLTQAYEDA GWQGSVTAGV EGGRFAAELV 2101 GVLTAVKGGA QITAKLPTAA KNLVNAIAES PVSGSMSSQL GAVGDLGRLG GGGKGYVDIL 2161 SHEAKQHILY GDKPGSGGHL WPGQAGKTVF PQNWSADKIV HEVGDIATSP STKWYAQTGT 2221 GGVYTSKGDP AKWVAYEVRD GVRMRVVYQP ATGKVITAFP DNAPIPPYKP IK //