LOCUS BCA53091.1 801 aa PRT BCT 14-FEB-2020 DEFINITION Nitrospira sp. KM1 arylsulfatase protein. ACCESSION AP022671-231 PROTEIN_ID BCA53091.1 SOURCE Nitrospira sp. KM1 ORGANISM Nitrospira sp. KM1 Bacteria; Nitrospirota; Nitrospiria; Nitrospirales; Nitrospiraceae; Nitrospira. REFERENCE 1 (bases 1 to 4509223) AUTHORS Ishii,K., Fujitani,H. and Tsuneda,S. TITLE Direct Submission JOURNAL Submitted (13-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Kento Ishii Waseda University, Department of Life Science and Medical Bioscience; 2-2 Wakamatsu-cho, Shinjuku-ku, Tokyo 162-8480, Japan REFERENCE 2 AUTHORS Fujitani,H., Momiuchi,K., Ishii,K., Kikuchi,S., Ushiki,N., Sekiguchi,Y. and Tsuneda,S. TITLE Genomic and physiological characteristics of a novel nitrite-oxidizing Nitrospira strain isolated from a drinking water treatment plant JOURNAL Unpublished (2020) COMMENT Annotated by DFAST v.1.2.3 (https://dfast.nig.ac.jp) ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 2.5.0 Genome Coverage :: 120x Sequencing Technology :: Illumina Miseq ##Genome-Assembly-Data-END## FEATURES Qualifiers source /db_xref="taxon:1936990" /mol_type="genomic DNA" /organism="Nitrospira sp. KM1" /strain="KM1" protein /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:RefSeq:WP_011118806.1" /locus_tag="W02_02310" /transl_table=11 BEGIN 1 MNTRALLAAL LVIVSSMLGW VLPSGLTIVL AENMKTDEAS PVADGTPTVL PRPDFRFPGS 61 VGRTFLDSDP AQFPQPVQAP KGAPNVVLIL LDDAGFGQFS TFGGGVPSPT MDNLAAQGLR 121 FNRFHTTALC SPTRAALITG RNHHSVATAG IQEMATGYEG YTGIIPRSTG TIGEVLRQNG 181 YMTAWIGKNH NTPAWEVSAA GPFDRWVNGL GFDYFYGFNA GDMNHWNPVL YENRDLVPAS 241 ADPNYHLTSD IADHAVAWVR KVKSIAPDRP FFLYVAPGAT HSPHQAPKEW IDKFKGKFDM 301 GWDTYREETF ARQKKLGVIP ANAKLTTRSA GLPAWASLNA DQKRLYARMM EVFAGYGAHV 361 DHHMGRLIEA VQQLPGAQDT VFIYIAGDNG SSAEGGIEGS INENLFFNGF PEKWEENLKV 421 IDELGGPKHF NHFPSAWAHA MDTPFQWTKQ VASHFGGTRN PMIVSWPSRI KDGGGLRDQF 481 LHVIDIVPTL YEIIGITSPM VLNGVEQKSI EGISFAYTLE DAKAKDRRTT QYFELGANRG 541 IYHDGWMASA TSFAPWQPNR TGFDPDKQRW ELYNIDQDFT QADDLASVNP QKLRELQDLW 601 WVEAAKHNVL PLDWRGVERL NAELMGRPDP GGNRKSYTYY PGQVGLPNDA APRLLNKSWT 661 VTADLEVPEA GVEGMVVTHG GLVGGYGLYV RDGKPAFVYN YLSLDRTMIA SQKPLPHGKV 721 QLKIDFDYKG RAGERGEPAF VTMSVNGTKV AEGQLPKTIP NQISLGEGLD IGEDVGSPVD 781 FSYKLPFAFT GKIEKVTVEL K //