LOCUS ABX20184.1 3833 aa PRT BCT 09-NOV-2020 DEFINITION Salmonella enterica subsp. arizonae serovar 62:z4,z23: - hypothetical protein protein. ACCESSION CP000880-230 PROTEIN_ID ABX20184.1 SOURCE Salmonella enterica subsp. arizonae serovar 62:z4,z23:- ORGANISM Salmonella enterica subsp. arizonae serovar 62:z4,z23:- Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Salmonella. REFERENCE 1 (bases 1 to 4600800) AUTHORS McClelland,M., Sanderson,E.K., Porwollik,S., Spieth,J., Clifton,W.S., Fulton,R., Chunyan,W., Wollam,A., Shah,N., Pepin,K., Bhonagiri,V., Nash,W., Johnson,M., Thiruvilangam,P. and Wilson,R. CONSRTM The Salmonella enterica serovar Arizonae Genome Sequencing Project TITLE Direct Submission JOURNAL Submitted (02-NOV-2007) Genetics, Genome Sequencing Center, 4444 Forest Park Parkway, St. Louis, MO 63108, USA COMMENT Salmonella enterica subspecies IIIa (Arizonae) serovar 62:z4,z23:--Most bacteria in the species S. enterica belong to one of seven subspecies; all but subspecies I normally grow only in cold-blooded animals. Subspecies IIIa (S. Arizonae) is naturally found in reptiles, but also causes outbreaks of salmonellosis in turkeys and sheep and can occasionally produce both gastroenteritis and serious disseminated disease in humans. Many human infections can be traced to contact with reptiles or ingestion of various reptile products, particularly from rattlesnakes. Fewer than ten cases in humans are typically reported in the US each year. The strain of S. Arizonae (62:z4,z23:-) being sequenced is CDC346-86; it was named RSK2980 by R.K. Selander and is strain SARC5 of the Salmonella Reference C set. This serovar is of interest because of its taxonomic position. It appears to be the most divergent subspecies among the S. enterica. It can be obtained from the American Type Culture Collection as ATCC BAA-731, or the Salmonella Genetic Stock Centre as SGSC4693. The genome was sequenced to 8X coverage, using plasmid and fosmid libraries and was finished to an error rate of less than 1 per 10,000 bases. Automated annotation was performed and manual annotation will continue in the labs of Michael McClelland and Kenneth Sanderson. The National Institute of Allergy and Infectious Diseases (NIAID), National Institutes of Health (NIH) has funded this project. Coding sequences below are predicted using GeneMark v3.3 and Glimmer2 v2.13.Intergenic regions not spanned by GeneMark and Glimmer2 were blasted against NCBI's non-redundant (NR) database and predictions generated based on protein alignments. RNA genes were determined using tRNAscan-SE 1.23 or Rfam v8.0. This sequence was finished as follows unless otherwise noted: all regions were double stranded, sequenced with an alternate chemistries or covered by high quality data(i.e., phred quality >=30);an attempt was made to resolve all sequencing problems, such as compressions and repeats; all regions were covered by sequence from more than one m13 subclone. FEATURES Qualifiers source /organism="Salmonella enterica subsp. arizonae serovar 62:z4,z23:-" /mol_type="genomic DNA" /strain="RSK2980" /serovar="62:z4,z23:-" /sub_species="arizonae" /culture_collection="ATCC:BAA-731" /db_xref="taxon:41514" protein /locus_tag="SARI_00238" /inference="protein motif:superfamily:IPR008957" /inference="protein motif:superfamily:IPR008964" /inference="protein motif:superfamily:IPR008979" /inference="protein motif:superfamily:IPR011048" /inference="protein motif:superfamily:IPR011049" /note="KEGG: chu:CHU_1906 3.9e-51 CHU large protein, possible SAP or adhesin AidA-related K01238; COG: COG5295 Autotransporter adhesin" /transl_table=11 /db_xref="InterPro:IPR008957" /db_xref="InterPro:IPR008964" /db_xref="InterPro:IPR008979" /db_xref="InterPro:IPR011048" /db_xref="InterPro:IPR011049" BEGIN 1 MRLLAVVSKL TGVSTTVESS AVTLNAPSIV KLSVARDEIS QLTRINQDLM VTLHSGETIT 61 IKNFYVTNDL GASQLVLAES DGTLWWVENP QAGLHFEQIA DINELLVTSG ASHETGGAVW 121 PWVLAGAVAA GGIAAIASSG GGDSHHSDSD NPLPDNTNPG GNPPDNTNPD GNPPDNDNPG 181 GTNPDGNNPD SSNPVDTTPP LAPSELLISA DGKTVSGQAE AGSTITIKDP SGNVVGEGKT 241 DSNGKFSIDL TTPHLSGEQL TVTATDDAGN TGPSATIDAP NIPFPDTPII TAAIDNADPL 301 TGTLSNNQFT NDSTPTLEGT GSAGAVIHIY ANGQEIGTTT VDASGNWRFS ITNALTDGEN 361 RFTAIATNVK GESSESASFT LTIDTLSPDA PIVELITDNT GLLTGPLQNN DRTDEAKPLF 421 SGQGEPGNTI TIKEGSTIIG SATVDENGRW TFMPTTPLSN GEHTFTVEQS DQAGNVSRVT 481 TTPTIIVDTT PPDAAAIDNV AKDGSTVSGT AEAGSTVSIY DPAGNYLGSA ITGDNNQFNI 541 TLNPAQTHGE RLEARIQDAV GNIGPVTEFT ASDSQYPAQP IILTVTDDAG AVTGLLKNGD 601 ATDDNRPTLS GTAEPGSTIS ISDNGFPVPT FPLVVADADG KWSFTPSLAL PDGDHVFTAT 661 ATNDRGTSGQ SVSFTIDIDT QPPVLENLAV SDIGDKLTGA TEAGSTVVIK DSQGNMLGSG 721 TAGDDGTFSI SISPAKINGE TLSISVIDKA ANSGPVETLN APDKTAPAAP NGLIVATDGL 781 SVSGQAEAGS TVTIRDSSNT VLGSAVANGN GQFIVPLNTA QTNGQALIAT ATDVAKNESA 841 AATVIAPDST APEMPKDVVI SEDGASISGT AEPGSAITIT TSDGTPLGSG KVDGEGHFTL 901 PLVPAQTNGE QVTVTATDGA NNVSPPATAQ APDITAPDKP IISQVLDDVE SFTGPLANGQ 961 TTNDSRPTLS GTAEAGARVE VFDNGVSLGL ATLQSNGSWT FTPLQNLSEG AHRLTVIATD 1021 AKGNASPAAS FDLVVDTQSP QQPVITFITD DAPGMLGSIA HLGLTNDNTP TINGTGEPGS 1081 TVHLYQNGAR IADIIVGSSG VWSYAYTTAS PLVDDTYTFT VTASDSNGNT TPFSTDFTIT 1141 IDTLVPAAPG IISVADGDGN TIDTNQLTQE SQPRLSGSGT AGDTIILYDN GNIIGQALVG 1201 ADGRWQFTPP AALGDGTHLL TARANDPAGN ESPESISFTL RVDTQAPDAP QIVSAAIAGG 1261 EGEVLLANGS ITNQRMPTLS GTGEPGAIIT LYNNGAVLDT VQVNPQGSWT YPLTSNLSEG 1321 LNVLTAIAMD AAGNSSPTSG VFSVTLDTLP PAQPDAPLIS DNVAPVIGNI GNNGATNDTT 1381 PTFSGTGEIG STIILYNNGS EIGRTTVGDN GSWNFTPAAL TPGTYIITVT ETDVAGNISP 1441 PSASVTFAID ITAPANPVIT FAEDNVGDVQ DNVASGATTD DNTPVIHGTG DIGSVITLYN 1501 GGNVLGVATV DETGTWTLPV TSALPDGTYT LTAIAADAAG NSSGVSNSFT LTVDTVPLQP 1561 PVVSEILDDV APVTGPLTDG AFTNDRTLTI NGSGENGSTV TIYDNGIEIG TALVSDGTWT 1621 FNTPVLSEAS HALTFSATDG AGNTTAQTQP IIITIDVTAP PAPSIQTVAD DGTRVAGLAD 1681 PYATVEIRNA DDILVGSAVA NATGEFAVTL SPAQTDGGTL TAIALDRAGN NGPATDFLAS 1741 DSGLPAVPVI TAIEDNVGSV QGNIAAGGAT DDATPTLRGT TDIGSTVEVF IDGDSAGFAT 1801 VDASGNWIFE IATPLSESAH SVTVQATNAK GQGGLSEPVR ITIDLSAPAQ PIITSATDDV 1861 PGVTGTLDNG ALTNDSRPTL NGTGEAGATI RILDNGVEIG SATVDQSGNW RFTPNAPLES 1921 NTHLFTAVAT DPAGNSGQPS DGFTLTIDAL APDVPVIISV IDDNNQPSIP VSPGQSTDDR 1981 QPILNGTGEP GATITIFDNG TPLGTTQVNE SGSWSFPVTS NLSEGSHDLT VSATDPAGNT 2041 SAVSTPWTIV VDITPPAIPV LTSVVDDRPG ITGDLVSGQL TNDATPTLNG RGEAGATITV 2101 YLDGNPTPIG TTTVNSDGTW SFTPQTPIAN GNHTLTLSAT DPAGNSSAVS SGFVLTIDAT 2161 PPAAPVIASV ADNTAPVTGI VPNGGSTNET RPTLAGTGEA GSTISIYNGS ALVGTAQVQA 2221 NGSWSFTPPT SLSAGVWNLT ATATDAAGNT SVASETRTFT IDTTAPAAPV ISTVYDGTGP 2281 ITGNLSPGQM TDEVRPVISG TREANTTIRL YDNGTLLAEI PADNSSSWRY TPDASLATGN 2341 HVITVIAVDA AGNASPVSDS VNFVVDTTPP LAPVITSVSD DQAPGLGTIA NGQSTNDPTP 2401 TFSGTAEAGA TITLYENGTV IGTTTAQPDG AWSVSTSTLA SGTHVITAVA TDAAGNGSPN 2461 STAFTLTVDT TAPQTPILTS VVDDVAGGAT GNLANGQITN DNRPTLNGTA EAGSVVSIYD 2521 GGTLLGVTTA DAGGAWSFAP TTGLSDGTRT LTVTATDPAG NVSPVTDSFT IVVDTLAPTV 2581 PLITSIVDDV PNNTGAIGNG QSTNDTLPTL NGTAEANSTV SIFDNGALVA TVTANANGNW 2641 SWTPTTALGQ GSHAYSVSAA DAAGNVSAAS QPTTIIVDTI APGAPGNLTI NATGNRVTGT 2701 AEAGSTVTIT TDTGVVLGTA TADGTGSFSA ALTPAQTNGQ PLLAFAQDKA GNTGITAGFT 2761 APDTRVPEAP IITTVVDDVG IYTGAIANGQ VTNDAQPTLN GTAQAGATVS IYNNGALLGT 2821 TTANASGNWS FTPAGNLTEG SHAFTATATN ANGTGSVSTA ATVIVDTQAP GTPSGTLSAD 2881 GGSLSGLAEA NSTVTVTLAG GITLTTTAGS NGAWSLTLPT KQIEGQLINV TATDAAGNAS 2941 GTLGITAPIL PLAARDNITS LDLTSTADTS TQNYSDYGLL LVGALGNVAS VLGNDIAQVE 3001 FTIAEGGTGD VTIDAAATGI VLSLLSTQEI VVQRYDTSLG AWTTIVNTAV GDFANLLTLT 3061 GSGVTLNLSG LGEGQYRVLT YNTSLLATGS YTSLDVDVHQ TSAGIISGPT MSTGNVMADD 3121 TAPTGTTVTA ITNANGVSTP VGAGGVDIQG QYGTLHINQD GSYTYTLTNP TAGYGHKESF 3181 TYTITQNGVG SSSAQLVINL GPAPVPGSVV ATDNNASLVF DTHVSYVNNG PSTQSGVTVL 3241 SVGLGNVLNA NLLDDMTNPI IFNVEEGATR TMTLQGTVGG VSLASSFDLY VYRFNDAIQQ 3301 YEQFRVQKGW INTLLLAGQS QPLTLTLPGG EYLFVLNTAS GISVLTGYTL TISQDHTYAV 3361 DSITTSTTGN VLTNDVAPAD ALLTEVNGVA ISATGTTNVN GLYGTLTIDA KGNYTYTLKN 3421 GVGADSIKTP DSFIYTVKAP NGDTDTASLN ITPTARALDA INDVSDTLSV ATLQDTAAWL 3481 DSSVGSASWG LLGKSGSGSG TFDVATGTVL KGASLVFDVS TLITLGNLNI SWAILENGTV 3541 IRNGTVPVAN ITLGGATVTV NLSGLELDAG TYTLNFTGTN TLAGAATITP RVIGTTVDLD 3601 NFETSGTHTV LGNIFDGSDA AGAMDQLNTV NTRLNISGYN GSASTLDAST NTTSATIQGH 3661 YGTLQINLDG AYTYTLNNGI AISSITSKEV FTYQLDDKMG HTDSATLTID MVPQIVSTNQ 3721 NDVLIGSAYG DTLIYHLLNG ADATGGNGTD RWQNFSTAQG DKIDIHELLT GWDHQAATLG 3781 NFVQVHTSGA NTVISVDRDG AGSAFKSTDL VTLENVQLTL NDLLQNNHLV IGG //