LOCUS BCA71502.1 447 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli SopA-central-domain-like hexapeptide repeat protein protein. ACCESSION AP022811-4826 PROTEIN_ID BCA71502.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /gene="yjgL" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:CBJ04108.1" /locus_tag="JE86ST02C_48250" /note="DFAST-ECOLI:CBJ04108.1 SopA-central-domain-like hexapeptide repeat protein [pid:100.0%, q_cov:100.0%, s_cov:100.0%, Eval:1.1e-260]" /note="MGA_4838" /transl_table=11 BEGIN 1 MSKISGWNFS QNITSADNCK QKNEDLDTWY VGMNDFARIA GGQNSRSNIL SPRAFLEFLA 61 KIFTLGYVDF SKRSNEAGRN MMAHIKSSSY IKNNDGSEIM KFVMNNPEGE RADLSKVEIE 121 ITLASAFNNG IREGHTVIIF TQPDGSTNRY EGKSFERKDE SSLHLITNKI LACYQREANK 181 EIARLLNIPQ ELNNSQDLNN SQVSCKDSVD STITDLLEKP LNNALLAIRK EHLLLMPYVC 241 NESISYLLGE KGILKEIDDL NAVNNYLLNN KKATDNEIND IKVNLSHILI DSLDDAKVNL 301 TPVIDSILEI FLKFPYINDV RILDWCFNKR MQYFGDSEKI KYACSVINHI DFSRDQSKDF 361 SCDQSKIKIA ETLFFNLDKE HYKNSRKLQE LIWDKLVAYV NDFNLSNQEK SRLILRLFDD 421 VKLLFNEVPV SILVNDIFFE GFFHEAT //