LOCUS ACV54098.1 2099 aa PRT BCT 11-DEC-2013 DEFINITION Eggerthella lenta DSM 2243 cell wall/surface repeat protein protein. ACCESSION CP001726-103 PROTEIN_ID ACV54098.1 SOURCE Eggerthella lenta DSM 2243 ORGANISM Eggerthella lenta DSM 2243 Bacteria; Actinobacteria; Coriobacteriia; Eggerthellales; Eggerthellaceae; Eggerthella. REFERENCE 1 (bases 1 to 3632260) AUTHORS Saunders,E., Pukall,R., Abt,B., Lapidus,A., Glavina Del Rio,T., Copeland,A., Tice,H., Cheng,J.F., Lucas,S., Chen,F., Nolan,M., Bruce,D., Goodwin,L., Pitluck,S., Ivanova,N., Mavromatis,K., Ovchinnikova,G., Pati,A., Chen,A., Palaniappan,K., Land,M., Hauser,L., Chang,Y.J., Jeffries,C.D., Chain,P., Meincke,L., Sims,D., Brettin,T., Detter,J.C., Goker,M., Bristow,J., Eisen,J.A., Markowitz,V., Hugenholtz,P., Kyrpides,N.C., Klenk,H.P. and Han,C. TITLE Complete genome sequence of Eggerthella lenta type strain (IPP VPI 0255) JOURNAL Stand Genomic Sci 1 (2), 174-182 (2009) PUBMED 21304654 REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 3632260) AUTHORS Lucas,S., Copeland,A., Lapidus,A., Glavina del Rio,T., Dalin,E., Tice,H., Bruce,D., Goodwin,L., Pitluck,S., Kyrpides,N., Mavromatis,K., Ivanova,N., Ovchinnikova,G., Saunders,E., Brettin,T., Detter,J.C., Han,C., Larimer,F., Land,M., Hauser,L., Markowitz,V., Cheng,J.-F., Hugenholtz,P., Woyke,T., Wu,D., Pukall,R., Klenk,H.-P. and Eisen,J.A. CONSRTM US DOE Joint Genome Institute (JGI-PGF) TITLE The complete genome of Eggerthella lenta DSM 2243 JOURNAL Unpublished REFERENCE 3 (bases 1 to 3632260) AUTHORS Lucas,S., Copeland,A., Lapidus,A., Glavina del Rio,T., Dalin,E., Tice,H., Bruce,D., Goodwin,L., Pitluck,S., Kyrpides,N., Mavromatis,K., Ivanova,N., Ovchinnikova,G., Saunders,E., Brettin,T., Detter,J.C., Han,C., Larimer,F., Land,M., Hauser,L., Markowitz,V., Cheng,J.-F., Hugenholtz,P., Woyke,T., Wu,D., Pukall,R., Klenk,H.-P. and Eisen,J.A. CONSRTM US DOE Joint Genome Institute (JGI-PGF) TITLE Direct Submission JOURNAL Submitted (02-SEP-2009) US DOE Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598-1698, USA COMMENT URL -- http://www.jgi.doe.gov JGI Project ID: 4082980 Source DNA and organism available from Hans-Peter Klenk at the German Collection of Microorganisms and Cell Cultures (DSMZ) (hans-peter.klenk@dsmz.de) Contacts: Jonathan A. Eisen (jaeisen@ucdavis.edu) David Bruce (microbe@cuba.jgi-psf.org) Whole genome sequencing and draft assembly at JGI-PGF Finishing done by JGI-LANL Annotation by JGI-ORNL and JGI-PGF The JGI and collaborators endorse the principles for the distribution and use of large scale sequencing data adopted by the larger genome sequencing community and urge users of this data to follow them. It is our intention to publish the work of this project in a timely fashion and we welcome collaborative interaction on the project and analysis. (http://www.genome.gov/page.cfm?pageID=10506376). ##Metadata-START## Organism Display Name :: Eggerthella lenta VPI 0255, DSM 2243 Culture Collection ID :: DSM 2243, ATCC 25559, JCM 9979 GOLD Stamp ID :: Gc01054 Funding Program :: DOE-GEBA 2007 Number of Reads :: 39464-Sanger, 471609-454 Assembly Method :: Newbler version 1.1.02.15, Phrap Sequencing Depth :: 10.2x Sanger; 25.3x 454 Gene Calling Method :: Prodigal Isolation Site :: Rectal tumor Collection Date :: 1938 Host Name :: Homo sapiens Host Health :: Patient with rectal tumor Body Sample Site :: Blood Oxygen Requirement :: Anaerobe Cell Shape :: Rod-shaped Motility :: Nonmotile Sporulation :: Nonsporulating Temperature Range :: Mesophile Temperature Optimum :: 37C Gram Staining :: gram+ Biotic Relationship :: Free living Diseases :: Bacteremia Habitat :: Blood, Host, Human intestinal microflora Phenotypes :: Pathogen ##Metadata-END## FEATURES Qualifiers source /organism="Eggerthella lenta DSM 2243" /mol_type="genomic DNA" /strain="DSM 2243" /culture_collection="DSM:2243" /type_material="type strain of Eggerthella lenta" /db_xref="taxon:479437" protein /locus_tag="Elen_0104" /inference="protein motif:TFAM:TIGR02543" /note="TIGRFAM: cell wall/surface repeat protein; KEGG: mmw:Mmwyl1_0481 filamentous haemagglutinin outer membrane protein" /transl_table=11 /db_xref="InterPro:IPR013378" BEGIN 1 MLEKMLDLKN TFEGKFLAVL MSVVLVMSMT NILAFAGNEG QKDGSKTESA PTDQVVGESD 61 KEAVDEAVQH GESAAAKDAD ASKTTPSQPL VSTTVDEAVV TFETQNAFVS VKDQLLSGTM 121 LTTELHKELR FTASADTGFE LGAITAKNAA NADVPVTTQD GVSSIAAEYV DSTLVVSVVA 181 AAVVSDEPEV ETTPITSDTK IEPGEADEKG EPEEPKSEEP ETDEPEAPVA DEDVVEVEAD 241 VSNPAFEGYA QAGNVLVKVT AAEGVLPEGA TVQATRIERQ DVVDAVAERV ESQGKVLEDA 301 IAIDVTLLDK DGNEIQPNGA LNVCFFDANV EGEEVGVYRV SDDASQVETI GARQADPAVQ 361 SFDVDHFTIY VATGSNYAST GIKLDSSSIE VGETIKALGE RKWNSKGYSW TSSDASVAKV 421 AFSKENRADI VGVSPGVATI TYSYKVGNRT YTDTAKVHVV PSVVKHTVSW YVNGDVSKST 481 KVRDGEVPSY GGTPNRAGDW QYKQFVGWAT APNSKNYLTE GELPAVTEDV SYYAVFTSQA 541 YFYFVLEGRS NTSTVAKDYM YAGEGTMIVP DGFNSGDRWY DGSNFSIADY IVSTPSDEAI 601 RNGIRAAYAD YSPDWTYTID WTTLSVAGSS VDYRYNTFDY GKSMHTDGAL SINKDTTIGV 661 TYLTQKPDGS VVTNSTSHDK NVAFGLNSTV NTDAPSFETD GYTYNSRVNH NGASYVFDGW 721 YLDQSYTTKA PDSVSPSSSA SFYARYIANT KTLTYQANGG AFSDGTIQDK TAVQQVGARV 781 SFISNPSRDG YVFTGWKDKD TGDVYSAGVS GMIMPDRDVT LVAQWQGVIP IKLLGDETKK 841 TYSGTKQSYT GFTVSGLDMG SYTVSGVQAL AQGTDVGTYK GTIDYSGMRI FEKSSGSDVT 901 NQFEVVEASE PTLIVEKAPI SIVTPDDSKL YDGAPLIATK GAELSGLVND ETATLIVIGS 961 RTDVGTSDNA YQIDWSGSAK ESNYFIDEET IGTLEIAKRP VTVTAKGGEK QYDGKPLTAA 1021 DTGYDIGGEG LVEGHEADVA LSGSQTAPGA SPATVESVAV KDGDVDVANN YDVATADGSL 1081 KVTNRDAKYE VTLKANSSTG NVYDGTEKSA KGVVTDRFVI DDVEYAVSGY ETQDPAEVAA 1141 GVYTNNVSGD FKVKDPAGND VTSEFAVHTE DGELEIAKRP VTVTAKGGEK QYDGKPLTAA 1201 DTGYDIGGEG LVEGHEADVA LSGSQTAPGA SPATVESVAV KDGDVDVANN YDVATADGSL 1261 KVTNRDAKYE VTLKANSSTG NVYDGTEKSA KGVVTDRFVI DDVEYAVSGY ETQDPAEVAA 1321 GVYTNNVSGD FKVKDPAGND VTSEFAVHTE DGELEIAKRP VTVTAKGGEK QYDGKPLTAA 1381 DTGYDIGGEG LVEGHEADVA LSGSQTAPGA SPATVESVAV KDGDVDVANN YDVATADGSL 1441 KVTNRDAKYE VTLKANSSTG NVYDGTEKSA KGVVTDRFVI DDVEYAVSGY ETQDPAEVAA 1501 GVYTNNVSGD FKVKDPAGND VTSEFAVHTE DGELEILPRE VTLASGSATK IYDGTALELP 1561 DVTVGGDGFV GTEASVRATG SITDIGGPID NTIVVEPGEG FIAANYNVVY TVGKLTVTSA 1621 SIDPDDPSYT GIQINNPYDV QYNGQEQRWI PSVFDRNNHK LVTGTDYTVS YSQDVRNVGV 1681 VTVTITGIGN YMGTVERAYN ITPAPAVIRV NDSSKAYGEA DPGFTGAVEG LFGDDLLGDI 1741 AYSRTNIDEE VGNYSDVLTA TVGNLNGNYT YTVEPGNFSI VPAGGNVVTI DATGLTKTYD 1801 GQPVSVVAEA SVDGSALLYS VDGSTWSDAN PEFTNAGTYT VYVKATHDGY EESAPVSATV 1861 VINPAPVTIA VADASKVAGA DDPAFSGTVE GLVAEGDLGD INYVRPGGEE AAGVYVGALT 1921 ALYTQNGNYR VAVLNGTFTI TAAPVTPPTP PTPPTSPTPT PLPTPGTVPP DSPIAPVVTP 1981 IVDALQGAAE AVIGDNETPL AEPRETEIGD NDTPLASHDH ASCWVHWYII LGIIVTALYG 2041 ACVALRRGLF SRKLKKYEDG LTGGGDPAPG APSIGDDASA PIAPKGAPAG ATLAAGLGE //