LOCUS AEO98071.1 5427 aa PRT HTG 28-NOV-2011 DEFINITION Emiliania huxleyi virus 203 hypothetical protein protein. ACCESSION JF974291-61 PROTEIN_ID AEO98071.1 SOURCE Emiliania huxleyi virus 203 ORGANISM Emiliania huxleyi virus 203 Viruses; Varidnaviria; Bamfordvirae; Nucleocytoviricota; Megaviricetes; Algavirales; Phycodnaviridae; Coccolithovirus. REFERENCE 1 (bases 1 to 400520) AUTHORS Nissimov,J.I., Worthy,C.A., Rooks,P., Napier,J.A., Kimmance,S.A., Henn,M.R., Ogata,H. and Allen,M.J. TITLE Draft Genome Sequence of the Coccolithovirus Emiliania huxleyi Virus 203 JOURNAL J. Virol. 85 (24), 13468-13469 (2011) PUBMED 22106382 REFERENCE 2 (bases 1 to 400520) AUTHORS Henn,M.R., Allen,M., Levin,J., Malboeuf,C., Casali,M., Russ,C., Lennon,N., Chapman,S.B., Erlich,R., Young,S.K., Yandava,C., Zeng,Q., Alvarado,L., Anderson,S., Berlin,A., Chen,Z., Freedman,E., Gellesch,M., Goldberg,J., Green,L., Griggs,A., Gujja,S., Heilman,E.R., Heiman,D., Hollinger,A., Howarth,C., Larson,L., Mehta,T., Pearson,M., Roberts,A., Ryan,E., Saif,S., Shea,T., Shenoy,N., Sisk,P., Stolte,C., Sykes,S., White,J., Haas,B., Nusbaum,C. and Birren,B. CONSRTM The Broad Institute Genome Sequencing Platform TITLE The Genome Sequence of Emiliania huxleyi virus 203 JOURNAL Unpublished REFERENCE 3 (bases 1 to 400520) AUTHORS Henn,M.R., Allen,M., Levin,J., Malboeuf,C., Casali,M., Russ,C., Lennon,N., Chapman,S.B., Erlich,R., Young,S.K., Yandava,C., Zeng,Q., Alvarado,L., Anderson,S., Berlin,A., Chen,Z., Freedman,E., Gellesch,M., Goldberg,J., Green,L., Griggs,A., Gujja,S., Heilman,E.R., Heiman,D., Hollinger,A., Howarth,C., Larson,L., Mehta,T., Pearson,M., Roberts,A., Ryan,E., Saif,S., Shea,T., Shenoy,N., Sisk,P., Stolte,C., Sykes,S., White,J., Haas,B., Nusbaum,C. and Birren,B. CONSRTM The Broad Institute Genome Sequencing Platform TITLE Direct Submission JOURNAL Submitted (22-NOV-2010) Broad Institute of MIT and Harvard, 7 Cambridge Center, Cambridge, MA 02142, USA COMMENT This genome was sequenced by the Broad Institute and co-owned with CAMERA. ##Metadata-START## Isolation method :: triple-plaque-purified Template preparation method :: CsCl-gradient Phage/virus type :: DNA:ds Viral DNA Phage/virus "hybrid" information :: not applicable Phage/virus taxonomy :: Coccolithoviridae Phage/virus strain :: Strain 203 Morphology :: Icosahedral Latitude :: 50.006N Longitude :: 4.3145W Depth (m) :: 15.00 Collection date :: 27-Jul-2003 Sample collection site :: English Channel Filter fraction (um) :: 0.45 Volume filtered (L) :: 1.0 Habitat description :: Collected from natural E. huxleyi bloom Other metadata available :: http://www.westernchannelobserv atory.org.uk/data.php References :: PMID: 17359269; PMID: 16099989; PMID: 12209309; PMID: 16553948 Lab Host :: Emiliania huxleyi 1516 ##Metadata-END## * NOTE: This is a 'working draft' sequence. It currently * consists of 7 contigs. The true order of the pieces * is not known and their order in this sequence record is * arbitrary. Gaps between the contigs are represented as * runs of N, but the exact sizes of the gaps are unknown. * This record will be updated with the finished sequence * as soon as it is available and the accession number will * be preserved. * 1 20862: contig of 20862 bp in length * 20863 20962: gap of unknown length * 20963 45640: contig of 24678 bp in length * 45641 45740: gap of unknown length * 45741 78538: contig of 32798 bp in length * 78539 78638: gap of unknown length * 78639 123169: contig of 44531 bp in length * 123170 123269: gap of unknown length * 123270 178489: contig of 55220 bp in length * 178490 178589: gap of unknown length * 178590 257650: contig of 79061 bp in length * 257651 257750: gap of unknown length * 257751 400520: contig of 142770 bp in length. FEATURES Qualifiers source /organism="Emiliania huxleyi virus 203" /mol_type="genomic DNA" /strain="203" /isolation_source="English Channel" /db_xref="taxon:181212" /lab_host="Emiliania huxleyi 1516" /lat_lon="50.006 N 4.3145 W" /collection_date="27-Jul-2003" protein /locus_tag="ELVG_00396" BEGIN 1 MILQKSRKLV DQLINDPLGE MLLIRKSDCN FLKCKNKQVM ITILILLLHC AYVSAQWYWM 61 TGNALPGRTI TASTNGFSDF KENAIDGNII SRYSSRDGRP EPSPFWSVTF DQYILMRYII 121 VFLRGPISGE PDYFYSQRMN GVTVKITDDS AVPGGTFEYS VLNDCLDSAL EPQCNELPCS 181 YCEANNKNNI YDNGEKSRTW GWQGLGNGRV ARTITITSPN GVSTQPMSFQ ELYIWGVVSD 241 FPSPPPPPPP PPLPPPPPPP SPPPPPPPSP PPPPPPPPPP PPSPPPPPPP PLPTPTGRSC 301 FSYAIGPYTR GNDVTPVGEN ILEKTIFVPS DTIIVMYTQA VRQNSLRTDI ALYVNNVDTH 361 RSMATTATLG WQTATVLWSA IYPKGYYEFR IVGVGSIEWG IGEYAQMTLM TFPLSIPGIA 421 LYTADSPNLC PGMGATTRDI ASTTVTLTET SVLAMFAQSI TPETNGIYEL YVDSTLIAKS 481 NWAWHSPDYV GGRYWNAVSI VQYKQYNAGT YTVKVAIRDA SGVGDFGCNG RWQRVLNVVV 541 LPIGTSGLNL QTKAEDFQGS IALLPNENIL SYTVNVPVES TLLVAGSVLG DPGGAGSGNQ 601 LRYDYQLASN NPDCLSKTSI YNIRPEALRS GQVFSVCDAV GSTQFDLRPV FNGYSGTPVK 661 ESTYINTLVI PATYCDVPVA DVLPCNQLRV GDYPGNDVGN VGGVPSPSAC AAYCRNTLDA 721 AYFGYNIDDN HCYCKDARTS ENTNAGFVSG RTCYGESFAY YYTFPLDEHE WEVVGTSATA 781 PIMTRCGGHT ILGGFDAFGT GVTLSKKLID ITDHSGMIIK FDFMQIDSWD SEDAILTVDG 841 VEVWRQTFTQ GSSNICGFGW NDQYYPNIQV KFSHSRTTAD IIFSTTLDGA ADNEAWGIQN 901 IQIILTFDPV YPSGLPSSPN HWFRNDNADL NTIQWALVNP NTKYCNGFYA GTDRSFSECR 961 DVCANDPTCT HFARNGDFNE IGFCALYAAK TCEQSTDGDT TVIAFTKVFV WPDYVGELPG 1021 ATIQGILPTN IYVDGTNGAN SKFTAVGGSV DTAIRFGGII GADFTICSLT RYGTGTRSTI 1081 LRGETGNWLH GHHQGNVGKM YYENWITPSI GFQPSYLSQD DWIVSCGTSG VNPVYTECTN 1141 TRSGGKFGQT TSKISVNFNP FDSDSDFEVA EILTWPRVLS EGEMIDASTY LAGTVMGKYF 1201 SCNPPSPPPS PPPPYPPPPS PPPPSPPPPS LPPPSTPPPP PLPLTPGGWK GQYVSIFPPD 1261 ETLWTGTDEV TKCGPYVILG GYNVLGVGDT LSRTYTSMPH HVELLIQFDF MKIDSWDNED 1321 AILTVDGVEV WRKTYDGSGT ELCGNGNRAE IYDAGIRVQF THTSDTAALV FSTTLSSAAS 1381 DESWGIQNVV VNLLTDEYNT CDQWCKLEGL CTNDYHYIMV LGVRKYVYCI FDDNSRGIDV 1441 MDTTGLTTKN NLHPNSCPEG MNIWVPRSNS FVQTLETSLS FRPKTVGIYG IANGCGGCSS 1501 NAMNSDNAAQ AAHWKAVSPI RTPWFMRAVP YQEPNGDYTA GDWLHISTTV SDFDADGYYF 1561 NDRTDGYPES RYYCSTNSYE LDYPANLPTG ANAWYKQGTM NVDSPTWKLL HTDGFCSSGY 1621 YAGDAEYTGL TLEGCAEICA SEPQCGFFYW RDSGITCSRY DTRTCPFAAA PVGNTGSAYA 1681 KKYMWADATG NGNTAMIGGS DASVVDTLTL TPGENGAWLP MTVLRGTTTT TVDFGPVIVT 1741 DFTICSLTRY IGGSNKRILD GAGVNWLHGH HGGNLGKAYY GAWKTYSGSI TESKTEWLVM 1801 CGSNGVTEMY ANCEPRRTED GGTSPTSLTI NNGVYANTQP SDFEVSEIIT WPRELSETEM 1861 KAAVDYMLHE VLGQSTCTVT CEDMLTYRSL VPECTSTGGD ALDQPTCDAS YQRTGSGPTE 1921 VTDSFQVCQY VNVACTVADT IEDCKIPPFC QTFSLRIEPQ ANSYRIRIDL TNYGAGEYIV 1981 SADFYFSDDF DGDAWIAHST WYDTSSPDST KLEGRTASTP SSQWRSITAP KTVSFNPSHM 2041 YWFLGFPVQS TTGYVWVTNV QVTAPNGDLL IPDGTFPNGE DLGQYDPPGD PSEFHSIVPS 2101 CANAPTPPAV APPPLPPLPP RNGIFEFTCN FNTGDYVNNC PGVNLVTNVE KDLRYVNLAV 2161 TEDTTGYITY TLPDFPEITE IIVRATTFIS SVNNAGQSLM ISVAQTENQG LSQTNNNMAS 2221 QFLTRPSSGY DYAFRHYLRS NPGFRSRRHN RTNIKYSDLF DKFVEFYSRF VYTDTVLRGY 2281 INYGNFQEAV SNADESRGIL PMPKEDKQIL TISAESTGDT YLIRDFTIQV FYSEKSPPPP 2341 PPIPPPVPPR PPLTGNLPGK NIYARYQMGG FSLTPDNDIL SMNFELLGEY CCRADTYTLV 2401 TIGTTDKTTC ESTCRTDETC DAYAISGCSN ANDQDCTGTC YNYREMTGTK YTDFCSDDSL 2461 NGNAWCYIRS APSFGMWPDV SGNDFHATIS SPGAYAVTAD GNGATNQVTA LGGSTTTKVN 2521 FGSIIPSTFT ICSVTRYTNI NSRERILDGK GANWLHGHGD KLRGVAFYGE WKTPTTSISG 2581 PLYDWLVMCS SNGDVEIYAN CEDRRTNNGG TSPTELTIND GAFSVDSSDF EIVELVVWNR 2641 ILSDGEHLQA TAYLYNDILG VGTCTSPPPT PPPLPPLSIG NGIASYGCDF RNGKEDHFYY 2701 SCSPDNGFIP RSQLNSPSQS KIAESHIKLV QDSAPGTTGG VRVATQMSSR FSAIKSIVIR 2761 ASFFISRDTG ADQVGFLTHS NDANRNTDIC NNPNDNDINA ACINTYFSGS ATQSQLETNA 2821 VGTVKLVDFD DNPYLNQGTF KTLELIYDGT DVTGSWAHPD GPPIATSTPG TIPVNTLDYV 2881 TFGAWTGGTR NTFSIEWFTI QVEGEPIYNT CTDWCVIGNQ CSDGIQFINV NGKSIQVRCT 2941 YDGLIGVDTI WIQDGISTNR YDEPNSCPSG MDIWVPRSNP FVQSIVPYYY NQKLYQTESF 3001 SSDQTWLNRF VPTVQECYNE IISRPTCHQS YFNWAARGDG NCGCINIASN PLDNIATAYE 3061 IDIYKIVDIT SASGDIGVLF GMYGIADGCG TCSTSAMNSD SADKVAHWSA VSNPGIPWFV 3121 RATPYIEPNG DYIEGSWLSG KFLDADGLQF NDNGAGYAFT SYICSTNYWP PSPPSPPALP 3181 SSENCLNQQT DCTASPPECA VDTTYFRLAS CRTVTWTCSI TYGYVNGLSY LSSADENPTR 3241 NQYDYTRIFI GETMVSETRN DVSSSPRIIF DNVENPVVEI IAVSGVPVAE QHMLVSSVCL 3301 FQTTSPSPPP PPPPSPPPPC HQVWFDTNLP GGDLPGYPVT EVTYSSCAHR CELEPALFFT 3361 FKKSDKSCWC KDSYTEQQTD VDYVSGQSCL KPPSPPPPSP PPPLPPLSPL LSGTLVIDGC 3421 NMNNVRCEPA SFNTGSIRCC RDSGIPLGIS VCLGAGSGGS EFLPPIGLGV TGSKDASLYT 3481 TSLVCTTLGY RLCTILELST PNGGACSSGC GYNTQSEHTM ISSDPCSPPS PPPPSPPPSP 3541 PPSPPPPSPP PLLPIIPNRC RASNMLAHTI PQIDAPFSGS SILYPSSPME STFVETDEQK 3601 TPRANASVSL DRKYAVSNAG GTVIARVQTK TFATMTVTSN KHPPVRHDTI VRTPMAWVDR 3661 IRIRVAVQTK DQYGSPETLG PFSVSMRIDD SSNGLTGSGT CASQIGWSGN KYTMYCSMTT 3721 SPPGNWFPLG GVADVTTELT VGGSLVNTVV DTDAVQFITP PVWYNLVYRS DGEQNNRPSP 3781 TGYDPVTDKM FATLQVSPIY GDETFDVFIY TSTLTFPVNA WRIELRYDTT KLDYVSYSSS 3841 GKFQAPLVDP AASGVTIFSA GLLCGTGCNA GQLDEVTGDV IYLAKVTLRP KSGQPIGGIN 3901 TGMYPYASEI INYGGGDIHR STTGKMFDTR EGLQTTGQIT IQDIVPVGIF SYPPTITKPI 3961 GELFNSAHLY GTTAEYQLAV YQFNSDDRHN NGNNVISIGV SPSCSYSSGA NPSIIDSITN 4021 CNIVMTDSQT SSANDFGVIV QVISESITMT RTVLFDVYSP QTITITADDT TLNRLLDING 4081 ATFEPGPEPC VSVYQTAKLR VDVDGVDYTT QVSFVPDDTN VIRMLSGLDA TNIMRGIQPG 4141 TTALRLYTGA ANSISMTVSD AGVRAVAVNA RIVTDVSFIS STQPPELSPS YEYPGEVYST 4201 VSYQNEMNSE GDFGYIFTTV EWDDGHRTDA GFLDELGTLT YTRQVDSIEI DESGEYPMVT 4261 VAVNAVQMCI DQGIEVSYNW CGTTMITPQY IPIFIDLPIP VSLTLDIVQS RLTSEINDAR 4321 LSPINIPTSS SMILTVTFMD TTTGELTYRE LTDDTRVTYT PISSCASVDV NNDVQIVNND 4381 CVGSNAEISA TITIDGNGIT GSGVVSVVGI DNAQSNVEFT KYPSGGVAAS TIGKIECTNY 4441 YHSINARLTI MLTDNAVYYI TTQATYSSNN AGVASVSGRR VLGIADGIAS ITANFGSSTS 4501 ASATVTVSNS VTNSVSSITW NIPSLSGSTL SGIVDSSHPA AMVVTFVDGL IFNMPSGVPG 4561 IPDIDEMMSF ESDTTFAVTT DSSGTLQLHA NSVANVIVTS TLTCRPAVTD SNTIQANLKA 4621 DSLDVDLGQQ SGLQFVPISN GQTVDIHVYA NPGSDHYLRS ISMYIDVSDE TIIDPSSAIW 4681 TDPPSPQFPV TVSTNIPNED KRLMQLSGAA APTAFSNLGE VFLGTLRVTS ISNGDTFVTG 4741 QILSMQAVRT LNCNIGAAEP PTCLTSTTDP QIVAGSGVIR VGASTTLTNT QYNTLSSSIV 4801 YASRRLSECD PCGNEADRVA GDVNADCKLL SSDASALQAF ILARQDFENT GIGDDPLESY 4861 APNGQNCDWL KQQLNPDLNK FTEEDGYDGN AIGKPKIDAL DVISLIRTEV GFYRYMVYRS 4921 GLSAIDEEIK STCTNPTGHF NLTINSLKQS DGIGGMVDAD PDYVDVIVEL RIEPSRNSVF 4981 AIVGTIIEDS ENFPPSFPPY PYGTEDQTPL VVSAASLGNG AFGITISVEY SAIETSFYAA 5041 VLVETKNAQF EKTNPTSYQS HLGSSLKPYS DSGIEFNPLI GSYQSIRLNA SQTCIPDTPL 5101 PPPPSPPPPS PPPPPSPPPS PPPPSPPPSP PPPSPPPYPP CHEFDDETRI VGYDVPGSSI 5161 IVASSAYDCS IQCENEHGSV SGYFTHHKTD GDCECKTWPP PEAFELNVNY ISGQTCIKPP 5221 SPPPPSPPPP SPPPSPPPPS PPPSPPPSPP PPSPPPPSPP PPSPPPPSPP PSPPPPSPPP 5281 PSPPPPSPTP PSPPPPSPPP PSPPPPSPPP PSPPPPSPPA PSPPPPSPLP PPSPPPPSPP 5341 DRTSTSGTKA TIGSVAGGMS GIIILATIFA YRKIRSKKYR DFGSTADHME QGTYRITTKR 5401 GGRGKMRDDS LPIAHQPSRF LSRGRFH //