LOCUS       AEO98071.1              5427 aa    PRT              HTG 28-NOV-2011
DEFINITION  Emiliania huxleyi virus 203 hypothetical protein protein.
ACCESSION   JF974291-61
PROTEIN_ID  AEO98071.1
SOURCE      Emiliania huxleyi virus 203
  ORGANISM  Emiliania huxleyi virus 203
            Viruses; Varidnaviria; Bamfordvirae; Nucleocytoviricota;
            Megaviricetes; Algavirales; Phycodnaviridae; Coccolithovirus.
REFERENCE   1  (bases 1 to 400520)
  AUTHORS   Nissimov,J.I., Worthy,C.A., Rooks,P., Napier,J.A., Kimmance,S.A.,
            Henn,M.R., Ogata,H. and Allen,M.J.
  TITLE     Draft Genome Sequence of the Coccolithovirus Emiliania huxleyi
            Virus 203
  JOURNAL   J. Virol. 85 (24), 13468-13469 (2011)
   PUBMED   22106382
REFERENCE   2  (bases 1 to 400520)
  AUTHORS   Henn,M.R., Allen,M., Levin,J., Malboeuf,C., Casali,M., Russ,C.,
            Lennon,N., Chapman,S.B., Erlich,R., Young,S.K., Yandava,C.,
            Zeng,Q., Alvarado,L., Anderson,S., Berlin,A., Chen,Z., Freedman,E.,
            Gellesch,M., Goldberg,J., Green,L., Griggs,A., Gujja,S.,
            Heilman,E.R., Heiman,D., Hollinger,A., Howarth,C., Larson,L.,
            Mehta,T., Pearson,M., Roberts,A., Ryan,E., Saif,S., Shea,T.,
            Shenoy,N., Sisk,P., Stolte,C., Sykes,S., White,J., Haas,B.,
            Nusbaum,C. and Birren,B.
  CONSRTM   The Broad Institute Genome Sequencing Platform
  TITLE     The Genome Sequence of Emiliania huxleyi virus 203
  JOURNAL   Unpublished
REFERENCE   3  (bases 1 to 400520)
  AUTHORS   Henn,M.R., Allen,M., Levin,J., Malboeuf,C., Casali,M., Russ,C.,
            Lennon,N., Chapman,S.B., Erlich,R., Young,S.K., Yandava,C.,
            Zeng,Q., Alvarado,L., Anderson,S., Berlin,A., Chen,Z., Freedman,E.,
            Gellesch,M., Goldberg,J., Green,L., Griggs,A., Gujja,S.,
            Heilman,E.R., Heiman,D., Hollinger,A., Howarth,C., Larson,L.,
            Mehta,T., Pearson,M., Roberts,A., Ryan,E., Saif,S., Shea,T.,
            Shenoy,N., Sisk,P., Stolte,C., Sykes,S., White,J., Haas,B.,
            Nusbaum,C. and Birren,B.
  CONSRTM   The Broad Institute Genome Sequencing Platform
  TITLE     Direct Submission
  JOURNAL   Submitted (22-NOV-2010) Broad Institute of MIT and Harvard, 7
            Cambridge Center, Cambridge, MA 02142, USA
COMMENT     This genome was sequenced by the Broad Institute and co-owned with
            CAMERA.
            
            ##Metadata-START##
            Isolation method                 :: triple-plaque-purified
            Template preparation method      :: CsCl-gradient
            Phage/virus type                 :: DNA:ds Viral DNA
            Phage/virus "hybrid" information :: not applicable
            Phage/virus taxonomy             :: Coccolithoviridae
            Phage/virus strain               :: Strain 203
            Morphology                       :: Icosahedral
            Latitude                         :: 50.006N
            Longitude                        :: 4.3145W
            Depth (m)                        :: 15.00
            Collection date                  :: 27-Jul-2003
            Sample collection site           :: English Channel
            Filter fraction (um)             :: 0.45
            Volume filtered (L)              :: 1.0
            Habitat description              :: Collected from natural E.
                                                huxleyi bloom
            Other metadata available         :: http://www.westernchannelobserv
                                                atory.org.uk/data.php
            References                       :: PMID: 17359269; PMID: 16099989;
                                                PMID: 12209309; PMID: 16553948
            Lab Host                         :: Emiliania huxleyi 1516
            ##Metadata-END##
            * NOTE: This is a 'working draft' sequence. It currently
            * consists of 7 contigs. The true order of the pieces
            * is not known and their order in this sequence record is
            * arbitrary. Gaps between the contigs are represented as
            * runs of N, but the exact sizes of the gaps are unknown.
            * This record will be updated with the finished sequence
            * as soon as it is available and the accession number will
            * be preserved.
            *        1    20862: contig of 20862 bp in length
            *    20863    20962: gap of unknown length
            *    20963    45640: contig of 24678 bp in length
            *    45641    45740: gap of unknown length
            *    45741    78538: contig of 32798 bp in length
            *    78539    78638: gap of unknown length
            *    78639   123169: contig of 44531 bp in length
            *   123170   123269: gap of unknown length
            *   123270   178489: contig of 55220 bp in length
            *   178490   178589: gap of unknown length
            *   178590   257650: contig of 79061 bp in length
            *   257651   257750: gap of unknown length
            *   257751   400520: contig of 142770 bp in length.
FEATURES             Qualifiers
     source          /organism="Emiliania huxleyi virus 203"
                     /mol_type="genomic DNA"
                     /strain="203"
                     /isolation_source="English Channel"
                     /db_xref="taxon:181212"
                     /lab_host="Emiliania huxleyi 1516"
                     /lat_lon="50.006 N 4.3145 W"
                     /collection_date="27-Jul-2003"
     protein         /locus_tag="ELVG_00396"
BEGIN
        1 MILQKSRKLV DQLINDPLGE MLLIRKSDCN FLKCKNKQVM ITILILLLHC AYVSAQWYWM
       61 TGNALPGRTI TASTNGFSDF KENAIDGNII SRYSSRDGRP EPSPFWSVTF DQYILMRYII
      121 VFLRGPISGE PDYFYSQRMN GVTVKITDDS AVPGGTFEYS VLNDCLDSAL EPQCNELPCS
      181 YCEANNKNNI YDNGEKSRTW GWQGLGNGRV ARTITITSPN GVSTQPMSFQ ELYIWGVVSD
      241 FPSPPPPPPP PPLPPPPPPP SPPPPPPPSP PPPPPPPPPP PPSPPPPPPP PLPTPTGRSC
      301 FSYAIGPYTR GNDVTPVGEN ILEKTIFVPS DTIIVMYTQA VRQNSLRTDI ALYVNNVDTH
      361 RSMATTATLG WQTATVLWSA IYPKGYYEFR IVGVGSIEWG IGEYAQMTLM TFPLSIPGIA
      421 LYTADSPNLC PGMGATTRDI ASTTVTLTET SVLAMFAQSI TPETNGIYEL YVDSTLIAKS
      481 NWAWHSPDYV GGRYWNAVSI VQYKQYNAGT YTVKVAIRDA SGVGDFGCNG RWQRVLNVVV
      541 LPIGTSGLNL QTKAEDFQGS IALLPNENIL SYTVNVPVES TLLVAGSVLG DPGGAGSGNQ
      601 LRYDYQLASN NPDCLSKTSI YNIRPEALRS GQVFSVCDAV GSTQFDLRPV FNGYSGTPVK
      661 ESTYINTLVI PATYCDVPVA DVLPCNQLRV GDYPGNDVGN VGGVPSPSAC AAYCRNTLDA
      721 AYFGYNIDDN HCYCKDARTS ENTNAGFVSG RTCYGESFAY YYTFPLDEHE WEVVGTSATA
      781 PIMTRCGGHT ILGGFDAFGT GVTLSKKLID ITDHSGMIIK FDFMQIDSWD SEDAILTVDG
      841 VEVWRQTFTQ GSSNICGFGW NDQYYPNIQV KFSHSRTTAD IIFSTTLDGA ADNEAWGIQN
      901 IQIILTFDPV YPSGLPSSPN HWFRNDNADL NTIQWALVNP NTKYCNGFYA GTDRSFSECR
      961 DVCANDPTCT HFARNGDFNE IGFCALYAAK TCEQSTDGDT TVIAFTKVFV WPDYVGELPG
     1021 ATIQGILPTN IYVDGTNGAN SKFTAVGGSV DTAIRFGGII GADFTICSLT RYGTGTRSTI
     1081 LRGETGNWLH GHHQGNVGKM YYENWITPSI GFQPSYLSQD DWIVSCGTSG VNPVYTECTN
     1141 TRSGGKFGQT TSKISVNFNP FDSDSDFEVA EILTWPRVLS EGEMIDASTY LAGTVMGKYF
     1201 SCNPPSPPPS PPPPYPPPPS PPPPSPPPPS LPPPSTPPPP PLPLTPGGWK GQYVSIFPPD
     1261 ETLWTGTDEV TKCGPYVILG GYNVLGVGDT LSRTYTSMPH HVELLIQFDF MKIDSWDNED
     1321 AILTVDGVEV WRKTYDGSGT ELCGNGNRAE IYDAGIRVQF THTSDTAALV FSTTLSSAAS
     1381 DESWGIQNVV VNLLTDEYNT CDQWCKLEGL CTNDYHYIMV LGVRKYVYCI FDDNSRGIDV
     1441 MDTTGLTTKN NLHPNSCPEG MNIWVPRSNS FVQTLETSLS FRPKTVGIYG IANGCGGCSS
     1501 NAMNSDNAAQ AAHWKAVSPI RTPWFMRAVP YQEPNGDYTA GDWLHISTTV SDFDADGYYF
     1561 NDRTDGYPES RYYCSTNSYE LDYPANLPTG ANAWYKQGTM NVDSPTWKLL HTDGFCSSGY
     1621 YAGDAEYTGL TLEGCAEICA SEPQCGFFYW RDSGITCSRY DTRTCPFAAA PVGNTGSAYA
     1681 KKYMWADATG NGNTAMIGGS DASVVDTLTL TPGENGAWLP MTVLRGTTTT TVDFGPVIVT
     1741 DFTICSLTRY IGGSNKRILD GAGVNWLHGH HGGNLGKAYY GAWKTYSGSI TESKTEWLVM
     1801 CGSNGVTEMY ANCEPRRTED GGTSPTSLTI NNGVYANTQP SDFEVSEIIT WPRELSETEM
     1861 KAAVDYMLHE VLGQSTCTVT CEDMLTYRSL VPECTSTGGD ALDQPTCDAS YQRTGSGPTE
     1921 VTDSFQVCQY VNVACTVADT IEDCKIPPFC QTFSLRIEPQ ANSYRIRIDL TNYGAGEYIV
     1981 SADFYFSDDF DGDAWIAHST WYDTSSPDST KLEGRTASTP SSQWRSITAP KTVSFNPSHM
     2041 YWFLGFPVQS TTGYVWVTNV QVTAPNGDLL IPDGTFPNGE DLGQYDPPGD PSEFHSIVPS
     2101 CANAPTPPAV APPPLPPLPP RNGIFEFTCN FNTGDYVNNC PGVNLVTNVE KDLRYVNLAV
     2161 TEDTTGYITY TLPDFPEITE IIVRATTFIS SVNNAGQSLM ISVAQTENQG LSQTNNNMAS
     2221 QFLTRPSSGY DYAFRHYLRS NPGFRSRRHN RTNIKYSDLF DKFVEFYSRF VYTDTVLRGY
     2281 INYGNFQEAV SNADESRGIL PMPKEDKQIL TISAESTGDT YLIRDFTIQV FYSEKSPPPP
     2341 PPIPPPVPPR PPLTGNLPGK NIYARYQMGG FSLTPDNDIL SMNFELLGEY CCRADTYTLV
     2401 TIGTTDKTTC ESTCRTDETC DAYAISGCSN ANDQDCTGTC YNYREMTGTK YTDFCSDDSL
     2461 NGNAWCYIRS APSFGMWPDV SGNDFHATIS SPGAYAVTAD GNGATNQVTA LGGSTTTKVN
     2521 FGSIIPSTFT ICSVTRYTNI NSRERILDGK GANWLHGHGD KLRGVAFYGE WKTPTTSISG
     2581 PLYDWLVMCS SNGDVEIYAN CEDRRTNNGG TSPTELTIND GAFSVDSSDF EIVELVVWNR
     2641 ILSDGEHLQA TAYLYNDILG VGTCTSPPPT PPPLPPLSIG NGIASYGCDF RNGKEDHFYY
     2701 SCSPDNGFIP RSQLNSPSQS KIAESHIKLV QDSAPGTTGG VRVATQMSSR FSAIKSIVIR
     2761 ASFFISRDTG ADQVGFLTHS NDANRNTDIC NNPNDNDINA ACINTYFSGS ATQSQLETNA
     2821 VGTVKLVDFD DNPYLNQGTF KTLELIYDGT DVTGSWAHPD GPPIATSTPG TIPVNTLDYV
     2881 TFGAWTGGTR NTFSIEWFTI QVEGEPIYNT CTDWCVIGNQ CSDGIQFINV NGKSIQVRCT
     2941 YDGLIGVDTI WIQDGISTNR YDEPNSCPSG MDIWVPRSNP FVQSIVPYYY NQKLYQTESF
     3001 SSDQTWLNRF VPTVQECYNE IISRPTCHQS YFNWAARGDG NCGCINIASN PLDNIATAYE
     3061 IDIYKIVDIT SASGDIGVLF GMYGIADGCG TCSTSAMNSD SADKVAHWSA VSNPGIPWFV
     3121 RATPYIEPNG DYIEGSWLSG KFLDADGLQF NDNGAGYAFT SYICSTNYWP PSPPSPPALP
     3181 SSENCLNQQT DCTASPPECA VDTTYFRLAS CRTVTWTCSI TYGYVNGLSY LSSADENPTR
     3241 NQYDYTRIFI GETMVSETRN DVSSSPRIIF DNVENPVVEI IAVSGVPVAE QHMLVSSVCL
     3301 FQTTSPSPPP PPPPSPPPPC HQVWFDTNLP GGDLPGYPVT EVTYSSCAHR CELEPALFFT
     3361 FKKSDKSCWC KDSYTEQQTD VDYVSGQSCL KPPSPPPPSP PPPLPPLSPL LSGTLVIDGC
     3421 NMNNVRCEPA SFNTGSIRCC RDSGIPLGIS VCLGAGSGGS EFLPPIGLGV TGSKDASLYT
     3481 TSLVCTTLGY RLCTILELST PNGGACSSGC GYNTQSEHTM ISSDPCSPPS PPPPSPPPSP
     3541 PPSPPPPSPP PLLPIIPNRC RASNMLAHTI PQIDAPFSGS SILYPSSPME STFVETDEQK
     3601 TPRANASVSL DRKYAVSNAG GTVIARVQTK TFATMTVTSN KHPPVRHDTI VRTPMAWVDR
     3661 IRIRVAVQTK DQYGSPETLG PFSVSMRIDD SSNGLTGSGT CASQIGWSGN KYTMYCSMTT
     3721 SPPGNWFPLG GVADVTTELT VGGSLVNTVV DTDAVQFITP PVWYNLVYRS DGEQNNRPSP
     3781 TGYDPVTDKM FATLQVSPIY GDETFDVFIY TSTLTFPVNA WRIELRYDTT KLDYVSYSSS
     3841 GKFQAPLVDP AASGVTIFSA GLLCGTGCNA GQLDEVTGDV IYLAKVTLRP KSGQPIGGIN
     3901 TGMYPYASEI INYGGGDIHR STTGKMFDTR EGLQTTGQIT IQDIVPVGIF SYPPTITKPI
     3961 GELFNSAHLY GTTAEYQLAV YQFNSDDRHN NGNNVISIGV SPSCSYSSGA NPSIIDSITN
     4021 CNIVMTDSQT SSANDFGVIV QVISESITMT RTVLFDVYSP QTITITADDT TLNRLLDING
     4081 ATFEPGPEPC VSVYQTAKLR VDVDGVDYTT QVSFVPDDTN VIRMLSGLDA TNIMRGIQPG
     4141 TTALRLYTGA ANSISMTVSD AGVRAVAVNA RIVTDVSFIS STQPPELSPS YEYPGEVYST
     4201 VSYQNEMNSE GDFGYIFTTV EWDDGHRTDA GFLDELGTLT YTRQVDSIEI DESGEYPMVT
     4261 VAVNAVQMCI DQGIEVSYNW CGTTMITPQY IPIFIDLPIP VSLTLDIVQS RLTSEINDAR
     4321 LSPINIPTSS SMILTVTFMD TTTGELTYRE LTDDTRVTYT PISSCASVDV NNDVQIVNND
     4381 CVGSNAEISA TITIDGNGIT GSGVVSVVGI DNAQSNVEFT KYPSGGVAAS TIGKIECTNY
     4441 YHSINARLTI MLTDNAVYYI TTQATYSSNN AGVASVSGRR VLGIADGIAS ITANFGSSTS
     4501 ASATVTVSNS VTNSVSSITW NIPSLSGSTL SGIVDSSHPA AMVVTFVDGL IFNMPSGVPG
     4561 IPDIDEMMSF ESDTTFAVTT DSSGTLQLHA NSVANVIVTS TLTCRPAVTD SNTIQANLKA
     4621 DSLDVDLGQQ SGLQFVPISN GQTVDIHVYA NPGSDHYLRS ISMYIDVSDE TIIDPSSAIW
     4681 TDPPSPQFPV TVSTNIPNED KRLMQLSGAA APTAFSNLGE VFLGTLRVTS ISNGDTFVTG
     4741 QILSMQAVRT LNCNIGAAEP PTCLTSTTDP QIVAGSGVIR VGASTTLTNT QYNTLSSSIV
     4801 YASRRLSECD PCGNEADRVA GDVNADCKLL SSDASALQAF ILARQDFENT GIGDDPLESY
     4861 APNGQNCDWL KQQLNPDLNK FTEEDGYDGN AIGKPKIDAL DVISLIRTEV GFYRYMVYRS
     4921 GLSAIDEEIK STCTNPTGHF NLTINSLKQS DGIGGMVDAD PDYVDVIVEL RIEPSRNSVF
     4981 AIVGTIIEDS ENFPPSFPPY PYGTEDQTPL VVSAASLGNG AFGITISVEY SAIETSFYAA
     5041 VLVETKNAQF EKTNPTSYQS HLGSSLKPYS DSGIEFNPLI GSYQSIRLNA SQTCIPDTPL
     5101 PPPPSPPPPS PPPPPSPPPS PPPPSPPPSP PPPSPPPYPP CHEFDDETRI VGYDVPGSSI
     5161 IVASSAYDCS IQCENEHGSV SGYFTHHKTD GDCECKTWPP PEAFELNVNY ISGQTCIKPP
     5221 SPPPPSPPPP SPPPSPPPPS PPPSPPPSPP PPSPPPPSPP PPSPPPPSPP PSPPPPSPPP
     5281 PSPPPPSPTP PSPPPPSPPP PSPPPPSPPP PSPPPPSPPA PSPPPPSPLP PPSPPPPSPP
     5341 DRTSTSGTKA TIGSVAGGMS GIIILATIFA YRKIRSKKYR DFGSTADHME QGTYRITTKR
     5401 GGRGKMRDDS LPIAHQPSRF LSRGRFH
//