LOCUS       AEO98423.1               681 aa    PRT              HTG 28-NOV-2011
DEFINITION  Emiliania huxleyi virus 203 hypothetical protein protein.
ACCESSION   JF974291-413
PROTEIN_ID  AEO98423.1
SOURCE      Emiliania huxleyi virus 203
  ORGANISM  Emiliania huxleyi virus 203
            Viruses; Varidnaviria; Bamfordvirae; Nucleocytoviricota;
            Megaviricetes; Algavirales; Phycodnaviridae; Coccolithovirus.
REFERENCE   1  (bases 1 to 400520)
  AUTHORS   Nissimov,J.I., Worthy,C.A., Rooks,P., Napier,J.A., Kimmance,S.A.,
            Henn,M.R., Ogata,H. and Allen,M.J.
  TITLE     Draft Genome Sequence of the Coccolithovirus Emiliania huxleyi
            Virus 203
  JOURNAL   J. Virol. 85 (24), 13468-13469 (2011)
   PUBMED   22106382
REFERENCE   2  (bases 1 to 400520)
  AUTHORS   Henn,M.R., Allen,M., Levin,J., Malboeuf,C., Casali,M., Russ,C.,
            Lennon,N., Chapman,S.B., Erlich,R., Young,S.K., Yandava,C.,
            Zeng,Q., Alvarado,L., Anderson,S., Berlin,A., Chen,Z., Freedman,E.,
            Gellesch,M., Goldberg,J., Green,L., Griggs,A., Gujja,S.,
            Heilman,E.R., Heiman,D., Hollinger,A., Howarth,C., Larson,L.,
            Mehta,T., Pearson,M., Roberts,A., Ryan,E., Saif,S., Shea,T.,
            Shenoy,N., Sisk,P., Stolte,C., Sykes,S., White,J., Haas,B.,
            Nusbaum,C. and Birren,B.
  CONSRTM   The Broad Institute Genome Sequencing Platform
  TITLE     The Genome Sequence of Emiliania huxleyi virus 203
  JOURNAL   Unpublished
REFERENCE   3  (bases 1 to 400520)
  AUTHORS   Henn,M.R., Allen,M., Levin,J., Malboeuf,C., Casali,M., Russ,C.,
            Lennon,N., Chapman,S.B., Erlich,R., Young,S.K., Yandava,C.,
            Zeng,Q., Alvarado,L., Anderson,S., Berlin,A., Chen,Z., Freedman,E.,
            Gellesch,M., Goldberg,J., Green,L., Griggs,A., Gujja,S.,
            Heilman,E.R., Heiman,D., Hollinger,A., Howarth,C., Larson,L.,
            Mehta,T., Pearson,M., Roberts,A., Ryan,E., Saif,S., Shea,T.,
            Shenoy,N., Sisk,P., Stolte,C., Sykes,S., White,J., Haas,B.,
            Nusbaum,C. and Birren,B.
  CONSRTM   The Broad Institute Genome Sequencing Platform
  TITLE     Direct Submission
  JOURNAL   Submitted (22-NOV-2010) Broad Institute of MIT and Harvard, 7
            Cambridge Center, Cambridge, MA 02142, USA
COMMENT     This genome was sequenced by the Broad Institute and co-owned with
            CAMERA.
            
            ##Metadata-START##
            Isolation method                 :: triple-plaque-purified
            Template preparation method      :: CsCl-gradient
            Phage/virus type                 :: DNA:ds Viral DNA
            Phage/virus "hybrid" information :: not applicable
            Phage/virus taxonomy             :: Coccolithoviridae
            Phage/virus strain               :: Strain 203
            Morphology                       :: Icosahedral
            Latitude                         :: 50.006N
            Longitude                        :: 4.3145W
            Depth (m)                        :: 15.00
            Collection date                  :: 27-Jul-2003
            Sample collection site           :: English Channel
            Filter fraction (um)             :: 0.45
            Volume filtered (L)              :: 1.0
            Habitat description              :: Collected from natural E.
                                                huxleyi bloom
            Other metadata available         :: http://www.westernchannelobserv
                                                atory.org.uk/data.php
            References                       :: PMID: 17359269; PMID: 16099989;
                                                PMID: 12209309; PMID: 16553948
            Lab Host                         :: Emiliania huxleyi 1516
            ##Metadata-END##
            * NOTE: This is a 'working draft' sequence. It currently
            * consists of 7 contigs. The true order of the pieces
            * is not known and their order in this sequence record is
            * arbitrary. Gaps between the contigs are represented as
            * runs of N, but the exact sizes of the gaps are unknown.
            * This record will be updated with the finished sequence
            * as soon as it is available and the accession number will
            * be preserved.
            *        1    20862: contig of 20862 bp in length
            *    20863    20962: gap of unknown length
            *    20963    45640: contig of 24678 bp in length
            *    45641    45740: gap of unknown length
            *    45741    78538: contig of 32798 bp in length
            *    78539    78638: gap of unknown length
            *    78639   123169: contig of 44531 bp in length
            *   123170   123269: gap of unknown length
            *   123270   178489: contig of 55220 bp in length
            *   178490   178589: gap of unknown length
            *   178590   257650: contig of 79061 bp in length
            *   257651   257750: gap of unknown length
            *   257751   400520: contig of 142770 bp in length.
FEATURES             Qualifiers
     source          /organism="Emiliania huxleyi virus 203"
                     /mol_type="genomic DNA"
                     /strain="203"
                     /isolation_source="English Channel"
                     /db_xref="taxon:181212"
                     /lab_host="Emiliania huxleyi 1516"
                     /lat_lon="50.006 N 4.3145 W"
                     /collection_date="27-Jul-2003"
     protein         /locus_tag="ELVG_00122"
BEGIN
        1 MGEHSSASDH VKQIDDKFTR ALGHASKTVK KNQIVFETPK NKRGSKQFTE EYRHVSKIEE
       61 ELPPHDSELW ANPYETMEYF HAYLGEHFNI EDNEKNITCA SREPTIDMWQ LAATIPLLRV
      121 DELLVIAGTG TGKSWVMSQA ARMFMIKQYL KQQSDITDIV SRDPSHVIYV LRDDKAKQQQ
      181 YVEFMKNPSI IGAAEKYFTE ADKIKFRRGQ SDPSQNPVVD TRFSPRRGLV KIVTFMSYAQ
      241 AGNFLELHGE SAFNDAILIV DEIHEIQIAA EAAGATWKES VERFEQYLSN RVESENKGLL
      301 LALTATPYKS LDGFVRLMNY FSPIGTEPLS MEEFSEQEID PVLGEQMKLC GIDRTFYNDI
      361 STGRLNGFRF VFYSIELSDQ VYAQWFAQPE IIKIHVPTDR SEADVSWIMG IANVKNNRYL
      421 AGHKNIGMDQ KKKWTPRGLN WSGRQRSIAI TTADYVNIKI QEDGIEKTII FFPTDVGAKA
      481 FSKFISAYQP NIDVIYISNE DTKNDVETKK SMFDISDDNT MLVTNSDKFG TGHTFADPVM
      541 LSNGLSMGPK HIYYIQPSTY AGMIQIEGRA RRRCLHAGWG DILPKIKRSI ILPVATVDNQ
      601 EMETCYSIMK QVTDLEKPFV ESIDPAIFNA SYTKHAFWSR RPLGLLYAEK EDPSPNKKYK
      661 IGKRNKQMNI WDIIAHYLGI S
//