LOCUS       AEO98393.1              1240 aa    PRT              HTG 28-NOV-2011
DEFINITION  Emiliania huxleyi virus 203 hypothetical protein protein.
ACCESSION   JF974291-383
PROTEIN_ID  AEO98393.1
SOURCE      Emiliania huxleyi virus 203
  ORGANISM  Emiliania huxleyi virus 203
            Viruses; Varidnaviria; Bamfordvirae; Nucleocytoviricota;
            Megaviricetes; Algavirales; Phycodnaviridae; Coccolithovirus.
REFERENCE   1  (bases 1 to 400520)
  AUTHORS   Nissimov,J.I., Worthy,C.A., Rooks,P., Napier,J.A., Kimmance,S.A.,
            Henn,M.R., Ogata,H. and Allen,M.J.
  TITLE     Draft Genome Sequence of the Coccolithovirus Emiliania huxleyi
            Virus 203
  JOURNAL   J. Virol. 85 (24), 13468-13469 (2011)
   PUBMED   22106382
REFERENCE   2  (bases 1 to 400520)
  AUTHORS   Henn,M.R., Allen,M., Levin,J., Malboeuf,C., Casali,M., Russ,C.,
            Lennon,N., Chapman,S.B., Erlich,R., Young,S.K., Yandava,C.,
            Zeng,Q., Alvarado,L., Anderson,S., Berlin,A., Chen,Z., Freedman,E.,
            Gellesch,M., Goldberg,J., Green,L., Griggs,A., Gujja,S.,
            Heilman,E.R., Heiman,D., Hollinger,A., Howarth,C., Larson,L.,
            Mehta,T., Pearson,M., Roberts,A., Ryan,E., Saif,S., Shea,T.,
            Shenoy,N., Sisk,P., Stolte,C., Sykes,S., White,J., Haas,B.,
            Nusbaum,C. and Birren,B.
  CONSRTM   The Broad Institute Genome Sequencing Platform
  TITLE     The Genome Sequence of Emiliania huxleyi virus 203
  JOURNAL   Unpublished
REFERENCE   3  (bases 1 to 400520)
  AUTHORS   Henn,M.R., Allen,M., Levin,J., Malboeuf,C., Casali,M., Russ,C.,
            Lennon,N., Chapman,S.B., Erlich,R., Young,S.K., Yandava,C.,
            Zeng,Q., Alvarado,L., Anderson,S., Berlin,A., Chen,Z., Freedman,E.,
            Gellesch,M., Goldberg,J., Green,L., Griggs,A., Gujja,S.,
            Heilman,E.R., Heiman,D., Hollinger,A., Howarth,C., Larson,L.,
            Mehta,T., Pearson,M., Roberts,A., Ryan,E., Saif,S., Shea,T.,
            Shenoy,N., Sisk,P., Stolte,C., Sykes,S., White,J., Haas,B.,
            Nusbaum,C. and Birren,B.
  CONSRTM   The Broad Institute Genome Sequencing Platform
  TITLE     Direct Submission
  JOURNAL   Submitted (22-NOV-2010) Broad Institute of MIT and Harvard, 7
            Cambridge Center, Cambridge, MA 02142, USA
COMMENT     This genome was sequenced by the Broad Institute and co-owned with
            CAMERA.
            
            ##Metadata-START##
            Isolation method                 :: triple-plaque-purified
            Template preparation method      :: CsCl-gradient
            Phage/virus type                 :: DNA:ds Viral DNA
            Phage/virus "hybrid" information :: not applicable
            Phage/virus taxonomy             :: Coccolithoviridae
            Phage/virus strain               :: Strain 203
            Morphology                       :: Icosahedral
            Latitude                         :: 50.006N
            Longitude                        :: 4.3145W
            Depth (m)                        :: 15.00
            Collection date                  :: 27-Jul-2003
            Sample collection site           :: English Channel
            Filter fraction (um)             :: 0.45
            Volume filtered (L)              :: 1.0
            Habitat description              :: Collected from natural E.
                                                huxleyi bloom
            Other metadata available         :: http://www.westernchannelobserv
                                                atory.org.uk/data.php
            References                       :: PMID: 17359269; PMID: 16099989;
                                                PMID: 12209309; PMID: 16553948
            Lab Host                         :: Emiliania huxleyi 1516
            ##Metadata-END##
            * NOTE: This is a 'working draft' sequence. It currently
            * consists of 7 contigs. The true order of the pieces
            * is not known and their order in this sequence record is
            * arbitrary. Gaps between the contigs are represented as
            * runs of N, but the exact sizes of the gaps are unknown.
            * This record will be updated with the finished sequence
            * as soon as it is available and the accession number will
            * be preserved.
            *        1    20862: contig of 20862 bp in length
            *    20863    20962: gap of unknown length
            *    20963    45640: contig of 24678 bp in length
            *    45641    45740: gap of unknown length
            *    45741    78538: contig of 32798 bp in length
            *    78539    78638: gap of unknown length
            *    78639   123169: contig of 44531 bp in length
            *   123170   123269: gap of unknown length
            *   123270   178489: contig of 55220 bp in length
            *   178490   178589: gap of unknown length
            *   178590   257650: contig of 79061 bp in length
            *   257651   257750: gap of unknown length
            *   257751   400520: contig of 142770 bp in length.
FEATURES             Qualifiers
     source          /organism="Emiliania huxleyi virus 203"
                     /mol_type="genomic DNA"
                     /strain="203"
                     /isolation_source="English Channel"
                     /db_xref="taxon:181212"
                     /lab_host="Emiliania huxleyi 1516"
                     /lat_lon="50.006 N 4.3145 W"
                     /collection_date="27-Jul-2003"
     protein         /locus_tag="ELVG_00092"
BEGIN
        1 MAKKSGKTSN KKKSSDDTFT YILIGLIIIG IGYYYYYYYY NGDYDPAQIG GTSEVIGTIQ
       61 EDDTDDTDDT PAPIITRPNE PPKPIEKPKP VWLFKDGARN MIVLGETPIA SSAYGVAPAK
      121 LNQRAAHNMD YCIKSCKSLS ANVAAISPLR AMTLPSCQCY KVQNKVKLAA DMYSISKIAY
      181 LSNVTEEISN DCSNPAACAP QRSSQTSVVT KTCNTAASKN MYGYHQYHAT DSLINNTLIH
      241 IWFDTGTKKL MYKRRNNELD TGSLVYGTPA GVSFGTQYTV FKNSMRLTAN FRLGLVYSGT
      301 PPIGHKSNNA SYSRARINGI WYDWQSIIQC AVLRDARTGK ITNMDIDEYS FYTDKHKDVR
      361 VADRKMTESP DEMVMFEHNT KYYVFINRMA IESSIDNLSA IPIYPQVTYN QAHGDPSLFH
      421 YGTLKIEQVI NWVDSSTRDV WLLVHAPSGT MQGHTFKPDS NTMYTYYVKN TATPDNCLSC
      481 QVRERDVCPD NTILIGCGFG QQGRCGTCYM SSTEHGMATR NDKIDDSPCG NCQIGRLMVK
      541 TNTGKIWTDA NRRILKNAVP SHKNINDIFE GPFVNDDKGR SGKYSIEDGM KNAQLYRRIV
      601 KAARDTGRTN PTAQPAYCNL ATGSIKNDND TNTQMATGSN TWKQFKNFAY ISTLNAESYG
      661 PAICQLGTSA PNKFDGGFCT IGYADVHLNN FALDAARQQF PSYSDGGNRD NAGCIAALDR
      721 DHARGKYRHY AQNAAKNMDV TVKWTLVPRL MFPHSSPPIY NPYIGIQFLQ LTLTVKIYNK
      781 AYTDGAVEVN ENQFAEFNKG DKNYMFKFTD PGKAQNFTNT KLIVMPPYSD IRVGFNKLYR
      841 VVHNVNVDFL DFIPSLSGFS SGCVVTDVTP ANMMEDKKKY TPNDKEQTYV KSNIVKSVPN
      901 STCDKNNCMQ FDTVNHHIVP HQRRTIRGKP SFTRCYNLKS GPCRDHGHYV AWHLPRIVAE
      961 CRVMGAKPGD AFIEHHSGCQ HEGTHPVRTD CGPVNGSTKD YRVWEGIVEE KPPKITGNYN
     1021 GYSGSMKSKD VDNGCHPNDC MISTKHGQVA IQHLNIGDHV YTPSGYEPII GFFDKNNNKT
     1081 AEYYEIALEN NEIITVSRHH AIPIDGTLTD PSLIKSGDIV NTLHGITTVT SNTMISKQGA
     1141 HHFIVPSGLY YIDNVVCSDY TMHVPLIVFN IVHMYILARY NMGIPIIYRE QSILSPYWPY
     1201 HILGKMNAST TTYNACSIIF VPIIIVTEFI LSIYVNLCKK
//