LOCUS QIX13890.1 4405 aa PRT VRL 11-APR-2020 DEFINITION Severe acute respiratory syndrome coronavirus 2 ORF1a polyprotein protein. ACCESSION MT322412-2 PROTEIN_ID QIX13890.1 SOURCE Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) ORGANISM Severe acute respiratory syndrome coronavirus 2 Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Nidovirales; Cornidovirineae; Coronaviridae; Orthocoronavirinae; Betacoronavirus; Sarbecovirus; Severe acute respiratory syndrome-related coronavirus. REFERENCE 1 (bases 1 to 29859) AUTHORS Fink,L. TITLE Direct Submission JOURNAL Submitted (08-APR-2020) Department of General Services, Division of Consolidated Laboratory Services, 600 N 5th Street, Richmond, VA 23219, USA COMMENT ##Assembly-Data-START## Assembly Method :: minimap2 v. 2.17; ivar v. 1.1 Coverage :: 3631x Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Qualifiers source /organism="Severe acute respiratory syndrome coronavirus 2" /mol_type="genomic RNA" /isolate="SARS-CoV-2/human/USA/VA-DCLS-0024/2020" /host="Homo sapiens" /db_xref="taxon:2697049" /country="USA: VA" /collection_date="2020-03" /note="EPI_ISL_420628" protein /gene="ORF1ab" BEGIN 1 MESLVPGFNE KTHVQLSLPV LQVRDVLVRG FGDSVEEVLS EARQHLKDGT CGLVEVEKGV 61 LPQLEQPYVF IKRSDARTAP HGHVMVELVA ELEGIQYGRS GETLGVLVPH VGEIPVAYRK 121 VLLRKNGNKG AGGHSYGADL KSFDLGDELG TDPYEDFQEN WNTKHSSGVT RELMRELNGG 181 AYTRYVDNNF CGPDGYPLEC IKDLLARAGK ASCTLSEQLD FIDTKRGVYC CREHEHEIAW 241 YTERSEKSYE LQTPFEIKLA KKFDTFNGEC PNFVFPLNSI IKTIQPRVEK KKLDGFMGRI 301 RSVYPVASPN ECNQMCLSTL MKCDHCGETS WQTGDFVKAT CEFCGTENLT KEGATTCGYL 361 PQNAVVKIYC PACHNSEVGP EHSLAEYHNE SGLKTILRKG GRTIAFGGCV FSYVGCHNKC 421 AYWVPRASAN IGCNHTGVVG EGSEGLNDNL LEILQKEKVN INIVGDFKLN EEIAIILASF 481 SASTSAFVET VKGLDYKAFK QIVESCGNFK VTKGKAKKGA WNIGEQKSIL SPLYAFASEA 541 ARVVRSIFSR TLETAQNSVR VLQKAAITIL DGISQYSLRL IDAMMFTSDL ATNNLVVMAY 601 ITGGVVQLTS QWLTNIFGTV YEKLKPVLDW LEEKFKEGVE FLRDGWEIVK FISTCACEIV 661 GGQIVTCAKE IKESVQTFFK LVNKFLALCA DSIIIGGAKL KALNLGETFV THSKGLYRKC 721 VKSREETGLL MPLKAPKEII FLEGETLPTE VLTEEVVLKT GDLQPLEQPT SEAVEAPLVG 781 TPVCINGLML LEIKDTEKYC ALAPNMMVTN NTFTLKGGAP TKVTFGDDTV IEVQGYKSVN 841 ITFELDERID KVLNEKCSAY TVELGTEVNE FACVVADAVI KTLQPVSELL TPLGIDLDEW 901 SMATYYLFDE SGEFKLASHM YCSFYPPDED EEEGDCEEEE FEPSTQYEYG TEDDYQGKPL 961 EFGATSAALQ PEEEQEEDWL DDDSQQTVGQ QDGSEDNQTT TIQTIVEVQP QLEMELTPVV 1021 QTIEVNSFSG YLKLTDNVYI KNADIVEEAK KVKPTVVVNA ANVYLKHGGG VAGALNKATN 1081 NAMQVESDDY IATNGPLKVG GSCVLSGHNL AKHCLHVVGP NVNKGEDIQL LKSAYENFNQ 1141 HEVLLAPLLS AGIFGADPIH SLRVCVDTVR TNVYLAVFDK NLYDKLVSSF LEMKSEKQVE 1201 QKIAEIPKEE VKPFITESKP SVEQRKQDDK KIKACVEEVT TTLEETKFLT ENLLLYIDIN 1261 GNLHPDSATL VSDIDITFLK KDAPYIVGDV VQEGVLTAVV IPTKKAGGTT EMLAKALRKV 1321 PTDNYITTYP GQGLNGYTVE EAKTVLKKCK SAFYILPSII SNEKQEILGT VSWNLREMLA 1381 HAEETRKLMP VCVETKAIVS TIQRKYKGIK IQEGVVDYGA RFYFYTSKTT VASLINTLND 1441 LNETLVTMPL GYVTHGLNLE EAARYMRSLK VPATVSVSSP DAVTAYNGYL TSSSKTPEEH 1501 FIETISLAGS YKDWSYSGQS TQLGIEFLKR GDKSVYYTSN PTTFHLDGEV ITFDNLKTLL 1561 SLREVRTIKV FTTVDNINLH TQVVDMSMTY GQQFGPTYLD GADVTKIKPH NSHEGKTFYV 1621 LPNDDTLRVE AFEYYHTTDP SFLGRYMSAL NHTKKWKYPQ VNGLTSIKWA DNNCYLATAL 1681 LTLQQIELKF NPPALQDAYY RARAGEAANF CALILAYCNK TVGELGDVRE TMSYLFQHAN 1741 LDSCKRVLNV VCKTCGQQQT TLKGVEAVMY MGTLSYEQFK KGVQIPCTCG KQATKYLVQQ 1801 ESPFVMMSAP PAQYELKHGT FTCASEYTGN YQCGHYKHIT SKETLYCIDG ALLTKSSEYK 1861 GPITDVFYKE NSYTTTIKPV TYKLDGVVCT EIDPKLDNYY KKDNSYFTEQ PIDLVPNQPY 1921 PNASFDNFKF VCDNIKFADD LNQLTGYKKP ASRELKVTFF PDLNGDVVAI DYKHYTPSFK 1981 KGAKLLHKPI VWHVNNATNK ATYKPNTWCI RCLWSTKPVE TSNSFDVLKS EDAQGMDNLA 2041 CEDLKPVSEE VVENPTIQKD VLECNVKTTE VVGDIILKPA NNSLKITEEV GHTDLMAAYV 2101 DNSSLTIKKP NELSRVLGLK TLATHGLAAV NSVPWDTIAN YAKPFLNKVV STTTNIVTRC 2161 LNRVCTNYMP YFFTLLLQLC TFTRSTNSRI KASMPTTIAK NTVKSVGKFC LEASFNYLKS 2221 PNFSKLINII IWFLLLSVCL GSLIYSTAAL GVLMSNLGMP SYCTGYREGY LNSTNVTIAT 2281 YCTGSIPCSV CLSGLDSLDT YPSLETIQIT ISSFKWDLTA FGLVAEWFLA YILFTRFFYV 2341 LGLAAIMQLF FSYFAVHFIS NSWLMWLIIN LVQMAPISAM VRMYIFFASF YYVWKSYVHV 2401 VDGCNSSTCM MCYKRNRATR VECTTIVNGV RRSFYVYANG GKGFCKLHNW NCVNCDTFCA 2461 GSTFISDEVA RDLSLQFKRP INPTDQSSYI VDSVTVKNGS IHLYFDKAGQ KTYERHSLSH 2521 FVNLDNLRAN NTKGSLPINV IVFDGKSKCE ESSAKSASVY YSQLMCQPIL LLDQALVSDV 2581 GDSAEVAVKM FDAYVNTFSS TFNVPMEKLK TLVATAEAEL AKNVSLDNVL STFISAARQG 2641 FVDSDVETKD VVECLKLSHQ SDIEVTGDSC NNYMLTYNKV ENMTPRDLGA CIDCSARHIN 2701 AQVAKSHNIA LIWNVKDFMS LSEQLRKQIR SAAKKNNLPF KLTCATTRQV VNVVTTKIAL 2761 KGGKIVNNWL KQLIKVTLVF LFVAAIFYLI TPVHVMSKHT DFSSEIIGYK AIDGGVTRDI 2821 ASTDTCFANK HADFDTWFSQ RGGSYTNDKA CPLIAAVITR EVGFVVPGLP GTILRTTNGD 2881 FLHFLPRVFS AVGNICYTPS KLIEYTDFAT SACVLAAECT IFKDASGKPV PYCYDTNVLE 2941 GSVAYESLRP DTRYVLMDGS IIQFPNTYLE GSVRVVTTFD SEYCRHGTCE RSEAGVCVST 3001 SGRWVLNNDY YRSLPGVFCG VDAVNLLTNM FTPLIQPIGA LDISASIVAG GIVAIVVTCL 3061 AYYFMRFRRA FGEYSHVVAF NTLLFLMSFT VLCLTPVYSF LPGVYSVIYL YLTFYLTNDV 3121 SFLAHIQWMV MFTPLVPFWI TIAYIICIST KHFYWFFSNY LKRRVVFNGV SFSTFEEAAL 3181 CTFLLNKEMY LKLRSDVLLP LTQYNRYLAL YNKYKYFSGA MDTTSYREAA CCHLAKALND 3241 FSNSGSDVLY QPPQTSITSA VLQSGFRKMA FPSGKVEGCM VQVTCGTTTL NGLWLDDVVY 3301 CPRHVICTSE DMLNPNYEDL LIRKSNHNFL VQAGNVQLRV IGHSMQNCVL KLKVDTANPK 3361 TPKYKFVRIQ PGQTFSVLAC YNGSPSGVYQ CAMRPNFTIK GSFLNGSCGS VGFNIDYDCV 3421 SFCYMHHMEL PTGVHAGTDL EGNFYGPFVD RQTAQAAGTD TTITVNVLAW LYAAVINGDR 3481 WFLNRFTTTL NDFNLVAMKY NYEPLTQDHV DILGPLSAQT GIAVLDMCAS LKELLQNGMN 3541 GRTILGSALL EDEFTPFDVV RQCSGVTFQS AVKRTIKGTH HWLLLTILTS LLVLVQSTQW 3601 SLFFFLYENA FLPFAMGIIA MSAFAMMFVK HKHAFLCLFL LPSLATVAYF NMVYMPASWV 3661 MRIMTWLDMV DTSLSGFKLK DCVMYASAVV LLILMTARTV YDDGARRVWT LMNVLTLVYK 3721 VYYGNALDQA ISMWALIISV TSNYSGVVTT VMFLARGIVF MCVEYCPIFF ITGNTLQCIM 3781 LVYCFLGYFC TCYFGLFCLL NRYFRLTLGV YDYLVSTQEF RYMNSQGLLP PKNSIDAFKL 3841 NIKLLGVGGK PCIKVATVQS KMSDVKCTSV VLLSVLQQLR VESSSKLWAQ CVQLHNDILL 3901 AKDTTEAFEK MVSLLSVLLS MQGAVDINKL CEEMLDNRAT LQAIASEFSS LPSYAAFATA 3961 QEAYEQAVAN GDSEVVLKKL KKSLNVAKSE FDRDAAMQRK LEKMADQAMT QMYKQARSED 4021 KRAKVTSAMQ TMLFTMLRKL DNDALNNIIN NARDGCVPLN IIPLTTAAKL MVVIPDYNTY 4081 KNTCDGTTFT YASALWEIQQ VVDADSKIVQ LSEISMDNSP NLAWPLIVTA LRANSAVKLQ 4141 NNELSPVALR QMSCAAGTTQ TACTDDNALA YYNTTKGGRF VLALLSDLQD LKWARFPKSD 4201 GTGTIYTELE PPCRFVTDTP KGPKVKYLYF IKGLNNLNRG MVLGSLAATV RLQAGNATEV 4261 PANSTVLSFC AFAVDAAKAY KDYLASGGQP ITNCVKMLCT HTGTGQAITV TPEANMDQES 4321 FGGASCCLYC RCHIDHPNPK GFCDLKGKYV QIPTTCANDP VGFTLKNTVC TVCGMWKGYG 4381 CSCDQLREPM LQSADAQSFL NGFAV //