LOCUS NHPE01000070 7150 bp DNA linear BCT 15-AUG-2017 DEFINITION Halorubrum sp. Eb13 contig0125, whole genome shotgun sequence. ACCESSION NHPE01000070 NHPE01000000 VERSION NHPE01000070.1 DBLINK BioProject: PRJNA232799 BioSample: SAMN02716040 KEYWORDS WGS. SOURCE Halorubrum sp. Eb13 ORGANISM Halorubrum sp. Eb13 Archaea; Euryarchaeota; Halobacteria; Haloferacales; Halorubraceae; Halorubrum. REFERENCE 1 (bases 1 to 7150) AUTHORS Fullmer,M.S., Soucy,S.M., Swithers,K.S., Makkay,A.M., Wheeler,R., Ventosa,A., Gogarten,J.P. and Papke,R.T. TITLE Population and genomic analysis of the genus Halorubrum JOURNAL Front Microbiol 5, 140 (2014) PUBMED 24782836 REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 7150) AUTHORS Fullmer,M.S. TITLE Direct Submission JOURNAL Submitted (18-MAY-2017) Molecular & Cell Biology, University of Connecticut, 91 N Eagleville, Storrs, CT 06269, USA COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: CLC NGS Cell v. 6.0.5 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 41x Sequencing Technology :: Illumina MiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 05/23/2017 11:52:44 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.1 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 3,573 CDS (total) :: 3,522 Genes (coding) :: 3,400 CDS (coding) :: 3,400 Genes (RNA) :: 51 rRNAs :: 1, 1, 1 (5S, 16S, 23S) complete rRNAs :: 1, 1, 1 (5S, 16S, 23S) tRNAs :: 46 ncRNAs :: 2 Pseudo Genes (total) :: 122 Pseudo Genes (ambiguous residues) :: 4 of 122 Pseudo Genes (frameshifted) :: 53 of 122 Pseudo Genes (incomplete) :: 76 of 122 Pseudo Genes (internal stop) :: 24 of 122 Pseudo Genes (multiple problems) :: 26 of 122 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7150 /organism="Halorubrum sp. Eb13" /mol_type="genomic DNA" /strain="Eb13" /isolation_source="endorheic salt-lake" /db_xref="taxon:1383843" /geo_loc_name="Iran: Aran-Bidgol" /collection_date="2009" gene 210..1388 /locus_tag="DJ75_07975" CDS 210..1388 /locus_tag="DJ75_07975" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017344068.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hydroxymethylbilane synthase" /protein_id="OYR45330.1" /translation="MTETLRLATRGSDLALRQAGTVRDALSSRRRDVELRRVETRGDQ IPDELIHRLGKTGAFVRSLDEEVLGGDADLAVHSLKDVPTEGMDDMVVAGVPERAPSG DVLVHPDGVGIDDLPSGAVIGTGSLRRTAQLKAARPDLVVEPLRGNVDTRIEKLLAPG LQAEHERRLIASGEASAMTAEESGADDAEDGAEDADSDDPEIDEEFDRTVEEWFDSLS DLERSAMERKVETEYDAVVLAEAGLRRSDLFHEIPTTRLPREEFVPAAGQGAIAVTAT DPDVIEDVRSAVDHPRTRVAVTVERTILGELNGGCVAPIGVSALVQGEHVHTRVRVLS TDGTEEVADTRDLPIRSHAKAAESFAADLADRGAADLIAAAREEAEVDQGTRQEADDE " gene 1381..2196 /locus_tag="DJ75_07980" CDS 1381..2196 /locus_tag="DJ75_07980" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008007216.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="uroporphyrinogen-III C-methyltransferase" /protein_id="OYR45331.1" /translation="MSEAKSEGDDGGESADIGTVYLVGSGPGDPDLMTVKAARLIESA DVVLHDKLPGPEILGEIPAEKREDVGKRAGGEWTPQEYTNRRLVELAREGKSVVRLKG GDPFVFGRGGEEAEHLADAGVPFEVVPGVTSAIAGPAVAGIPVTHRDHASSVSFVTGH EDPTKEESAVDWDALAATGGTIVVLMGVGKLPAYAAELRDAGLAGDTPVALVERATWP GMRVATGTLDSIVEVRDEAGIEPPAITVIGDVAAARDRVVEFLRNDGGRGEEA" gene 2193..2945 /locus_tag="DJ75_07985" CDS 2193..2945 /locus_tag="DJ75_07985" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008001056.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="uroporphyrinogen-III synthase" /protein_id="OYR45332.1" /translation="MSDAPRVAVFRPADERIDDAAALLESLGAEPVADPMLAVEATGA VPAAAPYVVLTSKTGVELAAEAGWEPGDATLVAIGPATAAAAREAGWTVDVVPDEYTS AGLVEALSGRVAGEAVEVARSDHGSDVLIEGLRDADATVNETVLYRLTRPEGAGESTE RAAAGDLDAAAFTSSLTVTHFLEAADERGIREAAIAGLDDATVGAIGPPTAETAAEHG IDVDVVPGDADFAALAEAVVAAATDGDVGPDR" gene 3104..3505 /locus_tag="DJ75_07990" CDS 3104..3505 /locus_tag="DJ75_07990" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008441647.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="OYR45333.1" /translation="MQPSERWSLRTDDEVLVVEFPHGTGLSPADSEALLDRWRTVTAD RAVDAVVILVRTSRPCSDAGRRALRESAQIAVARGVDRFAVVGERSKRRYLKRTIDVE GVDTEAFNDTDPALEWAKRPSTAVPSAETSS" gene complement(3580..4662) /locus_tag="DJ75_07995" CDS complement(3580..4662) /locus_tag="DJ75_07995" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_004596994.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="OYR45334.1" /translation="MTDRPRTTGDDRSQDTPADGFGRREFVALGAGASATLLAGCAGD DALPSSDGSDGSDGSDGSQTLTGNFRLLISDAPADIGDFDQLNVTLDEARIFEARDED DDEDDDDEDAEPDEQETDGEDDGSEEAGNATEAENDPPNGTADEDDGEGDAEDDEEGD AEDDGNEDEEDEDGDAEERGFTVVDLDGATVDLTRVIEEDAMAVFDGEIPAGSYEKIE LSVSAVEGIVDGGEVDVKLPSEKLRITNGFEVTPDEPVSFVFDINVVKRGPNNGYILK PVISGSGVAGRDVDVNEIDDDDDEDESAADAEEADGSDSDEDESAADTEEADGSDGDE DGSDGDGAAENETDGAGNETAAGSDS" gene complement(4793..6118) /locus_tag="DJ75_08000" CDS complement(4793..6118) /locus_tag="DJ75_08000" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008007571.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ethylammeline chlorohydrolase" /protein_id="OYR45335.1" /translation="MLIAGTVIADPETVVPDGAVVVEGATIAAVGAAAALREEYPDHE RREVDLVAPGLIGGHVHSVQSLGRGIADDAALLDWLFDAVLPMEAAMDAEATRAAAEL GYLECLESGTTTVVDHLSVNHAEQAFEAAIDTGIRARLGKVLMDRDSPDGLLEDTDAA LAESEALVEEYHGAAGGRIRYAVTPRFAVTCTEACLRGCRELADRHEGVTIHTHASEN EDEIEAVAADTGRRNVLWLDEVGLTGPDVTLAHCVHTDEREREVLAETDTVVTHCPSS NMKLASGIAPVHDYLDRGIAVALGNDGPPCNNTLDPFTEMRQASLLGKVDARDPTRLP AATVLEMATTNGARAAGFDRLGALREGHRADVIGLTTDRTRATPIHDPLSHLVYAAHG DDVVFTMVDGEVRYEGGEHVGIDADAVRERATRQAKRVVDEAGIDTADP" gene complement(6302..6718) /locus_tag="DJ75_08005" CDS complement(6302..6718) /locus_tag="DJ75_08005" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_004597089.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="OYR45336.1" /translation="MSLPTFEKKRLIGLVAVLSFGLTSLFAVLLPGALGPLIPATFIL GFFVIIPLVLLLGEDFPLVESGDAAGASAATTAAAGDPLETLRERYATGEIGEEEFER RLDRLLETEDLKGRIDADAADRRDSRDRRERETELE" gene 6836..>7150 /locus_tag="DJ75_08010" CDS 6836..>7150 /locus_tag="DJ75_08010" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006630388.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="non-canonical purine NTP pyrophosphatase" /protein_id="OYR45337.1" /translation="MLRYVTTNPGKVREAERYLPDGSVERLDFDYTEIQADGLGPIAA RGAREAYRHAGESVLVDDAGLFVEGLDGFPGPYSSYVEETLGVERVHEIAAGLDDRRA AFR" BASE COUNT 952 a 2562 c 2578 g 1058 t ORIGIN 1 cctccccaac ctcggcggac ggcccggcgc gggcggggcc gcgccggacc gccacctccc 61 tcgtgcggtc tgctcgcggc ctcacggccg ctcgcaggcg cacgccggtt ctgtttataa 121 attgtcgcgg cggcgctctc cttccgtccg cgttccggcg cgacgcccga actaatgccc 181 ttttgaacgc ggggaccgtc cggtcaggta tgaccgagac cctcaggctc gccacgcgcg 241 ggtcggacct cgccctccgg caggccggga ccgtccgcga cgcgttgtcg agccgccgac 301 gcgacgtgga actccgccgc gtcgagaccc gcggcgacca gatccccgac gagctgatcc 361 accggctcgg gaagaccggc gccttcgtcc gatcgctcga cgaggaggtg ctcggcggcg 421 acgccgacct cgccgtccac tccctgaagg acgtgccgac cgaggggatg gacgacatgg 481 tcgtcgccgg cgtccccgag cgcgcgccct cgggcgacgt gctcgtccac cccgacgggg 541 tcgggatcga tgacctgccc tcgggcgcgg tgatcgggac gggctcgctc cggcggaccg 601 cccagctcaa agccgcgcgc ccggacctcg tcgtcgagcc gctccgcggc aacgtcgaca 661 cccggatcga gaagctgctc gcgcccggcc tccaggcgga acacgagcgc cggctgatcg 721 cctcgggcga ggcgagcgcg atgaccgccg aggagagcgg tgctgacgac gccgaagacg 781 gcgccgagga cgccgacagc gacgaccccg agatcgacga ggagttcgac cgcaccgtcg 841 aggagtggtt cgactcgctc tccgacctgg agcgctcggc gatggagcgg aaggtcgaaa 901 ccgagtacga cgccgtcgtc ctcgcggagg cagggctccg gcgctccgac ctgttccacg 961 agattcccac gacccggctt ccccgcgagg agttcgttcc cgcggcgggc cagggcgcca 1021 tcgcggtgac cgcgaccgac ccggacgtga tcgaggacgt gcgctccgcg gtcgaccacc 1081 cccggacccg ggtggcggtc accgtcgagc ggacgatcct cggggaactc aacggcggct 1141 gcgtcgcccc catcggcgtc tcggcgctcg tgcagggcga gcacgtccac acccgcgtcc 1201 gcgtgctctc taccgacggc accgaggagg tggcggacac ccgcgacctc ccgatccggt 1261 cgcacgcgaa ggcggccgag tcgttcgcgg cggacctggc ggaccgtggc gcggccgatc 1321 tgatcgccgc cgcccgcgag gaggccgaag tggaccaggg gacccggcag gaggcggacg 1381 atgagtgagg cgaaaagcga gggcgacgac ggcggggaat cggccgacat cggcaccgtc 1441 tacctcgtcg gctcggggcc gggcgacccg gacctcatga ccgtgaaggc cgcccggctg 1501 atcgagtcgg ccgacgtggt cctccacgac aagctccccg gcccggagat actcggagag 1561 atcccggccg agaagcgcga ggacgtcggc aagcgcgccg gcggcgagtg gaccccccag 1621 gagtacacga accgacggct ggtcgaactc gcgcgcgagg ggaagtccgt cgtccggctc 1681 aagggcggcg acccgttcgt cttcggccgg ggcggcgaag aggccgaaca cctcgccgac 1741 gccggggtcc ccttcgaggt cgtcccgggc gtcacctcgg cgatcgcggg gcccgcagtc 1801 gccgggatcc ccgtgacgca ccgcgatcac gcctcctccg tctccttcgt cacgggccac 1861 gaggacccga cgaaggagga gtcggcggtc gactgggacg cgctcgccgc gaccggcggg 1921 acgatcgtcg tgctgatggg cgtcggcaag ctgcccgcgt acgccgccga gctccgcgac 1981 gccgggctcg ccggcgacac ccccgtcgcg ctggtggagc gcgcgacgtg gccggggatg 2041 cgcgtcgcga ccggcaccct cgactcgatc gtcgaggtcc gcgacgaggc cgggatcgag 2101 ccgcccgcga tcaccgtgat cggcgacgtg gccgcggccc gggaccgggt cgtcgagttc 2161 ctccgaaacg acggcgggcg gggtgaggag gcgtgagcga cgccccccgc gtcgccgtct 2221 tccgcccggc cgacgagcgg atagacgacg ccgcggcgct gctggagtcg ctcggcgccg 2281 agccggtcgc cgacccgatg ctcgcggtcg aagcgacggg ggcggtgccg gccgcggcgc 2341 cgtacgtcgt cctcacgagc aagaccggcg tcgagctcgc ggccgaggcg ggctgggagc 2401 cgggcgacgc gacgctcgtc gccatcggcc ccgcgaccgc cgccgcggcg cgcgaggccg 2461 gctggaccgt cgacgtcgtc cccgacgagt acacctcggc ggggctggtc gaggcgctct 2521 ccgggcgcgt cgccggcgag gccgtcgagg tggcccgctc ggaccacggc agcgacgtgc 2581 tgatcgaggg cctccgcgac gccgacgcga cggtcaacga gacggtgctg taccggctga 2641 cacggccgga gggggcgggc gagtccaccg agcgggccgc cgcgggcgac ctcgacgccg 2701 ccgcgttcac ctcgtcgctc acggtgactc acttcctcga ggccgccgac gagcggggga 2761 tccgcgaggc ggcgatcgcc ggcctcgacg acgcgaccgt gggcgcgatc ggcccgccga 2821 ccgcggagac ggcggcggaa cacgggatcg acgtcgacgt cgttccgggg gacgccgact 2881 tcgcggccct cgccgaggcg gtcgtggcgg cggcgacgga cggtgacgtc ggtccggacc 2941 ggtagttctc gggaccgata cgttcgctcg cttcgaccac acggccggga atccgtcgct 3001 cggtgcgctt ccgacgcgat ccgccgcgtc cgccctgtcg caaggaatca gatcagataa 3061 tttggaacgc acatttatta tccgacaggt gagaagcaat gacatgcaac cgagcgagcg 3121 ctggtcgctg cggaccgacg acgaggtgct agtcgtcgag ttcccacacg gaacggggct 3181 tagccccgcc gacagcgagg cgctgctcga tcggtggcgg acggtcaccg ccgaccgcgc 3241 ggtcgacgcc gtcgtgatcc tcgtgcggac gagccggccc tgctccgacg ccgggcggcg 3301 ggcgctccgg gagtcggccc agatcgccgt cgcccgcggc gtggaccggt tcgccgtcgt 3361 cggcgagcgc tcgaagcgcc gctacctcaa gcggactatc gacgtcgaag gggtcgacac 3421 cgaggcgttc aacgacaccg accccgcgct ggagtgggcg aagcgtccgt cgaccgccgt 3481 cccttccgcc gagacgtcgt cgtaggcggg tcctcgtttc aaatagaagc cggtgccggt 3541 tcgagcgacg ccggccttgc ccgaatcgct gtaccgcggt tagctgtcgc tgccggcggc 3601 ggtctcgttt ccggcaccgt cggtttcgtt ttcggcggcg ccgtcgccgt cggacccgtc 3661 ctcgtcgccg tccgacccgt ccgcttcctc ggtgtcggcg gcggattcgt cctcgtcgct 3721 gtccgacccg tccgcttcct cggcgtcggc ggcggattcg tcctcgtcgt catcgtcgtc 3781 gatctcgttg acgtcgacgt cccggcccgc gaccccgctc ccggagatca cgggtttgag 3841 gatgtagccg ttgttcggac cgcgcttgac gacgttgatg tcgaagacga agctgactgg 3901 ttcgtcgggc gtgacctcga acccgttcgt gatccggagc ttctcgctcg ggagcttcac 3961 gtcgacttcc ccgccgtcga cgatcccctc gacggcggag acggagagct cgatcttctc 4021 gtagcttccc gccggaattt caccgtcgaa cacggccata gcgtcctcct cgatcacccg 4081 cgtgaggtcg acagtggcgc cgtcgaggtc gacgacggtg aagccgcgct cctccgcgtc 4141 tccgtcttcg tcctcctcgt cttcgttccc gtcgtcctcc gcatcgcctt cctcgtcgtc 4201 ctccgcatcg ccttccccgt cgtcctcgtc cgcggttccg ttcggcgggt cgttctccgc 4261 ttcggtcgcg ttcccggcct cttccgagcc gtcgtcctca ccgtctgtct cttgttcgtc 4321 gggctccgcg tcctcgtcgt catcgtcttc gtcgtcgtcc tcgtcccgtg cctcgaagat 4381 ccgagcctcg tcgagggtga cgttcagctg atcgaagtcg ccgatgtcgg ccggcgcgtc 4441 gctgatgagg agccggaaat taccggtgag cgtctgcgac ccgtcggagc cgtccgagcc 4501 gtccgagccg tccgacgagg ggagcgcatc gtcgccggcg cagccggcga gtagcgtcgc 4561 gctcgcgccc gcgccgagcg ccacgaactc acgtcgtccg aagccgtcgg cgggggtgtc 4621 ctgagagcgg tcgtcacctg tcgtccgcgg tcggtccgtc atggtggttc acccaacgcg 4681 ccccggacgg gtatataacg gagattctca gcccgaatta atgcggatca aaccgctttc 4741 aatcgtctgg attagccgga tagacaccga cgagagacgt ctgagcgacc gttcacggat 4801 cggccgtgtc gatgcccgct tcgtcgacca cgcgcttcgc ctgtcgggtc gcgcgctcgc 4861 ggaccgcgtc ggcgtcgatc ccgacgtgct cgccgccctc gtaccggacc tcgccgtcga 4921 ccatcgtgaa gaccacgtcg tcgccgtgag cggcgtacac gagatgcgag agcgggtcgt 4981 ggatcggcgt ggcgcgcgtc cggtcggtgg tgagcccgat cacgtcggcc cggtggccct 5041 ctcgaagggc gccgagtcgg tcgaagccgg cggcgcgcgc gccgttcgtc gtcgccatct 5101 ccaacacggt cgcggcgggg agccgggtcg ggtcgcgcgc gtcgaccttc ccgaggaggc 5161 tcgcctgccg catctcggtg aacgggtcga gcgtgttgtt gcagggcggg ccgtcgttgc 5221 cgagcgcgac cgcgatcccc cggtcgaggt agtcgtggac cggggcgatc ccggaggcga 5281 gcttcatgtt cgaggagggg cagtgggtga cgaccgtgtc ggtctcggcg agcacctctc 5341 gctcgcgctc gtctgtgtgg acgcagtggg cgagcgtcac gtccggtccc gtgagcccga 5401 cctcgtcgag ccacaggacg tttcgtcggc ccgtgtcggc cgctaccgcc tcgatctcgt 5461 cctcgttctc gctggcgtgg gtgtggatcg tcacgccctc gtggcggtcg gcgagctcgc 5521 ggcagccccg caggcacgcc tccgtgcagg tgacggcgaa ccggggggtg accgcgtacc 5581 ggatccgacc gccggcggcg ccgtggtact cctcgacgag cgcctcgctc tcggcgagcg 5641 cggcgtccgt gtcctctaag agtccgtccg gggagtcccg gtccatcagc accttcccga 5701 gccgcgcgcg gatccccgtg tctatcgcgg cctcgaacgc ctgctccgcg tggttgacgg 5761 agaggtggtc gacgacggtc gtcgtcccgc tctcgaggca ttcgaggtac ccgagctcgg 5821 cggcggcgcg ggtcgcctcg gcgtccatcg cggcctccat cgggagcacg gcgtcgaaca 5881 gccagtcgag cagggcggcg tcgtcggcga tcccccggcc gagcgactgc accgagtgca 5941 cgtggccccc gatcaacccg ggcgcgacga ggtcgacctc gcggcgctcg tggtcggggt 6001 actcctcgcg gagggcggcg gcggcgccga ccgcggcgat cgtcgcgccc tcgacgacga 6061 ccgcgccgtc ggggacgacg gtctccggat cggcgatgac ggttccggcg atcaacatgt 6121 ccgtccgttc ctgatggtcg attcttaaga gggtgtcctt cgcgggtcgg cggcgctggt 6181 cgcgggtcgg cggcgagcgg cgcggatcgg cggcgagcgg cgcggatcgg cggcgctggt 6241 cgcgggtcgg cggcgagcga cggcgaccga cggagcggtc gacgccgcgt tctcccgtcc 6301 gctactccag ttccgtctcg cgctcgcgac ggtcacggga gtcgcgtcgg tccgcggcgt 6361 cggcgtcgat tcggcccttc agatcctcgg tctccagcag tcggtcgagc cggcgctcga 6421 actcctcctc gccgatctcg ccggtggcgt accgctcgcg gagcgtctcc aggggatccc 6481 cggccgccgc ggtcgtcgcg gccgacgcac cggccgcgtc cccggactcg acgagcggga 6541 agtcctcgcc gagcagcagc acgagcggga tgatcacgaa gaagccgagg atgaacgtgg 6601 cggggatcag cggccccaac gcccccggga ggaggacggc gaacagcgag gtgagcccga 6661 acgagaggac cgcgacgagc cctatcagcc gctttttctc gaacgtcggg agggacatac 6721 gcgggagatt ccgacgaacc aacaaaagcg tactgccggc ggtccgccga gcgggtcgac 6781 cgcggcggcg cccgcgaccg acggctattc gtttcggccg cccccagccg gatccgtgct 6841 cagatacgtg acgacgaacc ccgggaaagt gcgtgaggcg gagcggtacc tgccggacgg 6901 ctcggtggag cgactcgact tcgactacac ggagatccag gccgacgggc tcgggccgat 6961 cgccgcgcgg ggcgcgcggg aggcgtaccg ccacgcgggc gagtcggtgc tcgtcgacga 7021 cgcggggctg ttcgttgagg ggctcgacgg gttccccggt ccgtactcct cgtacgtgga 7081 ggagacgctc ggggtcgagc gcgtccacga gatcgccgcc ggcctcgacg accgccgggc 7141 cgcgttccgc //