LOCUS DQIR01312624 3409 bp RNA linear TSA 28-FEB-2019 DEFINITION TPA_asm: Sus scrofa Susscr4EVm002634t2, transcribed RNA sequence. ACCESSION DQIR01312624 VERSION DQIR01312624.1 DBLINK BioProject: PRJNA480168 BioSample: SAMEA3303526, SAMEA3303527, SAMEA3303528, SAMEA3303529, SAMEA3303531, SAMEA3303532, SAMEA4447539, SAMEA4447549, SAMEA4447554, SAMN02925400, SAMN02925407, SAMN05952593, SAMN05952594, SAMN05952595, SAMN05952596, SAMN05952612, SAMN05952613, SAMN05952614, SAMN05952615, SAMN05952626, SAMN07956242, SAMN07956243, SAMN07956244, SAMN07956245, SAMN07956246, SAMN07956247 Sequence Read Archive: ERR789444, ERR789445, ERR789446, ERR789447, ERR789449, ERR789450, ERR972387, ERR972388, ERR972389, SRR1519321, SRR1519322, SRR5027048, SRR5027049, SRR5027050, SRR5027051, SRR5027060, SRR5027061, SRR5027062, SRR5027063, SRR5027064, SRR6236876, SRR6236879, SRR6236881, SRR6236882, SRR6236888, SRR6236889 KEYWORDS TSA; Transcriptome Shotgun Assembly; Third Party Data; TPA; TPA:assembly. SOURCE Sus scrofa (pig) ORGANISM Sus scrofa Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Suina; Suidae; Sus. REFERENCE 1 (bases 1 to 3409) AUTHORS Gilbert,D.G. TITLE Genes of the pig, Sus scrofa, reconstructed with EvidentialGene JOURNAL PeerJ 7, e6374 (2019) PUBMED 30723633 REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 3409) AUTHORS Gilbert,D.G. TITLE Direct Submission JOURNAL Submitted (28-JUL-2018) Indiana University, IN, USA COMMENT RNA-Seq data of Sus_scrofa are assembled with various RNA assemblers, using multiple options. EvidentialGene tr2aacds pipeline is used to process the many resulting assemblies by coding sequences, translated proteins, and gene evidence, then classify/reduce to a biologically informative transcriptome of primary and alternate transcripts. ##Assembly-Data-START## Assembly Method :: EvidentialGene v2018.06.18 Assembly Name :: pig4321ew.evigene Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..3409 /organism="Sus scrofa" /mol_type="transcribed RNA" /submitter_seqid="Susscr4EVm002634t2" /db_xref="taxon:9823" CDS 96..2873 /inference="similar to AA sequence:GenBank:NP_001230261.1" /note="original_id:Susscrtrtrin1a_sBn2l1SRR1519321trinLocD N40567c0g2t1; located at chr3:18098111-18119537:-, 99 pct aligned, 19 exons; protein is 100 pct similar to human18nc:NP_001230261.1 gene.; db_xref_other:GenBank:NP_001230261.1; db_xref_other:GenBank:XP_005224910.1; db_xref_other:pig18ncrf:XM_003354543.4" /codon_start=1 /product="seizure 6-like protein 2 isoform 5 precursor" /protein_id="HDC68102.1" /translation="MGTPRAQHPPPPQLLFLFLLSCPWIQGLPLKEEEALLEPGSETP TVASEALAELLHGALLRRGPEMGYLPGSDPDPTLATPPAGQTLAAPSLPRATEPGTGP LTTAVTPKGGRGAGPTAPELLTPPPGTTAPPLPGPASPGPPLGPGPEGGEEETTTTII TTTTVTTTVTSPVLCNNNISEGEGHVESPDLGSPASRTLGLLDCTYSIHVYPGYGIEI QVQTLNLSREEELLVLAGGGSPGLAPRLLANSSMLGEGQVLRSPTNRLLLHFQSPRVP RGGGFRIHYQAYLLSCGFPPRPAHGDVSVTDLHPGGTATFHCDSGYQLQGEETLICLN GTRPAWSSEPPSCMASCGGTIHNATLGRIVSPEPGGAAGPNLTCRWVIEAAEGRRLHL HFERVSLDEDNDRLMVRSGGSPLSPVIYDSDMDDVPERGLISDAQSLYVELLSETPAN PLLLSLRFEAFEEDRCFAPFLAHGNVTTTDPEYRPGTLATFSCLPGYALEPPGPPSAI ECVDPTEPHWNDTEPACKAMCGGELSEPAGVVLSPDWPQSYSPGQDCVWGLHVQEEKR ILLQVEILNVREGDMLTLFDGDGPSARVLAQLRGPQPRRRLLSSGPDLTLQFQAPPGP PNPGLGQGFVLHFKEVPRNDTCPELPPPEWGWRTASHGDLIRGTVLTYQCEPGYELLG SDILTCQWDLSWSAAPPACQKIMTCADPGEITNGHRTTSDAGFPVGSHVQYRCLPGYS LEGAAVLTCYSRDTGTPKWSDRVPKCALKYEPCLNPGVPENGYQTLYKHHYQAGESLR FFCYEGFELIGEVTITCVPGHPSQWTSQPPLCKVAYEELLDNRKLEVTQTTDPSRQLE GGNLALAILLPLGLVIVLGSGVYIYYTKLQGKSLFGFSGSHSYSPITVESDFSNPLYE AGDTREYEVSI" BASE COUNT 648 a 1161 c 965 g 635 t ORIGIN 1 cgacaccttt ctcagcttct ccctcccgca gctccagtgc ccaagcctcg gctttccagg 61 agaggcacgg aagagagaaa tcggtgtgag tcgccatggg gactcccagg gcccagcatc 121 caccgcctcc ccagctgctg ttcctatttc tgctgagctg tccctggatt cagggtctgc 181 ccctgaagga agaggaggca ctgctggagc ctggaagcga gacccccaca gtagcctctg 241 aggccttggc ggagctgctc cacggggccc tgctgaggag gggcccagag atgggctacc 301 tgcccggatc tgatccagac cccacactag ccacccctcc agccggccag actcttgccg 361 caccctcgct gccacgggcc actgagccag gcacagggcc tctgactaca gccgtaaccc 421 ctaagggggg caggggagca ggccccaccg cgccggagct gctgaccccg cccccaggaa 481 ctacggcccc gccccttccc gggcccgcct ccccaggacc gcccctcggg cccgggcccg 541 agggaggaga ggaggagacc acgaccacca tcatcaccac gacgactgtc accaccacgg 601 tgaccagccc agttctgtgt aataacaaca tctccgaggg cgaagggcat gtggagtctc 661 cagatttagg gagcccagcc agccgcacct tggggctcct ggactgcacg tacagcatcc 721 atgtctaccc cggctacggc attgagatcc aggtgcagac gctgaacctg tctcgggagg 781 aggaactcct ggtgctggct ggtggggggt ccccaggcct agccccccga ctcctggcta 841 actcctccat gctgggagaa ggacaggtcc ttcggagtcc tacgaaccgt ctgctcctgc 901 acttccagag cccacgggtc ccaaggggcg gtggcttcag gatccactat caggcctacc 961 tcttgagctg tggcttccct ccccggccag cccatgggga tgtgagtgtg acagaccttc 1021 accctggagg cactgccacc ttccactgtg attcgggcta ccagctgcag ggtgaggaga 1081 cccttatctg cctcaatggc acccggccag cctggagcag tgaacccccc agctgcatgg 1141 cttcctgcgg tggcaccatc cacaacgcta cacttggccg catcgtgtct ccggagcctg 1201 ggggagctgc aggccccaac ctcacctgcc gttgggtcat tgaagcagca gagggacgcc 1261 ggctacacct gcacttcgaa agggtctcgc tggatgagga caatgaccgg ctgatggtgc 1321 gctcaggggg cagtcctcta tccccagtga tctatgactc ggacatggac gatgtcccag 1381 aacgaggtct catcagtgat gcccagtccc tctatgtgga gctgctttca gagacacctg 1441 ccaatcccct gctgctaagc ctccgatttg aagcttttga ggaagatcgc tgcttcgccc 1501 ccttcctggc acatggcaat gtcaccacca cggaccctga gtaccgccca ggaacactgg 1561 ccaccttctc ctgcctccca ggatatgccc tagagccccc tggccccccc agtgccatcg 1621 aatgtgtgga tcccacagaa ccccactgga atgacacaga gccagcctgc aaggccatgt 1681 gtggagggga gctgtcagag ccggccggcg tggtcctctc tcccgactgg ccccagagct 1741 atagccctgg ccaagactgt gtgtggggtc tgcacgtcca ggaagagaag cgcatcttgc 1801 tccaagttga gatcttgaat gtacgcgaag gggatatgct gacgttgttc gacggggacg 1861 gtcccagtgc ccgagtcctg gcccaactgc ggggacctca gccgcgccgc cgcctccttt 1921 cctctgggcc ggacctcacg ctgcagttcc aggcaccgcc cggaccccca aacccgggcc 1981 tgggccaggg tttcgtgttg cacttcaaag aggtcccgag gaacgacacg tgccccgagc 2041 tgccgcctcc ggagtggggc tggaggacag cttcccatgg ggacctgatc cggggcacgg 2101 tgcttaccta ccagtgcgag ccaggctacg agctgctcgg ctccgacatt ctcacctgcc 2161 agtgggacct gtcctggagc gctgcgcctc ccgcctgcca aaagatcatg acttgtgctg 2221 accctgggga gatcaccaac ggacaccgta ccacctcgga tgctggcttc cctgttggct 2281 ctcacgtcca gtaccgctgt ctgcccgggt acagcctgga gggggcggcc gtactcacct 2341 gctacagccg ggacacgggc acgcccaagt ggagcgaccg ggtccccaag tgcgccttga 2401 agtacgagcc gtgcctgaac cccggggtcc cagagaatgg ctatcagaca ttgtacaagc 2461 atcactacca ggcgggcgag tctctgcgct tcttctgcta tgagggcttt gagctcattg 2521 gcgaggtcac catcacctgt gtgcccggcc acccctcgca gtggaccagc cagcccccac 2581 tctgcaaagt ggcctatgag gagctcctgg acaaccgaaa actggaagtg acccagacca 2641 ccgacccgtc gcggcagctg gaagggggca acctcgcctt ggccattctg ctgcccctgg 2701 gcttggtcat cgtcctgggc agtggcgttt acatttacta caccaagcta cagggaaaat 2761 ccctcttcgg cttctccggt tcccattcct acagccctat cacggtggag tcagacttca 2821 gcaatccgct gtatgaagct ggggatacac gggagtatga agtttccatc tgaaccccaa 2881 gactaaggct gcaggaccca ggacgcccct cctctcctca ttctgagcag ggaggagtag 2941 gacctggtct ctggctccct tcctccctgc tgtgtaaata gtctcctggt cccacagggg 3001 ggctttgatg gccctgggga ccctaccata aataaaccag caacctgtgc ccaaagcctc 3061 ctcttctcag ttgccaaaca aggggcctgc ccgccccccc aacctgcttt tggaccctgg 3121 gaggggaact cagcccccgc tgtaactgct gggcccctcc agcccagggc tttctctaag 3181 gactgctcca gagcttggct gttccctggg ccagcaaggc tctgccctgc ccggatgctc 3241 tgcccagggc ctaacctctg acccttggag ggtcagaaga agagggatga atgggagagc 3301 tgggacaagg cccctgcccc cttcctgcct tctcctcgac ccgcaatctc cccatctttg 3361 cttctggatt tttgtttttg agcaataaac agaaaatcac caaaaaaaa //