LOCUS JV044285 3464 bp mRNA linear TSA 04-NOV-2014 DEFINITION TSA: Macaca mulatta Mamu_415346 mRNA sequence. ACCESSION JV044285 VERSION JV044285.1 DBLINK BioProject: PRJNA77627 Sequence Read Archive: SRR331898 KEYWORDS TSA; Transcriptome Shotgun Assembly. SOURCE Macaca mulatta (Rhesus monkey) ORGANISM Macaca mulatta Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. REFERENCE 1 (bases 1 to 3464) AUTHORS Zimin,A.V., Cornish,A.S., Maudhoo,M.D., Gibbs,R.M., Zhang,X., Pandey,S., Meehan,D.T., Wipfler,K., Bosinger,S.E., Johnson,Z.P., Tharp,G.K., Marcais,G., Roberts,M., Ferguson,B., Fox,H.S., Treangen,T., Salzberg,S.L., Yorke,J.A. and Norgren,R.B. Jr. TITLE A new rhesus macaque assembly and annotation for next-generation sequencing analyses JOURNAL Biol. Direct 9 (1), 20 (2014) PUBMED 25319552 REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 3464) AUTHORS Pandey,S., Maudhoo,M.D., Guda,C., Ferguson,B., Fox,H. and Norgren,R.B. TITLE Direct Submission JOURNAL Submitted (17-APR-2012) Genetics, Cell Biology and Anatomy, University of Nebraska Medical Center, 985805 Nebraska Medical Center, Omaha, NE 68198-5805, USA COMMENT All reads were aligned with the human RefSeq mRNA sequences using BLAST. For read lengths of 76 bp, if the alignment length was 70 or less, the read was filtered from the input file. For read lengths of 100 bp, if the alignment length was 90 or less, the read was filtered from the input file. For paired end sequences, if one sequence was filtered, its pair was also filtered. Velveth was performed as recommended. kmer was set at 29 for single end sequences and 31 for paired end sequences. The output files from Velveth were passed to Velvetg. After performing Velvetg, this output file was passed to Oases where the final assembly was performed. Default parameters were used for Velvetg and Oases. ##Assembly-Data-START## Assembly Method :: Velvet v.1.1.05; Oases v.0.1.22 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..3464 /organism="Macaca mulatta" /mol_type="mRNA" /isolation_source="001T-NHP" /db_xref="taxon:9544" /tissue_type="testis" /geo_loc_name="China" gene <1..>3464 /gene="FAM115A" CDS 142..2907 /gene="FAM115A" /codon_start=1 /product="hypothetical protein LOC9747 isoform 1" /protein_id="AFI34356.1" /translation="MATPSAAFEALMNGVTSWDVPEDAVPCELLLIGEASFPVMVNDM GQVLIAASSYGRGRLVVMSHEDYLVEAQLTPFLLNAVGWLCSSPGAPIGVHPSLAPLA KILEGSGMDAKIEPEVKDSLGVYCIDAYNETMTEKLVKFMKRGGGLLIGGQAWDWANQ GDDERVLFTFPGNLVTSVAGIYFTDNKGDTSFFKVSKKMPKIPVLVSCEDDLSEDREE LLHGISELDISNSDCFPSQLLVHGALAFPLGLDSYHGCVIAAARYGRGRVVVTGHKVL FTVGKLGPFLLNAVRWLDGGRRGKIVVQTELRTLSGLLAVGGIDTSIEPNLTSDASVY CFEPVSEVGVKELQEFVAEGGGLFVGAQAWWWAFKNPGVSPLARFPGNLLLNPFGISI TSQSLNPGPFRTPKAGIRTYHFRSTLAEFQVIMGRKRGNVEKGWLAKLGPDGAAFLQI PAEEIPAYMSVHRLLRKLLSRYRLPVATRENPVINDCCRGAMLSLATGLAHSGSDLSL LVPEIEDMYSSPYLRPSESPITVEVNCTNPGTRYCWMSTGLYIPGRQIIEVSLPEAAA SADLKIQIGCHTDDLTRASKLFRGPLVINRCCLDKPTKSITCLWGGLLYIIVPQNSKL GSVPITVKGAVHAPYYKLGETTLEEWKRRIQENPGPWGELATDNIILTVPTANLRTLE NPEPLLRLWDEVMQAVARLGAEPFPLRLPQRIVADVQISVGWMHAGYPIMCHLESVQE LINEKLIRTKGLWGPVHELGRNQQRQEWEFPPHTTEATCNLWCVYVHETVLGIPRSRA NIALWPPVREKRVRIYLSKGPNVKNWNAWTALETYLQLQEAFGWEPFIRLFTEYRNQT NLPTDNVDKMNLWVKMFSHQVQKNLAPFFEAWAWPIQKEVATSLAYLPEWKENIMKLY LLTQMPH" BASE COUNT 852 a 905 c 937 g 770 t ORIGIN 1 gccgctccta gcaggttcct actgccccga acccgcgctg cagggaacag cggggcaaac 61 agtgagtggg gttcagcgta gactctggac caggagaggc ccgcggtgac caaggcctgg 121 gctccggaaa ccaacagagc catggcgact ccttctgctg ccttcgaggc ccttatgaat 181 ggcgtgacaa gctgggatgt acccgaagat gccgttccat gtgaactgct tcttattgga 241 gaggcttcat ttcctgtgat ggtgaatgac atgggccagg tcctcattgc tgcctcctcc 301 tatggccgcg gccgcctggt ggtcatgtcc catgaggact acctggtgga agcccaactc 361 acgccctttc tcctgaacgc agtggggtgg ctttgctctt cccctggggc tcccattggt 421 gtacacccat ccctggcacc tttggccaaa atcctcgagg gctctggaat ggatgcaaag 481 attgagccag aagtgaaaga ctccctgggg gtttactgta ttgatgccta caacgaaacc 541 atgacagaaa agctggtcaa gttcatgaaa cgtggtggcg gcttgctcat aggaggacaa 601 gcctgggact gggccaacca gggtgatgat gaaagggttc tcttcacatt ccctgggaac 661 ctcgtgacca gtgtggctgg catttacttt actgacaaca aaggggacac gagtttcttt 721 aaagtctcca agaagatgcc caagatccca gtgttggtta gttgtgaaga tgacctctcc 781 gaggacagag aggagcttct gcatgggatt tcagagctgg acatcagcaa ctcggattgt 841 ttcccatccc agctgctagt gcatggggct ttagcctttc ctctagggtt agattcctac 901 catggctgtg ttatagcggc tgcccgctat ggccggggcc gggtggttgt gactggccat 961 aaggtattat tcactgttgg taaactgggc ccctttctgc tcaatgccgt ccgctggctg 1021 gatgggggcc gcagaggcaa gatcgtggtg cagacagagc tgagaaccct gagtggcctc 1081 ctcgcagtgg ggggcataga caccagcatc gagcccaatc tgaccagtga tgcaagtgtc 1141 tattgctttg aacccgtgag cgaagtgggg gtcaaagaac tgcaggagtt tgtagcagag 1201 ggtggcgggc tgtttgttgg agcccaagcc tggtggtggg ccttcaagaa tcccggagtg 1261 tcccctttgg ctcgattccc aggaaacctc ctcctcaacc cctttggcat cagcattaca 1321 agccaaagcc tcaatccagg gccctttcgt actcctaaag cagggataag gacctatcac 1381 ttccgctcca ccttggccga gttccaggtt ataatgggca ggaagagagg aaatgtggaa 1441 aagggctggt tggcaaagct gggaccagat ggtgcagctt tcctgcagat tcccgccgaa 1501 gagatccctg cctacatgtc tgtgcatcga ctcctgagga agctgctaag tcgataccgg 1561 ctcccagtag caacccgaga gaaccctgtt atcaatgact gctgcagggg tgctatgctt 1621 tccctggcca cagggctggc ccactccgga agtgacctct ctctgttagt cccagaaatt 1681 gaagatatgt acagcagccc ctatctgcgc ccctcagaat ctcctatcac cgtcgaggtc 1741 aactgcacca atccaggcac cagatattgc tggatgagta ctgggctcta catacctgga 1801 aggcaaatta tagaagtctc actgcctgaa gctgctgcct ctgctgacct gaagatacag 1861 atcggctgcc acacagatga cctgaccagg gccagcaagc ttttccgagg cccactcgta 1921 attaaccggt gctgcttgga caaacccaca aaatcgatca cgtgcctctg gggtggactc 1981 ctctatataa ttgtgcctca gaacagcaaa ctgggttctg tgcccatcac cgtgaagggg 2041 gctgtgcatg ctccatacta caagctgggg gagaccaccc tggaggagtg gaagaggcgt 2101 atccaggaga atccagggcc ctggggagag ctggccacgg acaacatcat tctgaccgtg 2161 ccgaccgcaa atcttcgtac tctggagaac cctgagccgc tgctccgcct ctgggacgag 2221 gtgatgcagg ctgtggcgcg actgggagct gagcccttcc ctctgcgcct gcctcagagg 2281 attgttgccg acgtgcagat ctcagtgggc tggatgcatg cagggtaccc catcatgtgc 2341 catctggagt cagtgcagga gctcatcaac gagaagctca tcagaaccaa ggggctgtgg 2401 ggccccgtcc acgagctggg ccgcaaccag cagcggcagg agtgggagtt cccaccacac 2461 accaccgagg ccacctgcaa cctgtggtgt gtgtatgtgc atgagacggt cttgggcatt 2521 cctcgaagcc gtgccaatat tgctctgtgg cccccagttc gggagaaaag agtcagaatc 2581 tacctgagca agggtcccaa tgtgaaaaac tggaatgcat ggaccgcact ggaaacgtat 2641 ttacagctac aggaagcctt tggttgggag ccattcatcc gtctcttcac cgagtacagg 2701 aaccagacca acttacccac agataatgtt gacaaaatga atctgtgggt caagatgttc 2761 tcccaccaag tgcagaagaa cctggctccg ttctttgagg cctgggcctg gcccatccag 2821 aaggaagtgg ctaccagcct ggcctatctg cctgaatgga aggaaaatat tatgaaattg 2881 tacctcctca cacagatgcc ccactgaaat tgaagtcaag aaatgcaaaa aggaattgac 2941 acatctcagc aaagaaaact gaagggattt ggttataagt ggagaagatc tcagcacatt 3001 tctggaagag agaaagtgga tgaaacatga taatgaaaga gtgaagaacc tttcagataa 3061 aatataagct gatctgaaca acataacccc aaagagactt gagcacctga aaagtttgtc 3121 tactgaagaa ttactccggt tgtaaaattg agccctctcc tactcttccc caactcttat 3181 gcacagaaca caggcagtct ccactattga tacccgttaa aaatgtcttg ttattgttgc 3241 tgttgtttgt ataaaggaat tgaacagact gcctgcagag acataagagt tggtgctcca 3301 gaaaaaaatt tctcttctag tggaaataag tgcccccagc cagtagctga ttctctactg 3361 acagctgaga cctcctactc aagtgcctgt tgccttcagg tatagaagag atttgctgaa 3421 gaaacagacc taaccgtaca acagcagagg aaacccatgc tgac //