LOCUS AY061361 3981 bp mRNA linear INV 08-NOV-2001 DEFINITION Drosophila melanogaster LD28662 full length cDNA. ACCESSION AY061361 VERSION AY061361.1 KEYWORDS FLI_CDNA. SOURCE Drosophila melanogaster (fruit fly) ORGANISM Drosophila melanogaster Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora. REFERENCE 1 (bases 1 to 3981) AUTHORS Stapleton,M., Brokstein,P., Hong,L., Agbayani,A., Carlson,J., Champe,M., Chavez,C., Dorsett,V., Farfan,D., Frise,E., George,R., Gonzalez,M., Guarin,H., Li,P., Liao,G., Miranda,A., Mungall,C.J., Nunoo,J., Pacleb,J., Paragas,V., Park,S., Phouanenavong,S., Wan,K., Yu,C., Lewis,S.E., Rubin,G.M. and Celniker,S. TITLE Direct Submission JOURNAL Submitted (30-OCT-2001) Berkeley Drosophila Genome Project, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA 94720, USA COMMENT Sequence submitted by: Berkeley Drosophila Genome Project Lawrence Berkeley National Laboratory Berkeley, CA 94720 This clone was sequenced as part of a high-throughput process to sequence clones from Drosophila Gene Collection 1 (Rubin et al., Science 2000). The sequence has been subjected to integrity checks for sequence accuracy, presence of a polyA tail and contiguity within 100 kb in the genome. Thus we believe the sequence to reflect accurately this particular cDNA clone. However, there are artifacts associated with the generation of cDNA clones that may have not been detected in our initial analyses such as internal priming, priming from contaminating genomic DNA, retained introns due to reverse transcription of unspliced precursor RNAs, and reverse transcriptase errors that result in single base changes. For further information about this sequence, including its location and relationship to other sequences, please visit our Web site (http://fruitfly.berkeley.edu) or send email to cdna@fruitfly.berkeley.edu. FEATURES Location/Qualifiers source 1..3981 /organism="Drosophila melanogaster" /mol_type="mRNA" /strain="y; cn bw sp" /db_xref="taxon:7227" /map="78A4-78A5" gene 1..3981 /gene="pap" /note="alignment with genomic scaffold AE003593" /db_xref="FLYBASE:FBgn0024200" CDS 372..3194 /gene="pap" /note="Longest ORF" /codon_start=1 /product="LD28662p" /protein_id="AAL28909.1" /db_xref="FLYBASE:FBgn0024200" /translation="MSRQRRIFGNNGAPTASLASIANVLEFMDAHDVISLALEQSRLA FENQRMDNMMDFHGNGSSSSHQQQQLTAFHAPPPALRHKLAGIGAGRLTVHKWPYLPV GFTRSNKEIVRTMNAIQPMLQNAFHCKSRGGSGSKDASSYNTVSGPLTWRQFHRLAGR ASGQCEPQPIPSVVVGYEKDWISVAPHSIHYWDKFLLEPYSYARDVVYVVVCPDNEHV VNCTRSYFRELSSTYEMCKLGKHTPIRGWDGFLQVGAARNNVPADRETTPLDDWLRTL EHAALAEQIRRYAVAFIHQLAPYLSRVPNDKTLLNPPDGSGNSHSKGGSSCSSNSSSV SGLPGGDLPTDNIKLEPGTEPQVQPMETNEIKQEPGVGKGGTAAGETKPTLILGDPLG MGETLEDINPSAIVLYVVNPFTFASDSCELERLALIALLRCYAELLKAVPDSVRSQMN IQIISLESVMELGPCGNRKRFSDEIRCLALNIFSQCRRHLVHAQSVKSLTGFGTAANM EAFLKTKDEPNRRAYKMYTAPFVLAPMHERNDKTDFSRSAGSMHGQNEHRYSVMYCNY CLSEDQAWLLATATDERGEMLEKICINIDVPNRARRRKAPARYVALKKLMDFIMGIIS QTSQMWRLVIGRIGRIGHSELKSWSFLLSKQQLQKASKQFKDMCKQCTLMYPPTILSA CLVTLEPDAKLRVMPDQFTPDERFSQISMQNPLATPQDVTCTHILVFPTSAVCAPFTR QFQNEPQVDDDFLTFEEEGNEDFSDADIGDLFWDTHMDRVSNHGSPGRMDDNRSWQSA GGNNFKCTPPQEVEEVGSLNQQPISVGYMVSTAPTGRMPAWFWSACPHLEDVCPVFLK TALHLHVPSIQSADDILNSTNAHQSGNDHPLDSNLTADVLRFVLEGYNALSWLALDSN THDRLSCLPINVQTLMDLYYLTAAIA" BASE COUNT 1137 a 953 c 976 g 915 t ORIGIN 1 ccgggtatgt ggacgatgat cccgtggagt gtacgtgtgg tttcagtgca gtggttaatc 61 gaagactctc tcatcgcgct ggactctttt acgaagatga ggtggagatc acgggcatcg 121 cggatgatcc gggcaggaat aagcagccca cgttgctcag tatcatccag agtcttagca 181 ggaaaaacca aaacaagcag ggacctggag aaacaagctc tgcattggat aaaattggag 241 cgggaggatt gcctaatgga caactggagc aattgggtca tgctgtattt gacctgctat 301 tggatcagtg ctctatcatc caaacatcta gcagttcggt gcacagagca cttcagtctc 361 atcggaggcg aatgtcccgc caacgaagga tcttcggtaa caatggagct cccactgcct 421 cattggcatc aattgcaaat gttctagagt ttatggatgc acacgacgtt ataagtttgg 481 ccctggagca gtcgagattg gcattcgaaa accagcggat ggacaacatg atggacttcc 541 atgggaatgg tagcagtagc tcgcatcagc agcagcaact gacagctttc catgcgccac 601 cacctgcttt gcgtcacaag ctagcaggta ttggtgctgg gcggttgacg gttcacaagt 661 ggccatatct tcccgtcggt ttcacccgta gcaataagga gattgtacgc accatgaatg 721 ctatacagcc gatgctgcag aatgcattcc attgcaaatc caggggcggc tcaggttcca 781 aggatgccag ttcgtacaat acggtgagtg gaccgctaac ctggcgccaa ttccatcgac 841 tggcaggccg tgcatccgga caatgtgagc cgcaaccgat accatccgtg gtggtgggct 901 acgagaagga ttggatctcg gtggcaccgc actccatcca ctattgggat aagtttctcc 961 tggaaccgta ctcctatgcc agggatgttg tatacgtggt agtgtgtccg gacaacgagc 1021 atgtggtaaa ttgtactcgc agctatttcc gggaactaag cagcacctac gagatgtgca 1081 agctgggcaa acacacaccc atccgaggat gggatggttt ccttcaagta ggcgctgctc 1141 gcaacaacgt gcctgcggat cgggaaacaa ctcccttgga tgattggctg cgtaccctgg 1201 aacatgctgc tctggcggag caaatccgtc ggtatgcagt tgcctttatt caccagttgg 1261 ctccatacct aagtcgagtg ccaaatgata aaacgctgtt gaatccacca gatggcagtg 1321 gcaactcaca ctctaaagga ggaagctctt gctcctcgaa tagctcttca gtcagcggat 1381 tgccaggcgg ggacttaccc acggataata taaagctgga gccaggtaca gaaccacaag 1441 tccaaccgat ggagactaac gaaataaaac aggaacccgg agtgggaaaa ggaggcactg 1501 cagcaggtga aaccaaacca actctgatcc tcggagatcc gttgggaatg ggcgagactt 1561 tggaggacat caacccgtca gccattgtcc tttatgtggt caatccgttc actttcgcct 1621 cggatagctg cgagctggag cgtctagctc tgattgctct acttcgttgc tatgcggaac 1681 tgctgaaagc agtcccagat tcagtgcgat cccagatgaa catacagatt atatcgctag 1741 aatcggtaat ggaactgggt ccgtgcggca atcgaaagcg tttctcggac gagattaggt 1801 gcctggctct aaacatattt tcgcagtgcc ggcgacatct ggtgcatgcc caatccgtta 1861 aaagtcttac tggctttggt acggcggcta atatggaggc gtttctcaag acaaaggatg 1921 aacccaatcg ccgggcctac aagatgtaca cggctccctt tgttctggca ccaatgcacg 1981 agaggaatga caaaacggac ttctccagat cagcgggaag tatgcatggc cagaatgaac 2041 atcgttattc ggttatgtat tgcaattact gcctaagtga ggatcaggct tggcttttgg 2101 ccaccgccac tgatgaaagg ggcgagatgc tggagaagat ctgcattaat attgatgtgc 2161 caaatcgtgc tcggaggaga aaagcccctg ctcgctatgt ggccctaaag aaactgatgg 2221 acttcatcat gggaatcatc tcgcagacat cgcaaatgtg gcgtttggtt atcggacgca 2281 ttggaaggat cggacacagt gaactgaagt cgtggagttt tttgctcagc aagcaacagc 2341 tgcaaaaggc atccaagcaa tttaaggata tgtgtaagca atgtacattg atgtatccac 2401 ccacgatcct gagtgcttgc cttgtgacct tggagccgga tgccaagctg cgcgtgatgc 2461 ccgaccagtt tacgccagat gagcgcttct cacagatttc catgcagaac ccgttggcca 2521 ctccacagga cgtgacctgc acccacatcc tggtctttcc cactagcgcc gtctgtgcgc 2581 cttttacgcg ccagttccaa aacgagccgc aagtagatga cgactttcta acattcgagg 2641 aagagggcaa cgaggacttc agcgatgcgg acattggaga tctcttctgg gacactcata 2701 tggacagagt atccaatcat ggtagtcccg gtcgcatgga tgacaatcgg agttggcaga 2761 gcgctggcgg taataatttc aagtgcacgc cgccccagga agtagaggag gttggttcac 2821 tcaaccagca gcccatatca gtgggttaca tggtgtccac ggcgccaacc ggtcgcatgc 2881 ctgcttggtt ctggtccgcc tgtccgcatc tagaagacgt ttgtccggtg tttctgaaga 2941 cagccctcca tctccatgtg cccagcatac agtctgccga cgatattctc aactcgacca 3001 atgcccatca gtcaggcaac gatcatccgc ttgactccaa tctcacagcg gatgtgctgc 3061 gcttcgtcct cgagggatac aatgcccttt cctggctggc gctcgactcc aacacacacg 3121 atcgtctctc ctgtctgccc atcaatgtcc agacgctaat ggatctgtac tatctgacgg 3181 ctgcaatagc ctaaggtccg ggatctcaga tcctttggat catctaaaaa actaaatcaa 3241 tcaaacgcag tcgcatgctt ccgtttgcag aacttccact agctaactag ctatatcaca 3301 tcttcaaaga acaacagaat acagaaccga aaggatctga agctgaatca acaccaagta 3361 accctgaata gaagttaaca gaatcgaaaa acaacaacag aacaaccaga gcaatagcag 3421 cagcagttaa cgcaacgcac agattactta gcataatcaa caggaaaact aaacataaac 3481 cgatgaaccg atgaaagcgg cggataaaac attatcgtct accgataagt cttgaacagc 3541 aacagcatta ctatttatta gtcgtcgaga atttgtaaaa gtgtcaagtg caagaattgt 3601 ttgtttaatt ttatttttat atatatacat aggctggcgt tcgattagtg tttaatattt 3661 ataaatatac aaaggggagc gaatgagata gcaaaaagca aacaacaaca aacaaccaca 3721 cacatataag catatagttc atatatatac aatagcggag tggagttaac gcgagaaaca 3781 tattagcata agtaaaaaag tttgtacata tttacaacca attcgccgaa agcagcaaac 3841 cagcagcaac aacaaagcaa aacaaaagtg taattataca attagaaatt agtttacaac 3901 aaattggagg agcagaagag gagaccctat ttaaatctaa taaattaaat caataaattg 3961 tgtaaaaaaa aaaaaaaaaa a //