LOCUS BC065502 2709 bp mRNA linear HUM 12-FEB-2004 DEFINITION Homo sapiens WD repeat and FYVE domain containing 3, mRNA (cDNA clone IMAGE:4418749), partial cds. ACCESSION BC065502 VERSION BC065502.1 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2709) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2709) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (26-JAN-2004) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 134 Row: i Column: 4 This clone was selected for full length sequencing because it passed the following selection criteria: Hexamer frequency ORF analysis. FEATURES Location/Qualifiers source 1..2709 /db_xref="H-InvDB:HIT000261910" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="IMAGE:4418749" /tissue_type="Duodenum, adenocarcinoma" /clone_lib="NIH_MGC_88" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene <1..2709 /gene="WDFY3" /gene_synonym="ALFY" /gene_synonym="KIAA0993" /gene_synonym="MGC16461" /gene_synonym="ZFYVE25" /db_xref="GeneID:23001" CDS <1..2339 /gene="WDFY3" /gene_synonym="ALFY" /gene_synonym="KIAA0993" /gene_synonym="MGC16461" /gene_synonym="ZFYVE25" /codon_start=3 /product="WDFY3 protein" /protein_id="AAH65502.1" /db_xref="GeneID:23001" /translation="FRNLAKPMGAQTDERLAQYKKRYKDWEDPNGETPAYHYGTHYSS AMIVASYLVRMEPFTQIFLRLQGGHFDLADRMFHSVREAWYSASKHNMADVKELIPEF FYLPEFLFNSNNFDLGCKQNGTKLGDVILPPWAKGDPREFIRVHREALECDYVSAHLH EWIDLIFGYKQQGPAAVEAVNVFHHLFYEGQVDIYNINDPLKETATIGFINNFGQIPK QLFKKPHPPKRVRSRLNGDNAGISVLPGSTSDKIFFHHLDNLRPSLTPVKELKEPVGQ IVCTDKGILAVEQNKVLIPPTWNKTFAWGYADLSCRLGTYESDKAMTVYECLSEWGQI LCAICPNPKLVITGGTSTVVCVWEMGTSKEKAKTVTLKQALLGHTDTVTCATASLAYH IIVSGSRDRTCIIWDLNKLSFLTQLRGHRAPVSALCINELTGDIVSCAGTYIHVWSIN GNPIVSVNTFTGRSQQIICCCMSEMNEWDTQNVIVTGHSDGVVRFWRMEFLQVPETPA PEPAEVLEMQEDCPEAQIGQEAQDEDSSDSEADEQSISQDPKDTPSQPSSTSHRPRAA SCRATAAWCTDSGSDDSRRWSDQLSLDEKDGFIFVNYSEGQTRAHLQGPLSHPHPNPI EVRNYSRLKPGYRWERQLVFRSKLTMHTAFDRKDNAHPAEVTALGISKDHSRILVGDS RGRVFSWSVSDQPGRSAADHWVKDEGGDSCSGCSVRFSLTERRHHCRNCGQLFCQKCS RFQSEIKRLKISSPVRVCQNCYYNLQHERGSEDGPRNC" misc_feature 3..686 /gene="WDFY3" /gene_synonym="ALFY" /gene_synonym="KIAA0993" /gene_synonym="MGC16461" /gene_synonym="ZFYVE25" /note="Beach; Region: Beige/BEACH domain" /db_xref="CDD:pfam02138" misc_feature 894..1499 /gene="WDFY3" /gene_synonym="ALFY" /gene_synonym="KIAA0993" /gene_synonym="MGC16461" /gene_synonym="ZFYVE25" /note="WD40; Region: WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly" /db_xref="CDD:cd00200" misc_feature 2100..2303 /gene="WDFY3" /gene_synonym="ALFY" /gene_synonym="KIAA0993" /gene_synonym="MGC16461" /gene_synonym="ZFYVE25" /note="FYVE; Region: Protein present in Fab1, YOTB, Vac1, and EEA1" /db_xref="CDD:smart00064" BASE COUNT 767 a 647 c 652 g 643 t ORIGIN 1 cgtttagaaa cctggctaag ccaatgggag cacaaacaga tgaacgatta gctcagtata 61 agaagcggta taaagactgg gaggatccta atggagaaac tcctgcatac cactatggga 121 cccactattc atctgcaatg attgtggcct cataccttgt aaggatggag cctttcacac 181 agatattctt aaggctacag ggtggccact ttgacctggc tgaccggatg tttcacagtg 241 tgcgcgaggc ctggtattca gcgtcaaagc acaatatggc agatgtaaaa gaacttatcc 301 cagagttctt ttatttacca gaattcctgt tcaattccaa caactttgat ctaggctgta 361 aacaaaatgg caccaagctt ggagatgtta tccttccacc ctgggcaaaa ggggacccac 421 gagaattcat cagagtccat cgtgaggctt tggagtgtga ttacgtgagt gcccatctac 481 atgagtggat tgacttaatc ttcggttata aacagcaagg ccctgctgca gtagaagctg 541 taaatgtctt ccatcatctt ttttatgagg gtcaagtgga tatctacaac atcaatgacc 601 cactaaagga gacagccaca attgggttca ttaataactt cggtcagatc cctaaacagt 661 tatttaaaaa acctcatcca ccaaagcgag tgagaagtcg actcaatgga gacaatgcag 721 gaatctctgt cctaccagga tctacaagtg acaagatctt ttttcatcat ctagacaact 781 tgaggccttc tctaacacct gtaaaagaac tcaaagaacc tgtaggacaa atcgtatgta 841 cagataaagg tattcttgcg gtggaacaga ataaggttct tatcccacca acctggaata 901 aaacttttgc ttggggctat gcagacctca gttgcagact gggaacctat gagtcagaca 961 aggccatgac tgtttatgaa tgcttgtctg agtggggcca gattctctgt gcaatctgcc 1021 ccaaccccaa gctggtcatc acgggtggaa caagcacggt tgtgtgtgtg tgggagatgg 1081 gcacctccaa agaaaaggcc aagaccgtca ccctcaaaca ggccttactg ggccacactg 1141 ataccgtcac ctgcgccaca gcatcattag cctatcacat aattgtcagt gggtcccgtg 1201 atcgaacctg tatcatttgg gatttgaaca aactgtcatt tctaacccag cttcgagggc 1261 atcgagctcc agtttctgct ctttgtatca atgaattaac aggggacatt gtgtcctgcg 1321 ctggcacata tatccatgtg tggagcatca atgggaaccc tatcgtgagt gtcaacacgt 1381 tcacaggtag gagccagcag atcatctgct gctgcatgtc ggagatgaac gaatgggaca 1441 cgcagaacgt catagtgaca ggacactcag atggagtggt tcggttttgg agaatggaat 1501 ttttgcaagt tcctgaaaca ccagctcctg agcctgctga agtcctagaa atgcaggaag 1561 actgtccaga agcacaaata gggcaggaag cccaagacga ggacagcagt gattcagaag 1621 cagatgagca gagcatcagc caggacccta aggacactcc aagccaaccc agcagcacca 1681 gccacaggcc ccgggcagcc tcctgccgcg caacagccgc ctggtgtact gacagtggct 1741 ctgacgactc cagacgctgg tccgaccagc tcagtctaga tgagaaagac ggcttcatat 1801 ttgtgaacta ttcagagggc cagaccagag cccatctgca gggccccctt agccaccccc 1861 accccaatcc cattgaggtg cggaattaca gcagattgaa acctgggtac cgatgggaac 1921 ggcagctggt gttcaggagt aagctgacta tgcacacagc ctttgatcga aaggacaatg 1981 cacacccagc tgaggtcact gccttgggca tctccaagga tcacagtagg atcctcgttg 2041 gtgacagtcg aggccgagtt ttcagctggt ctgtgagtga ccagccaggc cgttctgctg 2101 ctgatcactg ggtgaaggat gaaggtggtg acagctgctc aggctgctcg gtgaggtttt 2161 cactcacaga aagacgacac cattgcagga actgtggtca gctcttctgc cagaagtgca 2221 gtcgctttca atctgaaatc aaacgcttga aaatctcatc cccggtgcgt gtttgtcaga 2281 actgttatta taacttacag catgagagag gttcagaaga tgggcctcga aattgttgaa 2341 gattcaacaa gctgagtgga gaccatggtc tgtagacccc ttcccgattc tcctgtccca 2401 gcttggaagg cattgaaaac agtctccgtt tacacatctc ttcataccac gtgtttgaag 2461 tgttaaaatt caaagggatc attgaataaa acgggtgtag agtacaggaa tggggcagac 2521 gcgattcagg tgaacagcac aagaagaata tgaggtggtt cctaggagca acactttcga 2581 cctccagttc tccctgatga cagtagctgt ctccaagaga aaaatcctca cttattaact 2641 ctcttttctt gcatctcatt tttatagagc tactcatcct tatttggaaa aaccaaaaaa 2701 aaaaaaaaa //