LOCUS BC065502 2709 bp mRNA linear HUM 12-FEB-2004
DEFINITION Homo sapiens WD repeat and FYVE domain containing 3, mRNA (cDNA
clone IMAGE:4418749), partial cds.
ACCESSION BC065502
VERSION BC065502.1
KEYWORDS .
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2709)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2709)
AUTHORS Strausberg,R.
TITLE Direct Submission
JOURNAL Submitted (26-JAN-2004) National Institutes of Health, Mammalian
Gene Collection (MGC), Cancer Genomics Office, National Cancer
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 134 Row: i Column: 4
This clone was selected for full length sequencing because it
passed the following selection criteria: Hexamer frequency ORF
analysis.
FEATURES Location/Qualifiers
source 1..2709
/db_xref="H-InvDB:HIT000261910"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="IMAGE:4418749"
/tissue_type="Duodenum, adenocarcinoma"
/clone_lib="NIH_MGC_88"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene <1..2709
/gene="WDFY3"
/gene_synonym="ALFY"
/gene_synonym="KIAA0993"
/gene_synonym="MGC16461"
/gene_synonym="ZFYVE25"
/db_xref="GeneID:23001"
CDS <1..2339
/gene="WDFY3"
/gene_synonym="ALFY"
/gene_synonym="KIAA0993"
/gene_synonym="MGC16461"
/gene_synonym="ZFYVE25"
/codon_start=3
/product="WDFY3 protein"
/protein_id="AAH65502.1"
/db_xref="GeneID:23001"
/translation="FRNLAKPMGAQTDERLAQYKKRYKDWEDPNGETPAYHYGTHYSS
AMIVASYLVRMEPFTQIFLRLQGGHFDLADRMFHSVREAWYSASKHNMADVKELIPEF
FYLPEFLFNSNNFDLGCKQNGTKLGDVILPPWAKGDPREFIRVHREALECDYVSAHLH
EWIDLIFGYKQQGPAAVEAVNVFHHLFYEGQVDIYNINDPLKETATIGFINNFGQIPK
QLFKKPHPPKRVRSRLNGDNAGISVLPGSTSDKIFFHHLDNLRPSLTPVKELKEPVGQ
IVCTDKGILAVEQNKVLIPPTWNKTFAWGYADLSCRLGTYESDKAMTVYECLSEWGQI
LCAICPNPKLVITGGTSTVVCVWEMGTSKEKAKTVTLKQALLGHTDTVTCATASLAYH
IIVSGSRDRTCIIWDLNKLSFLTQLRGHRAPVSALCINELTGDIVSCAGTYIHVWSIN
GNPIVSVNTFTGRSQQIICCCMSEMNEWDTQNVIVTGHSDGVVRFWRMEFLQVPETPA
PEPAEVLEMQEDCPEAQIGQEAQDEDSSDSEADEQSISQDPKDTPSQPSSTSHRPRAA
SCRATAAWCTDSGSDDSRRWSDQLSLDEKDGFIFVNYSEGQTRAHLQGPLSHPHPNPI
EVRNYSRLKPGYRWERQLVFRSKLTMHTAFDRKDNAHPAEVTALGISKDHSRILVGDS
RGRVFSWSVSDQPGRSAADHWVKDEGGDSCSGCSVRFSLTERRHHCRNCGQLFCQKCS
RFQSEIKRLKISSPVRVCQNCYYNLQHERGSEDGPRNC"
misc_feature 3..686
/gene="WDFY3"
/gene_synonym="ALFY"
/gene_synonym="KIAA0993"
/gene_synonym="MGC16461"
/gene_synonym="ZFYVE25"
/note="Beach; Region: Beige/BEACH domain"
/db_xref="CDD:pfam02138"
misc_feature 894..1499
/gene="WDFY3"
/gene_synonym="ALFY"
/gene_synonym="KIAA0993"
/gene_synonym="MGC16461"
/gene_synonym="ZFYVE25"
/note="WD40; Region: WD40 domain, found in a number of
eukaryotic proteins that cover a wide variety of functions
including adaptor/regulatory modules in signal
transduction, pre-mRNA processing and cytoskeleton
assembly"
/db_xref="CDD:cd00200"
misc_feature 2100..2303
/gene="WDFY3"
/gene_synonym="ALFY"
/gene_synonym="KIAA0993"
/gene_synonym="MGC16461"
/gene_synonym="ZFYVE25"
/note="FYVE; Region: Protein present in Fab1, YOTB, Vac1,
and EEA1"
/db_xref="CDD:smart00064"
BASE COUNT 767 a 647 c 652 g 643 t
ORIGIN
1 cgtttagaaa cctggctaag ccaatgggag cacaaacaga tgaacgatta gctcagtata
61 agaagcggta taaagactgg gaggatccta atggagaaac tcctgcatac cactatggga
121 cccactattc atctgcaatg attgtggcct cataccttgt aaggatggag cctttcacac
181 agatattctt aaggctacag ggtggccact ttgacctggc tgaccggatg tttcacagtg
241 tgcgcgaggc ctggtattca gcgtcaaagc acaatatggc agatgtaaaa gaacttatcc
301 cagagttctt ttatttacca gaattcctgt tcaattccaa caactttgat ctaggctgta
361 aacaaaatgg caccaagctt ggagatgtta tccttccacc ctgggcaaaa ggggacccac
421 gagaattcat cagagtccat cgtgaggctt tggagtgtga ttacgtgagt gcccatctac
481 atgagtggat tgacttaatc ttcggttata aacagcaagg ccctgctgca gtagaagctg
541 taaatgtctt ccatcatctt ttttatgagg gtcaagtgga tatctacaac atcaatgacc
601 cactaaagga gacagccaca attgggttca ttaataactt cggtcagatc cctaaacagt
661 tatttaaaaa acctcatcca ccaaagcgag tgagaagtcg actcaatgga gacaatgcag
721 gaatctctgt cctaccagga tctacaagtg acaagatctt ttttcatcat ctagacaact
781 tgaggccttc tctaacacct gtaaaagaac tcaaagaacc tgtaggacaa atcgtatgta
841 cagataaagg tattcttgcg gtggaacaga ataaggttct tatcccacca acctggaata
901 aaacttttgc ttggggctat gcagacctca gttgcagact gggaacctat gagtcagaca
961 aggccatgac tgtttatgaa tgcttgtctg agtggggcca gattctctgt gcaatctgcc
1021 ccaaccccaa gctggtcatc acgggtggaa caagcacggt tgtgtgtgtg tgggagatgg
1081 gcacctccaa agaaaaggcc aagaccgtca ccctcaaaca ggccttactg ggccacactg
1141 ataccgtcac ctgcgccaca gcatcattag cctatcacat aattgtcagt gggtcccgtg
1201 atcgaacctg tatcatttgg gatttgaaca aactgtcatt tctaacccag cttcgagggc
1261 atcgagctcc agtttctgct ctttgtatca atgaattaac aggggacatt gtgtcctgcg
1321 ctggcacata tatccatgtg tggagcatca atgggaaccc tatcgtgagt gtcaacacgt
1381 tcacaggtag gagccagcag atcatctgct gctgcatgtc ggagatgaac gaatgggaca
1441 cgcagaacgt catagtgaca ggacactcag atggagtggt tcggttttgg agaatggaat
1501 ttttgcaagt tcctgaaaca ccagctcctg agcctgctga agtcctagaa atgcaggaag
1561 actgtccaga agcacaaata gggcaggaag cccaagacga ggacagcagt gattcagaag
1621 cagatgagca gagcatcagc caggacccta aggacactcc aagccaaccc agcagcacca
1681 gccacaggcc ccgggcagcc tcctgccgcg caacagccgc ctggtgtact gacagtggct
1741 ctgacgactc cagacgctgg tccgaccagc tcagtctaga tgagaaagac ggcttcatat
1801 ttgtgaacta ttcagagggc cagaccagag cccatctgca gggccccctt agccaccccc
1861 accccaatcc cattgaggtg cggaattaca gcagattgaa acctgggtac cgatgggaac
1921 ggcagctggt gttcaggagt aagctgacta tgcacacagc ctttgatcga aaggacaatg
1981 cacacccagc tgaggtcact gccttgggca tctccaagga tcacagtagg atcctcgttg
2041 gtgacagtcg aggccgagtt ttcagctggt ctgtgagtga ccagccaggc cgttctgctg
2101 ctgatcactg ggtgaaggat gaaggtggtg acagctgctc aggctgctcg gtgaggtttt
2161 cactcacaga aagacgacac cattgcagga actgtggtca gctcttctgc cagaagtgca
2221 gtcgctttca atctgaaatc aaacgcttga aaatctcatc cccggtgcgt gtttgtcaga
2281 actgttatta taacttacag catgagagag gttcagaaga tgggcctcga aattgttgaa
2341 gattcaacaa gctgagtgga gaccatggtc tgtagacccc ttcccgattc tcctgtccca
2401 gcttggaagg cattgaaaac agtctccgtt tacacatctc ttcataccac gtgtttgaag
2461 tgttaaaatt caaagggatc attgaataaa acgggtgtag agtacaggaa tggggcagac
2521 gcgattcagg tgaacagcac aagaagaata tgaggtggtt cctaggagca acactttcga
2581 cctccagttc tccctgatga cagtagctgt ctccaagaga aaaatcctca cttattaact
2641 ctcttttctt gcatctcatt tttatagagc tactcatcct tatttggaaa aaccaaaaaa
2701 aaaaaaaaa
//