LOCUS BC136602 4008 bp mRNA linear HUM 11-FEB-2009 DEFINITION Homo sapiens zinc finger protein 804A, mRNA (cDNA clone MGC:168215 IMAGE:9020592), complete cds. ACCESSION BC136602 VERSION BC136602.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4008) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 4008) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (29-MAR-2007) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Ambion cDNA Library Preparation: British Columbia Cancer Research Center cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: LLDM Plate: 670 Row: o Column: 22 single stranded. FEATURES Location/Qualifiers source 1..4008 /db_xref="H-InvDB:HIT000500612" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:168215 IMAGE:9020592" /tissue_type="Testicle, PCR rescued clones" /clone_lib="NIH_MGC_382" /lab_host="DH10B" /note="Vector: pCR4-TOPO with reversed insert; Clone identification sequence tag: CCGATCCT" gene 1..4008 /gene="ZNF804A" /db_xref="GeneID:91752" /db_xref="HGNC:HGNC:21711" /db_xref="MIM:612282" CDS 250..3882 /gene="ZNF804A" /codon_start=1 /product="zinc finger protein 804A" /protein_id="AAI36603.1" /db_xref="GeneID:91752" /db_xref="HGNC:HGNC:21711" /db_xref="MIM:612282" /translation="MECYYIVISSTHLSNGHFRNIKGVFRGPLSKNGNKTLDYAEKEN TIAKALEDLKANFYCELCDKQYYKHQEFDNHINSYDHAHKQRLKELKQREFARNVASK SRKDERKQEKALQRLHKLAELRKETVCAPGSGPMFKSTTVTVRENCNEISQRVVVDSV NNQQDFKYTLIHSEENTKDATTVAEDPESANNYTAKNNQVGDQAQGIHRHKIGFSFAF PKKASVKLESSAAAFSEYSDDASVGKGFSRKSRFVPSACHLQLSSPTDVLLSSEEKTN SFHPPEAMCRDKETVQTQEIKEVSSEKDALLLPSFCKFQLQLSSDADNCQNSVPLADQ IPLESVVINEDIPVSGNSFELLGNKSTVLDMSNDCISVQATTEENVKHNEASTTEVEN KNGPETLAPSNTEEVNITIHKKTNFCKRQCEPFVPVLNKHRSTVLQWPSEMLVYTTTK PSISYSCNPLCFDFKSTKVNNNLDKNKPDLKDLCSQQKQEDICMGPLSDYKDVSTEGL TDYEIGSSKNKCSQVTPLLADDILSSSCDSGKNKNTGQRYKNISCKIRETEKYNFTKS QIKQDTLDEKYNKIRLKETHEYWFHKSRRKKKRKKLCQHHHMEKTKESETRCKMEAEN SYTENAGKYLLEPISEKQYLAAEQLLDSHQLLDKRPKSESISLSDNEEMCKTWNTEYN TYDTISSKNHCKKNTTILLNGQSNAKMIHSGKHNLTYSRTYCCWKTKMSSCSQDHRSL VLQNDMKRMSQNQAVKRGYNSVMNESERFYRKRRQHSHSYSSDESLNRQNHLPEEFLR PPSTSVAPCKPKKKRRRKRGRFHPGFETLELKENTDYPVKDNSSLNPLDRLISEDKKE KMKPQEVAKIERNSEQTNQLRNKLSFHPNNLLPSETNGETEHLEMETTSGELSDVSND PTTSVCVASAPTKEAIDNTLLEHKERSENINLNEKQIPFQVPNIERNFRQSQPKSYLC HYELAEALPQGKMNETPTEWLRYNSGILNTQPPLPFKEAHVSGHTFVTAEQILAPLAL PEQALLIPLENHDKFKNVPCEVYQHILQPNMLANKVKFTFPPAALPPPSTPLQPLPLQ QSLCSTSVTTIHHTVLQQHAAAAAAAAAAAAAGTFKVLQPHQQFLSQIPALTRTSLPQ LSVGPVGPRLCPGNQPTFVAPPQMPIIPASVLHPSHLAFPSLPHALFPSLLSPHPTVI PLQPLF" BASE COUNT 1359 a 872 c 795 g 982 t ORIGIN 1 cgacgcgaat ctgaggagaa acaggagcga gagactgagg ggagagcgcg gcgagcatgc 61 ggaggcgggg gagcctcggc gctcaccaca gaggggtaca gtgagccagt ctccagagga 121 cgtgccgggg gtggctgcgt gccctcgtgg cgggttccca gcccaccgtc gccggccccg 181 gcgcgctgcg gctgtgggcg cggggtgcgt ggaagcggcg gctgcggcgg aggaggcggc 241 ggctgcccca tggagtgtta ctacattgtc atcagctcca cgcatctcag caacggacac 301 tttcgcaaca tcaagggagt tttccggggc cctctcagca agaacgggaa caaaactctg 361 gactatgctg agaaggaaaa taccatagca aaagctctgg aagatctgaa ggcaaatttt 421 tactgtgaac tctgtgacaa gcagtactat aagcaccagg agtttgacaa tcacattaat 481 tcatatgacc atgctcacaa gcagaggctc aaggaactga aacaaaggga atttgctcga 541 aatgtagcat ctaaatccag gaaagatgaa agaaaacagg aaaaggcact ccaacgcctg 601 cacaagctgg ctgagctaag aaaggaaact gtatgtgctc ctggaagtgg ccccatgttc 661 aaatcaacaa ctgttactgt gagagaaaac tgtaatgaaa tttcccaacg agttgttgtg 721 gattcagtta ataaccagca agatttcaaa tatactttga ttcatagtga agagaatact 781 aaagatgcta ccactgttgc tgaagatcca gaaagtgcaa ataattatac agcaaaaaat 841 aaccaagttg gggatcaagc ccaggggatt cacagacaca aaatcggctt ttcttttgca 901 tttccaaaga aagcgtccgt gaagctagag tcctcagctg cagccttctc tgaatacagt 961 gatgatgcct cagtgggaaa aggatttagc agaaaaagta gatttgtccc cagtgcttgt 1021 catcttcaac tatcttcacc aacagatgtg cttttgagtt ctgaggagaa aactaactct 1081 tttcatccac cagaggcaat gtgcagagac aaagaaactg ttcaaactca agagataaaa 1141 gaagtctcta gtgaaaaaga tgcattatta ttaccttcat tttgcaagtt tcaacttcag 1201 ttatcttctg atgcagataa ttgtcaaaat tcagtcccat tagcagatca aataccacta 1261 gagagtgttg ttattaatga agacatacct gttagtggta acagttttga gttgttagga 1321 aataaatcca cagttcttga catgtctaat gattgcatat ctgtgcaagc taccacagag 1381 gaaaatgtta agcataacga ggcatccaca actgaggttg aaaataaaaa tggtcccgag 1441 acattggccc cttcaaatac tgaagaggtt aacataacta tacataagaa aacaaatttc 1501 tgcaaaagac aatgtgagcc atttgtacct gtccttaaca aacacagatc tacagttctt 1561 cagtggccat cagaaatgct ggtttataca actacgaaac catcaatttc ctatagctgt 1621 aatcctctat gttttgactt caagtctact aaagtaaata ataatctaga taaaaataag 1681 ccagacttaa aagatctttg ttctcagcag aagcaggaag acatttgcat gggaccactt 1741 tcagattaca aggatgtatc tacagaagga ctcactgatt atgaaattgg aagtagcaaa 1801 aataaatgca gccaagtcac tcctcttttg gctgatgata ttctctccag tagttgtgat 1861 tctggaaaaa ataagaacac gggtcagagg tataaaaaca tttcctgtaa gatcagagaa 1921 acagaaaagt ataattttac taaaagtcaa ataaaacagg acactctaga tgaaaaatac 1981 aacaaaataa ggttgaaaga gacccatgaa tactggttcc ataaaagtag aagaaagaaa 2041 aagagaaaaa agttatgtca gcatcatcat atggagaaaa ccaaagaatc agaaactcgc 2101 tgcaaaatgg aagcagagaa tagttacact gaaaatgctg ggaaatatct attggaacca 2161 atttcagaaa agcagtattt agctgcagag caattattag actcacatca gttacttgat 2221 aaaaggccca aatcagaatc catatcctta agtgacaatg aagaaatgtg taaaacatgg 2281 aatactgaat acaacactta tgatactatc agttctaaaa accactgtaa aaagaacaca 2341 acaatacttt taaatggaca atcaaatgca aaaatgatac attctgggaa acataattta 2401 acatattcta gaacttactg ttgttggaaa accaaaatgt caagctgtag tcaggatcac 2461 agaagcttag ttcttcaaaa tgatatgaaa cgcatgagtc agaatcaggc tgttaaaaga 2521 ggttacaatt ctgtcatgaa tgaatcagaa agattctatc gaaaacgtag acaacattca 2581 cattcttatt cttcagatga aagtttaaat cgacagaatc atttaccaga agaatttttg 2641 aggccaccaa gtacttcagt tgctccctgc aagcctaaaa agaaacggag gcgaaaaaga 2701 ggcagattcc accccggatt tgaaacttta gaactcaaag aaaatacaga ttatcccgtg 2761 aaagacaatt cttccttaaa tcctctggat aggttaataa gtgaagacaa aaaagagaaa 2821 atgaaaccac aagaagttgc aaaaatcgaa aggaactcag aacaaacaaa ccaattaaga 2881 aacaaactgt ctttccaccc taacaatctc cttccttctg aaaccaatgg tgaaactgag 2941 catttagaaa tggagaccac ttctggtgaa ttgtcagatg tttccaatga tcccaccaca 3001 tctgtctgtg tagctagtgc cccaacaaaa gaagcaattg acaataccct gcttgaacac 3061 aaagaaagaa gtgagaatat aaatcttaat gaaaagcaaa ttccttttca ggtgcctaat 3121 attgaaagga actttagaca gtcacagcct aaatcctatc tttgccatta tgaactggct 3181 gaggcccttc cacaaggaaa gatgaatgag acaccaactg agtggctgcg ttataattca 3241 ggaatcctta acacacaacc accattacca ttcaaagaag cacatgtcag tggtcatact 3301 tttgtaacag ctgagcaaat cctggctcca ttagctttac cagagcaagc attattgatc 3361 ccactagaaa accatgacaa attcaaaaat gtaccatgtg aggtctacca gcacattctg 3421 cagccaaaca tgctggccaa caaggttaaa tttacctttc ctccagctgc cctcccaccc 3481 cctagcacac ctctgcagcc tttgcctttg cagcagtcct tatgttctac ctctgtaacc 3541 actatccatc acactgtttt gcagcagcac gctgcagctg ctgcagctgc agctgcagcc 3601 gcagctgcag gaacctttaa agtgcttcag ccacaccaac agtttctttc ccaaatccca 3661 gctctcacca gaacctcatt acctcagctc tcagtaggac cagtaggacc gaggctttgt 3721 cctgggaacc agccaacttt tgttgctcct cctcagatgc caatcattcc agcttccgtt 3781 cttcatccta gccatctggc tttcccatct ttaccccatg cactctttcc ttcactgctt 3841 tccccacacc ctactgtcat ccctttgcaa cctctcttct agtcatcacc ataatgggaa 3901 aaaaatactc ttgtgaaaac tattgctata tgcgttaagt gttcatctat gtgggtacat 3961 ggctatttaa ctggtggaaa taaactggcc gatacatggc gtcattgg //