LOCUS BC140826 4684 bp mRNA linear HUM 12-MAY-2008 DEFINITION Homo sapiens zinc finger protein 804B, mRNA (cDNA clone MGC:176503 IMAGE:9021694), complete cds. ACCESSION BC140826 VERSION BC140826.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4684) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 4684) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (03-MAY-2007) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Mike Brownstein, NIMH cDNA Library Preparation: British Columbia Cancer Research Center cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: LLDM Plate: 673 Row: m Column: 20. FEATURES Location/Qualifiers source 1..4684 /db_xref="H-InvDB:HIT000501573" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:176503 IMAGE:9021694" /tissue_type="Pooled, cerebellum, kidney, placenta, testis, lung, colon, liver, heart, thyroid, bladder, uterus, PCR rescued clones" /clone_lib="NIH_MGC_363" /lab_host="DH10B" /note="Vector: pCR-XL-TOPO; Clone identification sequence tag: TACCGCAG" gene 1..4684 /gene="ZNF804B" /db_xref="GeneID:219578" /db_xref="HGNC:HGNC:21958" CDS 610..4659 /gene="ZNF804B" /codon_start=1 /product="zinc finger protein 804B" /protein_id="AAI40827.1" /db_xref="GeneID:219578" /db_xref="HGNC:HGNC:21958" /translation="MACYLVISSRHLSNGHYRGIKGVFRGPLCKNGSPSPDFAEKKST AKALEDVKANFYCELCDKQYHKHQEFDNHINSYDHAHKQRLKELKQREFARNVASKSW KDEKKQEKALKRLHQLAELRQQSECVSGNGPAYKAPRVAIEKQLQQGIFPIKNGRKVS CMKSALLLKGKNLPRIISDKQRSTMPNRHQLQSDRRCLFGNQVLQTSSDLSNANHRTG VSFTFSKKVHLKLESSASVFSENTEETHDCNKSPIYKTKQTADKCKCCRFANKDTHLT KEKEVNISPSHLESVLHNTISINSKILQDKHDSIDETLEDSIGIHASFSKSNIHLSDV DFTPTSREKETRNTLKNTLENCVNHPCQANASFSPPNIYNHSDARISECLDEFSSLEP SEQKSTVHLNPNSRIENREKSLDKTERVSKNVQRLVKEACTHNVASKPLPFLHVQSKD GHTTLQWPTELLLFTKTEPCISYGCNPLYFDFKLSRNTKEDHNLEDLKTELGKKPLEL KTKRESQVSGLTEDQQKLIQEDYQYPKPKTMIANPDWEKFQRKYNLDYSDSEPNKSEY TFSANDLEMKNPKVPLYLNTSLKDCAGKNNSSENKLKEASRAHWQGCRKAVLNDIDED LSFPSYISRFKKHKLIPCSPHLEFEDERQFNCKSSPCTVGGHSDHGKDFSVILKSNHI SMTSKVSGCGNQRYKRYSPQSCLSRYSSSLDTSPSSMSSLRSTCSSHRFNGNSRGNLL CFHKREHHSVERHKRKCLKHNCFYLSDDITKSSQMQSEPQKERNCKLWESFKNEKYSK RRYCHCRERQKLGKNQQQFSGLKSTRIIYCDSNSQISCTGSSKKPPNCQGTQHDRLDS YSIEKMYYLNKSKRNQESLGSPHICDLGKVRPMKCNSGNISCLLKNCSSGPSETTESN TAEGERTPLTAKILLERVQAKKCQEQSSNVEISSNSCKSELEAPSQVPCTIQLAPSGC NRQALPLSEKIQYASESRNDQDSAIPRTTEKDKSKSSHTNNFTILADTDCDNHLSKGI IHLVTESQSLNIKRDATTKEQSKPLISEIQPFIQSCDPVPNEFPGAFPSNKYTGVTDS TETQEDQINLDLQDVSMHINHVEGNINSYYDRTMQKPDKVEDGLEMCHKSISPPLIQQ PITFSPDEIDKYKILQLQAQQHMQKQLLSKHLRVLPAAGPTAFSPASTVQTVPVHQHT SITTIHHTFLQHFAVSASLSSHSSHLPIAHLHPLSQAHFSPISFSTLTPTIIPAHPTF LAGHPLHLVAATPFHPSHITLQPLPPTAFIPTLFGPHLNPATTSIIHLNPLIQPVFQG QDFCHHSCSSQMQQLNEVKEALNVSTHLN" BASE COUNT 1517 a 1137 c 923 g 1107 t ORIGIN 1 cccaactccc tgtcctcacc ccggcccgcc cgtctcctcc tccgccctct cccgactgca 61 gcgggggcag cgctttggcg ctcccaagga ctccccgcca ctctccccac aaagcagcgg 121 cggtggcggc ggctgctgct ggatcttcac actgcagcca gcagcccagg acgcccccgg 181 ccggacgatg ggtacccgcg cctgagcatc ccccgggcag gcgccaggcg cgaaggcagg 241 gggcgggagg aaaggggcgg gggaattcct gattccctgg tggaccctgg aagttgtcct 301 taaataaata tatcgctggc ccgcggttga gcagccacct cgtcagagca gcatgtggac 361 tggctcgccg ggtcccctcc gtgctctgtg ctgtcgccgc cgccgcctct gtcagagcag 421 cagctgtcgg cagcaggagc cccgcacggg gcgcggagca gggacgcgct gccaccgcct 481 ccccctgcgt cctgctggcc gcgtcttctc gggaggtggt agtcgctgtt gccgctgaga 541 aacccgcccg ctttccacgg ctggtcgcct ggtgaggagt tgagactctg cgcctccgcc 601 cggacccaca tggcttgtta cctggtcatc agttcgagac atctcagcaa tgggcactac 661 cggggcatta aaggagtctt caggggaccc ctgtgcaaga acggatctcc ctctccggat 721 tttgcagaaa agaagtccac agcaaaggcc ctggaagatg taaaggcaaa cttttactgt 781 gaattatgtg acaagcagta tcacaaacac caggagtttg acaatcatat taattcttat 841 gaccatgctc ataagcagag actgaaagaa ttaaagcaac gggaatttgc tcgaaatgta 901 gcttctaagt catggaaaga tgagaaaaaa caagaaaaag cacttaaacg acttcatcag 961 ctggctgagt taaggcagca atctgaatgt gtttctggaa atggaccagc atacaaagcc 1021 cccagggtag ccatagaaaa gcaactccag caaggaattt tccccattaa gaatggcaga 1081 aaggtatcat gcatgaagag tgctcttctc cttaaaggaa aaaatctccc cagaatcata 1141 tccgataaac agcggtccac catgccaaat cgacaccaat tacaatcaga caggcgttgt 1201 ttgtttggaa atcaggtact gcaaacatct tcagatctca gcaatgcaaa tcacagaaca 1261 ggagtatcat ttactttttc caaaaaagtg cacctaaaat tagaatcttc agcatcagtt 1321 ttcagtgaga acacagaaga aacccatgat tgtaacaagt cacccattta taaaacaaaa 1381 caaactgcag ataagtgcaa gtgctgcagg tttgcaaata aagatacaca ccttaccaag 1441 gaaaaagagg taaatatctc accaagccat ctggaaagtg ttttacacaa taccatctcc 1501 ataaactcta aaattttgca agacaaacac gactctattg atgagacact agaagattca 1561 attggcattc atgcttcatt ctctaaatct aacattcatc tttcagatgt agattttact 1621 cctaccagca gagaaaaaga aactagaaat acattgaaga acactttaga aaattgtgtt 1681 aatcacccat gccaagcaaa tgcttccttc agcccaccaa acatttacaa ccatagtgat 1741 gccaggatat ctgaatgcct ggatgagttt tcatcactgg agccaagtga acaaaagagt 1801 acagtgcatc tgaatccaaa ttccagaata gagaacagag aaaaatcttt agataaaaca 1861 gaaagagtta gcaaaaatgt tcaaagactt gtaaaagaag catgtaccca taatgtggca 1921 tctaaaccac taccttttct ccacgttcaa agcaaggatg gccacaccac tcttcaatgg 1981 cctacggaac ttctgctctt tacaaaaaca gaaccctgta tctcttatgg ctgcaaccca 2041 ctgtattttg attttaagct ttctcggaac acaaaggaag accacaatct agaggactta 2101 aaaacagaat tgggtaagaa gcccttggaa ttgaagacta aaagagagag ccaagtctca 2161 ggtttaactg aagaccaaca aaaattgatc caagaagatt atcaatatcc gaaaccaaag 2221 acgatgatag ctaatccgga ttgggaaaaa ttccagagga aatataattt ggactacagt 2281 gattctgagc caaataagag tgaatatact ttcagtgcaa atgatttgga aatgaaaaat 2341 cctaaagtgc ctctttacct caacacatct ctaaaggatt gtgctggaaa gaataatagt 2401 agtgagaaca aacttaagga agcttcaagg gcccattggc aaggctgcag aaaggcagtt 2461 ctaaatgata tagatgagga cctatctttt ccttcctaca tctctaggtt taaaaagcat 2521 aaattgattc cctgcagtcc tcatttggaa tttgaagatg aaagacaatt caactgcaag 2581 tccagtcctt gtacagtagg gggtcacagt gaccatggga aagacttcag tgtaattttg 2641 aagagtaacc acatcagcat gaccagcaag gtttccggat gtggaaacca aagatacaag 2701 agatactctc cacagtcatg tttgagtaga tattcttcct ctttggacac atcccctagc 2761 agcatgtcta gcttgagaag tacttgttca agtcatagat tcaatggtaa tagcagaggt 2821 aatttgctct gcttccataa aagagaacac cactcagttg aaaggcacaa acggaaatgt 2881 ctaaagcaca actgcttcta cttgtctgat gatataacaa agagcagcca aatgcagtct 2941 gaaccacaga aagagaggaa ctgcaaattg tgggaatcat ttaaaaatga aaaatactca 3001 aaacgtagat attgtcactg cagagaaaga caaaaactgg gcaaaaatca acaacaattt 3061 tcagggctaa aatctacgag aatcatctat tgtgattcta actcacagat ttcctgtact 3121 ggaagcagta aaaaaccacc taattgccag ggaactcagc acgacagatt ggactcttac 3181 tcaatagaga aaatgtatta cttgaataaa agcaagagaa atcaagagtc tttgggcagc 3241 cctcacattt gtgatctggg aaaagtcagg cccatgaagt gtaactccgg gaatatcagc 3301 tgccttctaa agaactgttc cagtggccct tcagaaacca cagaatcaaa cactgcagaa 3361 ggagagagga cccctctaac agcaaaaatc cttttagaaa gagtacaagc caagaaatgt 3421 caagaacaat caagtaatgt tgagatctct tcaaacagtt gtaaaagtga attagaggct 3481 ccttcgcaag tcccatgcac aattcaactt gcaccatcag gctgtaacag acaagcattg 3541 cctttgtctg aaaaaataca gtatgcaagt gagagcagaa atgatcaaga cagtgcaatt 3601 ccaaggacta cggagaaaga caaaagcaaa agttcacaca caaataattt tacaatttta 3661 gcagacactg attgtgataa ccatctttct aaaggtataa ttcacctagt aacagagtct 3721 cagtcactaa acataaaaag ggatgcaaca acaaaagaac aatcaaaacc tttaattagt 3781 gaaatccaac cttttattca aagctgtgac ccagtaccaa atgaattccc tggtgctttt 3841 ccgtctaata aatatactgg tgtgactgat tcaacagaga cccaagaaga ccaaataaat 3901 ctagacttac aggatgtaag catgcatata aatcatgtag agggaaatat aaactcttac 3961 tatgacagaa ctatgcagaa acctgacaaa gtcgaagacg gattagaaat gtgtcataaa 4021 tctatctctc cccctttaat tcaacagccc ataacatttt ctcctgacga aatagataaa 4081 tataagatcc tacagctaca agcccagcag catatgcaga agcaactcct atcaaagcat 4141 cttcgagttt tgcctgctgc agggcctact gccttctctc cggcctcaac cgtacagaca 4201 gttccagttc accagcacac ttctatcacc accatccacc acacgttcct gcagcatttt 4261 gctgtttctg cttccttaag ttctcatagc agtcacctcc ctattgctca tctacatcct 4321 ctttcacagg cacatttcag tcctatttca ttttcgactc tgactccaac cattatccct 4381 gcacacccca ctttcttagc aggtcatccc ctgcatttag tagctgctac ccccttccac 4441 ccatctcaca taacacttca gcctctgccc cctacagcat ttattcctac attgtttggt 4501 cctcacttaa atccagccac aacttctatc atccacttga atcctttaat ccaaccagta 4561 ttccaaggtc aagatttttg ccatcattct tgctctagcc agatgcaaca gctaaatgaa 4621 gtgaaagagg ccttaaatgt gtccacacac ttgaactaat aagtgttaaa gcccctcctg 4681 tgga //