LOCUS BC018612 2902 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens zinc finger protein 473, mRNA (cDNA clone MGC:20009
IMAGE:4634056), complete cds.
ACCESSION BC018612
VERSION BC018612.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2902)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2902)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (03-DEC-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Genome Sequence Centre,
BC Cancer Agency, Vancouver, BC, Canada
info@bcgsc.bc.ca
Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson
Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen
Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel
Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave
Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth
Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao,
Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR
Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang,
Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 29 Row: i Column: 24
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 54792149.
FEATURES Location/Qualifiers
source 1..2902
/db_xref="H-InvDB:HIT000038366"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:20009 IMAGE:4634056"
/tissue_type="Eye, retinoblastoma"
/clone_lib="NIH_MGC_16"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..2902
/gene="ZNF473"
/gene_synonym="HZFP100"
/gene_synonym="ZN473"
/db_xref="GeneID:25888"
/db_xref="HGNC:HGNC:23239"
CDS 161..2776
/gene="ZNF473"
/gene_synonym="HZFP100"
/gene_synonym="ZN473"
/codon_start=1
/product="zinc finger protein 473"
/protein_id="AAH18612.1"
/db_xref="GeneID:25888"
/db_xref="HGNC:HGNC:23239"
/translation="MAEEFVTLKDVGMDFTLGDWEQLGLEQGDTFWDTALDNCQDLFL
LDPPRPNLTSHPDGSEDLEPLAGGSPEATSPDVTETKNSPLMEDFFEEGFSQEIIEML
SKDGFWNSNFGEACIEDTWLDSLLGDPESLLRSDIATNGESPTECKSHELKRGLSPVS
TVSTGEDSMVHNVSEKTLTPAKSKEYRGEFFSYSDHSQQDSVQEGEKPYQCSECGKSF
SGSYRLTQHWITHTREKPTVHQECEQGFDRNASLSVYPKTHTGYKFYVCNEYGTTFSQ
STYLWHQKTHTGEKPCKSQDSDHPPSHDTQPGEHQKTHTDSKSYNCNECGKAFTRIFH
LTRHQKIHTRKRYECSKCQATFNLRKHLIQHQKTHAAKTTSECQECGKIFRHSSLLIE
HQALHAGEEPYKCNERGKSFRHNSTLKIHQRVHSGEKPYKCSECGKAFHRHTHLNEHR
RIHTGYRPHKCQECVRSFSRPSHLMRHQAIHTAEKPYSCAECKETFSDNNRLVQHQKM
HTVKTPYECQECGERFICGSTLKCHESVHAREKQGFFVSGKILDQNPEQKEKCFKCNK
CEKTFSCSKYLTQHERIHTRGVKPFECDQCGKAFGQSTRLIHHQRIHSRVRLYKWGEQ
GKAISSASLIKLQSFHTKEHPFKCNECGKTFSHSAHLSKHQLIHAGENPFKCSKCDRV
FTQRNYLVQHERTHARKKPLVCNECGKTFRQSSCLSKHQRIHSGEKPYVCDYCGKAFG
LSAELVRHQRIHTGEKPYVCQECGKAFTQSSCLSIHRRVHTGEKPYRCGECGKAFAQK
ANLTQHQRIHTGEKPYSCNVCGKAFVLSAHLNQHLRVHTQETLYQCQRCQKAFRCHSS
LSRHQRVHNKQQYCL"
BASE COUNT 853 a 710 c 718 g 621 t
ORIGIN
1 gctgcgagga ggcgcgtgtg cggggagttg aatctcccgc tcccttgagg ctggggttgc
61 gtctgttgac gcggccgact acaatcccga gccctgccag ccgggaacac ggaggggaag
121 gaggaggagc ttaaaagagg ctactgaacc ccagttggcc atggctgagg aatttgtgac
181 cctcaaggat gtcggcatgg acttcacctt gggagactgg gagcagctcg ggctggaaca
241 gggggacacg ttctgggaca cagcgttgga caattgccag gacctcttcc tgctggaccc
301 cccaagaccc aacctgacct cccacccaga tggcagtgaa gatctggagc ctctggcagg
361 aggaagccca gaagcaacaa gccctgatgt gactgagacc aagaactctc ctctgatgga
421 ggatttcttc gaagaaggat tctcccagga gattatagag atgttatcca aggatggctt
481 ctggaactcc aatttcggag aagcctgtat agaggacacc tggttagata gtttgctagg
541 cgatccagaa agtcttctga ggtctgatat tgccaccaac ggggaaagtc ccacggaatg
601 caagagtcat gaattaaaga gaggactcag tcctgtgtcc accgtttcca cgggagaaga
661 ttccatggtg cataatgttt ctgaaaagac cctcacacca gctaagtcta aggaatatag
721 gggtgagttt ttctcctact ccgaccacag ccagcaggat tctgttcagg aaggggagaa
781 accatatcaa tgtagtgaat gtgggaaaag cttcagtggg agttaccgtc ttacccagca
841 ctggatcact catactaggg agaaacccac tgtccatcaa gagtgtgagc aaggttttga
901 ccggaatgct tccctttctg tgtatccgaa aactcacacg ggctacaaat tctatgtgtg
961 taatgaatat gggacaactt ttagtcagag tacatacctg tggcatcaga aaactcacac
1021 tggagaaaaa ccatgtaaga gtcaagatag tgaccaccca cccagtcatg acacacagcc
1081 tggtgagcat cagaaaactc acacagatag taagtcctac aactgtaacg aatgcggcaa
1141 ggcttttacc cggatcttcc accttactcg gcaccagaag atccacactc ggaaacgcta
1201 tgagtgttcc aagtgccagg cgaccttcaa cttgagaaaa cacctcatcc aacatcagaa
1261 aactcacgct gcaaaaacta cctctgagtg tcaggagtgt gggaagattt ttaggcacag
1321 ttcgctgctc attgaacacc aggctcttca tgctggagag gagccttata agtgtaacga
1381 acgtgggaaa tccttcaggc ataactctac cctaaagatc catcagaggg ttcacagtgg
1441 agagaagcct tacaaatgca gtgagtgtgg gaaggccttc caccggcaca ctcaccttaa
1501 tgaacatcgg cgaattcata caggctacag accccacaaa tgtcaggaat gcgtcaggag
1561 tttcagccgg ccctcacatc tgatgcgaca tcaggccatt cacaccgcag aaaagcccta
1621 tagctgtgct gaatgcaagg agactttcag cgataacaat cgccttgtgc aacaccagaa
1681 aatgcacact gtcaaaaccc catatgaatg tcaggagtgc ggagaacgct tcatttgcgg
1741 ctcaaccctg aagtgccacg agagtgttca cgccagagaa aaacaaggat tttttgtgag
1801 tgggaagatc ttggatcaga acccagaaca gaaagagaag tgctttaagt gtaacaaatg
1861 tgagaaaacc tttagctgca gcaaatacct aactcagcac gagaggattc acaccagggg
1921 agtgaagccc tttgaatgtg accagtgtgg gaaagccttt ggccaaagta ctcggctcat
1981 tcaccatcaa agaatccact ctagagtgag gctgtataaa tggggtgagc aagggaaagc
2041 catcagcagt gcctccctta tcaaacttca gtccttccac acaaaggagc acccttttaa
2101 atgtaacgaa tgcggaaaga ccttcagcca cagtgcacac ctctcaaaac atcagttaat
2161 tcacgctgga gagaatccct ttaaatgtag taagtgtgac agagtcttca cccagagaaa
2221 ctaccttgtt cagcatgagc gaactcatgc cagaaagaag ccgttggtgt gtaacgaatg
2281 cgggaaaacg ttccgtcaga gctcatgcct ttctaagcat cagagaattc actcaggtga
2341 gaagccctat gtatgtgatt actgcgggaa ggccttcggc ctgagtgctg agcttgtccg
2401 ccaccagaga attcacactg gagaaaagcc ttatgtttgt caggaatgcg ggaaagcctt
2461 cacccagagc tcatgccttt ctattcaccg gagagttcac actggggaga agccctacag
2521 atgtggtgaa tgtgggaaag cctttgccca gaaagcaaat ctaacacagc accagagaat
2581 tcacacaggg gagaagcctt actcctgtaa tgtgtgtggc aaagcttttg tcctcagtgc
2641 ccatctcaac cagcacctga gagttcacac ccaggagaca ctttatcagt gtcaacgttg
2701 ccagaaagcc tttcggtgcc actcgagcct cagccgccat cagcgtgtac acaacaagca
2761 gcaatactgc ctgtagccat tgggtggcag cagagtccca gaatatgaga ccgttactcg
2821 gatgttgaaa gttggaaact atcccattgc aagtttctct ccaaataaat gcatctaaag
2881 attgaaaaaa aaaaaaaaaa aa
//