LOCUS BC008827 4022 bp mRNA linear HUM 22-FEB-2007 DEFINITION Homo sapiens zinc finger and SCAN domain containing 20, mRNA (cDNA clone MGC:13310 IMAGE:4110431), complete cds. ACCESSION BC008827 VERSION BC008827.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4022) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 4022) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (25-MAY-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 20, 2003 this sequence version replaced BC008827.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 17 Row: d Column: 8 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 34147716. FEATURES Location/Qualifiers source 1..4022 /db_xref="H-InvDB:HIT000034001" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:13310 IMAGE:4110431" /tissue_type="Muscle, rhabdomyosarcoma" /clone_lib="NIH_MGC_17" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..4022 /gene="ZSCAN20" /gene_synonym="KOX29" /gene_synonym="ZFP-31" /db_xref="GeneID:7579" /db_xref="HGNC:HGNC:13093" CDS 145..3078 /gene="ZSCAN20" /gene_synonym="KOX29" /gene_synonym="ZFP-31" /codon_start=1 /product="zinc finger and SCAN domain containing 20" /protein_id="AAH08827.1" /db_xref="GeneID:7579" /db_xref="HGNC:HGNC:13093" /translation="MAMALELQAQASPQPEPEELLIVKLELLVLEQFLTILPREVQTW VQARHPESGEEAVALVEDWHRETRTAGQSGLELHTEETRPLKTGEEAQSFQLQPVDPW PEGQSQKKGVKNTCPDLPNHLNAEVAPQPLKESAVLTPRVPTLPKMGSVGDWEVTAES QEALGPGKHAEKELCKDPPGDDCGNSVCLGVPVSKPSNTSEKEQGPEFWGLSLINSGK RSTADYSLDNEPAQALTWRDSRAWEEQYQWDVEDMKVSGVHWGYEETKTFLAILSESP FSEKLRTCHQNRQVYRAIAEQLRARGFLRTLEQCRYRVKNLLRNYRKAKSSHPPGTCP FYEELEALVRARTAIRATDGPGEAVALPRLGDSDAEMDEQEEGGWDPEEMAEDCNGAG LVNVESTQGPRIAGAPALFQSRIAGVHWGYEETKAFLAILSESPFSEKLRTCHQNSQV YRAIAERLCALGFLRTLEQCRYRFKNLLRSYRKAKSSHPPGTCPFYEELDSLMRARAA VRAMGTVREAAGLPRCGQSSAETDAQEAWGEVANEDAVKPSTLCPKAPDMGFEMRHED EDQISEQDIFEGLPGALSKCPTEAVCQPLDWGEDSENENEDEGQWGNPSQEQWQESSS EEDLEKLIDHQGLYLAEKPYKCDTCMKSFSRSSHFIAHQRIHTGEKPYKCLECGKNFS DRSNLNTHQRIHTGEKPYKCLECGKSFSDHSNLITHQRIHTGEKPYKCGECWKSFNQS SNLLKHQRIHLGGNPDQCSEPGGNFAQSPSFSAHWRNSTEETAPEQPQSISKDLNSPG PHSTNSGEKLYECSECGRSFSKSSALISHQRIHTGEKPYECAECGKSFSKSSTLANHQ RTHTGEKPYKCVDCGKCFSERSKLITHQRVHTGEKPYKCLECGKFFRDRSNLITHQRI HTGEKPYKCRECGKCFNQSSSLIIHQRIHTGEKPYKCTECGKDFNNSSHFSAHRRTHA GGKAS" BASE COUNT 1135 a 925 c 1071 g 891 t ORIGIN 1 cttccgccct gttgtgaagt gggtgtctcg gtggagcctt ggggagcagt cccttttcta 61 ggagcctctt gaaggactca ccgtagatgc aggaagacat tggatgaggt cagcatagct 121 gaagtgaggt gtctgggtta gacaatggct atggccctgg aattgcaagc ccaggcatct 181 ccgcagccag agcctgaaga actcctgatt gtgaaactgg agctgctcgt gctggagcag 241 ttcctgacaa tcttgcctag ggaggtccag acctgggtgc aggcacgcca ccctgagagt 301 ggtgaggagg ctgtggcctt ggtggaggat tggcaccgag agaccaggac tgcaggacag 361 tcgggactgg aattgcatac agaagagacc aggcccttaa agacagggga agaagctcag 421 agcttccagc tgcagccagt ggatccctgg cctgagggac agtcccagaa gaagggggtg 481 aagaatacat gccctgacct tcccaatcac ctaaatgccg aggtggcacc acagcctttg 541 aaagagagtg ctgtcctcac tccccgagtc cctactctcc caaagatggg gagcgttgga 601 gattgggagg tgacagctga gtcccaggaa gccctgggcc ctggcaaaca tgctgagaag 661 gagctctgta aagacccccc aggagacgac tgtgggaaca gcgtgtgcct gggagttcca 721 gtttcaaaac caagtaatac ctccgagaaa gagcaaggac cagagttttg gggtctaagt 781 cttataaatt ctgggaaaag gagcactgca gattacagcc tggataatga gccagctcag 841 gcattgacct ggagggattc aagagcctgg gaggaacaat accagtggga tgtggaggac 901 atgaaggtgt caggtgttca ctggggctat gaggagacca agactttcct ggcaattttg 961 agtgaatctc ctttctctga aaagctccgg acttgtcacc agaaccgcca ggtatatcgg 1021 gccattgcag agcagctaag ggcaaggggc ttcctgcgga cactggagca atgtcgctat 1081 agggtcaaaa acctcctacg gaattaccgg aaagccaaga gcagccaccc accaggtacc 1141 tgccccttct atgaggagct ggaggccctg gtcagggctc ggacagccat cagagccaca 1201 gatggcccag gagaggccgt ggcacttccc aggctcgggg atagtgacgc agagatggat 1261 gagcaggagg aagggggctg ggatcctgaa gaaatggcag aagactgtaa cggtgctggc 1321 ctggtcaatg ttgagtctac ccaggggccc aggattgcag gggccccagc tctgttccag 1381 agtcgtattg caggtgtgca ctggggctat gaggagacca aggccttcct ggcaattctc 1441 agtgagtccc cattctcgga aaagcttcgt acctgtcacc agaacagcca ggtgtaccgg 1501 gccattgcag agcggctgtg tgctctgggc ttcctgcgga cactggagca gtgtcgctac 1561 agattcaaaa acctccttcg aagctaccgg aaagccaaga gcagccaccc accagggaca 1621 tgccctttct atgaggaact ggactcgctg atgagggctc gggctgcagt cagggccatg 1681 gggactgtcc gagaggctgc aggtctccct aggtgtgggc agagtagtgc tgagactgat 1741 gcccaggagg cctggggtga agtggccaat gaagatgctg tcaaaccttc aaccttgtgt 1801 cctaaagccc cagacatggg ttttgaaatg aggcatgagg atgaagacca gatttcagag 1861 caggacattt ttgagggttt gcctggagcc ttatcaaaat gtcctacaga agctgtttgc 1921 caacctcttg actggggaga agacagtgaa aatgaaaatg aagatgaagg gcagtgggga 1981 aatccctcac aggaacagtg gcaagaaagt tcttctgaag aggacttaga aaaacttatt 2041 gaccatcaag gcctgtacct tgcggagaaa ccctacaagt gtgacacatg catgaagagc 2101 ttcagtcgga gctcccactt cattgcccat cagcgaatcc acacaggtga gaagccctac 2161 aaatgccttg aatgtggaaa aaactttagt gaccgctcta acctcaatac ccatcagaga 2221 atccacactg gagagaagcc ctataaatgc cttgaatgtg ggaaaagctt tagtgaccat 2281 tctaatctca tcactcacca gagaattcac acgggggaaa agccctataa atgtggagaa 2341 tgttggaaaa gcttcaacca gagctcaaac cttctgaaac atcagagaat ccacttggga 2401 ggaaatcctg accagtgtag tgagcctggg ggaaactttg cccaaagccc atcttttagt 2461 gctcactgga ggaattctac agaagagaca gctcctgaac aacctcaaag tatcagtaag 2521 gacttgaatt ctcctggacc acacagcaca aactcagggg agaaacttta tgagtgttct 2581 gaatgtggaa gaagcttctc taagagctct gccctcatta gtcaccaaag aatccatacg 2641 ggagagaaac catatgaatg tgccgaatgt gggaaaagct tcagtaagag ctccaccctg 2701 gccaaccacc agcgcaccca cactggagag aagccgtata aatgtgtgga ctgtgggaag 2761 tgcttcagtg agcgctccaa gctcatcaca caccagagag tgcacacagg agagaagccc 2821 tacaaatgcc ttgagtgtgg aaaattcttc cgtgaccgtt ctaacctcat tactcaccag 2881 aggattcata cgggagagaa gccgtataag tgcagagagt gtgggaaatg ctttaaccag 2941 agctccagtc ttattattca ccagagaatc cacacagggg agaaacccta caagtgcaca 3001 gagtgtggca aagacttcaa caacagttcc cacttcagtg ctcaccggag aacccatgca 3061 ggagggaagg cgtcgtaggg gacagtttcc tcaacaacaa aggaggactc aatgtatata 3121 tcttatatca taagatgtat gctagagaga aactttccaa tttttaagct tggtgtgtac 3181 ccagggaagt tatcttggta taaaccaggt aatttggaag tgaattacaa atactaagga 3241 tccagatttg aaggcacttt taagtgtaat ttgtttttct tctgtaaaga cccacacaga 3301 atcctgactg tccttgtatt tgctatcatg taagagctgt gtcagtattt gagccaaact 3361 ggcaacttac gagattagag ttaaatcagt ggtcctgagc agtgattcca aactctcact 3421 gtgccttcac accatccatg tgtaatccac cccatcagca gtggttctcc tactttctca 3481 gtgggggcat atcctcaagg gagaatgttg tgactctggt gagagaggag ttggcctaga 3541 gcaggccact gtgagctcag cacagagtag gaagaacagt gacatcatta caaaatcatc 3601 aatagcagca atagctggtg tttattgagc tgttgctctt tggcaagctc tgtgctaaga 3661 actttgtata catcatctca tttaatcttc acaacggccc caggagataa gtactaactt 3721 tcttcccatt tcctagaggc ttgggaaatt aagtaactcg cactggtcac acatctgtaa 3781 gtgggagagg caggattcaa cagatctgtc tgtcttgagt ctatcgtgct cttattgcta 3841 tactgaattc ccattcatgt taggtaactt aggggccaga atctccatgc actttgtaga 3901 ccacatttcc tgttatgaaa tttccacatt ttgaattttt taaacaaata tggaaactga 3961 gaatccctaa ggattaaagg acttgggata tccagaaaaa aaaaaaaaaa aaaaaaaaaa 4021 aa //