LOCUS BC008827 4022 bp mRNA linear HUM 22-FEB-2007
DEFINITION Homo sapiens zinc finger and SCAN domain containing 20, mRNA (cDNA
clone MGC:13310 IMAGE:4110431), complete cds.
ACCESSION BC008827
VERSION BC008827.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 4022)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 4022)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (25-MAY-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Aug 20, 2003 this sequence version replaced BC008827.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 17 Row: d Column: 8
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 34147716.
FEATURES Location/Qualifiers
source 1..4022
/db_xref="H-InvDB:HIT000034001"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:13310 IMAGE:4110431"
/tissue_type="Muscle, rhabdomyosarcoma"
/clone_lib="NIH_MGC_17"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..4022
/gene="ZSCAN20"
/gene_synonym="KOX29"
/gene_synonym="ZFP-31"
/db_xref="GeneID:7579"
/db_xref="HGNC:HGNC:13093"
CDS 145..3078
/gene="ZSCAN20"
/gene_synonym="KOX29"
/gene_synonym="ZFP-31"
/codon_start=1
/product="zinc finger and SCAN domain containing 20"
/protein_id="AAH08827.1"
/db_xref="GeneID:7579"
/db_xref="HGNC:HGNC:13093"
/translation="MAMALELQAQASPQPEPEELLIVKLELLVLEQFLTILPREVQTW
VQARHPESGEEAVALVEDWHRETRTAGQSGLELHTEETRPLKTGEEAQSFQLQPVDPW
PEGQSQKKGVKNTCPDLPNHLNAEVAPQPLKESAVLTPRVPTLPKMGSVGDWEVTAES
QEALGPGKHAEKELCKDPPGDDCGNSVCLGVPVSKPSNTSEKEQGPEFWGLSLINSGK
RSTADYSLDNEPAQALTWRDSRAWEEQYQWDVEDMKVSGVHWGYEETKTFLAILSESP
FSEKLRTCHQNRQVYRAIAEQLRARGFLRTLEQCRYRVKNLLRNYRKAKSSHPPGTCP
FYEELEALVRARTAIRATDGPGEAVALPRLGDSDAEMDEQEEGGWDPEEMAEDCNGAG
LVNVESTQGPRIAGAPALFQSRIAGVHWGYEETKAFLAILSESPFSEKLRTCHQNSQV
YRAIAERLCALGFLRTLEQCRYRFKNLLRSYRKAKSSHPPGTCPFYEELDSLMRARAA
VRAMGTVREAAGLPRCGQSSAETDAQEAWGEVANEDAVKPSTLCPKAPDMGFEMRHED
EDQISEQDIFEGLPGALSKCPTEAVCQPLDWGEDSENENEDEGQWGNPSQEQWQESSS
EEDLEKLIDHQGLYLAEKPYKCDTCMKSFSRSSHFIAHQRIHTGEKPYKCLECGKNFS
DRSNLNTHQRIHTGEKPYKCLECGKSFSDHSNLITHQRIHTGEKPYKCGECWKSFNQS
SNLLKHQRIHLGGNPDQCSEPGGNFAQSPSFSAHWRNSTEETAPEQPQSISKDLNSPG
PHSTNSGEKLYECSECGRSFSKSSALISHQRIHTGEKPYECAECGKSFSKSSTLANHQ
RTHTGEKPYKCVDCGKCFSERSKLITHQRVHTGEKPYKCLECGKFFRDRSNLITHQRI
HTGEKPYKCRECGKCFNQSSSLIIHQRIHTGEKPYKCTECGKDFNNSSHFSAHRRTHA
GGKAS"
BASE COUNT 1135 a 925 c 1071 g 891 t
ORIGIN
1 cttccgccct gttgtgaagt gggtgtctcg gtggagcctt ggggagcagt cccttttcta
61 ggagcctctt gaaggactca ccgtagatgc aggaagacat tggatgaggt cagcatagct
121 gaagtgaggt gtctgggtta gacaatggct atggccctgg aattgcaagc ccaggcatct
181 ccgcagccag agcctgaaga actcctgatt gtgaaactgg agctgctcgt gctggagcag
241 ttcctgacaa tcttgcctag ggaggtccag acctgggtgc aggcacgcca ccctgagagt
301 ggtgaggagg ctgtggcctt ggtggaggat tggcaccgag agaccaggac tgcaggacag
361 tcgggactgg aattgcatac agaagagacc aggcccttaa agacagggga agaagctcag
421 agcttccagc tgcagccagt ggatccctgg cctgagggac agtcccagaa gaagggggtg
481 aagaatacat gccctgacct tcccaatcac ctaaatgccg aggtggcacc acagcctttg
541 aaagagagtg ctgtcctcac tccccgagtc cctactctcc caaagatggg gagcgttgga
601 gattgggagg tgacagctga gtcccaggaa gccctgggcc ctggcaaaca tgctgagaag
661 gagctctgta aagacccccc aggagacgac tgtgggaaca gcgtgtgcct gggagttcca
721 gtttcaaaac caagtaatac ctccgagaaa gagcaaggac cagagttttg gggtctaagt
781 cttataaatt ctgggaaaag gagcactgca gattacagcc tggataatga gccagctcag
841 gcattgacct ggagggattc aagagcctgg gaggaacaat accagtggga tgtggaggac
901 atgaaggtgt caggtgttca ctggggctat gaggagacca agactttcct ggcaattttg
961 agtgaatctc ctttctctga aaagctccgg acttgtcacc agaaccgcca ggtatatcgg
1021 gccattgcag agcagctaag ggcaaggggc ttcctgcgga cactggagca atgtcgctat
1081 agggtcaaaa acctcctacg gaattaccgg aaagccaaga gcagccaccc accaggtacc
1141 tgccccttct atgaggagct ggaggccctg gtcagggctc ggacagccat cagagccaca
1201 gatggcccag gagaggccgt ggcacttccc aggctcgggg atagtgacgc agagatggat
1261 gagcaggagg aagggggctg ggatcctgaa gaaatggcag aagactgtaa cggtgctggc
1321 ctggtcaatg ttgagtctac ccaggggccc aggattgcag gggccccagc tctgttccag
1381 agtcgtattg caggtgtgca ctggggctat gaggagacca aggccttcct ggcaattctc
1441 agtgagtccc cattctcgga aaagcttcgt acctgtcacc agaacagcca ggtgtaccgg
1501 gccattgcag agcggctgtg tgctctgggc ttcctgcgga cactggagca gtgtcgctac
1561 agattcaaaa acctccttcg aagctaccgg aaagccaaga gcagccaccc accagggaca
1621 tgccctttct atgaggaact ggactcgctg atgagggctc gggctgcagt cagggccatg
1681 gggactgtcc gagaggctgc aggtctccct aggtgtgggc agagtagtgc tgagactgat
1741 gcccaggagg cctggggtga agtggccaat gaagatgctg tcaaaccttc aaccttgtgt
1801 cctaaagccc cagacatggg ttttgaaatg aggcatgagg atgaagacca gatttcagag
1861 caggacattt ttgagggttt gcctggagcc ttatcaaaat gtcctacaga agctgtttgc
1921 caacctcttg actggggaga agacagtgaa aatgaaaatg aagatgaagg gcagtgggga
1981 aatccctcac aggaacagtg gcaagaaagt tcttctgaag aggacttaga aaaacttatt
2041 gaccatcaag gcctgtacct tgcggagaaa ccctacaagt gtgacacatg catgaagagc
2101 ttcagtcgga gctcccactt cattgcccat cagcgaatcc acacaggtga gaagccctac
2161 aaatgccttg aatgtggaaa aaactttagt gaccgctcta acctcaatac ccatcagaga
2221 atccacactg gagagaagcc ctataaatgc cttgaatgtg ggaaaagctt tagtgaccat
2281 tctaatctca tcactcacca gagaattcac acgggggaaa agccctataa atgtggagaa
2341 tgttggaaaa gcttcaacca gagctcaaac cttctgaaac atcagagaat ccacttggga
2401 ggaaatcctg accagtgtag tgagcctggg ggaaactttg cccaaagccc atcttttagt
2461 gctcactgga ggaattctac agaagagaca gctcctgaac aacctcaaag tatcagtaag
2521 gacttgaatt ctcctggacc acacagcaca aactcagggg agaaacttta tgagtgttct
2581 gaatgtggaa gaagcttctc taagagctct gccctcatta gtcaccaaag aatccatacg
2641 ggagagaaac catatgaatg tgccgaatgt gggaaaagct tcagtaagag ctccaccctg
2701 gccaaccacc agcgcaccca cactggagag aagccgtata aatgtgtgga ctgtgggaag
2761 tgcttcagtg agcgctccaa gctcatcaca caccagagag tgcacacagg agagaagccc
2821 tacaaatgcc ttgagtgtgg aaaattcttc cgtgaccgtt ctaacctcat tactcaccag
2881 aggattcata cgggagagaa gccgtataag tgcagagagt gtgggaaatg ctttaaccag
2941 agctccagtc ttattattca ccagagaatc cacacagggg agaaacccta caagtgcaca
3001 gagtgtggca aagacttcaa caacagttcc cacttcagtg ctcaccggag aacccatgca
3061 ggagggaagg cgtcgtaggg gacagtttcc tcaacaacaa aggaggactc aatgtatata
3121 tcttatatca taagatgtat gctagagaga aactttccaa tttttaagct tggtgtgtac
3181 ccagggaagt tatcttggta taaaccaggt aatttggaag tgaattacaa atactaagga
3241 tccagatttg aaggcacttt taagtgtaat ttgtttttct tctgtaaaga cccacacaga
3301 atcctgactg tccttgtatt tgctatcatg taagagctgt gtcagtattt gagccaaact
3361 ggcaacttac gagattagag ttaaatcagt ggtcctgagc agtgattcca aactctcact
3421 gtgccttcac accatccatg tgtaatccac cccatcagca gtggttctcc tactttctca
3481 gtgggggcat atcctcaagg gagaatgttg tgactctggt gagagaggag ttggcctaga
3541 gcaggccact gtgagctcag cacagagtag gaagaacagt gacatcatta caaaatcatc
3601 aatagcagca atagctggtg tttattgagc tgttgctctt tggcaagctc tgtgctaaga
3661 actttgtata catcatctca tttaatcttc acaacggccc caggagataa gtactaactt
3721 tcttcccatt tcctagaggc ttgggaaatt aagtaactcg cactggtcac acatctgtaa
3781 gtgggagagg caggattcaa cagatctgtc tgtcttgagt ctatcgtgct cttattgcta
3841 tactgaattc ccattcatgt taggtaactt aggggccaga atctccatgc actttgtaga
3901 ccacatttcc tgttatgaaa tttccacatt ttgaattttt taaacaaata tggaaactga
3961 gaatccctaa ggattaaagg acttgggata tccagaaaaa aaaaaaaaaa aaaaaaaaaa
4021 aa
//