LOCUS BC005868 3669 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens zinc finger protein 551, mRNA (cDNA clone MGC:4079
IMAGE:3530863), complete cds.
ACCESSION BC005868
VERSION BC005868.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 3669)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 3669)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (02-APR-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Aug 19, 2003 this sequence version replaced BC005868.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Institute for Systems Biology
http://www.systemsbiology.org
contact: amadan@systemsbiology.org
Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 11 Row: f Column: 1
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 34147525.
FEATURES Location/Qualifiers
source 1..3669
/db_xref="H-InvDB:HIT000032487"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:4079 IMAGE:3530863"
/tissue_type="Muscle, rhabdomyosarcoma"
/clone_lib="NIH_MGC_17"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..3669
/gene="ZNF551"
/db_xref="GeneID:90233"
/db_xref="HGNC:HGNC:25108"
CDS 176..2140
/gene="ZNF551"
/codon_start=1
/product="zinc finger protein 551"
/protein_id="AAH05868.1"
/db_xref="GeneID:90233"
/db_xref="HGNC:HGNC:25108"
/translation="MAAVALRDSAQGMTFEDVAIYFSQEEWELLDESQRFLYCDVMLE
NFAHVTSLGYCHGMENEAIASEQSVSIQVRTSKGNTPTQKTHLSEIKMCVPVLKDILP
AAEHQTTSPVQKSYLGSTSMRGFCFSADLHQHQKHYNEEEPWKRKVDEATFVTGCRFH
VLNYFTCGEAFPAPTDLLQHEATPSGEEPHSSSSKHIQAFFNAKSYYKWGEYRKASSH
KHTLVQHQSVCSEGGLYECSKCEKAFTCKNTLVQHQQIHTGQKMFECSECEESFSKKC
HLILHKIIHTGERPYECSDREKAFIHKSEFIHHQRRHTGGVRHECGECRKTFSYKSNL
IEHQRVHTGERPYECGECGKSFRQSSSLFRHQRVHSGERPYQCCECGKSFRQIFNLIR
HRRVHTGEMPYQCSDCGKSFSCKSELIQHQRIHSGERPYECRECGKSFRQFSNLIRHR
SIHTGDRPYECSECEKSFSRKFILIQHQRVHTGERPYECSECGKSFTRKSDLIQHRRI
HTGTRPYECSECGKSFRQRSGLIQHRRLHTGERPYECSECGKSFSQSASLIQHQRVHT
GERPYECSECGKSFSQSSSLIQHQRGHTGERPYECSQCGKPFTHKSDLIQHQRVHTGE
RPYECSECGKSFSRKSNLIRHRRVHTEERP"
BASE COUNT 1048 a 794 c 892 g 935 t
ORIGIN
1 gccagtttct gcagtggagg tcgcgactga gggacgggac agagaagtcg cgaaagtggg
61 ccagaggttc tgcgacacca cctcgggtga gctgcgccag gcccgggata gggactgttg
121 tgttcgaatg cccgccccgg tcggccgccg ctccccgcct agtccacgga gctcaatggc
181 ggcagtcgcg ctgagggact cggctcaggg tatgaccttt gaggatgtgg ccatttattt
241 ctcccaagaa gagtgggagc tccttgatga gtctcagagg ttcctgtact gcgatgtgat
301 gctggagaac tttgcacatg taacatccct gggttattgc catggaatgg agaatgaggc
361 gatagcttct gagcagagtg tatctataca ggtcaggact tctaagggca atacacccac
421 ccagaaaact cacctcagtg agattaagat gtgtgtccca gtcttgaaag acattttgcc
481 tgcggctgag caccaaacca catcccctgt gcaaaagtca tacttgggta gcacaagcat
541 gagaggcttc tgcttcagtg ctgaccttca ccagcatcaa aagcattaca atgaagaaga
601 gccctggaaa aggaaggtgg atgaggctac atttgtgacc ggctgcagat tccatgtgtt
661 gaattatttc acctgtgggg aggccttccc agcccccacg gacctactcc aacacgaagc
721 cactcccagt ggtgaggagc cacacagtag cagcagcaag catatacagg catttttcaa
781 tgcaaaaagt tattacaagt ggggtgaata cagaaaagct tcaagccaca aacacacact
841 tgttcagcat cagagtgtct gttctgaagg agggctttat gagtgtagca aatgtgagaa
901 agccttcact tgcaagaaca cacttgttca gcaccagcaa attcacactg gacaaaagat
961 gtttgagtgt agtgaatgtg aggaatcctt tagcaaaaag tgccacctaa tcttacacaa
1021 gataattcac actggagaaa ggccttatga atgcagtgat cgtgagaaag cctttatcca
1081 taaatctgaa ttcattcacc accagagacg tcacactgga ggagtgcgtc atgagtgtgg
1141 tgaatgtagg aaaaccttta gctacaaatc taacctcatt gaacaccaga gagttcacac
1201 tggagaaagg ccttatgaat gtggcgagtg cgggaaatcc tttagacaaa gctctagcct
1261 ttttcgacac cagagagttc actctggaga aaggccttat cagtgctgtg agtgtgggaa
1321 atcctttaga caaatcttca atctcattcg acatagaaga gttcacactg gagaaatgcc
1381 ttatcagtgc agtgattgtg ggaaatcttt tagctgcaaa tcggaactca ttcaacacca
1441 gagaattcac agtggagaaa gaccttatga atgcagagaa tgtgggaaat cctttagaca
1501 attctctaac ctcattcgac accgcagcat tcacactggt gataggcctt atgagtgcag
1561 tgaatgtgag aaatccttta gccgcaaatt tatcctgatt caacaccaaa gagttcacac
1621 tggagaaaga ccttatgaat gcagtgaatg tggaaaatcc tttacccgca aatctgacct
1681 cattcaacac cggagaattc atactggcac aagaccttat gagtgcagtg aatgtggcaa
1741 atcttttaga cagcgctctg gcctcattca gcaccggaga cttcatactg gagaaaggcc
1801 ttatgaatgt agtgaatgtg gaaagtcttt tagccaaagt gctagcctca ttcaacacca
1861 gagagttcac actggagaaa ggccttatga atgtagtgaa tgtgggaaat cctttagcca
1921 gagctctagc ctcattcaac accagagagg tcacactgga gaaagacctt atgagtgcag
1981 tcaatgtggg aaacccttta cccacaaatc agaccttatt cagcaccaaa gagttcacac
2041 tggagaaagg ccttatgaat gcagtgaatg tgggaaatcc tttagccgca aatctaacct
2101 cattcgacat cggagagttc acactgaaga aaggccttaa atgtgaaggg aatgtgctat
2161 ttctttattc agtataatag cactggagga gactgtggta gccatcttcg taaatttaaa
2221 ctttgagcac ccacagtggg gtattcttca taagtttcag gtatgtggga agctctgagg
2281 aggttcattg taatttctaa tctgccgagg cctatagcct gatttatgtc actgccaatt
2341 tctgaggctg aagccatttc acatttcacc cctaccacct ggcaggtgca caccatgtgc
2401 atgagtcact tccccacagt gctcagagaa gcaaacctct gtactctccc atttgcttgg
2461 ggaaattatt aatagcccaa gcatgtaggg tcttcatgtc ttttctctga ctttagagta
2521 tacacctgac ccagttgtgg tccagagaat ctgagttgtg ctgtcagctt tctaaggatt
2581 acctttttaa ggaacattgt cggtctcaca tgatgcttgt aatgaatttt tccacattct
2641 gattcaccag cctggaaact gcttggtgca cattgcttta atttataatg tttttataaa
2701 ggttttaaaa aaatttacgc acctttaatc ttgggtgttc tattcattgc atagaatgac
2761 ttgtaagcag aatagtgatg tcagcatgaa ggggtgaata gattttgtgg agatctcaaa
2821 ctttctgtac tcagaaggga caatgtgatg gcagaagaca gctctttgtt ctgtattgct
2881 ctaattggcc tgctgcccaa ggcctcaaag taaagactgc caggctttgt gttcctgtat
2941 tgtggcctgg tgccattttt gtcaggtcag aggtcaccat gaaaaggagg cagttgtgac
3001 aatccccagt gcttctgtga acgtgaattt tgttcagttg tccacaggag atccacataa
3061 ttcctagcac tggtgagggg aatctgttgc ccaggtcctg caaaggagcc atgtgccatg
3121 gcatataagt cagtgtagac tggtgccatt tgttgtcaga ctataggtgt ggaggtgaaa
3181 ttacaggttc aacagtaatt gggacagaaa ctccaggtaa atggggagtg gagaagactg
3241 cagtaaatta gatggaatga ctcttctaaa agttcatcta caaattttcc agtgaatatg
3301 gttgtgtagg atcagtgcat acagaaattc tcaggatctt ctgtttacta tcgctgagat
3361 cattatcaga aaatagtctg gccgggcatg gtgtaatcct aacactttgg gaggccaagg
3421 cgggcggatt gcctgagctc aggagttcaa gaccagcctg ggcaacatgg tgaaatccca
3481 tgtctactaa aatacaaaaa attagccagg cgtggcagca tgcgcctata gtcccagcca
3541 cttgggaggc tgaggcagga gaatcgcttg aacccagaag gcggaagttg cagtgagctg
3601 agattgcacc actgcactcc agcccaggca acagagtgag actccgtctc caaaaaaaaa
3661 aaaaaaaaa
//