LOCUS BC014267 3460 bp mRNA linear HUM 11-AUG-2006
DEFINITION Homo sapiens CCCTC-binding factor (zinc finger protein), mRNA (cDNA
clone MGC:10345 IMAGE:3840827), complete cds.
ACCESSION BC014267
VERSION BC014267.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 3460)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 3460)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (17-SEP-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Aug 19, 2003 this sequence version replaced BC014267.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Institute for Systems Biology
http://www.systemsbiology.org
contact: amadan@systemsbiology.org
Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 14 Row: m Column: 8
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 62952500.
FEATURES Location/Qualifiers
source 1..3460
/db_xref="H-InvDB:HIT000036613"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:10345 IMAGE:3840827"
/tissue_type="Placenta, choriocarcinoma"
/clone_lib="NIH_MGC_21"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..3460
/gene="CTCF"
/db_xref="GeneID:10664"
/db_xref="HGNC:HGNC:13723"
/db_xref="MIM:604167"
CDS 279..2462
/gene="CTCF"
/codon_start=1
/product="CCCTC-binding factor (zinc finger protein)"
/protein_id="AAH14267.1"
/db_xref="GeneID:10664"
/db_xref="HGNC:HGNC:13723"
/db_xref="MIM:604167"
/translation="MEGDAVEAIVEESETFIKGKERKTYQRRREGGQEEDACHLPQNQ
TDGGEVVQDVNSSVQMVMMEQLDPTLLQMKTEVMEGTVAPEAEAAVDDTQIITLQVVN
MEEQPINIGELQLVQVPVPVTVPVATTSVEELQGAYENEVSKEGLAESEPMICHTLPL
PEGFQVVKVGANGEVETLEQGELPPQEDPSWQKDPDYQPPAKKTKKTKKSKLRYTEEG
KDVDVSVYDFEEEQQEGLLSEVNAEKVVGNMKPPKPTKIKKKGVKKTFQCELCSYTCP
RRSNLDRHMKSHTDERPHKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCPDCDMAFVTS
GELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTY
KLKRHMRTHSGEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPHCDTVIARKS
DLGVHLRKQHSYIEQGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQCDYACRQER
HMIMHKRTHTGEKPYACSHCDKTFRQKQLLDMHFKRYHDPNFVPAAFVCSKCGKTFTR
RNTMARHADNCAGPDGVEGENGGETKKSKRGRKRKMRSKKEDSSDSENAEPDLDDNED
EEEPAVEIEPEPEPQPVTPAPPPAKKRRGRPPGRTNQPKQNQPTAIIQVEDQNTGAIE
NIIVEVKKEPDAEPAEGEEEEAQPAATDAPNGDLTPEMILSMMDR"
BASE COUNT 1009 a 772 c 892 g 787 t
ORIGIN
1 gtggagcgat taaaccgtgc gcggagctgc ttctttggcg gcagcggcgg cggcggtggc
61 cggtgcggac gcgcggagct cgccggagac gccgggtggc cggagccgtg gagcggcggc
121 ggagcgggcg ccgcgggggg tgtggcgcgg agaatgatta cggacctgaa gccaaagaac
181 aagatgcgct agtggacaga ttgctgacca ggggcttgag agctgggttc tattttccct
241 cctcaaactg actttgcagc cacggagagg caggggaaat ggaaggtgat gcagtcgaag
301 ccattgtgga ggagtccgaa acttttatta aaggaaagga gagaaagact taccagagac
361 gccgggaagg gggccaggaa gaagatgcct gccacttacc ccagaaccag acggatgggg
421 gtgaggtggt ccaggatgtc aacagcagtg tacagatggt gatgatggaa cagctggacc
481 ccacccttct tcagatgaag actgaagtaa tggagggcac agtggctcca gaagcagagg
541 ctgctgtgga cgatacccag attataactt tacaggttgt aaatatggag gaacagccca
601 taaacatagg agaacttcag cttgttcaag tacctgttcc tgtgactgta cctgttgcta
661 ccacttcagt agaagaactt cagggggctt atgaaaatga agtgtctaaa gagggccttg
721 cggaaagtga acccatgata tgccacaccc tacctttgcc tgaagggttt caggtggtta
781 aagtgggggc caatggagag gtggagacac tagaacaagg ggaacttcca ccccaggaag
841 atcctagttg gcaaaaagac ccagactatc agccaccagc caaaaaaaca aagaaaacca
901 aaaagagcaa actgcgttat acagaggagg gcaaagatgt agatgtgtct gtctacgatt
961 ttgaggaaga acagcaggag ggtctgctat cagaggttaa tgcagagaaa gtggttggta
1021 atatgaagcc tccaaagcca acaaaaatta aaaagaaagg tgtaaagaag acattccagt
1081 gtgagctttg cagttacacg tgtccacggc gttcaaattt ggatcgtcac atgaaaagcc
1141 acactgatga gagaccacac aagtgccatc tctgtggcag ggcattcaga acagtcaccc
1201 tcctgaggaa tcaccttaac acacacacag gtactcgtcc tcacaagtgc ccagactgcg
1261 acatggcctt tgtgaccagt ggagaattgg ttcggcatcg tcgttacaaa cacacccacg
1321 agaagccatt caagtgttcc atgtgcgatt acgccagtgt agaagtcagc aaattaaaac
1381 gtcacattcg ctctcatact ggagagcgtc cgtttcagtg cagtttgtgc agttatgcca
1441 gcagggacac atacaagctg aaaaggcaca tgagaaccca ttcaggggaa aagccttatg
1501 aatgttatat ttgtcatgct cggtttaccc aaagtggtac catgaagatg cacattttac
1561 agaagcacac agaaaatgtg gccaaatttc actgtcccca ctgtgacaca gtcatagccc
1621 gaaaaagtga tttgggtgtc cacttgcgaa agcagcattc ctatattgag caaggcaaga
1681 aatgccgtta ctgtgatgct gtgtttcatg agcgctatgc cctcatccag catcagaagt
1741 cacacaagaa tgagaagcgc tttaagtgtg accagtgtga ttacgcttgt agacaggaga
1801 ggcacatgat catgcacaag cgcacccaca ccggggagaa gccttacgcc tgcagccact
1861 gcgataagac cttccgccag aagcagcttc tcgacatgca cttcaagcgc tatcacgacc
1921 ccaacttcgt ccctgcggct tttgtctgtt ctaagtgtgg gaaaacattt acacgtcgga
1981 ataccatggc aagacatgct gataattgtg ctggcccaga tggcgtagag ggggaaaatg
2041 gaggagaaac gaagaagagt aaacgtggaa gaaaaagaaa gatgcgctct aagaaagaag
2101 attcctctga cagtgaaaat gctgaaccag atctggacga caatgaggat gaggaggagc
2161 ctgccgtaga aattgaacct gagccagagc ctcagcctgt gaccccagcc ccaccacccg
2221 ccaagaagcg gagaggacga ccccctggca gaaccaacca gcccaaacag aaccagccaa
2281 cagctatcat tcaggttgaa gaccagaata caggtgcaat tgagaacatt atagttgaag
2341 taaaaaaaga gccagatgct gagcccgcag agggagagga agaggaggcc cagccagctg
2401 ccacagatgc ccccaacgga gacctcacgc ccgagatgat cctcagcatg atggaccggt
2461 gatggcggag ccttgtgcgt cgccaggact tctctgggct gtgtttaaac ggcccgcatc
2521 ttaatttttc tcccttcttt ctttttttgg ctttgggaaa agcatcattt taccaaacat
2581 accgagaacg aaaacttcaa ggatgatgtt agaaaaaaat gtgatttaac tagaacttgc
2641 tgtctgatgt tagcaaatca tggaatgttc tgagtccctg agggtttact gtgaagtgct
2701 gaggacagtg ttgacaacta actcgttttc ctagatggaa acggagacat tgacccctcc
2761 ctccatgtgg taaaccactc cagaatggcc accaggcttc ccagagttct atggtcttct
2821 tcccaagaga gtttttaatt gtaaatgcat acttgggaag gacttagagt tttaaactgt
2881 tttttgcttt tgcttttccc tgactccctt tgcttggagt cagctgcaca ccagtagtat
2941 ggcatgctac gatcaggttc tgtcctgaaa gctttgcctc tttcttggca aagtttctgg
3001 tatggtcaag cttgtaaata acttttttta cattttaatc ttttccatta attaagaggt
3061 tgaaaagaag tgcagtgtaa gaaaacccag cattttaatt acttgcaaat taagttacca
3121 cagactctgt agtgtgtaaa tgttgacaag gaattggatc acaatcatgt agcagaatgg
3181 cacccagacc actgcccacc agtgacggac atgcacgtgg cagatcatga tttccagccc
3241 acggagccag catttgaacc ttgtataatt aactttcagt tatgatttcc catcgacatt
3301 ttctttgccc tgtttgtagc tgattgttgt gttttataaa tcttctgtta aggcagaagg
3361 gtgattatga gtggttcaca gcagccctta taagctgggc cagaaaattt cactaggtca
3421 gtaatttaaa ccttggatct tcaaaaaaaa aaaaaaaaaa
//