LOCUS BC014267 3460 bp mRNA linear HUM 11-AUG-2006 DEFINITION Homo sapiens CCCTC-binding factor (zinc finger protein), mRNA (cDNA clone MGC:10345 IMAGE:3840827), complete cds. ACCESSION BC014267 VERSION BC014267.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3460) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3460) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (17-SEP-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 19, 2003 this sequence version replaced BC014267.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 14 Row: m Column: 8 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 62952500. FEATURES Location/Qualifiers source 1..3460 /db_xref="H-InvDB:HIT000036613" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:10345 IMAGE:3840827" /tissue_type="Placenta, choriocarcinoma" /clone_lib="NIH_MGC_21" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..3460 /gene="CTCF" /db_xref="GeneID:10664" /db_xref="HGNC:HGNC:13723" /db_xref="MIM:604167" CDS 279..2462 /gene="CTCF" /codon_start=1 /product="CCCTC-binding factor (zinc finger protein)" /protein_id="AAH14267.1" /db_xref="GeneID:10664" /db_xref="HGNC:HGNC:13723" /db_xref="MIM:604167" /translation="MEGDAVEAIVEESETFIKGKERKTYQRRREGGQEEDACHLPQNQ TDGGEVVQDVNSSVQMVMMEQLDPTLLQMKTEVMEGTVAPEAEAAVDDTQIITLQVVN MEEQPINIGELQLVQVPVPVTVPVATTSVEELQGAYENEVSKEGLAESEPMICHTLPL PEGFQVVKVGANGEVETLEQGELPPQEDPSWQKDPDYQPPAKKTKKTKKSKLRYTEEG KDVDVSVYDFEEEQQEGLLSEVNAEKVVGNMKPPKPTKIKKKGVKKTFQCELCSYTCP RRSNLDRHMKSHTDERPHKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCPDCDMAFVTS GELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTY KLKRHMRTHSGEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPHCDTVIARKS DLGVHLRKQHSYIEQGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQCDYACRQER HMIMHKRTHTGEKPYACSHCDKTFRQKQLLDMHFKRYHDPNFVPAAFVCSKCGKTFTR RNTMARHADNCAGPDGVEGENGGETKKSKRGRKRKMRSKKEDSSDSENAEPDLDDNED EEEPAVEIEPEPEPQPVTPAPPPAKKRRGRPPGRTNQPKQNQPTAIIQVEDQNTGAIE NIIVEVKKEPDAEPAEGEEEEAQPAATDAPNGDLTPEMILSMMDR" BASE COUNT 1009 a 772 c 892 g 787 t ORIGIN 1 gtggagcgat taaaccgtgc gcggagctgc ttctttggcg gcagcggcgg cggcggtggc 61 cggtgcggac gcgcggagct cgccggagac gccgggtggc cggagccgtg gagcggcggc 121 ggagcgggcg ccgcgggggg tgtggcgcgg agaatgatta cggacctgaa gccaaagaac 181 aagatgcgct agtggacaga ttgctgacca ggggcttgag agctgggttc tattttccct 241 cctcaaactg actttgcagc cacggagagg caggggaaat ggaaggtgat gcagtcgaag 301 ccattgtgga ggagtccgaa acttttatta aaggaaagga gagaaagact taccagagac 361 gccgggaagg gggccaggaa gaagatgcct gccacttacc ccagaaccag acggatgggg 421 gtgaggtggt ccaggatgtc aacagcagtg tacagatggt gatgatggaa cagctggacc 481 ccacccttct tcagatgaag actgaagtaa tggagggcac agtggctcca gaagcagagg 541 ctgctgtgga cgatacccag attataactt tacaggttgt aaatatggag gaacagccca 601 taaacatagg agaacttcag cttgttcaag tacctgttcc tgtgactgta cctgttgcta 661 ccacttcagt agaagaactt cagggggctt atgaaaatga agtgtctaaa gagggccttg 721 cggaaagtga acccatgata tgccacaccc tacctttgcc tgaagggttt caggtggtta 781 aagtgggggc caatggagag gtggagacac tagaacaagg ggaacttcca ccccaggaag 841 atcctagttg gcaaaaagac ccagactatc agccaccagc caaaaaaaca aagaaaacca 901 aaaagagcaa actgcgttat acagaggagg gcaaagatgt agatgtgtct gtctacgatt 961 ttgaggaaga acagcaggag ggtctgctat cagaggttaa tgcagagaaa gtggttggta 1021 atatgaagcc tccaaagcca acaaaaatta aaaagaaagg tgtaaagaag acattccagt 1081 gtgagctttg cagttacacg tgtccacggc gttcaaattt ggatcgtcac atgaaaagcc 1141 acactgatga gagaccacac aagtgccatc tctgtggcag ggcattcaga acagtcaccc 1201 tcctgaggaa tcaccttaac acacacacag gtactcgtcc tcacaagtgc ccagactgcg 1261 acatggcctt tgtgaccagt ggagaattgg ttcggcatcg tcgttacaaa cacacccacg 1321 agaagccatt caagtgttcc atgtgcgatt acgccagtgt agaagtcagc aaattaaaac 1381 gtcacattcg ctctcatact ggagagcgtc cgtttcagtg cagtttgtgc agttatgcca 1441 gcagggacac atacaagctg aaaaggcaca tgagaaccca ttcaggggaa aagccttatg 1501 aatgttatat ttgtcatgct cggtttaccc aaagtggtac catgaagatg cacattttac 1561 agaagcacac agaaaatgtg gccaaatttc actgtcccca ctgtgacaca gtcatagccc 1621 gaaaaagtga tttgggtgtc cacttgcgaa agcagcattc ctatattgag caaggcaaga 1681 aatgccgtta ctgtgatgct gtgtttcatg agcgctatgc cctcatccag catcagaagt 1741 cacacaagaa tgagaagcgc tttaagtgtg accagtgtga ttacgcttgt agacaggaga 1801 ggcacatgat catgcacaag cgcacccaca ccggggagaa gccttacgcc tgcagccact 1861 gcgataagac cttccgccag aagcagcttc tcgacatgca cttcaagcgc tatcacgacc 1921 ccaacttcgt ccctgcggct tttgtctgtt ctaagtgtgg gaaaacattt acacgtcgga 1981 ataccatggc aagacatgct gataattgtg ctggcccaga tggcgtagag ggggaaaatg 2041 gaggagaaac gaagaagagt aaacgtggaa gaaaaagaaa gatgcgctct aagaaagaag 2101 attcctctga cagtgaaaat gctgaaccag atctggacga caatgaggat gaggaggagc 2161 ctgccgtaga aattgaacct gagccagagc ctcagcctgt gaccccagcc ccaccacccg 2221 ccaagaagcg gagaggacga ccccctggca gaaccaacca gcccaaacag aaccagccaa 2281 cagctatcat tcaggttgaa gaccagaata caggtgcaat tgagaacatt atagttgaag 2341 taaaaaaaga gccagatgct gagcccgcag agggagagga agaggaggcc cagccagctg 2401 ccacagatgc ccccaacgga gacctcacgc ccgagatgat cctcagcatg atggaccggt 2461 gatggcggag ccttgtgcgt cgccaggact tctctgggct gtgtttaaac ggcccgcatc 2521 ttaatttttc tcccttcttt ctttttttgg ctttgggaaa agcatcattt taccaaacat 2581 accgagaacg aaaacttcaa ggatgatgtt agaaaaaaat gtgatttaac tagaacttgc 2641 tgtctgatgt tagcaaatca tggaatgttc tgagtccctg agggtttact gtgaagtgct 2701 gaggacagtg ttgacaacta actcgttttc ctagatggaa acggagacat tgacccctcc 2761 ctccatgtgg taaaccactc cagaatggcc accaggcttc ccagagttct atggtcttct 2821 tcccaagaga gtttttaatt gtaaatgcat acttgggaag gacttagagt tttaaactgt 2881 tttttgcttt tgcttttccc tgactccctt tgcttggagt cagctgcaca ccagtagtat 2941 ggcatgctac gatcaggttc tgtcctgaaa gctttgcctc tttcttggca aagtttctgg 3001 tatggtcaag cttgtaaata acttttttta cattttaatc ttttccatta attaagaggt 3061 tgaaaagaag tgcagtgtaa gaaaacccag cattttaatt acttgcaaat taagttacca 3121 cagactctgt agtgtgtaaa tgttgacaag gaattggatc acaatcatgt agcagaatgg 3181 cacccagacc actgcccacc agtgacggac atgcacgtgg cagatcatga tttccagccc 3241 acggagccag catttgaacc ttgtataatt aactttcagt tatgatttcc catcgacatt 3301 ttctttgccc tgtttgtagc tgattgttgt gttttataaa tcttctgtta aggcagaagg 3361 gtgattatga gtggttcaca gcagccctta taagctgggc cagaaaattt cactaggtca 3421 gtaatttaaa ccttggatct tcaaaaaaaa aaaaaaaaaa //