LOCUS       BC014267                3460 bp    mRNA    linear   HUM 11-AUG-2006
DEFINITION  Homo sapiens CCCTC-binding factor (zinc finger protein), mRNA (cDNA
            clone MGC:10345 IMAGE:3840827), complete cds.
ACCESSION   BC014267
VERSION     BC014267.2
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3460)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 3460)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (17-SEP-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Aug 19, 2003 this sequence version replaced BC014267.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Institute for Systems Biology
            http://www.systemsbiology.org
            contact: amadan@systemsbiology.org
            Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
            Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 14 Row: m Column: 8
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 62952500.
FEATURES             Location/Qualifiers
     source          1..3460
                     /db_xref="H-InvDB:HIT000036613"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:10345 IMAGE:3840827"
                     /tissue_type="Placenta, choriocarcinoma"
                     /clone_lib="NIH_MGC_21"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            1..3460
                     /gene="CTCF"
                     /db_xref="GeneID:10664"
                     /db_xref="HGNC:HGNC:13723"
                     /db_xref="MIM:604167"
     CDS             279..2462
                     /gene="CTCF"
                     /codon_start=1
                     /product="CCCTC-binding factor (zinc finger protein)"
                     /protein_id="AAH14267.1"
                     /db_xref="GeneID:10664"
                     /db_xref="HGNC:HGNC:13723"
                     /db_xref="MIM:604167"
                     /translation="MEGDAVEAIVEESETFIKGKERKTYQRRREGGQEEDACHLPQNQ
                     TDGGEVVQDVNSSVQMVMMEQLDPTLLQMKTEVMEGTVAPEAEAAVDDTQIITLQVVN
                     MEEQPINIGELQLVQVPVPVTVPVATTSVEELQGAYENEVSKEGLAESEPMICHTLPL
                     PEGFQVVKVGANGEVETLEQGELPPQEDPSWQKDPDYQPPAKKTKKTKKSKLRYTEEG
                     KDVDVSVYDFEEEQQEGLLSEVNAEKVVGNMKPPKPTKIKKKGVKKTFQCELCSYTCP
                     RRSNLDRHMKSHTDERPHKCHLCGRAFRTVTLLRNHLNTHTGTRPHKCPDCDMAFVTS
                     GELVRHRRYKHTHEKPFKCSMCDYASVEVSKLKRHIRSHTGERPFQCSLCSYASRDTY
                     KLKRHMRTHSGEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPHCDTVIARKS
                     DLGVHLRKQHSYIEQGKKCRYCDAVFHERYALIQHQKSHKNEKRFKCDQCDYACRQER
                     HMIMHKRTHTGEKPYACSHCDKTFRQKQLLDMHFKRYHDPNFVPAAFVCSKCGKTFTR
                     RNTMARHADNCAGPDGVEGENGGETKKSKRGRKRKMRSKKEDSSDSENAEPDLDDNED
                     EEEPAVEIEPEPEPQPVTPAPPPAKKRRGRPPGRTNQPKQNQPTAIIQVEDQNTGAIE
                     NIIVEVKKEPDAEPAEGEEEEAQPAATDAPNGDLTPEMILSMMDR"
BASE COUNT         1009 a          772 c          892 g          787 t
ORIGIN      
        1 gtggagcgat taaaccgtgc gcggagctgc ttctttggcg gcagcggcgg cggcggtggc
       61 cggtgcggac gcgcggagct cgccggagac gccgggtggc cggagccgtg gagcggcggc
      121 ggagcgggcg ccgcgggggg tgtggcgcgg agaatgatta cggacctgaa gccaaagaac
      181 aagatgcgct agtggacaga ttgctgacca ggggcttgag agctgggttc tattttccct
      241 cctcaaactg actttgcagc cacggagagg caggggaaat ggaaggtgat gcagtcgaag
      301 ccattgtgga ggagtccgaa acttttatta aaggaaagga gagaaagact taccagagac
      361 gccgggaagg gggccaggaa gaagatgcct gccacttacc ccagaaccag acggatgggg
      421 gtgaggtggt ccaggatgtc aacagcagtg tacagatggt gatgatggaa cagctggacc
      481 ccacccttct tcagatgaag actgaagtaa tggagggcac agtggctcca gaagcagagg
      541 ctgctgtgga cgatacccag attataactt tacaggttgt aaatatggag gaacagccca
      601 taaacatagg agaacttcag cttgttcaag tacctgttcc tgtgactgta cctgttgcta
      661 ccacttcagt agaagaactt cagggggctt atgaaaatga agtgtctaaa gagggccttg
      721 cggaaagtga acccatgata tgccacaccc tacctttgcc tgaagggttt caggtggtta
      781 aagtgggggc caatggagag gtggagacac tagaacaagg ggaacttcca ccccaggaag
      841 atcctagttg gcaaaaagac ccagactatc agccaccagc caaaaaaaca aagaaaacca
      901 aaaagagcaa actgcgttat acagaggagg gcaaagatgt agatgtgtct gtctacgatt
      961 ttgaggaaga acagcaggag ggtctgctat cagaggttaa tgcagagaaa gtggttggta
     1021 atatgaagcc tccaaagcca acaaaaatta aaaagaaagg tgtaaagaag acattccagt
     1081 gtgagctttg cagttacacg tgtccacggc gttcaaattt ggatcgtcac atgaaaagcc
     1141 acactgatga gagaccacac aagtgccatc tctgtggcag ggcattcaga acagtcaccc
     1201 tcctgaggaa tcaccttaac acacacacag gtactcgtcc tcacaagtgc ccagactgcg
     1261 acatggcctt tgtgaccagt ggagaattgg ttcggcatcg tcgttacaaa cacacccacg
     1321 agaagccatt caagtgttcc atgtgcgatt acgccagtgt agaagtcagc aaattaaaac
     1381 gtcacattcg ctctcatact ggagagcgtc cgtttcagtg cagtttgtgc agttatgcca
     1441 gcagggacac atacaagctg aaaaggcaca tgagaaccca ttcaggggaa aagccttatg
     1501 aatgttatat ttgtcatgct cggtttaccc aaagtggtac catgaagatg cacattttac
     1561 agaagcacac agaaaatgtg gccaaatttc actgtcccca ctgtgacaca gtcatagccc
     1621 gaaaaagtga tttgggtgtc cacttgcgaa agcagcattc ctatattgag caaggcaaga
     1681 aatgccgtta ctgtgatgct gtgtttcatg agcgctatgc cctcatccag catcagaagt
     1741 cacacaagaa tgagaagcgc tttaagtgtg accagtgtga ttacgcttgt agacaggaga
     1801 ggcacatgat catgcacaag cgcacccaca ccggggagaa gccttacgcc tgcagccact
     1861 gcgataagac cttccgccag aagcagcttc tcgacatgca cttcaagcgc tatcacgacc
     1921 ccaacttcgt ccctgcggct tttgtctgtt ctaagtgtgg gaaaacattt acacgtcgga
     1981 ataccatggc aagacatgct gataattgtg ctggcccaga tggcgtagag ggggaaaatg
     2041 gaggagaaac gaagaagagt aaacgtggaa gaaaaagaaa gatgcgctct aagaaagaag
     2101 attcctctga cagtgaaaat gctgaaccag atctggacga caatgaggat gaggaggagc
     2161 ctgccgtaga aattgaacct gagccagagc ctcagcctgt gaccccagcc ccaccacccg
     2221 ccaagaagcg gagaggacga ccccctggca gaaccaacca gcccaaacag aaccagccaa
     2281 cagctatcat tcaggttgaa gaccagaata caggtgcaat tgagaacatt atagttgaag
     2341 taaaaaaaga gccagatgct gagcccgcag agggagagga agaggaggcc cagccagctg
     2401 ccacagatgc ccccaacgga gacctcacgc ccgagatgat cctcagcatg atggaccggt
     2461 gatggcggag ccttgtgcgt cgccaggact tctctgggct gtgtttaaac ggcccgcatc
     2521 ttaatttttc tcccttcttt ctttttttgg ctttgggaaa agcatcattt taccaaacat
     2581 accgagaacg aaaacttcaa ggatgatgtt agaaaaaaat gtgatttaac tagaacttgc
     2641 tgtctgatgt tagcaaatca tggaatgttc tgagtccctg agggtttact gtgaagtgct
     2701 gaggacagtg ttgacaacta actcgttttc ctagatggaa acggagacat tgacccctcc
     2761 ctccatgtgg taaaccactc cagaatggcc accaggcttc ccagagttct atggtcttct
     2821 tcccaagaga gtttttaatt gtaaatgcat acttgggaag gacttagagt tttaaactgt
     2881 tttttgcttt tgcttttccc tgactccctt tgcttggagt cagctgcaca ccagtagtat
     2941 ggcatgctac gatcaggttc tgtcctgaaa gctttgcctc tttcttggca aagtttctgg
     3001 tatggtcaag cttgtaaata acttttttta cattttaatc ttttccatta attaagaggt
     3061 tgaaaagaag tgcagtgtaa gaaaacccag cattttaatt acttgcaaat taagttacca
     3121 cagactctgt agtgtgtaaa tgttgacaag gaattggatc acaatcatgt agcagaatgg
     3181 cacccagacc actgcccacc agtgacggac atgcacgtgg cagatcatga tttccagccc
     3241 acggagccag catttgaacc ttgtataatt aactttcagt tatgatttcc catcgacatt
     3301 ttctttgccc tgtttgtagc tgattgttgt gttttataaa tcttctgtta aggcagaagg
     3361 gtgattatga gtggttcaca gcagccctta taagctgggc cagaaaattt cactaggtca
     3421 gtaatttaaa ccttggatct tcaaaaaaaa aaaaaaaaaa
//