LOCUS       BC066592                5164 bp    mRNA    linear   HUM 12-NOV-2007
DEFINITION  Homo sapiens cut-like homeobox 1, mRNA (cDNA clone MGC:75164
            IMAGE:5740343), complete cds.
ACCESSION   BC066592
VERSION     BC066592.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 5164)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 5164)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (20-FEB-2004) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 139 Row: j Column: 22
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 31652237.
FEATURES             Location/Qualifiers
     source          1..5164
                     /db_xref="H-InvDB:HIT000262266"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:75164 IMAGE:5740343"
                     /tissue_type="Duodenum, adenocarcinoma"
                     /clone_lib="NIH_MGC_88"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..5164
                     /gene="CUX1"
                     /gene_synonym="CASP"
                     /gene_synonym="CDP"
                     /gene_synonym="CDP/Cut"
                     /gene_synonym="CDP1"
                     /gene_synonym="Clox"
                     /gene_synonym="COY1"
                     /gene_synonym="CUX"
                     /gene_synonym="Cux/CDP"
                     /gene_synonym="GOLIM6"
                     /gene_synonym="Nbla10317"
                     /gene_synonym="p100"
                     /gene_synonym="p110"
                     /gene_synonym="p200"
                     /gene_synonym="p75"
                     /db_xref="GeneID:1523"
                     /db_xref="HGNC:HGNC:2557"
                     /db_xref="MIM:116896"
     CDS             21..4571
                     /gene="CUX1"
                     /gene_synonym="CASP"
                     /gene_synonym="CDP"
                     /gene_synonym="CDP/Cut"
                     /gene_synonym="CDP1"
                     /gene_synonym="Clox"
                     /gene_synonym="COY1"
                     /gene_synonym="CUX"
                     /gene_synonym="Cux/CDP"
                     /gene_synonym="GOLIM6"
                     /gene_synonym="Nbla10317"
                     /gene_synonym="p100"
                     /gene_synonym="p110"
                     /gene_synonym="p200"
                     /gene_synonym="p75"
                     /codon_start=1
                     /product="CUX1 protein"
                     /protein_id="AAH66592.1"
                     /db_xref="GeneID:1523"
                     /db_xref="HGNC:HGNC:2557"
                     /db_xref="MIM:116896"
                     /translation="MAANVGSMFQYWKRFDLQQLQRELDATATVLANRQDESEQSRKR
                     LIEQSREFKKNTPEDLRKQVAPLLKSFQGEIDALSKRSKEAEAAFLNVYKRLIDVPDP
                     VPALDLGQQLQLKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREY
                     EQTLKNQAETIALEKEQKLQNDFAEKERKLQETQMSTTSKLEEAEHKVQSLQTALEKT
                     RTELFDLKTKYDEETTAKADEIEMIMTDLERANQRAEVAQREAETLREQLSSANHSLQ
                     LASQIQKAPDVEQAIEVLTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSASQ
                     ISQLEQQLSAKNSTLKQLEEKLKGQADYEEVKKELNILKSMEFAPSEGAGTQDAAKPL
                     EVLLLEKNRSLQSENAALRISNSDLSGSARRKGKDQPESRRPGSLPAPPPSQLPRNPG
                     EQASNTNGTHQFSPAGLSQDFFSSSLASPSLPLASTGKFALNSLLQRQLMQSFYSKAM
                     QEAGSTSMIFSTGPYSTNSISSQSPLQQSPDVNGMAPSPSQSESAGSVSEGEEMDTAE
                     IARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLTVRGKEPFHKMKQF
                     LSDEQNILALRSIQGRQRENPGQSLNRLFQEVPKRRNGSEGNITTRIRASETGSDEAI
                     KSILEQAKRELQVQKTAEPAQPSSASGSGNSDDAIRSILQQARREMEAQQAALDPALK
                     QAPLSQSDITILTPKLLSTSPMPTVSSYPPLAISLKKPSAAPEAGASALPNPPALKKE
                     AQDAPGLDPQGAADCAQGVLRQVKNEVGRSGAWKDHWWSAVQPERRNAASSEEAKAEE
                     TGGGKEKGSGGSGGGSQPRAERSQLQGPSSSEYWKEWPSAESPYSQSSELSLTGASRS
                     ETPQNSPLPSSPIVPMSKPTKPSVPPLTPEQYEVYMYQEVDTIELTRQVKEKLAKNGI
                     CQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIRMQLWLNGELGQGVLPVQG
                     QQQGPVLHSVTSLQDPLQQGCVSSESTPKTSASCSPAPESPMSSSESVKSLTELVQQP
                     CPPIEASKDSKPPEPSDPPASDSQPTTPLPLSGHSALSIQELVAMSPELDTYGITKRV
                     KEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVRMQLWLNDPN
                     NVEKLMDMKRMEKKAYMKRRHSSVSDSQPCEPPSVGTEYSQGASPQPQHQLKKPRVVL
                     APEEKEALKRAYQQKPYPSPKTIEDLATQLNLKTSTVINWFHNYRSRIRRELFIEEIQ
                     AGSQGQAGASDSPSARSGRAAPSSEGDSCDGVEATEGPGSADTEEPKSQGEAEREEVP
                     RPAEQTEPPPSGTPGPDDARDDDHEGGPVEGPGPLPSPASATATAAPAAPEDAATSAA
                     AAPGEGPAAPSSAPPPSNSSSSSAPRRPSSLQSLFGLPEAAGARDSRDNPLRKKKAAN
                     LNSIIHRLEKAASREEPIEWEF"
BASE COUNT         1306 a         1614 c         1478 g          766 t
ORIGIN      
        1 ccgtctcaat atgtctcaag atggcggcca atgtgggatc gatgtttcaa tattggaagc
       61 gctttgattt acagcagctg cagagagaac tcgatgccac cgcaacggta ttggcgaacc
      121 ggcaggatga aagtgagcag tccagaaagc ggcttatcga acagagccgg gagttcaaga
      181 agaacactcc agaggatttg cgcaagcagg tagcgccgct gctgaagagt ttccaaggag
      241 agattgatgc actgagtaaa agaagcaagg aagctgaagc agctttcttg aatgtctaca
      301 aaagattgat tgacgtccca gatcccgtac cagctttgga tctcggacag caactccagc
      361 tcaaagtgca gcgcctgcac gatattgaaa cagagaacca gaaacttagg gaaactctgg
      421 aagaatacaa caaggaattt gctgaagtga aaaatcaaga ggttacgata aaagcactta
      481 aagagaaaat ccgagaatat gaacagacac tgaagaacca agccgaaacc atagctcttg
      541 agaaggaaca gaagttacag aatgactttg cagaaaagga gagaaagctg caggagacac
      601 agatgtccac cacctcaaag ctggaggaag ctgagcataa ggttcagagc ctacaaacag
      661 ccctggaaaa aactcgaaca gaattatttg acctgaaaac caaatacgat gaagaaacta
      721 ctgcaaaggc cgacgagatt gaaatgatca tgacggacct tgaaagggca aaccagaggg
      781 cagaggtggc tcagagagag gcggagacct taagggaaca gctctcatcg gccaatcact
      841 ccctccagct ggcctcacag atccagaagg caccagacgt ggagcaggcc atagaggtgc
      901 tgacccgctc cagcctagaa gttgagttgg ccgccaagga gcgggagatc gcacagctgg
      961 tggaggacgt gcagagactc caggccagcc tcaccaagct gcgggagaat tcggccagcc
     1021 agatctcaca gcttgagcag cagctgagcg ccaaaaacag cacactcaaa caactggaag
     1081 aaaaactcaa aggccaggct gactatgaag aggtgaagaa agagctgaac attctgaagt
     1141 ccatggagtt tgcaccgtcc gagggcgctg ggacacagga tgcggccaag cccctggagg
     1201 tgctgttgct ggagaagaac cgctcgctgc agtccgagaa cgccgcgctg cgcatctcca
     1261 acagcgacct gagcgggtca gccaggagga aagggaaaga ccagcctgaa agtcggcgcc
     1321 cgggatcttt gccggccccc cctccttctc agttgccccg caacccgggg gagcaggctt
     1381 ccaatactaa tggtacacac cagttctcac cagcggggtt aagtcaagac tttttcagct
     1441 catccctggc aagccccagc ctacccctgg cttctacagg aaaatttgca ctaaactctc
     1501 ttctccagcg gcagctaatg cagtccttct actccaaggc tatgcaggaa gccggaagca
     1561 caagcatgat tttttcaaca ggtccataca gcacaaactc catatcttcc caaagtccat
     1621 tacaacaaag cccagatgtc aatggcatgg ccccatcccc cagccagtca gaaagtgctg
     1681 ggagcgtctc cgagggcgag gagatggaca ctgcagaaat cgcccggcag gtcaaagagc
     1741 agctgattaa gcacaatatc ggacaacgta ttttcggaca ttatgtgttg ggactgtctc
     1801 aagggtccgt gagcgagatt ctggcccggc ccaagccatg gaataaactg actgttcgtg
     1861 gcaaggagcc atttcacaag atgaaacagt tcctctccga tgagcagaac atcctggccc
     1921 tccgtagcat ccaaggcaga caaagagaga atccaggcca gagcctgaac agactatttc
     1981 aggaagtacc gaaacgaaga aatgggtctg aaggtaacat caccacccgg atccgagcct
     2041 cggagactgg ctctgatgaa gccatcaagt ccatcctaga gcaagccaag agggagctcc
     2101 aagtgcagaa aactgcagag ccggcccagc cttcctccgc atccggcagc gggaactctg
     2161 atgacgccat ccgctccatc ctgcagcaag cccgccggga gatggaggcc cagcaggctg
     2221 ccctcgaccc tgccttaaag caggcaccac tgtcccagag tgacatcacc atcctcaccc
     2281 ccaagcttct gtccacctcg cccatgccca ccgtgtccag ctacccacct ctcgccatct
     2341 ccctgaagaa gccctccgca gctcctgagg ccggtgcctc tgctctgccg aaccccccgg
     2401 ccctcaaaaa ggaggcccag gacgcccccg ggctggaccc ccagggagca gccgattgtg
     2461 cacaaggggt cctgagacag gtgaaaaatg aggtgggccg cagcggtgcc tggaaggacc
     2521 actggtggag cgcggtgcag ccggagagaa gaaatgccgc ctcctccgag gaggccaagg
     2581 ccgaagaaac gggcggcggg aaagagaagg gcagcggtgg cagcggaggt ggcagccagc
     2641 ctcgggccga gcgcagtcag ctccagggac cctcgtcgtc agagtactgg aaggagtggc
     2701 ccagcgctga gtccccatac tcccagagct cagagctgag tctgaccggg gccagccgca
     2761 gcgagacacc acagaacagc cccctgccat cctccccgat cgtgcccatg tccaagccca
     2821 ccaagccctc ggtccccccg ctgacccccg agcagtacga ggtctacatg taccaggagg
     2881 tggacaccat cgagctcacc cggcaggtta aggaaaagct ggccaagaac ggcatctgcc
     2941 agagaatctt cggggagaag gtgctgggcc tgtcccaggg cagcgtcagc gacatgctgt
     3001 cccgaccgaa gccatggagc aagctgacgc agaaaggccg agaacccttc atccggatgc
     3061 agctctggct gaacggcgag ctaggccagg gtgttctacc cgtccagggc cagcagcaag
     3121 ggccagtcct ccactccgtg acatcgctcc aggacccgct gcagcagggc tgtgtgagct
     3181 cagaaagcac tccaaagacc tccgccagct gcagccctgc ccctgagtcc ccgatgagtt
     3241 ccagtgagtc ggtgaagagc ctgaccgagc tggtccagca gccctgtccc cccatcgagg
     3301 cgagcaagga cagcaagcca ccagagccca gtgacccgcc agcatccgac tcccagccca
     3361 caaccccgct gcctctctcc ggacactcgg ccctcagcat ccaagaatta gtagccatgt
     3421 ccccggagct ggacacctac ggcataacca agcgggtgaa ggaggtgctg acggacaaca
     3481 acctcggcca gcgcttattt ggggagacca tcttagggct cacccaaggc tctgtctctg
     3541 acctccttgc ccgccccaaa ccctggcata agctcagtct gaaaggacga gagcccttcg
     3601 tccggatgca gctgtggctg aacgacccca acaatgtgga gaagctgatg gacatgaaac
     3661 ggatggagaa gaaagcctac atgaagcggc ggcacagctc agtcagtgac agccagccct
     3721 gcgaaccgcc ctctgtcggc accgagtaca gccagggcgc cagcccccag ccccagcacc
     3781 agctgaagaa accccgggtg gtgctggctc cggaggagaa ggaggcgctg aaacgagcgt
     3841 atcagcaaaa gccatacccg tcaccaaaaa ccatcgaaga cctcgccacc cagctcaacc
     3901 tgaaaaccag caccgtcatc aactggttcc acaactacag gtctcggatc cgcagagaac
     3961 tgttcattga ggaaattcag gccgggagtc agggccaggc gggcgccagc gactcaccct
     4021 cggcccgcag cggccgggcg gcgcccagct cggagggcga cagctgcgac ggcgtggagg
     4081 ccactgaggg cccaggcagc gccgacaccg aggagcccaa gtctcaggga gaggccgagc
     4141 gggaggaggt gccgcggccg gcggagcaga cggagccgcc gccctcgggg accccgggcc
     4201 cggacgacgc ccgcgacgac gaccacgagg gaggccccgt ggaaggcccg gggcccctgc
     4261 ccagccccgc ctccgcgacc gccaccgccg cgcccgcggc ccccgaggac gccgctacct
     4321 cagccgccgc cgcgccgggg gagggccccg cggccccgag ctccgcgccg ccgcccagca
     4381 acagcagcag cagcagcgcc ccccgcaggc ccagctcgct gcagagcctt ttcggcctcc
     4441 ccgaggccgc gggcgcccgg gactcgcgcg acaaccccct gcgcaagaag aaggccgcga
     4501 acttgaacag catcatccac cgcctggaga aggccgccag ccgggaggaa cctatcgaat
     4561 gggagttctg aggggctgcg gccctggggc gggcagccag gctgggccgc aagggcctgg
     4621 acggggtcgg acggggcagg cgctgcggac accgtggcct gggcttggcc cgcggcctgc
     4681 accgaccccg ggccggacct aagcccgcag cccagacccc ctccacggtc cgcggcctgc
     4741 accgacccga ggcccagatc caaggccgcg gcccagaccc actctgcggc ccgggccgac
     4801 cctgcggcct ccaccaaccc cgcggcccag acccagcccg cggcctggac ccctggaccg
     4861 ctttgcgcac ttaccgccct gcgggccaca gggcaaaatc gccataggcc aaggtgcata
     4921 tagaaaacaa aggagcatta agcccaatct atgtcgtgtt ttcaaggaag aaaacggaaa
     4981 tgtgtggtcg agcttttttg taccctgaag tgtttttttt attgccctaa gtgatttcca
     5041 caggttctgg aataactctt acagctttgc cttgtgtcct cttgttccgt gtgggcttta
     5101 aaagaaaaaa aatcaaaccc acatattaaa agggggcttt ttatctgcca aaaaaaaaaa
     5161 aaaa
//