LOCUS BC066592 5164 bp mRNA linear HUM 12-NOV-2007 DEFINITION Homo sapiens cut-like homeobox 1, mRNA (cDNA clone MGC:75164 IMAGE:5740343), complete cds. ACCESSION BC066592 VERSION BC066592.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5164) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 5164) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (20-FEB-2004) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 139 Row: j Column: 22 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 31652237. FEATURES Location/Qualifiers source 1..5164 /db_xref="H-InvDB:HIT000262266" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:75164 IMAGE:5740343" /tissue_type="Duodenum, adenocarcinoma" /clone_lib="NIH_MGC_88" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..5164 /gene="CUX1" /gene_synonym="CASP" /gene_synonym="CDP" /gene_synonym="CDP/Cut" /gene_synonym="CDP1" /gene_synonym="Clox" /gene_synonym="COY1" /gene_synonym="CUX" /gene_synonym="Cux/CDP" /gene_synonym="GOLIM6" /gene_synonym="Nbla10317" /gene_synonym="p100" /gene_synonym="p110" /gene_synonym="p200" /gene_synonym="p75" /db_xref="GeneID:1523" /db_xref="HGNC:HGNC:2557" /db_xref="MIM:116896" CDS 21..4571 /gene="CUX1" /gene_synonym="CASP" /gene_synonym="CDP" /gene_synonym="CDP/Cut" /gene_synonym="CDP1" /gene_synonym="Clox" /gene_synonym="COY1" /gene_synonym="CUX" /gene_synonym="Cux/CDP" /gene_synonym="GOLIM6" /gene_synonym="Nbla10317" /gene_synonym="p100" /gene_synonym="p110" /gene_synonym="p200" /gene_synonym="p75" /codon_start=1 /product="CUX1 protein" /protein_id="AAH66592.1" /db_xref="GeneID:1523" /db_xref="HGNC:HGNC:2557" /db_xref="MIM:116896" /translation="MAANVGSMFQYWKRFDLQQLQRELDATATVLANRQDESEQSRKR LIEQSREFKKNTPEDLRKQVAPLLKSFQGEIDALSKRSKEAEAAFLNVYKRLIDVPDP VPALDLGQQLQLKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREY EQTLKNQAETIALEKEQKLQNDFAEKERKLQETQMSTTSKLEEAEHKVQSLQTALEKT RTELFDLKTKYDEETTAKADEIEMIMTDLERANQRAEVAQREAETLREQLSSANHSLQ LASQIQKAPDVEQAIEVLTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSASQ ISQLEQQLSAKNSTLKQLEEKLKGQADYEEVKKELNILKSMEFAPSEGAGTQDAAKPL EVLLLEKNRSLQSENAALRISNSDLSGSARRKGKDQPESRRPGSLPAPPPSQLPRNPG EQASNTNGTHQFSPAGLSQDFFSSSLASPSLPLASTGKFALNSLLQRQLMQSFYSKAM QEAGSTSMIFSTGPYSTNSISSQSPLQQSPDVNGMAPSPSQSESAGSVSEGEEMDTAE IARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLTVRGKEPFHKMKQF LSDEQNILALRSIQGRQRENPGQSLNRLFQEVPKRRNGSEGNITTRIRASETGSDEAI KSILEQAKRELQVQKTAEPAQPSSASGSGNSDDAIRSILQQARREMEAQQAALDPALK QAPLSQSDITILTPKLLSTSPMPTVSSYPPLAISLKKPSAAPEAGASALPNPPALKKE AQDAPGLDPQGAADCAQGVLRQVKNEVGRSGAWKDHWWSAVQPERRNAASSEEAKAEE TGGGKEKGSGGSGGGSQPRAERSQLQGPSSSEYWKEWPSAESPYSQSSELSLTGASRS ETPQNSPLPSSPIVPMSKPTKPSVPPLTPEQYEVYMYQEVDTIELTRQVKEKLAKNGI CQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIRMQLWLNGELGQGVLPVQG QQQGPVLHSVTSLQDPLQQGCVSSESTPKTSASCSPAPESPMSSSESVKSLTELVQQP CPPIEASKDSKPPEPSDPPASDSQPTTPLPLSGHSALSIQELVAMSPELDTYGITKRV KEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVRMQLWLNDPN NVEKLMDMKRMEKKAYMKRRHSSVSDSQPCEPPSVGTEYSQGASPQPQHQLKKPRVVL APEEKEALKRAYQQKPYPSPKTIEDLATQLNLKTSTVINWFHNYRSRIRRELFIEEIQ AGSQGQAGASDSPSARSGRAAPSSEGDSCDGVEATEGPGSADTEEPKSQGEAEREEVP RPAEQTEPPPSGTPGPDDARDDDHEGGPVEGPGPLPSPASATATAAPAAPEDAATSAA AAPGEGPAAPSSAPPPSNSSSSSAPRRPSSLQSLFGLPEAAGARDSRDNPLRKKKAAN LNSIIHRLEKAASREEPIEWEF" BASE COUNT 1306 a 1614 c 1478 g 766 t ORIGIN 1 ccgtctcaat atgtctcaag atggcggcca atgtgggatc gatgtttcaa tattggaagc 61 gctttgattt acagcagctg cagagagaac tcgatgccac cgcaacggta ttggcgaacc 121 ggcaggatga aagtgagcag tccagaaagc ggcttatcga acagagccgg gagttcaaga 181 agaacactcc agaggatttg cgcaagcagg tagcgccgct gctgaagagt ttccaaggag 241 agattgatgc actgagtaaa agaagcaagg aagctgaagc agctttcttg aatgtctaca 301 aaagattgat tgacgtccca gatcccgtac cagctttgga tctcggacag caactccagc 361 tcaaagtgca gcgcctgcac gatattgaaa cagagaacca gaaacttagg gaaactctgg 421 aagaatacaa caaggaattt gctgaagtga aaaatcaaga ggttacgata aaagcactta 481 aagagaaaat ccgagaatat gaacagacac tgaagaacca agccgaaacc atagctcttg 541 agaaggaaca gaagttacag aatgactttg cagaaaagga gagaaagctg caggagacac 601 agatgtccac cacctcaaag ctggaggaag ctgagcataa ggttcagagc ctacaaacag 661 ccctggaaaa aactcgaaca gaattatttg acctgaaaac caaatacgat gaagaaacta 721 ctgcaaaggc cgacgagatt gaaatgatca tgacggacct tgaaagggca aaccagaggg 781 cagaggtggc tcagagagag gcggagacct taagggaaca gctctcatcg gccaatcact 841 ccctccagct ggcctcacag atccagaagg caccagacgt ggagcaggcc atagaggtgc 901 tgacccgctc cagcctagaa gttgagttgg ccgccaagga gcgggagatc gcacagctgg 961 tggaggacgt gcagagactc caggccagcc tcaccaagct gcgggagaat tcggccagcc 1021 agatctcaca gcttgagcag cagctgagcg ccaaaaacag cacactcaaa caactggaag 1081 aaaaactcaa aggccaggct gactatgaag aggtgaagaa agagctgaac attctgaagt 1141 ccatggagtt tgcaccgtcc gagggcgctg ggacacagga tgcggccaag cccctggagg 1201 tgctgttgct ggagaagaac cgctcgctgc agtccgagaa cgccgcgctg cgcatctcca 1261 acagcgacct gagcgggtca gccaggagga aagggaaaga ccagcctgaa agtcggcgcc 1321 cgggatcttt gccggccccc cctccttctc agttgccccg caacccgggg gagcaggctt 1381 ccaatactaa tggtacacac cagttctcac cagcggggtt aagtcaagac tttttcagct 1441 catccctggc aagccccagc ctacccctgg cttctacagg aaaatttgca ctaaactctc 1501 ttctccagcg gcagctaatg cagtccttct actccaaggc tatgcaggaa gccggaagca 1561 caagcatgat tttttcaaca ggtccataca gcacaaactc catatcttcc caaagtccat 1621 tacaacaaag cccagatgtc aatggcatgg ccccatcccc cagccagtca gaaagtgctg 1681 ggagcgtctc cgagggcgag gagatggaca ctgcagaaat cgcccggcag gtcaaagagc 1741 agctgattaa gcacaatatc ggacaacgta ttttcggaca ttatgtgttg ggactgtctc 1801 aagggtccgt gagcgagatt ctggcccggc ccaagccatg gaataaactg actgttcgtg 1861 gcaaggagcc atttcacaag atgaaacagt tcctctccga tgagcagaac atcctggccc 1921 tccgtagcat ccaaggcaga caaagagaga atccaggcca gagcctgaac agactatttc 1981 aggaagtacc gaaacgaaga aatgggtctg aaggtaacat caccacccgg atccgagcct 2041 cggagactgg ctctgatgaa gccatcaagt ccatcctaga gcaagccaag agggagctcc 2101 aagtgcagaa aactgcagag ccggcccagc cttcctccgc atccggcagc gggaactctg 2161 atgacgccat ccgctccatc ctgcagcaag cccgccggga gatggaggcc cagcaggctg 2221 ccctcgaccc tgccttaaag caggcaccac tgtcccagag tgacatcacc atcctcaccc 2281 ccaagcttct gtccacctcg cccatgccca ccgtgtccag ctacccacct ctcgccatct 2341 ccctgaagaa gccctccgca gctcctgagg ccggtgcctc tgctctgccg aaccccccgg 2401 ccctcaaaaa ggaggcccag gacgcccccg ggctggaccc ccagggagca gccgattgtg 2461 cacaaggggt cctgagacag gtgaaaaatg aggtgggccg cagcggtgcc tggaaggacc 2521 actggtggag cgcggtgcag ccggagagaa gaaatgccgc ctcctccgag gaggccaagg 2581 ccgaagaaac gggcggcggg aaagagaagg gcagcggtgg cagcggaggt ggcagccagc 2641 ctcgggccga gcgcagtcag ctccagggac cctcgtcgtc agagtactgg aaggagtggc 2701 ccagcgctga gtccccatac tcccagagct cagagctgag tctgaccggg gccagccgca 2761 gcgagacacc acagaacagc cccctgccat cctccccgat cgtgcccatg tccaagccca 2821 ccaagccctc ggtccccccg ctgacccccg agcagtacga ggtctacatg taccaggagg 2881 tggacaccat cgagctcacc cggcaggtta aggaaaagct ggccaagaac ggcatctgcc 2941 agagaatctt cggggagaag gtgctgggcc tgtcccaggg cagcgtcagc gacatgctgt 3001 cccgaccgaa gccatggagc aagctgacgc agaaaggccg agaacccttc atccggatgc 3061 agctctggct gaacggcgag ctaggccagg gtgttctacc cgtccagggc cagcagcaag 3121 ggccagtcct ccactccgtg acatcgctcc aggacccgct gcagcagggc tgtgtgagct 3181 cagaaagcac tccaaagacc tccgccagct gcagccctgc ccctgagtcc ccgatgagtt 3241 ccagtgagtc ggtgaagagc ctgaccgagc tggtccagca gccctgtccc cccatcgagg 3301 cgagcaagga cagcaagcca ccagagccca gtgacccgcc agcatccgac tcccagccca 3361 caaccccgct gcctctctcc ggacactcgg ccctcagcat ccaagaatta gtagccatgt 3421 ccccggagct ggacacctac ggcataacca agcgggtgaa ggaggtgctg acggacaaca 3481 acctcggcca gcgcttattt ggggagacca tcttagggct cacccaaggc tctgtctctg 3541 acctccttgc ccgccccaaa ccctggcata agctcagtct gaaaggacga gagcccttcg 3601 tccggatgca gctgtggctg aacgacccca acaatgtgga gaagctgatg gacatgaaac 3661 ggatggagaa gaaagcctac atgaagcggc ggcacagctc agtcagtgac agccagccct 3721 gcgaaccgcc ctctgtcggc accgagtaca gccagggcgc cagcccccag ccccagcacc 3781 agctgaagaa accccgggtg gtgctggctc cggaggagaa ggaggcgctg aaacgagcgt 3841 atcagcaaaa gccatacccg tcaccaaaaa ccatcgaaga cctcgccacc cagctcaacc 3901 tgaaaaccag caccgtcatc aactggttcc acaactacag gtctcggatc cgcagagaac 3961 tgttcattga ggaaattcag gccgggagtc agggccaggc gggcgccagc gactcaccct 4021 cggcccgcag cggccgggcg gcgcccagct cggagggcga cagctgcgac ggcgtggagg 4081 ccactgaggg cccaggcagc gccgacaccg aggagcccaa gtctcaggga gaggccgagc 4141 gggaggaggt gccgcggccg gcggagcaga cggagccgcc gccctcgggg accccgggcc 4201 cggacgacgc ccgcgacgac gaccacgagg gaggccccgt ggaaggcccg gggcccctgc 4261 ccagccccgc ctccgcgacc gccaccgccg cgcccgcggc ccccgaggac gccgctacct 4321 cagccgccgc cgcgccgggg gagggccccg cggccccgag ctccgcgccg ccgcccagca 4381 acagcagcag cagcagcgcc ccccgcaggc ccagctcgct gcagagcctt ttcggcctcc 4441 ccgaggccgc gggcgcccgg gactcgcgcg acaaccccct gcgcaagaag aaggccgcga 4501 acttgaacag catcatccac cgcctggaga aggccgccag ccgggaggaa cctatcgaat 4561 gggagttctg aggggctgcg gccctggggc gggcagccag gctgggccgc aagggcctgg 4621 acggggtcgg acggggcagg cgctgcggac accgtggcct gggcttggcc cgcggcctgc 4681 accgaccccg ggccggacct aagcccgcag cccagacccc ctccacggtc cgcggcctgc 4741 accgacccga ggcccagatc caaggccgcg gcccagaccc actctgcggc ccgggccgac 4801 cctgcggcct ccaccaaccc cgcggcccag acccagcccg cggcctggac ccctggaccg 4861 ctttgcgcac ttaccgccct gcgggccaca gggcaaaatc gccataggcc aaggtgcata 4921 tagaaaacaa aggagcatta agcccaatct atgtcgtgtt ttcaaggaag aaaacggaaa 4981 tgtgtggtcg agcttttttg taccctgaag tgtttttttt attgccctaa gtgatttcca 5041 caggttctgg aataactctt acagctttgc cttgtgtcct cttgttccgt gtgggcttta 5101 aaagaaaaaa aatcaaaccc acatattaaa agggggcttt ttatctgcca aaaaaaaaaa 5161 aaaa //