LOCUS BC066592 5164 bp mRNA linear HUM 12-NOV-2007
DEFINITION Homo sapiens cut-like homeobox 1, mRNA (cDNA clone MGC:75164
IMAGE:5740343), complete cds.
ACCESSION BC066592
VERSION BC066592.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 5164)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 5164)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (20-FEB-2004) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 139 Row: j Column: 22
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 31652237.
FEATURES Location/Qualifiers
source 1..5164
/db_xref="H-InvDB:HIT000262266"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:75164 IMAGE:5740343"
/tissue_type="Duodenum, adenocarcinoma"
/clone_lib="NIH_MGC_88"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..5164
/gene="CUX1"
/gene_synonym="CASP"
/gene_synonym="CDP"
/gene_synonym="CDP/Cut"
/gene_synonym="CDP1"
/gene_synonym="Clox"
/gene_synonym="COY1"
/gene_synonym="CUX"
/gene_synonym="Cux/CDP"
/gene_synonym="GOLIM6"
/gene_synonym="Nbla10317"
/gene_synonym="p100"
/gene_synonym="p110"
/gene_synonym="p200"
/gene_synonym="p75"
/db_xref="GeneID:1523"
/db_xref="HGNC:HGNC:2557"
/db_xref="MIM:116896"
CDS 21..4571
/gene="CUX1"
/gene_synonym="CASP"
/gene_synonym="CDP"
/gene_synonym="CDP/Cut"
/gene_synonym="CDP1"
/gene_synonym="Clox"
/gene_synonym="COY1"
/gene_synonym="CUX"
/gene_synonym="Cux/CDP"
/gene_synonym="GOLIM6"
/gene_synonym="Nbla10317"
/gene_synonym="p100"
/gene_synonym="p110"
/gene_synonym="p200"
/gene_synonym="p75"
/codon_start=1
/product="CUX1 protein"
/protein_id="AAH66592.1"
/db_xref="GeneID:1523"
/db_xref="HGNC:HGNC:2557"
/db_xref="MIM:116896"
/translation="MAANVGSMFQYWKRFDLQQLQRELDATATVLANRQDESEQSRKR
LIEQSREFKKNTPEDLRKQVAPLLKSFQGEIDALSKRSKEAEAAFLNVYKRLIDVPDP
VPALDLGQQLQLKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREY
EQTLKNQAETIALEKEQKLQNDFAEKERKLQETQMSTTSKLEEAEHKVQSLQTALEKT
RTELFDLKTKYDEETTAKADEIEMIMTDLERANQRAEVAQREAETLREQLSSANHSLQ
LASQIQKAPDVEQAIEVLTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSASQ
ISQLEQQLSAKNSTLKQLEEKLKGQADYEEVKKELNILKSMEFAPSEGAGTQDAAKPL
EVLLLEKNRSLQSENAALRISNSDLSGSARRKGKDQPESRRPGSLPAPPPSQLPRNPG
EQASNTNGTHQFSPAGLSQDFFSSSLASPSLPLASTGKFALNSLLQRQLMQSFYSKAM
QEAGSTSMIFSTGPYSTNSISSQSPLQQSPDVNGMAPSPSQSESAGSVSEGEEMDTAE
IARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLTVRGKEPFHKMKQF
LSDEQNILALRSIQGRQRENPGQSLNRLFQEVPKRRNGSEGNITTRIRASETGSDEAI
KSILEQAKRELQVQKTAEPAQPSSASGSGNSDDAIRSILQQARREMEAQQAALDPALK
QAPLSQSDITILTPKLLSTSPMPTVSSYPPLAISLKKPSAAPEAGASALPNPPALKKE
AQDAPGLDPQGAADCAQGVLRQVKNEVGRSGAWKDHWWSAVQPERRNAASSEEAKAEE
TGGGKEKGSGGSGGGSQPRAERSQLQGPSSSEYWKEWPSAESPYSQSSELSLTGASRS
ETPQNSPLPSSPIVPMSKPTKPSVPPLTPEQYEVYMYQEVDTIELTRQVKEKLAKNGI
CQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREPFIRMQLWLNGELGQGVLPVQG
QQQGPVLHSVTSLQDPLQQGCVSSESTPKTSASCSPAPESPMSSSESVKSLTELVQQP
CPPIEASKDSKPPEPSDPPASDSQPTTPLPLSGHSALSIQELVAMSPELDTYGITKRV
KEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREPFVRMQLWLNDPN
NVEKLMDMKRMEKKAYMKRRHSSVSDSQPCEPPSVGTEYSQGASPQPQHQLKKPRVVL
APEEKEALKRAYQQKPYPSPKTIEDLATQLNLKTSTVINWFHNYRSRIRRELFIEEIQ
AGSQGQAGASDSPSARSGRAAPSSEGDSCDGVEATEGPGSADTEEPKSQGEAEREEVP
RPAEQTEPPPSGTPGPDDARDDDHEGGPVEGPGPLPSPASATATAAPAAPEDAATSAA
AAPGEGPAAPSSAPPPSNSSSSSAPRRPSSLQSLFGLPEAAGARDSRDNPLRKKKAAN
LNSIIHRLEKAASREEPIEWEF"
BASE COUNT 1306 a 1614 c 1478 g 766 t
ORIGIN
1 ccgtctcaat atgtctcaag atggcggcca atgtgggatc gatgtttcaa tattggaagc
61 gctttgattt acagcagctg cagagagaac tcgatgccac cgcaacggta ttggcgaacc
121 ggcaggatga aagtgagcag tccagaaagc ggcttatcga acagagccgg gagttcaaga
181 agaacactcc agaggatttg cgcaagcagg tagcgccgct gctgaagagt ttccaaggag
241 agattgatgc actgagtaaa agaagcaagg aagctgaagc agctttcttg aatgtctaca
301 aaagattgat tgacgtccca gatcccgtac cagctttgga tctcggacag caactccagc
361 tcaaagtgca gcgcctgcac gatattgaaa cagagaacca gaaacttagg gaaactctgg
421 aagaatacaa caaggaattt gctgaagtga aaaatcaaga ggttacgata aaagcactta
481 aagagaaaat ccgagaatat gaacagacac tgaagaacca agccgaaacc atagctcttg
541 agaaggaaca gaagttacag aatgactttg cagaaaagga gagaaagctg caggagacac
601 agatgtccac cacctcaaag ctggaggaag ctgagcataa ggttcagagc ctacaaacag
661 ccctggaaaa aactcgaaca gaattatttg acctgaaaac caaatacgat gaagaaacta
721 ctgcaaaggc cgacgagatt gaaatgatca tgacggacct tgaaagggca aaccagaggg
781 cagaggtggc tcagagagag gcggagacct taagggaaca gctctcatcg gccaatcact
841 ccctccagct ggcctcacag atccagaagg caccagacgt ggagcaggcc atagaggtgc
901 tgacccgctc cagcctagaa gttgagttgg ccgccaagga gcgggagatc gcacagctgg
961 tggaggacgt gcagagactc caggccagcc tcaccaagct gcgggagaat tcggccagcc
1021 agatctcaca gcttgagcag cagctgagcg ccaaaaacag cacactcaaa caactggaag
1081 aaaaactcaa aggccaggct gactatgaag aggtgaagaa agagctgaac attctgaagt
1141 ccatggagtt tgcaccgtcc gagggcgctg ggacacagga tgcggccaag cccctggagg
1201 tgctgttgct ggagaagaac cgctcgctgc agtccgagaa cgccgcgctg cgcatctcca
1261 acagcgacct gagcgggtca gccaggagga aagggaaaga ccagcctgaa agtcggcgcc
1321 cgggatcttt gccggccccc cctccttctc agttgccccg caacccgggg gagcaggctt
1381 ccaatactaa tggtacacac cagttctcac cagcggggtt aagtcaagac tttttcagct
1441 catccctggc aagccccagc ctacccctgg cttctacagg aaaatttgca ctaaactctc
1501 ttctccagcg gcagctaatg cagtccttct actccaaggc tatgcaggaa gccggaagca
1561 caagcatgat tttttcaaca ggtccataca gcacaaactc catatcttcc caaagtccat
1621 tacaacaaag cccagatgtc aatggcatgg ccccatcccc cagccagtca gaaagtgctg
1681 ggagcgtctc cgagggcgag gagatggaca ctgcagaaat cgcccggcag gtcaaagagc
1741 agctgattaa gcacaatatc ggacaacgta ttttcggaca ttatgtgttg ggactgtctc
1801 aagggtccgt gagcgagatt ctggcccggc ccaagccatg gaataaactg actgttcgtg
1861 gcaaggagcc atttcacaag atgaaacagt tcctctccga tgagcagaac atcctggccc
1921 tccgtagcat ccaaggcaga caaagagaga atccaggcca gagcctgaac agactatttc
1981 aggaagtacc gaaacgaaga aatgggtctg aaggtaacat caccacccgg atccgagcct
2041 cggagactgg ctctgatgaa gccatcaagt ccatcctaga gcaagccaag agggagctcc
2101 aagtgcagaa aactgcagag ccggcccagc cttcctccgc atccggcagc gggaactctg
2161 atgacgccat ccgctccatc ctgcagcaag cccgccggga gatggaggcc cagcaggctg
2221 ccctcgaccc tgccttaaag caggcaccac tgtcccagag tgacatcacc atcctcaccc
2281 ccaagcttct gtccacctcg cccatgccca ccgtgtccag ctacccacct ctcgccatct
2341 ccctgaagaa gccctccgca gctcctgagg ccggtgcctc tgctctgccg aaccccccgg
2401 ccctcaaaaa ggaggcccag gacgcccccg ggctggaccc ccagggagca gccgattgtg
2461 cacaaggggt cctgagacag gtgaaaaatg aggtgggccg cagcggtgcc tggaaggacc
2521 actggtggag cgcggtgcag ccggagagaa gaaatgccgc ctcctccgag gaggccaagg
2581 ccgaagaaac gggcggcggg aaagagaagg gcagcggtgg cagcggaggt ggcagccagc
2641 ctcgggccga gcgcagtcag ctccagggac cctcgtcgtc agagtactgg aaggagtggc
2701 ccagcgctga gtccccatac tcccagagct cagagctgag tctgaccggg gccagccgca
2761 gcgagacacc acagaacagc cccctgccat cctccccgat cgtgcccatg tccaagccca
2821 ccaagccctc ggtccccccg ctgacccccg agcagtacga ggtctacatg taccaggagg
2881 tggacaccat cgagctcacc cggcaggtta aggaaaagct ggccaagaac ggcatctgcc
2941 agagaatctt cggggagaag gtgctgggcc tgtcccaggg cagcgtcagc gacatgctgt
3001 cccgaccgaa gccatggagc aagctgacgc agaaaggccg agaacccttc atccggatgc
3061 agctctggct gaacggcgag ctaggccagg gtgttctacc cgtccagggc cagcagcaag
3121 ggccagtcct ccactccgtg acatcgctcc aggacccgct gcagcagggc tgtgtgagct
3181 cagaaagcac tccaaagacc tccgccagct gcagccctgc ccctgagtcc ccgatgagtt
3241 ccagtgagtc ggtgaagagc ctgaccgagc tggtccagca gccctgtccc cccatcgagg
3301 cgagcaagga cagcaagcca ccagagccca gtgacccgcc agcatccgac tcccagccca
3361 caaccccgct gcctctctcc ggacactcgg ccctcagcat ccaagaatta gtagccatgt
3421 ccccggagct ggacacctac ggcataacca agcgggtgaa ggaggtgctg acggacaaca
3481 acctcggcca gcgcttattt ggggagacca tcttagggct cacccaaggc tctgtctctg
3541 acctccttgc ccgccccaaa ccctggcata agctcagtct gaaaggacga gagcccttcg
3601 tccggatgca gctgtggctg aacgacccca acaatgtgga gaagctgatg gacatgaaac
3661 ggatggagaa gaaagcctac atgaagcggc ggcacagctc agtcagtgac agccagccct
3721 gcgaaccgcc ctctgtcggc accgagtaca gccagggcgc cagcccccag ccccagcacc
3781 agctgaagaa accccgggtg gtgctggctc cggaggagaa ggaggcgctg aaacgagcgt
3841 atcagcaaaa gccatacccg tcaccaaaaa ccatcgaaga cctcgccacc cagctcaacc
3901 tgaaaaccag caccgtcatc aactggttcc acaactacag gtctcggatc cgcagagaac
3961 tgttcattga ggaaattcag gccgggagtc agggccaggc gggcgccagc gactcaccct
4021 cggcccgcag cggccgggcg gcgcccagct cggagggcga cagctgcgac ggcgtggagg
4081 ccactgaggg cccaggcagc gccgacaccg aggagcccaa gtctcaggga gaggccgagc
4141 gggaggaggt gccgcggccg gcggagcaga cggagccgcc gccctcgggg accccgggcc
4201 cggacgacgc ccgcgacgac gaccacgagg gaggccccgt ggaaggcccg gggcccctgc
4261 ccagccccgc ctccgcgacc gccaccgccg cgcccgcggc ccccgaggac gccgctacct
4321 cagccgccgc cgcgccgggg gagggccccg cggccccgag ctccgcgccg ccgcccagca
4381 acagcagcag cagcagcgcc ccccgcaggc ccagctcgct gcagagcctt ttcggcctcc
4441 ccgaggccgc gggcgcccgg gactcgcgcg acaaccccct gcgcaagaag aaggccgcga
4501 acttgaacag catcatccac cgcctggaga aggccgccag ccgggaggaa cctatcgaat
4561 gggagttctg aggggctgcg gccctggggc gggcagccag gctgggccgc aagggcctgg
4621 acggggtcgg acggggcagg cgctgcggac accgtggcct gggcttggcc cgcggcctgc
4681 accgaccccg ggccggacct aagcccgcag cccagacccc ctccacggtc cgcggcctgc
4741 accgacccga ggcccagatc caaggccgcg gcccagaccc actctgcggc ccgggccgac
4801 cctgcggcct ccaccaaccc cgcggcccag acccagcccg cggcctggac ccctggaccg
4861 ctttgcgcac ttaccgccct gcgggccaca gggcaaaatc gccataggcc aaggtgcata
4921 tagaaaacaa aggagcatta agcccaatct atgtcgtgtt ttcaaggaag aaaacggaaa
4981 tgtgtggtcg agcttttttg taccctgaag tgtttttttt attgccctaa gtgatttcca
5041 caggttctgg aataactctt acagctttgc cttgtgtcct cttgttccgt gtgggcttta
5101 aaagaaaaaa aatcaaaccc acatattaaa agggggcttt ttatctgcca aaaaaaaaaa
5161 aaaa
//