LOCUS BC002767 2601 bp mRNA linear HUM 06-APR-2004 DEFINITION Homo sapiens LATS, large tumor suppressor, homolog 1 (Drosophila), mRNA (cDNA clone MGC:3608 IMAGE:3632571), complete cds. ACCESSION BC002767 VERSION BC002767.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2601) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2601) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (05-FEB-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 12 Row: o Column: 22 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 10862687. FEATURES Location/Qualifiers source 1..2601 /db_xref="H-InvDB:HIT000258892" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:3608 IMAGE:3632571" /tissue_type="Uterus, endometrium adenocarcinoma" /clone_lib="NIH_MGC_44" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2601 /gene="LATS1" /gene_synonym="WARTS" /gene_synonym="wts" /db_xref="GeneID:9113" /db_xref="MIM:603473" CDS 407..2479 /gene="LATS1" /gene_synonym="WARTS" /gene_synonym="wts" /codon_start=1 /product="LATS1 protein" /protein_id="AAH02767.1" /db_xref="GeneID:9113" /db_xref="MIM:603473" /translation="MKRSEKPEGYRQMRPKTFPASNYTVSSRQMLQEIRESLRNLSKP SDAAKAEHNMSKMSTEDPRQVRNPPKFGTHHKALQEIRNSLLPFANETNSSRSTSEVN PQMLQDLQAAGFDEDMVIQALQKTNNRSIEAAIEFISKMSYQDPRREQMAAAAARPIN ASMKPGNVQQSVNRKQSWKGSKESLVPQRHGPPLGESVAYHSESPNSQTDVGRPLSGS GISAFVQAHPSNGQRVNPPPPPQVRSVTPPPPPRGQTPPPRGTTPPPPSWEPNSQTKR YSGNMEYVISRISPVPPGAWQEGYPPPPLNTSPMNPPNQGQRGISSVPVGRQPIIMQS SSKFNFPSGRPGMQNGTGQTDFMIHQNVVPAGTVNRQPPPPYPLTAANGQSPSALQTG GSAAPSSYTNGSIPQSMMVPNRNSHNMELYNISVPGLQTNWPQSSSAPAQSSPSSGHE IPTWQPNIPVRSNSFNNPLGNRASHSANSQPSATTVTAITPAPIQQPVKSMRVLKPEL QTALAPTHPSWIPQPIQTVQPSPFPEGTASNVTVMPPVAEAPNYQGPPPPYPKHLLHQ NPSVPPYESISKPSKEDQPSLPKEDESEKSYENVDSGDKEKKQITTSPITVRKNKKDE ERRESRIQSYSPQAFKFFMEQHVENVLKSHQQRLHRKKQLENEMMRVKPFKMSIFILN HLFAWCLF" misc_feature 707..820 /gene="LATS1" /gene_synonym="WARTS" /gene_synonym="wts" /note="UBA; Region: UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to ubiquitin and the ubiquitination pathway. The structure of the UBA domain consists of a compact three helix bundle. This domain is found at the N terminus of EF-TS hence the name TS-N. The structure of EF-TS is known and this domain is implicated in its interaction with EF-TU. The domain has been found in non EF-TS proteins such as alpha-NAC and MJ0280" /db_xref="CDD:pfam00627" BASE COUNT 805 a 636 c 532 g 628 t ORIGIN 1 ggcacgaggc tgcagcggag tgcggcggcg gcgacactga gtggaaggca aaatggcggc 61 ggcggcggcg gtggcctggt gttaagggga gagccaggtc cttacgaccc ctgggacggg 121 ccgcgctggc ccgcggcagc ccccccgttc gtctccccgc tctgccccac cagggatact 181 tggggttgct gggacggact ctggccgcct cagcgtccgc cctcaggccc gtggccgctg 241 tccaggagct ctgctctccc ctccagagtt aattatttat attgtaaaga attttaacag 301 tcctggggac ttccttgaag catcattttc acttttgctc agaagaaagc tctggatcta 361 tcaaataaag aagtccttcg tgtgggctac atatatagat gttttcatga agaggagtga 421 aaagccagaa ggatatagac aaatgaggcc taagaccttt cctgccagta actatactgt 481 cagtagccgg caaatgttac aagaaattcg ggaatccctt aggaatttat ctaaaccatc 541 tgatgctgct aaggctgagc ataacatgag taaaatgtca accgaagatc ctcgacaagt 601 cagaaatcca cccaaatttg ggacgcatca taaagccttg caggaaattc gaaactctct 661 gcttccattt gcaaatgaaa caaattcttc tcggagtact tcagaagtta atccacaaat 721 gcttcaagac ttgcaagctg ctggatttga tgaggatatg gttatacaag ctcttcagaa 781 aactaacaac agaagtatag aagcagcaat tgaattcatt agtaaaatga gttaccaaga 841 tcctcgacga gagcagatgg ctgcagcagc tgccagacct attaatgcca gcatgaaacc 901 agggaatgtg cagcaatcag ttaaccgcaa acagagctgg aaaggttcta aagaatcctt 961 agttcctcag aggcatggcc cgccactagg agaaagtgtg gcctatcatt ctgagagtcc 1021 caactcacag acagatgtag gaagaccttt gtctggatct ggtatatcag catttgttca 1081 agctcaccct agcaacggac agagagtgaa ccccccacca ccacctcaag taaggagtgt 1141 tactcctcca ccacctccaa gaggccagac tccccctcca agaggtacaa ctccacctcc 1201 cccttcatgg gaaccaaact ctcaaacaaa gcgctattct ggaaacatgg aatacgtaat 1261 ctcccgaatc tctcctgtcc cacctggggc atggcaagag ggctatcctc caccacctct 1321 caacacttcc cccatgaatc ctcctaatca aggacagaga ggcattagtt ctgttcctgt 1381 tggcagacaa ccaatcatca tgcagagttc tagcaaattt aactttccat cagggagacc 1441 tggaatgcag aatggtactg gacaaactga tttcatgata caccaaaatg ttgtccctgc 1501 tggcactgtg aatcggcagc caccacctcc atatcctctg acagcagcta atggacaaag 1561 cccttctgct ttacaaacag ggggatctgc tgctccttcg tcatatacaa atggaagtat 1621 tcctcagtct atgatggtgc caaacagaaa tagtcataac atggaactat ataacattag 1681 tgtacctgga ctgcaaacaa attggcctca gtcatcttct gctccagccc agtcatcccc 1741 gagcagtggg catgaaatcc ctacatggca acctaacata ccagtgaggt caaattcttt 1801 taataaccca ttaggaaata gagcaagtca ctctgctaat tctcagcctt ccgctacaac 1861 agtcactgca attacaccag ctcctattca acagcctgtg aaaagtatgc gtgtattaaa 1921 accagagcta cagactgctt tagcacctac acacccttct tggataccac agccaattca 1981 aactgttcaa cccagtcctt ttcctgaggg aaccgcttca aatgtgactg tgatgccacc 2041 tgttgctgaa gctccaaact atcaaggacc accaccaccc tacccaaaac atctgctgca 2101 ccaaaaccca tctgttcctc catacgagtc aatcagtaag cctagcaaag aggatcagcc 2161 aagcttgccc aaggaagatg agagtgaaaa gagttatgaa aatgttgata gtggggataa 2221 agaaaagaaa cagattacaa cttcacctat tactgttagg aaaaacaaga aagatgaaga 2281 gcgaagggaa tctcgtattc aaagttattc tcctcaagca tttaaattct ttatggagca 2341 acatgtagaa aatgtactca aatctcatca gcagcgtcta catcgtaaaa aacaattaga 2401 gaatgaaatg atgcgggtaa aaccttttaa aatgtccatt tttatactta atcatctgtt 2461 tgcttggtgt ttattttaaa atattgtgtc cagtattttt ctttcttttt atagctaaat 2521 aaaatatatt atcagttatg gaatttaaaa gtgaataaat attaaagtac ttttgaaaaa 2581 aaaaaaaaaa aaaaaaaaaa a //