LOCUS BC005165 2255 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens small glutamine-rich tetratricopeptide repeat (TPR)-containing, alpha, mRNA (cDNA clone MGC:4672 IMAGE:3532317), complete cds. ACCESSION BC005165 VERSION BC005165.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2255) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2255) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (26-MAR-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Nov 6, 2003 this sequence version replaced BC005165.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 11 Row: n Column: 13 This clone was selected for full length sequencing because it passed the following selection criteria: Hexamer frequency ORF analysis. FEATURES Location/Qualifiers source 1..2255 /db_xref="H-InvDB:HIT000032260" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:4672 IMAGE:3532317" /tissue_type="Muscle, rhabdomyosarcoma" /clone_lib="NIH_MGC_17" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2255 /gene="SGTA" /gene_synonym="hSGT" /db_xref="GeneID:6449" /db_xref="HGNC:HGNC:10819" /db_xref="MIM:603419" CDS 45..986 /gene="SGTA" /gene_synonym="hSGT" /codon_start=1 /product="small glutamine-rich tetratricopeptide repeat (TPR)-containing, alpha" /protein_id="AAH05165.1" /db_xref="GeneID:6449" /db_xref="HGNC:HGNC:10819" /db_xref="MIM:603419" /translation="MDNKKRLAYAIIQFLHDQLRHGGLSSDAQESLEVAIQCLETAFG VTVEDSDLALPQTLPEIFEAAATGKEMPQDLRSPARTPPSEEDSAEAERLKTEGNEQM KVENFEAAVHFYGKAIELNPANAVYFCNRAAAYSKLGNYAGAVQDCERAICIDPAYSK AYGRMGLALSSLNKHVEAVAYYKKALELDPDNETYKSNLKIAELKLREAPSPTGGVGS FDIAGLLNNPGFMSMASNLMNNPQIQQLMSGMISGGNNPLGTPGTSPSQNDLASLIQA GQQFAQQMQQQNPELIEQLRSQIRSRTPSASNDDQQE" BASE COUNT 482 a 708 c 657 g 408 t ORIGIN 1 cggggatcgg tcgcctgaga ggtatcacct cttctgggct caagatggac aacaagaagc 61 gcctggccta cgccatcatc cagttcctgc atgaccagct ccggcacggg ggcctctcgt 121 ccgatgctca ggagagcttg gaagtcgcca tccagtgcct ggagactgcg tttggggtga 181 cggtagaaga cagtgacctt gcgctccctc agactctgcc ggagatattt gaagcggctg 241 ccacgggcaa ggagatgccg caggacctga ggagccccgc gcgaaccccg ccttccgagg 301 aggactcagc agaggcagag cgcctcaaaa ccgaaggaaa cgagcagatg aaagtggaaa 361 actttgaagc tgccgtgcat ttctacggaa aagccatcga gctcaaccca gccaacgccg 421 tctatttctg caacagagcc gcagcctaca gcaaactcgg caactacgca ggcgcggtgc 481 aggactgtga gcgggccatc tgcattgacc cggcctacag caaggcctac ggcaggatgg 541 gcctggcgct ctccagcctc aacaagcacg tggaggccgt ggcttactac aagaaggctc 601 tggagctgga ccccgacaac gagacataca agtccaacct caagatagcg gagctgaagc 661 tgcgggaggc ccccagcccc acgggaggcg tgggcagctt cgacatcgcc ggcctgctga 721 acaaccctgg cttcatgagc atggcttcga acctaatgaa caatccccag attcagcagc 781 tcatgtccgg catgatttcg ggtggcaaca accccttggg aactcccggc accagcccct 841 cgcagaacga cctggccagc ctcatccagg cgggccagca gtttgcccag cagatgcagc 901 agcagaaccc agagttgata gagcagctca ggagccagat ccggagtcgg acgcccagcg 961 ccagcaacga cgaccagcag gagtgacgct gcctgctccc ggtgtgaccg cgtccttccc 1021 tggccgaccc gaaggaagcc ttctggttgt ctgccacttc ctcctgttgg actgcctgag 1081 agaggggaag agagagacct cggacctgca tgtcaagatg gattttcccc ttttatctct 1141 gccctcctcc actccctttt tgtaactccc ttacagcccc cagacccttc ttgaaacgag 1201 agccagcaag ctgagcacag accagcagcg acctcccttc cagcccccag aaagctcggt 1261 cacttgagtg ttttctagaa tcctggggtg ctcccgggcc gctctcagag aagtggcagg 1321 tttcacgttc agccgtgtgg cggatcgtgt ggcttccaaa gccttttaca gcccccgccc 1381 cccatcccgt ggtctgtctg caggaactct cccgtctgtg agaagcctct ttccgagtcg 1441 acctcccggc caccccggcc ctgtgcctgc tcggaagagc tcactgccag ctgcggcctg 1501 ggcaccgcgg gccatgtgtg tttgcatgag gaactcttta gtggcagaca cctaagagac 1561 ggctgcggtc accccacgcc tccgcggctc aggagccgtc ctgggtgcat aggaccagtt 1621 tctgtgactt ttctccagtt gggcatgttg acagacatgt ttcccctcct cccaccctca 1681 ttttctggtc ctcgcgactg agagccaggg gcgacatcat gaccttctgt cccggccgcc 1741 ttagccccgg gcacagggaa ggcagctggg ccgtttctgt ctgtgtccca tcctgctgtc 1801 cttctgtcct ggatgtttca tgggcccggg gccccccagg gaagcttacc cctcctgtgc 1861 tgggtggagg ccacgggaca cctcaggtgc cacccacctt ggccctaaaa cagccaccag 1921 gaaagcagcc ggagagccgg acagcaggca gcctgtctgg gttcctgagg cctgggggtg 1981 gcagacgagc ccacggcgcc gtggtcccag cagcagggtt gtcagtcgga gcatcctggg 2041 gctccctggc tcctggccgt ctgtgaggta ggcgcagtac cgtgtatcgt aggtagcagt 2101 aggaacgggg gccgccgcgg ccctgcagcc gctcatggcg gtgaggtgtg tgccaagccc 2161 acccggggtg cagggcgtga cgtgtgggga ataaataggc gttgtgaaaa aaaaaaaaaa 2221 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaa //