LOCUS BC005165 2255 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens small glutamine-rich tetratricopeptide repeat
(TPR)-containing, alpha, mRNA (cDNA clone MGC:4672 IMAGE:3532317),
complete cds.
ACCESSION BC005165
VERSION BC005165.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2255)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2255)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (26-MAR-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Nov 6, 2003 this sequence version replaced BC005165.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Institute for Systems Biology
http://www.systemsbiology.org
contact: amadan@systemsbiology.org
Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 11 Row: n Column: 13
This clone was selected for full length sequencing because it
passed the following selection criteria: Hexamer frequency ORF
analysis.
FEATURES Location/Qualifiers
source 1..2255
/db_xref="H-InvDB:HIT000032260"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:4672 IMAGE:3532317"
/tissue_type="Muscle, rhabdomyosarcoma"
/clone_lib="NIH_MGC_17"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..2255
/gene="SGTA"
/gene_synonym="hSGT"
/db_xref="GeneID:6449"
/db_xref="HGNC:HGNC:10819"
/db_xref="MIM:603419"
CDS 45..986
/gene="SGTA"
/gene_synonym="hSGT"
/codon_start=1
/product="small glutamine-rich tetratricopeptide repeat
(TPR)-containing, alpha"
/protein_id="AAH05165.1"
/db_xref="GeneID:6449"
/db_xref="HGNC:HGNC:10819"
/db_xref="MIM:603419"
/translation="MDNKKRLAYAIIQFLHDQLRHGGLSSDAQESLEVAIQCLETAFG
VTVEDSDLALPQTLPEIFEAAATGKEMPQDLRSPARTPPSEEDSAEAERLKTEGNEQM
KVENFEAAVHFYGKAIELNPANAVYFCNRAAAYSKLGNYAGAVQDCERAICIDPAYSK
AYGRMGLALSSLNKHVEAVAYYKKALELDPDNETYKSNLKIAELKLREAPSPTGGVGS
FDIAGLLNNPGFMSMASNLMNNPQIQQLMSGMISGGNNPLGTPGTSPSQNDLASLIQA
GQQFAQQMQQQNPELIEQLRSQIRSRTPSASNDDQQE"
BASE COUNT 482 a 708 c 657 g 408 t
ORIGIN
1 cggggatcgg tcgcctgaga ggtatcacct cttctgggct caagatggac aacaagaagc
61 gcctggccta cgccatcatc cagttcctgc atgaccagct ccggcacggg ggcctctcgt
121 ccgatgctca ggagagcttg gaagtcgcca tccagtgcct ggagactgcg tttggggtga
181 cggtagaaga cagtgacctt gcgctccctc agactctgcc ggagatattt gaagcggctg
241 ccacgggcaa ggagatgccg caggacctga ggagccccgc gcgaaccccg ccttccgagg
301 aggactcagc agaggcagag cgcctcaaaa ccgaaggaaa cgagcagatg aaagtggaaa
361 actttgaagc tgccgtgcat ttctacggaa aagccatcga gctcaaccca gccaacgccg
421 tctatttctg caacagagcc gcagcctaca gcaaactcgg caactacgca ggcgcggtgc
481 aggactgtga gcgggccatc tgcattgacc cggcctacag caaggcctac ggcaggatgg
541 gcctggcgct ctccagcctc aacaagcacg tggaggccgt ggcttactac aagaaggctc
601 tggagctgga ccccgacaac gagacataca agtccaacct caagatagcg gagctgaagc
661 tgcgggaggc ccccagcccc acgggaggcg tgggcagctt cgacatcgcc ggcctgctga
721 acaaccctgg cttcatgagc atggcttcga acctaatgaa caatccccag attcagcagc
781 tcatgtccgg catgatttcg ggtggcaaca accccttggg aactcccggc accagcccct
841 cgcagaacga cctggccagc ctcatccagg cgggccagca gtttgcccag cagatgcagc
901 agcagaaccc agagttgata gagcagctca ggagccagat ccggagtcgg acgcccagcg
961 ccagcaacga cgaccagcag gagtgacgct gcctgctccc ggtgtgaccg cgtccttccc
1021 tggccgaccc gaaggaagcc ttctggttgt ctgccacttc ctcctgttgg actgcctgag
1081 agaggggaag agagagacct cggacctgca tgtcaagatg gattttcccc ttttatctct
1141 gccctcctcc actccctttt tgtaactccc ttacagcccc cagacccttc ttgaaacgag
1201 agccagcaag ctgagcacag accagcagcg acctcccttc cagcccccag aaagctcggt
1261 cacttgagtg ttttctagaa tcctggggtg ctcccgggcc gctctcagag aagtggcagg
1321 tttcacgttc agccgtgtgg cggatcgtgt ggcttccaaa gccttttaca gcccccgccc
1381 cccatcccgt ggtctgtctg caggaactct cccgtctgtg agaagcctct ttccgagtcg
1441 acctcccggc caccccggcc ctgtgcctgc tcggaagagc tcactgccag ctgcggcctg
1501 ggcaccgcgg gccatgtgtg tttgcatgag gaactcttta gtggcagaca cctaagagac
1561 ggctgcggtc accccacgcc tccgcggctc aggagccgtc ctgggtgcat aggaccagtt
1621 tctgtgactt ttctccagtt gggcatgttg acagacatgt ttcccctcct cccaccctca
1681 ttttctggtc ctcgcgactg agagccaggg gcgacatcat gaccttctgt cccggccgcc
1741 ttagccccgg gcacagggaa ggcagctggg ccgtttctgt ctgtgtccca tcctgctgtc
1801 cttctgtcct ggatgtttca tgggcccggg gccccccagg gaagcttacc cctcctgtgc
1861 tgggtggagg ccacgggaca cctcaggtgc cacccacctt ggccctaaaa cagccaccag
1921 gaaagcagcc ggagagccgg acagcaggca gcctgtctgg gttcctgagg cctgggggtg
1981 gcagacgagc ccacggcgcc gtggtcccag cagcagggtt gtcagtcgga gcatcctggg
2041 gctccctggc tcctggccgt ctgtgaggta ggcgcagtac cgtgtatcgt aggtagcagt
2101 aggaacgggg gccgccgcgg ccctgcagcc gctcatggcg gtgaggtgtg tgccaagccc
2161 acccggggtg cagggcgtga cgtgtgggga ataaataggc gttgtgaaaa aaaaaaaaaa
2221 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaa
//