LOCUS BC001138 1440 bp mRNA linear HUM 30-SEP-2003
DEFINITION Homo sapiens hexosaminidase A (alpha polypeptide), mRNA (cDNA clone
IMAGE:2989846), partial cds.
ACCESSION BC001138
VERSION BC001138.2
KEYWORDS .
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 1440)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 1440)
AUTHORS Strausberg,R.
TITLE Direct Submission
JOURNAL Submitted (11-DEC-2000) National Institutes of Health, Mammalian
Gene Collection (MGC), Cancer Genomics Office, National Cancer
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Aug 19, 2003 this sequence version replaced BC001138.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: DCTD/DTP
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Institute for Systems Biology
http://www.systemsbiology.org
contact: amadan@systemsbiology.org
Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 3 Row: k Column: 16.
FEATURES Location/Qualifiers
source 1..1440
/db_xref="H-InvDB:HIT000085962"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="IMAGE:2989846"
/tissue_type="Ovary, adenocarcinoma"
/clone_lib="NIH_MGC_9"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene <1..1440
/gene="HEXA"
/gene_synonym="TSD"
/db_xref="GeneID:3073"
/db_xref="MIM:606869"
CDS <1..1231
/gene="HEXA"
/gene_synonym="TSD"
/codon_start=2
/product="HEXA protein"
/protein_id="AAH01138.2"
/db_xref="GeneID:3073"
/db_xref="MIM:606869"
/translation="NDDQCLLLSETVWGALRGLETFSQLVWKSAEGTFFINKTEIEDF
PRFPHRGLLLDTSRHYLPLSSILDTLDVMAYNKLNVFHWHLVDDPSFPYESFTFPELM
RKGSYNPVTHIYTAQDVKEVIEYARLRGIRVLAEFDTPGHTLSWGPGIPGLLTPCYSG
SEPSGTFGPVNPSLNNTYEFMSTFFLEVSSVFPDFYLHLGGDEVDFTCWKSNPEIQDF
MRKKGFGEDFKQLESFYIQTLLDIVSSYGKGYVVWQEVFDNKVKIQPDTIIQVWREDI
PVNYMKELELVTKAGFRALLSAPWYLNRISYGPDWKDFYVVEPLAFEGTPEQKALVIG
GEACMWGEYVDNTNLVPRLWPRAGAVAERLWSNKLTSDLTFAYERLSHFRCELLRRGV
QAQPLNVGFCEQEFEQT"
misc_feature 140..1105
/gene="HEXA"
/gene_synonym="TSD"
/note="Glyco_hydro_20; Region: Glycosyl hydrolase family
20, catalytic domain. This domain has a TIM barrel fold"
/db_xref="CDD:pfam00728"
BASE COUNT 334 a 355 c 383 g 368 t
ORIGIN
1 aaatgatgac cagtgtttac tcctctctga gactgtctgg ggagctctcc gaggtctgga
61 gacttttagc cagcttgttt ggaaatctgc tgagggcaca ttctttatca acaagactga
121 gattgaggac tttccccgct ttcctcaccg gggcttgctg ttggatacat ctcgccatta
181 cctgccactc tctagcatcc tggacactct ggatgtcatg gcgtacaata aattgaacgt
241 gttccactgg catctggtag atgatccttc cttcccatat gagagcttca cttttccaga
301 gctcatgaga aaggggtcct acaaccctgt cacccacatc tacacagcac aggatgtgaa
361 ggaggtcatt gaatacgcac ggctccgggg tatccgtgtg cttgcagagt ttgacactcc
421 tggccacact ttgtcctggg gaccaggtat ccctggatta ctgactcctt gctactctgg
481 gtctgagccc tctggcacct ttggaccagt gaatcccagt ctcaataata cctatgagtt
541 catgagcaca ttcttcttag aagtcagctc tgtcttccca gatttttatc ttcatcttgg
601 aggagatgag gttgatttca cctgctggaa gtccaaccca gagatccagg actttatgag
661 gaagaaaggc ttcggtgagg acttcaagca gctggagtcc ttctacatcc agacgctgct
721 ggacatcgtc tcttcttatg gcaagggcta tgtggtgtgg caggaggtgt ttgataataa
781 agtaaagatt cagccagaca caatcataca ggtgtggcga gaggatattc cagtgaacta
841 tatgaaggag ctggaattgg tcaccaaggc cggcttccgg gcccttctct ctgccccctg
901 gtacctgaac cgtatatcct atggccctga ctggaaggat ttctacgtag tggaacccct
961 ggcatttgaa ggtacccctg agcagaaggc tctggtgatt ggtggagagg cttgtatgtg
1021 gggagaatat gtggacaaca caaacctggt ccccaggctc tggcccagag caggggctgt
1081 tgccgaaagg ctgtggagca acaagttgac atctgacctg acatttgcct atgaacgttt
1141 gtcacacttc cgctgtgagt tgctgaggcg aggtgtccag gcccaacccc tcaatgtagg
1201 cttctgtgag caggagtttg aacagacctg agccccaggc accgaggagg gtgctggctg
1261 taggtgaatg gtagtggagc caggcttcca ctgcatcctg gccaggggac ggagcccctt
1321 gccttcgtgc cccttgcctg cgtgcccctg tgcttggaga gaaaggggcc ggtgctggcg
1381 ctcgcattca ataaagagta atgtggcatt tttctataat aaaaaaaaaa aaaaaaaaaa
//