LOCUS BC021030 1186 bp mRNA linear SYN 02-SEP-2016
DEFINITION Synthetic construct Homo sapiens hexosaminidase A (alpha
polypeptide), mRNA (cDNA clone IMAGE:3626334), partial cds.
ACCESSION BC021030
VERSION BC021030.1
KEYWORDS .
SOURCE synthetic construct
ORGANISM synthetic construct
other sequences; artificial sequences.
REFERENCE 1 (bases 1 to 1186)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 1186)
AUTHORS Strausberg,R.
TITLE Direct Submission
JOURNAL Submitted (03-JAN-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Cancer Genomics Office, National Cancer
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Institute for Systems Biology
http://www.systemsbiology.org
contact: amadan@systemsbiology.org
Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 24 Row: k Column: 21
This clone was selected for full length sequencing because it
passed the following selection criteria: Hexamer frequency ORF
analysis.
FEATURES Location/Qualifiers
source 1..1186
/db_xref="H-InvDB:HIT000258964"
/organism="synthetic construct"
/mol_type="mRNA"
/isolation_source="Homo sapiens; Uterus, endometrium
adenocarcinoma"
/db_xref="taxon:32630"
/clone="IMAGE:3626334"
/clone_lib="NIH_MGC_44"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene <1..1186
/gene="HEXA"
/gene_synonym="TSD"
/db_xref="GeneID:3073"
/db_xref="MIM:606869"
CDS <1..930
/gene="HEXA"
/gene_synonym="TSD"
/codon_start=1
/transl_table=11
/product="HEXA protein"
/protein_id="AAH21030.1"
/db_xref="GeneID:3073"
/db_xref="MIM:606869"
/translation="RERTGSYNPVTHIYTAQDVKEVIEYARLRGIRVLAEFDTPGHTL
SWGPGIPGLLTPCYSGSEPSGTFGPVNPSLNNTYEFMSTFFLEVSSVFPDFYLHLGGD
EVDFTCWKSNPEIQDFMRKKGFGEDFKQLESFYIQTLLDIVSSYGKGYVVWQEVFDNK
VKIQPDTIIQVWREDIPVNYMKELELVTKAGFRALLSAPWYLNRISYGPDWKDFYVVE
PLAFEGTPEQKALVIGGEACMWGEYVDNTNLVPRLWPRAGAVAERLWSNKLTSDLTFA
YERLSHFRCELLRRGVQAQPLNVGFCEQEFEQT"
misc_feature 7..804
/gene="HEXA"
/gene_synonym="TSD"
/note="Glyco_hydro_20; Region: Glycosyl hydrolase family
20, catalytic domain. This domain has a TIM barrel fold"
/db_xref="CDD:pfam00728"
BASE COUNT 279 a 291 c 326 g 290 t
ORIGIN
1 agagagagaa cggggtccta caaccctgtc acccacatct acacagcaca ggatgtgaag
61 gaggtcattg aatacgcacg gctccggggt atccgtgtgc ttgcagagtt tgacactcct
121 ggccacactt tgtcctgggg accaggtatc cctggattac tgactccttg ctactctggg
181 tctgagccct ctggcacctt tggaccagtg aatcccagtc tcaataatac ctatgagttc
241 atgagcacat tcttcttaga agtcagctct gtcttcccag atttttatct tcatcttgga
301 ggagatgagg ttgatttcac ctgctggaag tccaacccag agatccagga ctttatgagg
361 aagaaaggct tcggtgagga cttcaagcag ctggagtcct tctacatcca gacgctgctg
421 gacatcgtct cttcttatgg caagggctat gtggtgtggc aggaggtgtt tgataataaa
481 gtaaagattc agccagacac aatcatacag gtgtggcgag aggatattcc agtgaactat
541 atgaaggagc tggaactggt caccaaggcc ggcttccggg cccttctctc tgccccctgg
601 tacctgaacc gtatatccta tggccctgac tggaaggatt tctacgtagt ggaacccctg
661 gcatttgaag gtacccctga gcagaaggct ctggtgattg gtggagaggc ttgtatgtgg
721 ggagaatatg tggacaacac aaacctggtc cccaggctct ggcccagagc aggggctgtt
781 gccgaaaggc tgtggagcaa caagttgaca tctgacctga catttgccta tgaacgtttg
841 tcacacttcc gctgtgagtt gctgaggcga ggtgtccagg cccaacccct caatgtaggc
901 ttctgtgagc aggagtttga acagacctga gccccaggca ccgaggaggg tgctggctgt
961 aggtgaatgg tagtggagcc aggcttccac tgcatcctgg ccaggggacg gagccccttg
1021 ccttcgtgcc ccttgcctgc gtgcccctgt gcttggagag aaaggggccg gtgctggcgc
1081 tcgcattcaa taaagagtaa tgtggcattt ttctataata aacatggatt acctgtgttt
1141 aaaaaaaaaa aaaaaaaaac tcgagactag ttctctccct cgtgcc
//