LOCUS       BC021030                1186 bp    mRNA    linear   SYN 02-SEP-2016
DEFINITION  Synthetic construct Homo sapiens hexosaminidase A (alpha
            polypeptide), mRNA (cDNA clone IMAGE:3626334), partial cds.
ACCESSION   BC021030
VERSION     BC021030.1
KEYWORDS    .
SOURCE      synthetic construct
  ORGANISM  synthetic construct
            other sequences; artificial sequences.
REFERENCE   1  (bases 1 to 1186)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 1186)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (03-JAN-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Institute for Systems Biology
            http://www.systemsbiology.org
            contact: amadan@systemsbiology.org
            Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
            Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 24 Row: k Column: 21
            This clone was selected for full length sequencing because it
            passed the following selection criteria: Hexamer frequency ORF
            analysis.
FEATURES             Location/Qualifiers
     source          1..1186
                     /db_xref="H-InvDB:HIT000258964"
                     /organism="synthetic construct"
                     /mol_type="mRNA"
                     /isolation_source="Homo sapiens; Uterus, endometrium
                     adenocarcinoma"
                     /db_xref="taxon:32630"
                     /clone="IMAGE:3626334"
                     /clone_lib="NIH_MGC_44"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            <1..1186
                     /gene="HEXA"
                     /gene_synonym="TSD"
                     /db_xref="GeneID:3073"
                     /db_xref="MIM:606869"
     CDS             <1..930
                     /gene="HEXA"
                     /gene_synonym="TSD"
                     /codon_start=1
                     /transl_table=11
                     /product="HEXA protein"
                     /protein_id="AAH21030.1"
                     /db_xref="GeneID:3073"
                     /db_xref="MIM:606869"
                     /translation="RERTGSYNPVTHIYTAQDVKEVIEYARLRGIRVLAEFDTPGHTL
                     SWGPGIPGLLTPCYSGSEPSGTFGPVNPSLNNTYEFMSTFFLEVSSVFPDFYLHLGGD
                     EVDFTCWKSNPEIQDFMRKKGFGEDFKQLESFYIQTLLDIVSSYGKGYVVWQEVFDNK
                     VKIQPDTIIQVWREDIPVNYMKELELVTKAGFRALLSAPWYLNRISYGPDWKDFYVVE
                     PLAFEGTPEQKALVIGGEACMWGEYVDNTNLVPRLWPRAGAVAERLWSNKLTSDLTFA
                     YERLSHFRCELLRRGVQAQPLNVGFCEQEFEQT"
     misc_feature    7..804
                     /gene="HEXA"
                     /gene_synonym="TSD"
                     /note="Glyco_hydro_20; Region: Glycosyl hydrolase family
                     20, catalytic domain. This domain has a TIM barrel fold"
                     /db_xref="CDD:pfam00728"
BASE COUNT          279 a          291 c          326 g          290 t
ORIGIN      
        1 agagagagaa cggggtccta caaccctgtc acccacatct acacagcaca ggatgtgaag
       61 gaggtcattg aatacgcacg gctccggggt atccgtgtgc ttgcagagtt tgacactcct
      121 ggccacactt tgtcctgggg accaggtatc cctggattac tgactccttg ctactctggg
      181 tctgagccct ctggcacctt tggaccagtg aatcccagtc tcaataatac ctatgagttc
      241 atgagcacat tcttcttaga agtcagctct gtcttcccag atttttatct tcatcttgga
      301 ggagatgagg ttgatttcac ctgctggaag tccaacccag agatccagga ctttatgagg
      361 aagaaaggct tcggtgagga cttcaagcag ctggagtcct tctacatcca gacgctgctg
      421 gacatcgtct cttcttatgg caagggctat gtggtgtggc aggaggtgtt tgataataaa
      481 gtaaagattc agccagacac aatcatacag gtgtggcgag aggatattcc agtgaactat
      541 atgaaggagc tggaactggt caccaaggcc ggcttccggg cccttctctc tgccccctgg
      601 tacctgaacc gtatatccta tggccctgac tggaaggatt tctacgtagt ggaacccctg
      661 gcatttgaag gtacccctga gcagaaggct ctggtgattg gtggagaggc ttgtatgtgg
      721 ggagaatatg tggacaacac aaacctggtc cccaggctct ggcccagagc aggggctgtt
      781 gccgaaaggc tgtggagcaa caagttgaca tctgacctga catttgccta tgaacgtttg
      841 tcacacttcc gctgtgagtt gctgaggcga ggtgtccagg cccaacccct caatgtaggc
      901 ttctgtgagc aggagtttga acagacctga gccccaggca ccgaggaggg tgctggctgt
      961 aggtgaatgg tagtggagcc aggcttccac tgcatcctgg ccaggggacg gagccccttg
     1021 ccttcgtgcc ccttgcctgc gtgcccctgt gcttggagag aaaggggccg gtgctggcgc
     1081 tcgcattcaa taaagagtaa tgtggcattt ttctataata aacatggatt acctgtgttt
     1141 aaaaaaaaaa aaaaaaaaac tcgagactag ttctctccct cgtgcc
//