LOCUS       BC012452                1248 bp    mRNA    linear   HUM 08-SEP-2006
DEFINITION  Homo sapiens heparan-alpha-glucosaminide N-acetyltransferase, mRNA
            (cDNA clone IMAGE:3880903), complete cds.
ACCESSION   BC012452
VERSION     BC012452.1
KEYWORDS    .
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1248)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 1248)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (15-AUG-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: DCTD/DTP/Gazdar
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Baylor College of Medicine Human Genome
            Sequencing Center
            Center code: BCM-HGSC
            Web site: http://www.hgsc.bcm.tmc.edu/cdna/
            Contact: amg@bcm.tmc.edu
            Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
            Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
            A.N., Gibbs, R.A.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 21 Row: n Column: 18
            This clone was selected for full length sequencing because it
            passed the following selection criteria: Hexamer frequency ORF
            analysis, Similarity but not identity to protein
            This clone has the following problem: The cds is short compared to
            the longest cds in the locus.
FEATURES             Location/Qualifiers
     source          1..1248
                     /db_xref="H-InvDB:HIT000035821"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="IMAGE:3880903"
                     /tissue_type="Lung, large cell carcinoma"
                     /clone_lib="NIH_MGC_68"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..1248
                     /gene="HGSNAT"
                     /gene_synonym="FLJ22242"
                     /gene_synonym="FLJ32731"
                     /gene_synonym="HGNAT"
                     /db_xref="GeneID:138050"
                     /db_xref="HGNC:HGNC:26527"
     CDS             201..821
                     /gene="HGSNAT"
                     /gene_synonym="FLJ22242"
                     /gene_synonym="FLJ32731"
                     /gene_synonym="HGNAT"
                     /codon_start=1
                     /product="HGSNAT protein"
                     /protein_id="AAH12452.1"
                     /db_xref="GeneID:138050"
                     /db_xref="HGNC:HGNC:26527"
                     /translation="MALGLCRCFHPRHSMAAFGLFPALPSALNSHPACTCLLDPSTWR
                     PAHVSGPALASSPQILSVFSLGFPGFVNGSCVSRYKPDIIFPPGLPPPDLPSSVSIFY
                     LQLLCSHGHCCITESGPLLSFSNWPPSLVPHFLKSPVHCHQIKLSPARSPLSEKPPLT
                     WKHHCLAHILTYSPSRLDPHTSFQPPLPLHSLLPPPPPHPLVSPPL"
BASE COUNT          251 a          401 c          220 g          376 t
ORIGIN      
        1 aatggaggca gcgttcctac ttgtcatcac acagctgaag acattgtttc ttaggtgtga
       61 aatcggggac aaaggacaaa cagagacaca cggcattgtt catgggaggc atcgtcaccc
      121 tcctgggtgt tctgtgggaa tttcctgtgt gaggaaaacg tggccacagg gttgtgctgt
      181 acccaccctt ccccggcgag atggccctcg gcctgtgccg ctgcttccac cctcgccact
      241 ccatggcagc ttttggtctg tttccggctc tgccctctgc cctgaactct catccggctt
      301 gtacctgcct gctggacccc tccacctgga ggccagccca tgtctcaggc ccagccctag
      361 cctcttctcc tcaaattcta agtgttttct ctttaggttt ccctggcttt gtgaatggat
      421 catgtgtctc taggtataaa cctgacatca tctttccacc cggcttacct ccaccagatc
      481 tccccagttc tgtctccatc ttctacctgc agctgctctg ttctcatggt cactgctgca
      541 tcactgagtc tggacccttg ttatcatttt caaactggcc tccttccctc gttccccact
      601 tcttaaagtc acctgtccat tgccaccaga ttaagctttc tccagccaga tcacctctct
      661 ctgagaaacc tccattgaca tggaaacacc attgtctggc acacatactc acatactcac
      721 cttcccgtct tgatccccac acatctttcc agcctcccct cccactccac tccctgctcc
      781 ctcctccacc tccccatcct cttgtctccc ctcccctctg aatccagccc agcggggctt
      841 ctcctgcctc catcacatca cagaagtacc tcctgcttct ggttttaatt agagccttcc
      901 ccgattacat tttcctctga attttttcct atctacattt gatctgtcat gtttaaaccc
      961 cctacttcta agggaacttc tctaatctct tatcctcatc cccaaatagt gttttcttcc
     1021 tctgggttct tataatgttg gtatcaatct cacagcattt agtgcttcct gcctggtgtg
     1081 acagttacct gtgtgcatgt gcaatttcta atttcccacg ctagactgtg agcttcctaa
     1141 ggcaagaatc atgccttgtt ggtttctgta ttcctcatgg tgccaaacac agtgccttct
     1201 acattgcagg cgctgaataa acatttttaa agcaaaaaaa aaaaaaaa
//