LOCUS       BC000095                1865 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens N-acetylgalactosaminidase, alpha-, mRNA (cDNA clone
            MGC:2280 IMAGE:3504221), complete cds.
ACCESSION   BC000095
VERSION     BC000095.2
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1865)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 1865)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (03-NOV-2000) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Aug 20, 2003 this sequence version replaced BC000095.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Institute for Systems Biology
            http://www.systemsbiology.org
            contact: amadan@systemsbiology.org
            Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
            Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 7 Row: o Column: 1
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 4557780.
FEATURES             Location/Qualifiers
     source          1..1865
                     /db_xref="H-InvDB:HIT000029341"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:2280 IMAGE:3504221"
                     /tissue_type="Placenta, choriocarcinoma"
                     /clone_lib="NIH_MGC_21"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            1..1865
                     /gene="NAGA"
                     /gene_synonym="D22S674"
                     /gene_synonym="GALB"
                     /db_xref="GeneID:4668"
                     /db_xref="HGNC:HGNC:7631"
                     /db_xref="MIM:104170"
     CDS             82..1317
                     /gene="NAGA"
                     /gene_synonym="D22S674"
                     /gene_synonym="GALB"
                     /codon_start=1
                     /product="N-acetylgalactosaminidase, alpha-"
                     /protein_id="AAH00095.1"
                     /db_xref="GeneID:4668"
                     /db_xref="HGNC:HGNC:7631"
                     /db_xref="MIM:104170"
                     /translation="MLLKTVLLLGHVAQVLMLDNGLLQTPPMGWLAWERFRCNINCDE
                     DPKNCISEQLFMEMADRMAQDGWRDMGYTYLNIDDCWIGGRDASGRLMPDPKRFPHGI
                     PFLADYVHSLGLKLGIYADMGNFTCMGYPGTTLDKVVQDAQTFAEWKVDMLKLDGCFS
                     TPEERAQGYPKMAAALNATGRPIAFSCSWPAYEGGLPPRVNYSLLADICNLWRNYDDI
                     QDSWWSVLSILNWFVEHQDILQPVAGPGHWNDPDMLLIGNFGLSLEQSRAQMALWTVL
                     AAPLLMSTDLRTISAQNMDILQNPLMIKINQDPLGIQGRRIHKEKSLIEVYMRPLSNK
                     ASALVFFSCRTDMPYRYHSSLGQLNFTGSVIYEAQDVYSGDIISGLRDETNFTVIINP
                     SGVVMWYLYPIKNLEMSQQ"
BASE COUNT          434 a          543 c          485 g          403 t
ORIGIN      
        1 cttcgcttcc agagcccaac acatacagct gatacacgca gaccagatct ggtcaggtcc
       61 tcggaagctg agtccagagc gatgctgctg aagacagtgc tcttgctggg acatgtggcc
      121 caggtgctga tgctggacaa tgggctcctg cagacaccac ccatgggctg gctggcctgg
      181 gaacgcttcc gctgcaacat taactgtgat gaggacccaa agaactgcat aagtgaacag
      241 ctcttcatgg agatggctga ccggatggca caggatggat ggcgggacat gggctacaca
      301 tacctcaaca ttgatgactg ctggatcggt ggtcgcgatg ccagtggccg cctgatgcca
      361 gatcccaagc gcttccctca tggcattcct ttcctggctg actacgttca ctccctgggc
      421 ctgaagttgg gtatctacgc ggacatgggc aacttcacct gcatgggtta cccaggcacc
      481 acactggaca aggtggtcca ggatgctcag accttcgccg agtggaaggt agacatgctc
      541 aagctggatg gctgcttctc cacccccgag gagcgggccc aggggtaccc caagatggct
      601 gctgccctga atgccacagg ccgccccatc gccttctcct gcagctggcc agcctatgaa
      661 ggcggcctcc ccccaagggt gaactacagt ctgctggcgg acatctgcaa cctctggcgt
      721 aactatgatg acatccagga ctcctggtgg agcgtgctct ccatcctgaa ttggttcgtg
      781 gagcaccagg acatactgca gccagtggcc ggccctgggc actggaatga ccctgacatg
      841 ctgctcattg ggaactttgg tctcagctta gagcaatccc gggcccagat ggccctgtgg
      901 acggtgctgg cagcccccct cttgatgtcc acagacctgc gtaccatctc cgcccagaac
      961 atggacattc tgcagaatcc actcatgatc aaaatcaacc aggatccctt aggcatccag
     1021 ggacgcagga ttcacaagga aaaatctctc atcgaagtgt acatgcggcc tctgtccaac
     1081 aaggctagcg ccttagtctt cttcagctgc aggaccgata tgccttatcg ctaccactcc
     1141 tcccttggcc agctgaactt caccgggtct gtgatatatg aggcccagga cgtctactca
     1201 ggtgacatca tcagtggcct ccgagatgaa accaacttca cagtgatcat caacccttca
     1261 ggggtagtga tgtggtacct gtatcccatc aagaacctgg agatgtccca gcagtgagga
     1321 gctgggacat gtgacaggct gtggtggcac cactgagcct agaccatgga gccttggcat
     1381 gcccagggca agtggggagg ttctctgctc cccaggcctg ctcggtgact gaccccatca
     1441 tacccaaagt gcaatctcac ggccaggttc tatgccctgt ccaagcgtaa accctcttgg
     1501 aaacttcttt tggggcaatt ttcctgtggc cttcctggcc tctacttcca tgtgcgcagc
     1561 cccacagacg ttgctgagca actcgccagc ctcctgagct ccatgcccat caggactcta
     1621 gcctctgacc ttgctgttga ctctgaaatc aggatttgga agttttcgaa ttaggagtag
     1681 agagatctga cctcttgcca ggaatgccca tggatcatgt gattggcttt tctacccata
     1741 gagggccttg cagcctgata ccacctggga gtgagggtca caaaggagac cttggctccc
     1801 tcaggtcacc aataaacctg ttctttaatc aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
     1861 aaaaa
//