LOCUS       BC041120                4464 bp    mRNA    linear   HUM 28-JUL-2005
DEFINITION  Homo sapiens UDP-N-acetyl-alpha-D-galactosamine:polypeptide
            N-acetylgalactosaminyltransferase 2 (GalNAc-T2), mRNA (cDNA clone
            MGC:47616 IMAGE:5553465), complete cds.
ACCESSION   BC041120
VERSION     BC041120.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 4464)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 4464)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (13-DEC-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Lou Staudt
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 82 Row: g Column: 15.
FEATURES             Location/Qualifiers
     source          1..4464
                     /db_xref="H-InvDB:HIT000052511"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:47616 IMAGE:5553465"
                     /tissue_type="Lymph, lymphoma"
                     /clone_lib="NIH_MGC_85"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..4464
                     /gene="GALNT2"
                     /gene_synonym="GalNAc-T2"
                     /db_xref="GeneID:2590"
                     /db_xref="MIM:602274"
     CDS             73..1788
                     /gene="GALNT2"
                     /gene_synonym="GalNAc-T2"
                     /codon_start=1
                     /product="GALNT2 protein"
                     /protein_id="AAH41120.1"
                     /db_xref="GeneID:2590"
                     /db_xref="MIM:602274"
                     /translation="MRRRSRMLLCFAFLWVLGIAYYMYSGGGSALAGGAGGGAGRKED
                     WNEIDPIKKKDLHHSNGEEKAQSMETLPPGKVRWPDFNQEAYVGGTMVRSGQDPYARN
                     KFNQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVL
                     KKSPPHLIKEIILVDDYSNDPEDGALLGKIEKVRVLRNDRREGLMRSRVRGADAAQAK
                     VLTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDW
                     NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGE
                     NLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEY
                     KNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL
                     QQGTNCLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI
                     KLQGCRENDSRQKWEQIEGNSKLRHVGSNLCLDSRTAKSGGLSVEVCGPALSQQWKFT
                     LNLQQ"
BASE COUNT          975 a         1193 c         1291 g         1005 t
ORIGIN      
        1 ccaccgcgcc cgcggccggc ccaggcagca ctcgcgagca gcggcggccc cgccggcggc
       61 cgagttggga gaatgcggcg gcgctcgcgg atgctgctct gcttcgcctt cctgtgggtg
      121 ctgggcatcg cctactacat gtactcgggg ggcggctctg cgctggccgg gggcgcgggc
      181 ggcggcgccg gcaggaagga ggactggaat gaaattgacc ccattaaaaa gaaagacctt
      241 catcacagca atggagaaga gaaagcacaa agcatggaga ccctccctcc agggaaagta
      301 cggtggccag actttaacca ggaagcttat gttggaggga cgatggtccg ctccgggcag
      361 gacccttacg cccgcaacaa gttcaaccag gtggagagtg ataagcttcg aatggacaga
      421 gccatccctg acacccggca tgaccagtgt cagcggaagc agtggcgggt ggatctgccg
      481 gccaccagcg tggtgatcac gtttcacaat gaagccaggt cggccctact caggaccgtg
      541 gtcagcgtgc ttaagaaaag cccgccccat ctcataaaag aaatcatctt ggtggatgac
      601 tacagcaatg atcctgagga cggggctctc ttggggaaaa ttgagaaagt gcgagttctt
      661 agaaatgatc gacgagaagg cctcatgcgc tcacgggttc ggggggccga tgctgcccaa
      721 gccaaggtcc tgaccttcct ggacagtcac tgcgagtgta atgagcactg gctggagccc
      781 ctcctggaaa gggtggcgga ggacaggact cgggttgtgt cacccatcat cgatgtcatt
      841 aatatggaca actttcagta tgtgggggca tctgctgact tgaagggcgg ttttgattgg
      901 aacttggtat tcaagtggga ttacatgacg cctgagcaga gaaggtcccg gcaggggaac
      961 ccagtcgccc ctataaaaac ccccatgatt gctggtgggc tgtttgtgat ggataagttc
     1021 tattttgaag aactggggaa gtacgacatg atgatggatg tgtggggagg agagaaccta
     1081 gagatctcgt tccgcgtgtg gcagtgtggt ggcagcctgg agatcatccc gtgcagccgt
     1141 gtgggacacg tgttccggaa gcagcacccc tacacgttcc cgggtggcag tggcactgtc
     1201 tttgcccgaa acacccgccg ggcagcagag gtctggatgg atgaatacaa aaatttctat
     1261 tatgcagcag tgccttctgc tagaaacgtt ccttatggaa atattcagag cagattggag
     1321 cttaggaaga aactcagctg caagcctttc aaatggtacc ttgaaaatgt ctatccagag
     1381 ttaagggttc cagaccatca ggatatagct tttggggcct tgcagcaggg aactaactgc
     1441 ctcgacactt tgggacactt tgctgatggt gtggttggag tttatgaatg tcacaatgct
     1501 gggggaaacc aggaatgggc cttgacgaag gagaagtcag tgaagcacat ggatttgtgc
     1561 cttactgtgg tggaccgggc accgggctct cttataaagc tgcagggctg ccgagaaaat
     1621 gacagcagac agaaatggga acagatcgag ggcaactcca agctgaggca cgtgggcagc
     1681 aacctgtgcc tggacagtcg cacggccaag agcgggggcc taagcgtgga ggtgtgtggc
     1741 ccggcccttt cgcagcagtg gaagttcacg ctcaacctgc agcagtagga gggtccggga
     1801 ggccctgccg tcctgtctcc tgcaccattg ggtggagtct ggtgatcaca ttattgatta
     1861 tgtttcttaa actttccgcg aaactaatat acctcagtat tccatcatgg tctgaaagtc
     1921 aaacttcggc aaggcacgga cgactgtgca gacacagcag cggcaagaag cgagaactgc
     1981 cctccccctc ctctcggtgc agcccagccg ggcccccttc cccaggccgg agcgcccctc
     2041 ttccttccag ctttcacttc tgccggctcc gcaactgagt gacacccagc gacaaccgac
     2101 tggggagtgg tagaagcaac tgaacggatg cgtgcgagct gaggacaggg cgggaggagg
     2161 gggcacacat gccccagggg agcgaggaga actcttgaaa tctccatttt caatcccttc
     2221 gaaatcacgt atggtttcca caaagccgag tcgtgtcacg tggcaggttt acgtcaatag
     2281 tccctctctc tgctcctcca ttcgcaagtg tcttcctggg ccagactccc ctccacctca
     2341 tgtacttgct atattgagga tgaagttttc tatggtggga cactaaatat aaagctatat
     2401 agagaaagaa tgtacggtca gttccctatg gtttctgtag atcatcgtca tcttgtatat
     2461 tccccacaaa gccgttcgca gcttccggga gaaggggcca gagcccggtg gggccagttt
     2521 ctcacagagg gaggaggtgg cctttgtccc ctggagcccg atcagccagt tggtgctact
     2581 gctgtggcca gctggggggc ttcctccaga ccaccggcct cggccccggc atccctgttg
     2641 ggcgtcagcc tgagagtccc tactgtgcgt cagaatccac cttgcgtgct gtgcgtatct
     2701 gtgaacctgg agcggttact tattttgaca gatatcactt tgggtctttt tacattaaat
     2761 ttcttttctc taaggaatat aagacatacc ccatagctct gcgtgagcca gcaataccgc
     2821 tgccccctgg cgacagggca gaccaatgat gccaggcagc tgtcacacgc tagtattggc
     2881 ttcattgtga tctgagccct gcacgctggg ccttcagaat taatggccag cagtgtcagg
     2941 gatgagcccg tcagccaggg cacaggcctg gctcacagtc cttcacacct gctggcctgg
     3001 ggagctccag ccaggcagcg agtcctgccc cgcccgcagc tccctcccac accccgcctg
     3061 gccaagatga ctgcttcagg gggctttggg gaaagaatta ggaagggtca gaaccaaaca
     3121 atacctgctc atttacactg aggattcagg gcgggagaca ggagccttgg ggtcctgtta
     3181 aaccacagac agttatgaac tgaaagtcat aacggggaga ggtgcctggc ttctacctgg
     3241 gtgctcagga atgttcctcg tcacccctgc cactctgtgg tcggtgccct gcttcctcct
     3301 ccactcctgg ccgccttctc cagcgccgca cacacagatg ctcggtctca gagaggctgg
     3361 cacggcctgg cagtctgaga aaagcgtcag ttaggcacac ctgcaggccc ctcggtggga
     3421 cagcggcggc cttggagtta ggagccaccc tgggaggttg tgccggtgcc atgctcctcc
     3481 ctgtgtcttg tatgaaaggg gccactgtgt gtcttcctcc ccggcgggag ccccacgtgt
     3541 gtgcactgta ggacagcggc cccgaggtgg aagcctggct ggagggctgc cctataggtc
     3601 ttctcttccc gcctcccctg ccatgcaacc agatgtgttc tgagtgggca gcgtgccccc
     3661 acgctggagt aactccgcac gcttctgtct ttcacggtgg gcgctcgggg ggagcctgag
     3721 gaaaaccccc ttaggtacct gtgcgaggct gtggagtgca ggccagagca gggtgtgcgt
     3781 agcccccagc acccaggttc ttctgtcaga ccctgtgacc tgcgagctgc tactactgta
     3841 aggagggaaa tggatgaatc tggctcgttt taaaatcacg ttttctgacg aatcctttgc
     3901 cccttcacct ttaccccgcc cgcaccccta ggccctctca gccttcctat catcccacgt
     3961 gtctacccag acccttgcgc ggcccatgcc ctgggggcgg cgtcctgtcc ctgagctggg
     4021 aggcggcttt ggatggtccg ggcgtcaaga gcaggggtgg gccggggagg ggtcctttgc
     4081 ggtgagctat gtttacatga cacagtgtgc caaagtgact tactgcggtt gcgttagttt
     4141 ctagtcatca ggactatctc accctcccac tcctgttttt aaaactcaga attctttcct
     4201 aagagccctt cgagcaaagc gtgccgaagt tagttgtctt ctctgtggtg gtcctttctt
     4261 atgtcctcat aaaagctcag atgatggtat ctgtgagtat gttttgcaaa ttcaaaatat
     4321 agtttggtaa tttttttttc cagttgattt ttaaaaagaa ctgctgtaca gagcttgtac
     4381 tttgtccatt ttatagatgg aaaccatcct tgaaaattgt ttaacttaaa taaagagaag
     4441 atactttcta aaaaaaaaaa aaaa
//