LOCUS       BC047468                4297 bp    mRNA    linear   HUM 17-JUL-2006
DEFINITION  Homo sapiens UDP-N-acetyl-alpha-D-galactosamine:polypeptide
            N-acetylgalactosaminyltransferase 7 (GalNAc-T7), mRNA (cDNA clone
            MGC:50994 IMAGE:5271749), complete cds.
ACCESSION   BC047468
VERSION     BC047468.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 4297)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 4297)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (28-FEB-2003) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Miklos Palkovits, M.D., Ph.D.
            cDNA Library Preparation: Michael J. Brownstein (NHGRI) &  Shiraki
            Toshiyuki and Piero Carninci (RIKEN)
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Baylor College of Medicine Human Genome
            Sequencing Center
            Center code: BCM-HGSC
            Web site: http://www.hgsc.bcm.tmc.edu/cdna/
            Contact: amg@bcm.tmc.edu
            Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
            Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
            A.N., Gibbs, R.A.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 91 Row: d Column: 21
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 8393408.
FEATURES             Location/Qualifiers
     source          1..4297
                     /db_xref="H-InvDB:HIT000053139"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:50994 IMAGE:5271749"
                     /tissue_type="Testis"
                     /clone_lib="NIH_MGC_97"
                     /lab_host="DH10B"
                     /note="Vector: pBluescriptR"
     gene            1..4297
                     /gene="GALNT7"
                     /gene_synonym="GALNAC-T7"
                     /gene_synonym="GalNAcT7"
                     /db_xref="GeneID:51809"
                     /db_xref="HGNC:HGNC:4129"
                     /db_xref="MIM:605005"
     CDS             59..2032
                     /gene="GALNT7"
                     /gene_synonym="GALNAC-T7"
                     /gene_synonym="GalNAcT7"
                     /codon_start=1
                     /product="UDP-N-acetyl-alpha-D-galactosamine:polypeptide
                     N-acetylgalactosaminyltransferase 7 (GalNAc-T7)"
                     /protein_id="AAH47468.1"
                     /db_xref="GeneID:51809"
                     /db_xref="HGNC:HGNC:4129"
                     /db_xref="MIM:605005"
                     /translation="MRLKIGFILRSLLVVGSFLGLVVLWSSLTPRPDDPSPLSRMRED
                     RDVNDPMPNRGGNGLAPGEDRFKPVVPWPHVEGVEVDLESIRRINKAKNEQEHHAGGD
                     SQKDIMQRQYLTFKPQTFTYHDPVLRPGILGNFEPKEPEPPGVVGGPGEKAKPLVLGP
                     EFKQAIQASIKEFGFNMVASDMISLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNE
                     GWSTLMRTVHSVIKRTPRKYLAEIVLIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNER
                     REGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAPLVAPISKDRTICTVPLIDVIN
                     GNTYEIIPQGGGDEDGYARGAWDWSMLWKRVPLTPQEKRLRKTKTEPYRSPAMAGGLF
                     AIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFVPCSRVGHIYRLEGWQGN
                     PPPIYVGSSPTLKNYVRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCK
                     SFKWFMEEIAYDITSHYPLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMG
                     GNQLFRINEANQLMQYDQCLTKGADGSKVMITHCNLNEFKEWQYFKNLHRFTHIPSGK
                     CLDRSEVLHQVFISNCDSSKTTQKWEMNNIHSV"
BASE COUNT         1345 a          763 c          890 g         1299 t
ORIGIN      
        1 tggggagagc ggtggcggcg gctgcgccgg gctgtgagtc tctcgccgcc ggaggaagat
       61 gaggctgaag attggattca tcttacgcag tttgctggtg gtgggaagct tcctggggct
      121 agtggtcctc tggtcttccc tgaccccgcg gccggacgac ccaagcccgc tgagcaggat
      181 gagggaagac agagatgtca atgaccccat gcccaaccga ggcggcaatg gactagctcc
      241 tggggaggac agattcaaac ctgtggtacc atggcctcat gttgaaggag tagaagtgga
      301 cttagagtct attagaagaa taaacaaggc caaaaatgaa caagagcacc atgctggagg
      361 agattcccag aaagatatca tgcagaggca gtatctcaca tttaagcctc agacattcac
      421 ctaccatgat cctgtgcttc gcccagggat cctcggtaac tttgaaccca aagaacctga
      481 gcctcctgga gtggttggtg gccctggaga gaaagccaag ccattggttt tgggaccaga
      541 attcaaacaa gcaattcaag ccagcattaa agagtttgga tttaacatgg tggcaagtga
      601 catgatctca ctggaccgca gcgtcaatga cttacgccaa gaagaatgca agtattggca
      661 ttatgatgaa aacttgctca cttcgagcgt tgtcattgtc ttccataatg aaggatggtc
      721 aaccctcatg agaacagtcc acagtgtaat taaaaggact ccaaggaaat atttagcaga
      781 aattgtgtta attgacgatt tcagtaataa agaacactta aaagaaaaac tggatgaata
      841 tattaagctg tggaatggcc tagtgaaggt atttcgaaat gaaagaaggg aaggtttaat
      901 tcaagcacga agtattggtg ctcagaaggc taaacttgga caggttttga tataccttga
      961 tgcccactgt gaggtggcag ttaactggta tgcaccactt gtagctccca tatctaagga
     1021 cagaaccatt tgcactgtgc cgcttataga tgtcataaat ggcaacacat atgaaattat
     1081 accccaaggg ggtggtgatg aagatgggta tgcccgagga gcatgggatt ggagtatgct
     1141 ctggaaacgg gtgcctctga cccctcaaga gaagagactg agaaagacaa aaactgaacc
     1201 gtatcggtcc ccagccatgg ctgggggatt atttgccatt gaacgagagt tcttctttga
     1261 attgggtctc tatgatccag gtctccagat ttggggtggt gaaaactttg agatctcata
     1321 caagatatgg cagtgtggtg gcaaattatt atttgttcct tgttctcgtg ttggacatat
     1381 ctaccgtctt gagggctggc aaggaaatcc tccgcccatt tatgttgggt cttctccaac
     1441 tctgaagaat tatgttagag ttgtggaggt ttggtgggat gaatataaag actacttcta
     1501 tgctagtcgt cctgaatcgc aggcattacc atatggggat atatcggagc tgaaaaaatt
     1561 tcgagaagat cacaactgca aaagttttaa gtggttcatg gaagaaatag cttatgatat
     1621 cacctcacac taccctttgc cacccaaaaa tgttgactgg ggagaaatca gaggcttcga
     1681 aactgcttac tgcattgata gcatgggaaa aacaaatgga ggctttgttg aactaggacc
     1741 ctgccacagg atgggaggga atcagctttt cagaatcaat gaagcaaatc aactcatgca
     1801 gtatgaccag tgtttgacaa agggagctga tggatcaaaa gttatgatta cacactgtaa
     1861 tctaaatgaa tttaaggaat ggcagtactt caagaacctg cacagattta ctcatattcc
     1921 ttcaggaaag tgtttagatc gctcagaggt cctgcatcaa gtattcatct ccaattgtga
     1981 ctccagtaaa acgactcaaa aatgggaaat gaataacatc catagtgttt agagagaaaa
     2041 aataaaccaa taacctacct actgacaagt aaatttatac aggactgaaa accgcctgaa
     2101 acctgctgca actattgtta ttaactctgt atagctccaa acctggaacc tcctgatcag
     2161 tttgaaggac attgataaac tgtgatttta caataacatt atcatctgca gttactgttt
     2221 acaagactgc ttttacctta aaccttgtag atgtttacat ctttttgttg tgttttaaga
     2281 tgatgttggt aatttgtgcc tttagctctg ttttattaga cagagttaaa gcatgttgtc
     2341 ttctttggga ttacactcag gggtctgaaa ggcagtttga tttttatttt taacacactt
     2401 gaaaaaaggt tggagtagcc agactttcat atataacttg gtgattatca acctgttgtg
     2461 tctttattta attttacatc tttttgaagc actgccacag gttattagcc aaggtggcct
     2521 tccttcacag tcatgctgct tttttgaaag gtgaatttca acacatttag tgcctctttc
     2581 atttctcagt atatatttca agagcttgtg atgaaatcta taggatggta atgatggact
     2641 tgtcacctgt atggggaata cttttactac tcagaaatga atttatgtgc tgccatttgc
     2701 tataaagttg aactttgtat ggcttgaaaa agaaatgaca atatggaaca tcccaaggct
     2761 gtcccatagg gttggaagtt gtgtagcatt cactccctta cctactggca ttcccagtgc
     2821 cctctgtcca tacctacttc taggattgca aaggagtctt ccaactagag aaaaattgtc
     2881 cactgacatt tgggatttac ttttctccaa tacctgccaa tacagaaaac tattatcagt
     2941 tgttattgtt atcccttgaa agcgagggtg acaaaaacaa caaaacaccg ttataaacac
     3001 atcaaaggtt cattctgact gaggtaagac tttccaagcc cttgttagat taggccttat
     3061 aaaacttgtg tgcattataa cctaagctgt gcaacctgtg aagccaagag tgaactgatg
     3121 tttcatttat attttcatcc aaatgacatt atctgcacgt ttttaaaatt taaaaacaaa
     3181 ggactattta aaaatacagt ttattaacaa acgtgaacta ctttctgtta cattaggtgt
     3241 tccctagtgt ttcttaattt ctttttagaa agtgtatttt tattagtatt tttccggtga
     3301 acagaagatt tgtttggatt taaacattta ctaagacagt acctattagg aaaaccaaat
     3361 attgcaaatg gtcaattcga ttttaatttc tcaaaagata ctctgttatc cagaagatta
     3421 aaatgcctac attgagtgct taaaaaaaaa aaaaacaact gtgatgatgt gagcagaatg
     3481 gcaagtaagt taagcatttt tgatcctgta atcatggtat cattacaatg aaaggaattc
     3541 acaaactact gccagaggaa gtttgttttt taatttaaga gggaaatata acctataaat
     3601 ttgtttcttc caagcttagc tcttaaattt ggagactcaa agttaaacat cctcaacaga
     3661 gttttattta taattttgaa ttgtcaattt gtattttgct actgatctgt gatcaaccat
     3721 tttaactttc atctctaggg atgtttaaca tttataattg caaaataaac caactataaa
     3781 aaaagaaact aagagagaat tggtacttta attacttgtg tgtttgcaaa taggctccat
     3841 tttccatgtt gagtagatta taaccttatt aactatgcat aggcctaaga aaggtggcaa
     3901 tgaactgtgc atgtaaattt taaatgggta ctttgtgcaa ttcgttaaaa gaagatactc
     3961 tatgaatatg attctatata ttgaaatcag aaaacctacc aaacaaaaac atcagaagct
     4021 gctgccataa tgactatttt ctactgtagg ctgctttgga aataattccc atatccttgc
     4081 tttgtaagtt ggtaatatca ctatgcattt ctacacattt tataaatttg atttatgcag
     4141 attttgatac actgtatgtt tctgtagaaa ttgtataaat attcaaaatt ttattaggat
     4201 aaatttgaga aacttacgta tatcttaatt ctgggttgct tgttttttag gtgacaaaaa
     4261 taaaatattg tattttaatc caaaaaaaaa aaaaaaa
//