LOCUS       BC046129                2054 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens UDP-N-acetyl-alpha-D-galactosamine:polypeptide
            N-acetylgalactosaminyltransferase 7 (GalNAc-T7), mRNA (cDNA clone
            MGC:57606 IMAGE:4442080), complete cds.
ACCESSION   BC046129
VERSION     BC046129.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2054)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2054)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (31-JAN-2003) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 107 Row: b Column: 16
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 8393408.
FEATURES             Location/Qualifiers
     source          1..2054
                     /db_xref="H-InvDB:HIT000052998"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:57606 IMAGE:4442080"
                     /tissue_type="Liver, adenocarcinoma"
                     /clone_lib="NIH_MGC_90"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..2054
                     /gene="GALNT7"
                     /gene_synonym="GALNAC-T7"
                     /gene_synonym="GalNAcT7"
                     /db_xref="GeneID:51809"
                     /db_xref="HGNC:HGNC:4129"
                     /db_xref="MIM:605005"
     CDS             62..2035
                     /gene="GALNT7"
                     /gene_synonym="GALNAC-T7"
                     /gene_synonym="GalNAcT7"
                     /codon_start=1
                     /product="UDP-N-acetyl-alpha-D-galactosamine:polypeptide
                     N-acetylgalactosaminyltransferase 7 (GalNAc-T7)"
                     /protein_id="AAH46129.1"
                     /db_xref="GeneID:51809"
                     /db_xref="HGNC:HGNC:4129"
                     /db_xref="MIM:605005"
                     /translation="MRLKIGFILRSLLVVGSFLGLVVLWSSLTPRPDDPSPLSRMRED
                     RDVNDPMPNRGGNGLAPGEDRFKPVVPWPHVEGVEVDLESIRRINKAKNEQEHHAGGD
                     SQKDIMQRQYLTFKPQTFTYHDPVLRPGILGNFEPKEPEPPGVVGGPGEKAKPLVLGP
                     EFKQAIQASIKEFGFNMVASDMISLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNE
                     GWSTLMRTVHSVIKRTPRKYLAEIVLIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNER
                     REGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAPLVAPISKDRTICTVPLIDVIN
                     GNTYEIIPQGGGDEDGYARGAWDWSMLWKRVPLTPQEKRLRKTKTEPYRSPAMAGGLF
                     AIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFVPCSRVGHIYRLEGWQGN
                     PPPIYVGSSPTLKNYVRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCK
                     SFKWFMEEIAYDITSHYPLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMG
                     GNQLFRINEANQLMQYDQCLTKGADGSKVMITHCNLNEFKEWQYFKNLHRFTHIPSGK
                     CLDRSEVLHQVFISNCDSSKTTQKWEMNNIHSV"
BASE COUNT          621 a          401 c          524 g          508 t
ORIGIN      
        1 gggaggggag agcggtggcg gcggctgcgc cgggctgtga gtctctcgcc gccggaggaa
       61 gatgaggctg aagattggat tcatcttacg cagtttgctg gtggtgggaa gcttcctggg
      121 gctagtggtc ctctggtctt ccctgacccc gcggccggac gacccaagcc cgctgagcag
      181 gatgagggaa gacagagatg tcaatgaccc catgcccaac cgaggcggca atggactagc
      241 tcctggggag gacagattca aacctgtggt accatggcct catgttgaag gagtagaagt
      301 ggacttagag tctattagaa gaataaacaa ggccaaaaat gaacaagagc accatgctgg
      361 aggagattcc cagaaagata tcatgcagag gcagtatctc acatttaagc ctcagacatt
      421 cacctaccat gatcctgtgc ttcgcccagg gatcctcggt aactttgaac ccaaagaacc
      481 tgagcctcct ggagtggttg gtggccctgg agagaaagcc aagccattgg ttttgggacc
      541 agaattcaaa caagcaattc aagccagcat taaagagttt ggatttaaca tggtggcaag
      601 tgacatgatc tcactggacc gcagcgtcaa tgacttacgc caagaagaat gcaagtattg
      661 gcattatgat gaaaacttgc tcacttcgag cgttgtcatt gtcttccata atgaaggatg
      721 gtcaaccctc atgagaacag tccacagtgt aattaaaagg actccaagga aatatttagc
      781 agaaattgtg ttaattgacg atttcagtaa taaagaacac ttaaaagaaa aactggatga
      841 atatattaag ctgtggaatg gcctagtgaa ggtatttcga aatgaaagaa gggaaggttt
      901 aattcaagca cgaagtattg gtgctcagaa ggctaaactt ggacaggttt tgatatacct
      961 tgatgcccac tgtgaggtgg cagttaactg gtatgcacca cttgtagctc ccatatctaa
     1021 ggacagaacc atttgcactg tgccgcttat agatgtcata aatggcaaca catatgaaat
     1081 tataccccaa gggggtggtg atgaagatgg gtatgcccga ggagcatggg attggagtat
     1141 gctctggaaa cgggtgcctc tgacccctca agagaagaga ctgagaaaga caaaaactga
     1201 accgtatcgg tccccagcca tggctggggg attatttgcc attgaacgag agttcttctt
     1261 tgaattgggt ctctatgatc caggtctcca gatttggggt ggtgaaaact ttgagatctc
     1321 atacaagata tggcagtgtg gtggcaaatt attatttgtt ccttgttctc gtgttggaca
     1381 tatctaccgt cttgagggct ggcaaggaaa tcctccgccc atttatgttg ggtcttctcc
     1441 aactctgaag aattatgtta gagttgtgga ggtttggtgg gatgaatata aagactactt
     1501 ctatgctagt cgtcctgaat cgcaggcatt accatatggg gatatatcgg agctgaaaaa
     1561 atttcgagaa gatcacaact gcaaaagttt taagtggttc atggaagaaa tagcttatga
     1621 tatcacctca cactaccctt tgccacccaa aaatgttgac tggggagaaa tcagaggctt
     1681 cgaaactgct tactgcattg atagcatggg aaaaacaaat ggaggctttg ttgaactagg
     1741 accctgccac aggatgggag ggaatcagct tttcagaatc aatgaagcaa atcaactcat
     1801 gcagtatgac cagtgtttga caaagggagc tgatggatca aaagttatga ttacacactg
     1861 taatctaaat gaatttaagg aatggcagta cttcaagaac ctgcacagat ttactcatat
     1921 tccttcagga aagtgtttag atcgctcaga ggtcctgcat caagtattca tctccaattg
     1981 tgactccagt aaaacgactc aaaaatggga aatgaataac atccatagtg tttagagaga
     2041 aaaaaaaaaa aaaa
//