LOCUS BC047468 4297 bp mRNA linear HUM 17-JUL-2006 DEFINITION Homo sapiens UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase 7 (GalNAc-T7), mRNA (cDNA clone MGC:50994 IMAGE:5271749), complete cds. ACCESSION BC047468 VERSION BC047468.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4297) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 4297) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (28-FEB-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Miklos Palkovits, M.D., Ph.D. cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki Toshiyuki and Piero Carninci (RIKEN) cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Baylor College of Medicine Human Genome Sequencing Center Center code: BCM-HGSC Web site: http://www.hgsc.bcm.tmc.edu/cdna/ Contact: amg@bcm.tmc.edu Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H., Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, A.N., Gibbs, R.A. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 91 Row: d Column: 21 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 8393408. FEATURES Location/Qualifiers source 1..4297 /db_xref="H-InvDB:HIT000053139" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:50994 IMAGE:5271749" /tissue_type="Testis" /clone_lib="NIH_MGC_97" /lab_host="DH10B" /note="Vector: pBluescriptR" gene 1..4297 /gene="GALNT7" /gene_synonym="GALNAC-T7" /gene_synonym="GalNAcT7" /db_xref="GeneID:51809" /db_xref="HGNC:HGNC:4129" /db_xref="MIM:605005" CDS 59..2032 /gene="GALNT7" /gene_synonym="GALNAC-T7" /gene_synonym="GalNAcT7" /codon_start=1 /product="UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase 7 (GalNAc-T7)" /protein_id="AAH47468.1" /db_xref="GeneID:51809" /db_xref="HGNC:HGNC:4129" /db_xref="MIM:605005" /translation="MRLKIGFILRSLLVVGSFLGLVVLWSSLTPRPDDPSPLSRMRED RDVNDPMPNRGGNGLAPGEDRFKPVVPWPHVEGVEVDLESIRRINKAKNEQEHHAGGD SQKDIMQRQYLTFKPQTFTYHDPVLRPGILGNFEPKEPEPPGVVGGPGEKAKPLVLGP EFKQAIQASIKEFGFNMVASDMISLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNE GWSTLMRTVHSVIKRTPRKYLAEIVLIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNER REGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAPLVAPISKDRTICTVPLIDVIN GNTYEIIPQGGGDEDGYARGAWDWSMLWKRVPLTPQEKRLRKTKTEPYRSPAMAGGLF AIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFVPCSRVGHIYRLEGWQGN PPPIYVGSSPTLKNYVRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCK SFKWFMEEIAYDITSHYPLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMG GNQLFRINEANQLMQYDQCLTKGADGSKVMITHCNLNEFKEWQYFKNLHRFTHIPSGK CLDRSEVLHQVFISNCDSSKTTQKWEMNNIHSV" BASE COUNT 1345 a 763 c 890 g 1299 t ORIGIN 1 tggggagagc ggtggcggcg gctgcgccgg gctgtgagtc tctcgccgcc ggaggaagat 61 gaggctgaag attggattca tcttacgcag tttgctggtg gtgggaagct tcctggggct 121 agtggtcctc tggtcttccc tgaccccgcg gccggacgac ccaagcccgc tgagcaggat 181 gagggaagac agagatgtca atgaccccat gcccaaccga ggcggcaatg gactagctcc 241 tggggaggac agattcaaac ctgtggtacc atggcctcat gttgaaggag tagaagtgga 301 cttagagtct attagaagaa taaacaaggc caaaaatgaa caagagcacc atgctggagg 361 agattcccag aaagatatca tgcagaggca gtatctcaca tttaagcctc agacattcac 421 ctaccatgat cctgtgcttc gcccagggat cctcggtaac tttgaaccca aagaacctga 481 gcctcctgga gtggttggtg gccctggaga gaaagccaag ccattggttt tgggaccaga 541 attcaaacaa gcaattcaag ccagcattaa agagtttgga tttaacatgg tggcaagtga 601 catgatctca ctggaccgca gcgtcaatga cttacgccaa gaagaatgca agtattggca 661 ttatgatgaa aacttgctca cttcgagcgt tgtcattgtc ttccataatg aaggatggtc 721 aaccctcatg agaacagtcc acagtgtaat taaaaggact ccaaggaaat atttagcaga 781 aattgtgtta attgacgatt tcagtaataa agaacactta aaagaaaaac tggatgaata 841 tattaagctg tggaatggcc tagtgaaggt atttcgaaat gaaagaaggg aaggtttaat 901 tcaagcacga agtattggtg ctcagaaggc taaacttgga caggttttga tataccttga 961 tgcccactgt gaggtggcag ttaactggta tgcaccactt gtagctccca tatctaagga 1021 cagaaccatt tgcactgtgc cgcttataga tgtcataaat ggcaacacat atgaaattat 1081 accccaaggg ggtggtgatg aagatgggta tgcccgagga gcatgggatt ggagtatgct 1141 ctggaaacgg gtgcctctga cccctcaaga gaagagactg agaaagacaa aaactgaacc 1201 gtatcggtcc ccagccatgg ctgggggatt atttgccatt gaacgagagt tcttctttga 1261 attgggtctc tatgatccag gtctccagat ttggggtggt gaaaactttg agatctcata 1321 caagatatgg cagtgtggtg gcaaattatt atttgttcct tgttctcgtg ttggacatat 1381 ctaccgtctt gagggctggc aaggaaatcc tccgcccatt tatgttgggt cttctccaac 1441 tctgaagaat tatgttagag ttgtggaggt ttggtgggat gaatataaag actacttcta 1501 tgctagtcgt cctgaatcgc aggcattacc atatggggat atatcggagc tgaaaaaatt 1561 tcgagaagat cacaactgca aaagttttaa gtggttcatg gaagaaatag cttatgatat 1621 cacctcacac taccctttgc cacccaaaaa tgttgactgg ggagaaatca gaggcttcga 1681 aactgcttac tgcattgata gcatgggaaa aacaaatgga ggctttgttg aactaggacc 1741 ctgccacagg atgggaggga atcagctttt cagaatcaat gaagcaaatc aactcatgca 1801 gtatgaccag tgtttgacaa agggagctga tggatcaaaa gttatgatta cacactgtaa 1861 tctaaatgaa tttaaggaat ggcagtactt caagaacctg cacagattta ctcatattcc 1921 ttcaggaaag tgtttagatc gctcagaggt cctgcatcaa gtattcatct ccaattgtga 1981 ctccagtaaa acgactcaaa aatgggaaat gaataacatc catagtgttt agagagaaaa 2041 aataaaccaa taacctacct actgacaagt aaatttatac aggactgaaa accgcctgaa 2101 acctgctgca actattgtta ttaactctgt atagctccaa acctggaacc tcctgatcag 2161 tttgaaggac attgataaac tgtgatttta caataacatt atcatctgca gttactgttt 2221 acaagactgc ttttacctta aaccttgtag atgtttacat ctttttgttg tgttttaaga 2281 tgatgttggt aatttgtgcc tttagctctg ttttattaga cagagttaaa gcatgttgtc 2341 ttctttggga ttacactcag gggtctgaaa ggcagtttga tttttatttt taacacactt 2401 gaaaaaaggt tggagtagcc agactttcat atataacttg gtgattatca acctgttgtg 2461 tctttattta attttacatc tttttgaagc actgccacag gttattagcc aaggtggcct 2521 tccttcacag tcatgctgct tttttgaaag gtgaatttca acacatttag tgcctctttc 2581 atttctcagt atatatttca agagcttgtg atgaaatcta taggatggta atgatggact 2641 tgtcacctgt atggggaata cttttactac tcagaaatga atttatgtgc tgccatttgc 2701 tataaagttg aactttgtat ggcttgaaaa agaaatgaca atatggaaca tcccaaggct 2761 gtcccatagg gttggaagtt gtgtagcatt cactccctta cctactggca ttcccagtgc 2821 cctctgtcca tacctacttc taggattgca aaggagtctt ccaactagag aaaaattgtc 2881 cactgacatt tgggatttac ttttctccaa tacctgccaa tacagaaaac tattatcagt 2941 tgttattgtt atcccttgaa agcgagggtg acaaaaacaa caaaacaccg ttataaacac 3001 atcaaaggtt cattctgact gaggtaagac tttccaagcc cttgttagat taggccttat 3061 aaaacttgtg tgcattataa cctaagctgt gcaacctgtg aagccaagag tgaactgatg 3121 tttcatttat attttcatcc aaatgacatt atctgcacgt ttttaaaatt taaaaacaaa 3181 ggactattta aaaatacagt ttattaacaa acgtgaacta ctttctgtta cattaggtgt 3241 tccctagtgt ttcttaattt ctttttagaa agtgtatttt tattagtatt tttccggtga 3301 acagaagatt tgtttggatt taaacattta ctaagacagt acctattagg aaaaccaaat 3361 attgcaaatg gtcaattcga ttttaatttc tcaaaagata ctctgttatc cagaagatta 3421 aaatgcctac attgagtgct taaaaaaaaa aaaaacaact gtgatgatgt gagcagaatg 3481 gcaagtaagt taagcatttt tgatcctgta atcatggtat cattacaatg aaaggaattc 3541 acaaactact gccagaggaa gtttgttttt taatttaaga gggaaatata acctataaat 3601 ttgtttcttc caagcttagc tcttaaattt ggagactcaa agttaaacat cctcaacaga 3661 gttttattta taattttgaa ttgtcaattt gtattttgct actgatctgt gatcaaccat 3721 tttaactttc atctctaggg atgtttaaca tttataattg caaaataaac caactataaa 3781 aaaagaaact aagagagaat tggtacttta attacttgtg tgtttgcaaa taggctccat 3841 tttccatgtt gagtagatta taaccttatt aactatgcat aggcctaaga aaggtggcaa 3901 tgaactgtgc atgtaaattt taaatgggta ctttgtgcaa ttcgttaaaa gaagatactc 3961 tatgaatatg attctatata ttgaaatcag aaaacctacc aaacaaaaac atcagaagct 4021 gctgccataa tgactatttt ctactgtagg ctgctttgga aataattccc atatccttgc 4081 tttgtaagtt ggtaatatca ctatgcattt ctacacattt tataaatttg atttatgcag 4141 attttgatac actgtatgtt tctgtagaaa ttgtataaat attcaaaatt ttattaggat 4201 aaatttgaga aacttacgta tatcttaatt ctgggttgct tgttttttag gtgacaaaaa 4261 taaaatattg tattttaatc caaaaaaaaa aaaaaaa //