LOCUS BC041120 4464 bp mRNA linear HUM 28-JUL-2005 DEFINITION Homo sapiens UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase 2 (GalNAc-T2), mRNA (cDNA clone MGC:47616 IMAGE:5553465), complete cds. ACCESSION BC041120 VERSION BC041120.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4464) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 4464) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (13-DEC-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Lou Staudt cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 82 Row: g Column: 15. FEATURES Location/Qualifiers source 1..4464 /db_xref="H-InvDB:HIT000052511" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:47616 IMAGE:5553465" /tissue_type="Lymph, lymphoma" /clone_lib="NIH_MGC_85" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..4464 /gene="GALNT2" /gene_synonym="GalNAc-T2" /db_xref="GeneID:2590" /db_xref="MIM:602274" CDS 73..1788 /gene="GALNT2" /gene_synonym="GalNAc-T2" /codon_start=1 /product="GALNT2 protein" /protein_id="AAH41120.1" /db_xref="GeneID:2590" /db_xref="MIM:602274" /translation="MRRRSRMLLCFAFLWVLGIAYYMYSGGGSALAGGAGGGAGRKED WNEIDPIKKKDLHHSNGEEKAQSMETLPPGKVRWPDFNQEAYVGGTMVRSGQDPYARN KFNQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVL KKSPPHLIKEIILVDDYSNDPEDGALLGKIEKVRVLRNDRREGLMRSRVRGADAAQAK VLTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDW NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGE NLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEY KNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL QQGTNCLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI KLQGCRENDSRQKWEQIEGNSKLRHVGSNLCLDSRTAKSGGLSVEVCGPALSQQWKFT LNLQQ" BASE COUNT 975 a 1193 c 1291 g 1005 t ORIGIN 1 ccaccgcgcc cgcggccggc ccaggcagca ctcgcgagca gcggcggccc cgccggcggc 61 cgagttggga gaatgcggcg gcgctcgcgg atgctgctct gcttcgcctt cctgtgggtg 121 ctgggcatcg cctactacat gtactcgggg ggcggctctg cgctggccgg gggcgcgggc 181 ggcggcgccg gcaggaagga ggactggaat gaaattgacc ccattaaaaa gaaagacctt 241 catcacagca atggagaaga gaaagcacaa agcatggaga ccctccctcc agggaaagta 301 cggtggccag actttaacca ggaagcttat gttggaggga cgatggtccg ctccgggcag 361 gacccttacg cccgcaacaa gttcaaccag gtggagagtg ataagcttcg aatggacaga 421 gccatccctg acacccggca tgaccagtgt cagcggaagc agtggcgggt ggatctgccg 481 gccaccagcg tggtgatcac gtttcacaat gaagccaggt cggccctact caggaccgtg 541 gtcagcgtgc ttaagaaaag cccgccccat ctcataaaag aaatcatctt ggtggatgac 601 tacagcaatg atcctgagga cggggctctc ttggggaaaa ttgagaaagt gcgagttctt 661 agaaatgatc gacgagaagg cctcatgcgc tcacgggttc ggggggccga tgctgcccaa 721 gccaaggtcc tgaccttcct ggacagtcac tgcgagtgta atgagcactg gctggagccc 781 ctcctggaaa gggtggcgga ggacaggact cgggttgtgt cacccatcat cgatgtcatt 841 aatatggaca actttcagta tgtgggggca tctgctgact tgaagggcgg ttttgattgg 901 aacttggtat tcaagtggga ttacatgacg cctgagcaga gaaggtcccg gcaggggaac 961 ccagtcgccc ctataaaaac ccccatgatt gctggtgggc tgtttgtgat ggataagttc 1021 tattttgaag aactggggaa gtacgacatg atgatggatg tgtggggagg agagaaccta 1081 gagatctcgt tccgcgtgtg gcagtgtggt ggcagcctgg agatcatccc gtgcagccgt 1141 gtgggacacg tgttccggaa gcagcacccc tacacgttcc cgggtggcag tggcactgtc 1201 tttgcccgaa acacccgccg ggcagcagag gtctggatgg atgaatacaa aaatttctat 1261 tatgcagcag tgccttctgc tagaaacgtt ccttatggaa atattcagag cagattggag 1321 cttaggaaga aactcagctg caagcctttc aaatggtacc ttgaaaatgt ctatccagag 1381 ttaagggttc cagaccatca ggatatagct tttggggcct tgcagcaggg aactaactgc 1441 ctcgacactt tgggacactt tgctgatggt gtggttggag tttatgaatg tcacaatgct 1501 gggggaaacc aggaatgggc cttgacgaag gagaagtcag tgaagcacat ggatttgtgc 1561 cttactgtgg tggaccgggc accgggctct cttataaagc tgcagggctg ccgagaaaat 1621 gacagcagac agaaatggga acagatcgag ggcaactcca agctgaggca cgtgggcagc 1681 aacctgtgcc tggacagtcg cacggccaag agcgggggcc taagcgtgga ggtgtgtggc 1741 ccggcccttt cgcagcagtg gaagttcacg ctcaacctgc agcagtagga gggtccggga 1801 ggccctgccg tcctgtctcc tgcaccattg ggtggagtct ggtgatcaca ttattgatta 1861 tgtttcttaa actttccgcg aaactaatat acctcagtat tccatcatgg tctgaaagtc 1921 aaacttcggc aaggcacgga cgactgtgca gacacagcag cggcaagaag cgagaactgc 1981 cctccccctc ctctcggtgc agcccagccg ggcccccttc cccaggccgg agcgcccctc 2041 ttccttccag ctttcacttc tgccggctcc gcaactgagt gacacccagc gacaaccgac 2101 tggggagtgg tagaagcaac tgaacggatg cgtgcgagct gaggacaggg cgggaggagg 2161 gggcacacat gccccagggg agcgaggaga actcttgaaa tctccatttt caatcccttc 2221 gaaatcacgt atggtttcca caaagccgag tcgtgtcacg tggcaggttt acgtcaatag 2281 tccctctctc tgctcctcca ttcgcaagtg tcttcctggg ccagactccc ctccacctca 2341 tgtacttgct atattgagga tgaagttttc tatggtggga cactaaatat aaagctatat 2401 agagaaagaa tgtacggtca gttccctatg gtttctgtag atcatcgtca tcttgtatat 2461 tccccacaaa gccgttcgca gcttccggga gaaggggcca gagcccggtg gggccagttt 2521 ctcacagagg gaggaggtgg cctttgtccc ctggagcccg atcagccagt tggtgctact 2581 gctgtggcca gctggggggc ttcctccaga ccaccggcct cggccccggc atccctgttg 2641 ggcgtcagcc tgagagtccc tactgtgcgt cagaatccac cttgcgtgct gtgcgtatct 2701 gtgaacctgg agcggttact tattttgaca gatatcactt tgggtctttt tacattaaat 2761 ttcttttctc taaggaatat aagacatacc ccatagctct gcgtgagcca gcaataccgc 2821 tgccccctgg cgacagggca gaccaatgat gccaggcagc tgtcacacgc tagtattggc 2881 ttcattgtga tctgagccct gcacgctggg ccttcagaat taatggccag cagtgtcagg 2941 gatgagcccg tcagccaggg cacaggcctg gctcacagtc cttcacacct gctggcctgg 3001 ggagctccag ccaggcagcg agtcctgccc cgcccgcagc tccctcccac accccgcctg 3061 gccaagatga ctgcttcagg gggctttggg gaaagaatta ggaagggtca gaaccaaaca 3121 atacctgctc atttacactg aggattcagg gcgggagaca ggagccttgg ggtcctgtta 3181 aaccacagac agttatgaac tgaaagtcat aacggggaga ggtgcctggc ttctacctgg 3241 gtgctcagga atgttcctcg tcacccctgc cactctgtgg tcggtgccct gcttcctcct 3301 ccactcctgg ccgccttctc cagcgccgca cacacagatg ctcggtctca gagaggctgg 3361 cacggcctgg cagtctgaga aaagcgtcag ttaggcacac ctgcaggccc ctcggtggga 3421 cagcggcggc cttggagtta ggagccaccc tgggaggttg tgccggtgcc atgctcctcc 3481 ctgtgtcttg tatgaaaggg gccactgtgt gtcttcctcc ccggcgggag ccccacgtgt 3541 gtgcactgta ggacagcggc cccgaggtgg aagcctggct ggagggctgc cctataggtc 3601 ttctcttccc gcctcccctg ccatgcaacc agatgtgttc tgagtgggca gcgtgccccc 3661 acgctggagt aactccgcac gcttctgtct ttcacggtgg gcgctcgggg ggagcctgag 3721 gaaaaccccc ttaggtacct gtgcgaggct gtggagtgca ggccagagca gggtgtgcgt 3781 agcccccagc acccaggttc ttctgtcaga ccctgtgacc tgcgagctgc tactactgta 3841 aggagggaaa tggatgaatc tggctcgttt taaaatcacg ttttctgacg aatcctttgc 3901 cccttcacct ttaccccgcc cgcaccccta ggccctctca gccttcctat catcccacgt 3961 gtctacccag acccttgcgc ggcccatgcc ctgggggcgg cgtcctgtcc ctgagctggg 4021 aggcggcttt ggatggtccg ggcgtcaaga gcaggggtgg gccggggagg ggtcctttgc 4081 ggtgagctat gtttacatga cacagtgtgc caaagtgact tactgcggtt gcgttagttt 4141 ctagtcatca ggactatctc accctcccac tcctgttttt aaaactcaga attctttcct 4201 aagagccctt cgagcaaagc gtgccgaagt tagttgtctt ctctgtggtg gtcctttctt 4261 atgtcctcat aaaagctcag atgatggtat ctgtgagtat gttttgcaaa ttcaaaatat 4321 agtttggtaa tttttttttc cagttgattt ttaaaaagaa ctgctgtaca gagcttgtac 4381 tttgtccatt ttatagatgg aaaccatcct tgaaaattgt ttaacttaaa taaagagaag 4441 atactttcta aaaaaaaaaa aaaa //