LOCUS BC041120 4464 bp mRNA linear HUM 28-JUL-2005
DEFINITION Homo sapiens UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2), mRNA (cDNA clone
MGC:47616 IMAGE:5553465), complete cds.
ACCESSION BC041120
VERSION BC041120.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 4464)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 4464)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (13-DEC-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: Lou Staudt
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 82 Row: g Column: 15.
FEATURES Location/Qualifiers
source 1..4464
/db_xref="H-InvDB:HIT000052511"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:47616 IMAGE:5553465"
/tissue_type="Lymph, lymphoma"
/clone_lib="NIH_MGC_85"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..4464
/gene="GALNT2"
/gene_synonym="GalNAc-T2"
/db_xref="GeneID:2590"
/db_xref="MIM:602274"
CDS 73..1788
/gene="GALNT2"
/gene_synonym="GalNAc-T2"
/codon_start=1
/product="GALNT2 protein"
/protein_id="AAH41120.1"
/db_xref="GeneID:2590"
/db_xref="MIM:602274"
/translation="MRRRSRMLLCFAFLWVLGIAYYMYSGGGSALAGGAGGGAGRKED
WNEIDPIKKKDLHHSNGEEKAQSMETLPPGKVRWPDFNQEAYVGGTMVRSGQDPYARN
KFNQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVL
KKSPPHLIKEIILVDDYSNDPEDGALLGKIEKVRVLRNDRREGLMRSRVRGADAAQAK
VLTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDW
NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGE
NLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEY
KNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL
QQGTNCLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI
KLQGCRENDSRQKWEQIEGNSKLRHVGSNLCLDSRTAKSGGLSVEVCGPALSQQWKFT
LNLQQ"
BASE COUNT 975 a 1193 c 1291 g 1005 t
ORIGIN
1 ccaccgcgcc cgcggccggc ccaggcagca ctcgcgagca gcggcggccc cgccggcggc
61 cgagttggga gaatgcggcg gcgctcgcgg atgctgctct gcttcgcctt cctgtgggtg
121 ctgggcatcg cctactacat gtactcgggg ggcggctctg cgctggccgg gggcgcgggc
181 ggcggcgccg gcaggaagga ggactggaat gaaattgacc ccattaaaaa gaaagacctt
241 catcacagca atggagaaga gaaagcacaa agcatggaga ccctccctcc agggaaagta
301 cggtggccag actttaacca ggaagcttat gttggaggga cgatggtccg ctccgggcag
361 gacccttacg cccgcaacaa gttcaaccag gtggagagtg ataagcttcg aatggacaga
421 gccatccctg acacccggca tgaccagtgt cagcggaagc agtggcgggt ggatctgccg
481 gccaccagcg tggtgatcac gtttcacaat gaagccaggt cggccctact caggaccgtg
541 gtcagcgtgc ttaagaaaag cccgccccat ctcataaaag aaatcatctt ggtggatgac
601 tacagcaatg atcctgagga cggggctctc ttggggaaaa ttgagaaagt gcgagttctt
661 agaaatgatc gacgagaagg cctcatgcgc tcacgggttc ggggggccga tgctgcccaa
721 gccaaggtcc tgaccttcct ggacagtcac tgcgagtgta atgagcactg gctggagccc
781 ctcctggaaa gggtggcgga ggacaggact cgggttgtgt cacccatcat cgatgtcatt
841 aatatggaca actttcagta tgtgggggca tctgctgact tgaagggcgg ttttgattgg
901 aacttggtat tcaagtggga ttacatgacg cctgagcaga gaaggtcccg gcaggggaac
961 ccagtcgccc ctataaaaac ccccatgatt gctggtgggc tgtttgtgat ggataagttc
1021 tattttgaag aactggggaa gtacgacatg atgatggatg tgtggggagg agagaaccta
1081 gagatctcgt tccgcgtgtg gcagtgtggt ggcagcctgg agatcatccc gtgcagccgt
1141 gtgggacacg tgttccggaa gcagcacccc tacacgttcc cgggtggcag tggcactgtc
1201 tttgcccgaa acacccgccg ggcagcagag gtctggatgg atgaatacaa aaatttctat
1261 tatgcagcag tgccttctgc tagaaacgtt ccttatggaa atattcagag cagattggag
1321 cttaggaaga aactcagctg caagcctttc aaatggtacc ttgaaaatgt ctatccagag
1381 ttaagggttc cagaccatca ggatatagct tttggggcct tgcagcaggg aactaactgc
1441 ctcgacactt tgggacactt tgctgatggt gtggttggag tttatgaatg tcacaatgct
1501 gggggaaacc aggaatgggc cttgacgaag gagaagtcag tgaagcacat ggatttgtgc
1561 cttactgtgg tggaccgggc accgggctct cttataaagc tgcagggctg ccgagaaaat
1621 gacagcagac agaaatggga acagatcgag ggcaactcca agctgaggca cgtgggcagc
1681 aacctgtgcc tggacagtcg cacggccaag agcgggggcc taagcgtgga ggtgtgtggc
1741 ccggcccttt cgcagcagtg gaagttcacg ctcaacctgc agcagtagga gggtccggga
1801 ggccctgccg tcctgtctcc tgcaccattg ggtggagtct ggtgatcaca ttattgatta
1861 tgtttcttaa actttccgcg aaactaatat acctcagtat tccatcatgg tctgaaagtc
1921 aaacttcggc aaggcacgga cgactgtgca gacacagcag cggcaagaag cgagaactgc
1981 cctccccctc ctctcggtgc agcccagccg ggcccccttc cccaggccgg agcgcccctc
2041 ttccttccag ctttcacttc tgccggctcc gcaactgagt gacacccagc gacaaccgac
2101 tggggagtgg tagaagcaac tgaacggatg cgtgcgagct gaggacaggg cgggaggagg
2161 gggcacacat gccccagggg agcgaggaga actcttgaaa tctccatttt caatcccttc
2221 gaaatcacgt atggtttcca caaagccgag tcgtgtcacg tggcaggttt acgtcaatag
2281 tccctctctc tgctcctcca ttcgcaagtg tcttcctggg ccagactccc ctccacctca
2341 tgtacttgct atattgagga tgaagttttc tatggtggga cactaaatat aaagctatat
2401 agagaaagaa tgtacggtca gttccctatg gtttctgtag atcatcgtca tcttgtatat
2461 tccccacaaa gccgttcgca gcttccggga gaaggggcca gagcccggtg gggccagttt
2521 ctcacagagg gaggaggtgg cctttgtccc ctggagcccg atcagccagt tggtgctact
2581 gctgtggcca gctggggggc ttcctccaga ccaccggcct cggccccggc atccctgttg
2641 ggcgtcagcc tgagagtccc tactgtgcgt cagaatccac cttgcgtgct gtgcgtatct
2701 gtgaacctgg agcggttact tattttgaca gatatcactt tgggtctttt tacattaaat
2761 ttcttttctc taaggaatat aagacatacc ccatagctct gcgtgagcca gcaataccgc
2821 tgccccctgg cgacagggca gaccaatgat gccaggcagc tgtcacacgc tagtattggc
2881 ttcattgtga tctgagccct gcacgctggg ccttcagaat taatggccag cagtgtcagg
2941 gatgagcccg tcagccaggg cacaggcctg gctcacagtc cttcacacct gctggcctgg
3001 ggagctccag ccaggcagcg agtcctgccc cgcccgcagc tccctcccac accccgcctg
3061 gccaagatga ctgcttcagg gggctttggg gaaagaatta ggaagggtca gaaccaaaca
3121 atacctgctc atttacactg aggattcagg gcgggagaca ggagccttgg ggtcctgtta
3181 aaccacagac agttatgaac tgaaagtcat aacggggaga ggtgcctggc ttctacctgg
3241 gtgctcagga atgttcctcg tcacccctgc cactctgtgg tcggtgccct gcttcctcct
3301 ccactcctgg ccgccttctc cagcgccgca cacacagatg ctcggtctca gagaggctgg
3361 cacggcctgg cagtctgaga aaagcgtcag ttaggcacac ctgcaggccc ctcggtggga
3421 cagcggcggc cttggagtta ggagccaccc tgggaggttg tgccggtgcc atgctcctcc
3481 ctgtgtcttg tatgaaaggg gccactgtgt gtcttcctcc ccggcgggag ccccacgtgt
3541 gtgcactgta ggacagcggc cccgaggtgg aagcctggct ggagggctgc cctataggtc
3601 ttctcttccc gcctcccctg ccatgcaacc agatgtgttc tgagtgggca gcgtgccccc
3661 acgctggagt aactccgcac gcttctgtct ttcacggtgg gcgctcgggg ggagcctgag
3721 gaaaaccccc ttaggtacct gtgcgaggct gtggagtgca ggccagagca gggtgtgcgt
3781 agcccccagc acccaggttc ttctgtcaga ccctgtgacc tgcgagctgc tactactgta
3841 aggagggaaa tggatgaatc tggctcgttt taaaatcacg ttttctgacg aatcctttgc
3901 cccttcacct ttaccccgcc cgcaccccta ggccctctca gccttcctat catcccacgt
3961 gtctacccag acccttgcgc ggcccatgcc ctgggggcgg cgtcctgtcc ctgagctggg
4021 aggcggcttt ggatggtccg ggcgtcaaga gcaggggtgg gccggggagg ggtcctttgc
4081 ggtgagctat gtttacatga cacagtgtgc caaagtgact tactgcggtt gcgttagttt
4141 ctagtcatca ggactatctc accctcccac tcctgttttt aaaactcaga attctttcct
4201 aagagccctt cgagcaaagc gtgccgaagt tagttgtctt ctctgtggtg gtcctttctt
4261 atgtcctcat aaaagctcag atgatggtat ctgtgagtat gttttgcaaa ttcaaaatat
4321 agtttggtaa tttttttttc cagttgattt ttaaaaagaa ctgctgtaca gagcttgtac
4381 tttgtccatt ttatagatgg aaaccatcct tgaaaattgt ttaacttaaa taaagagaag
4441 atactttcta aaaaaaaaaa aaaa
//