LOCUS BC036390 5364 bp mRNA linear HUM 17-JUL-2006 DEFINITION Homo sapiens UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase 4 (GalNAc-T4), mRNA (cDNA clone MGC:41784 IMAGE:5261236), complete cds. ACCESSION BC036390 VERSION BC036390.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5364) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 5364) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (05-AUG-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 25, 2003 this sequence version replaced BC036390.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Miklos Palkovits, M.D., Ph.D. cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki Toshiyuki and Piero Carninci (RIKEN) cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 73 Row: d Column: 4 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 34452724. FEATURES Location/Qualifiers source 1..5364 /db_xref="H-InvDB:HIT000051690" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:41784 IMAGE:5261236" /tissue_type="Brain, hippocampus" /clone_lib="NIH_MGC_95" /lab_host="DH10B" /note="Vector: pBluescriptR" gene 1..5364 /gene="GALNT4" /gene_synonym="GALNAC-T4" /gene_synonym="GalNAcT4" /db_xref="GeneID:8693" /db_xref="HGNC:HGNC:4126" /db_xref="MIM:603565" CDS 214..1950 /gene="GALNT4" /gene_synonym="GALNAC-T4" /gene_synonym="GalNAcT4" /codon_start=1 /product="UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase 4 (GalNAc-T4)" /protein_id="AAH36390.1" /db_xref="GeneID:8693" /db_xref="HGNC:HGNC:4126" /db_xref="MIM:603565" /translation="MAVRWTWAGKSCLLLAFLTVAYIFVELLVSTFHASAGAGRAREL GSRRLSGLQKNTEDLSRPLYKKPPADSRALGEWGKASKLQLNEDELKQQEELIERYAI NIYLSDRISLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLE TSPAVLLKEIILVDDLSDRVYLKTQLETYISNLDRVRLIRTNKREGLVRARLIGATFA TGDVLTFLDCHCECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQIGEPMIG GFDWRLTFQWHSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVW GGENLELSFRVWQCGGKLEIHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRS RGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFNSVTELCAEVPE QKNYVGMQNCPKDGFPVPANIIWHFKEDGTIFHPHSGLCLSAYRTPEGRPDVQMRTCD ALDKNQIWSFEK" BASE COUNT 1568 a 942 c 1165 g 1689 t ORIGIN 1 agttgggaga gaggagttgg gctgtgccgg aggccgagga ccgagagggc tcaggtgacc 61 cctggaaagc ctgggtggct ggaaaggagc ctagcgcctg catgaaagga agaacctgct 121 gggaagtacc tgagctcgag ctgtgggttc cgccgcactt cccctgcgtg gtggcttggt 181 ggccgcgtct gcgcctcagc cctgagaatc cggatggcgg tgaggtggac ttgggcaggc 241 aagagctgcc tgctgctggc gtttttaaca gtggcctata tcttcgtgga gctcttggtc 301 tctacttttc atgcctccgc aggagccggc cgtgccaggg agctggggtc aagaaggctc 361 tcaggcctcc agaaaaatac ggaggatttg tctcgaccgc tttataagaa gccccctgca 421 gattcccgtg cacttgggga gtgggggaaa gccagcaaac tccagctcaa cgaggatgaa 481 ctgaagcagc aagaagaact cattgagaga tacgccatca atatttacct cagtgacagg 541 atttccctgc atcgacacat agaggataaa agaatgtatg agtgtaagtc ccagaagttc 601 aactatagga cacttcctac cacctctgtt atcattgctt tctataacga agcctggtcg 661 actttgctcc gtaccattca cagtgtttta gaaacttctc ctgcagttct tttgaaagag 721 atcatcttgg tggatgactt gagtgacaga gtttatttga agacacaact tgaaacttac 781 atcagcaatc ttgatagagt acgcttgatt aggaccaata agcgagaggg gctggttagg 841 gcccgtctga ttggggccac tttcgccact ggggacgtcc tcactttcct ggattgtcac 901 tgtgagtgta attccggttg gctggaaccg cttttggaaa ggattgggag agatgaaaca 961 gcagttgtgt gtcctgttat agacacaatt gattggaata cttttgaatt ctatatgcag 1021 ataggggagc ccatgattgg tgggtttgac tggcgtttaa catttcagtg gcattctgtc 1081 cccaaacagg aaagggacag gcggatatca agaattgacc ccatcagatc acctaccatg 1141 gctggaggac tgtttgctgt cagcaagaaa tattttcagt accttggaac gtatgacaca 1201 ggaatggaag tgtggggagg tgaaaacctt gagctgtctt ttagggtgtg gcagtgtggt 1261 ggcaaattgg agatccaccc gtgttcccac gtgggccatg tgttccccaa gcgggcacca 1321 tatgctcgcc ccaatttcct acagaatact gctcgggcag cagaagtttg gatggatgaa 1381 tacaaagagc acttctacaa tagaaaccct ccagcaagaa aagaagctta tggtgatatt 1441 tctgaaagaa aattactacg agagcggttg agatgcaaga gctttgactg gtatttgaaa 1501 aacgtttttc ctaatttaca tgttccagag gatagaccag gctggcatgg ggctattcgc 1561 agtagaggga tctcgtctga atgtttagat tataattctc ctgacaacaa ccccacaggt 1621 gctaaccttt cactgtttgg atgccatggt caaggaggca atcaattctt tgaatatact 1681 tcaaacaaag aaataaggtt taattctgtg acagagttat gtgcagaggt acctgagcaa 1741 aaaaattatg tgggaatgca aaattgtccc aaagatgggt tccctgtacc agcaaacatt 1801 atttggcatt ttaaagaaga tggaactatt tttcacccac actcaggact gtgtcttagt 1861 gcttatcgga caccggaggg ccgacctgat gtacaaatga gaacttgtga tgctctagat 1921 aaaaatcaaa tttggagttt tgagaaatag agcacaacag cactttcgtc atgagctgac 1981 agtagtgtca agaaagtcaa agagccttaa gagcctcagt gaagattgta ttttatttta 2041 tcaaaagcca cctagcagtc atctgtggag cactggaaag ctggggttca ttttggtata 2101 tcacactgaa actgggtacc cagagtgctg ctgtttaata tttcacaatg ccttacttat 2161 tggttgtttt atataagagt tttgtcaata tggtctcttc ttaaaagaag ttgactatga 2221 attgaaacac acaaaacatt taagtgccag acttaatatt aaagaatgta aaggtccaag 2281 taaaatgagg tatgatttat gttgatgtgt aagttcaccg cacatcccac tttttaacaa 2341 aactcatgaa tgtgcagttt gagccattgc tattttgatt acatagaatt tgtatttctt 2401 ttttagccag cacattaaat tttagatttt attttttaat ctaatttttt tctaatcaaa 2461 aagaaaattg agcttaaggc aaaaggcctg gttttagaga tatgtgtaat tggaagaggg 2521 catttgtttg agtgtgagtt tggaggcctt tttaacatgc agacataccc atatttaaat 2581 gaaatgggga gatatttaca ttccgtactt tgtaaacttg agctattgga cttcactgat 2641 gtatatatta atacctcaga ttcctctgat tttgtaagct gtcttctctg tgaacgtgtt 2701 tgtgtgtgta gggcattttc tgattgcact tccttaagtt atgaatgtac tagaaaggga 2761 ctcatccaga atactatgcc tccctttgtt aatgcttaat catttaaagt aaacacaatt 2821 gaagcctctc tgaagttaaa cccaactatg tttattaaaa tgtgtgaaac tgaaagtggg 2881 ctaggttcta ccaaggctgt ggaactctcc tacgagttct gctgatcagg aaatttaaga 2941 atttatctta aaaatgcaag gaaaaaagac tgccttggca attgtgaatg gtgctttcaa 3001 tctcctagca ccgagcctgg cacttaggca gctttcagta agtgggtgaa tgaatgactg 3061 aatgaatgaa tgaatggctc agctgaggaa tgtaactttg gtcaagttat tatgatgtgt 3121 ttgggcttag ttttctcatt ggtaaaatgt gggtgctgga ttggatctta aagatccctt 3181 ccagctctga aatgctgatt gtacagtata ttcttcccag attgactcac tgtgcaatct 3241 ttacaatact ttttatcttt tcacttttga cataggtaat gttgttgagc agttgagcaa 3301 tgttcagtcc agttgtgaag ctggagaaga gaaatgggtt ttaaaaatta agtgagggga 3361 ggccgggtgc ggtggctcac gcatgtaatc ccagcacttt gggaggccaa ggcaggtgga 3421 tcacgaggtc aggagatcca gaccatcctg gctaacatgg tgaaacctcg tctctactaa 3481 aaatacaaaa aattagccag ctgtggtggc gggcgcctgt agtcccagct actcaggagg 3541 ctgaggcagg agaatggcgt gaacctgtga ggcagagctt acagtgagcc gagatcgtgc 3601 cactgcactc cagcctgggc gacagagcaa gactctgtct caaaaaaaaa aaataataat 3661 aaaataagtg agctgaactc acctgaagtg gtttacttct gtgggttaag aagttctagt 3721 cagtgttcat agtcgtttcg ttttgataat tgttgaacca attttgtttt taaaaccttt 3781 agactctgaa agtaatattt tgactaagaa tgtaaatatt tccaaactaa attactcggg 3841 aagtaaacgc tttttttaaa agtattttta ctggttttat accaatatta tatgcagaaa 3901 tcacaggatg aatttagaat taaatctcaa ttagttcact ttggcctaga tttatgaaaa 3961 atgcatgcct cgtaaagagt ccactgtatt cacgagtaaa gttgctttta gtgttcactt 4021 gatgacttgg agagtaggaa ttttgcaaaa tctgaattta aggaaattct ttaggataac 4081 catttcaaaa aataaaattg ctatgcaatc ttgaatattt tctcttttgc ctcgtaaaat 4141 gaaaatgcat tcacagtttc tgtaaattat ttagcagcct taaagtttat caaaaaattg 4201 tccagattcc acgtgcagca tgcttggccc tgcatttaat ttaagaagga ttaataataa 4261 tgctctgaat ttttcgaaag ggattctcct aaacccaccc acttctcttg cccaggctgc 4321 tttttaaaaa tattttttta ttttttactt atttttaaat tttctctttt tatttatttt 4381 tggttttctt gttagccacc tgttatatgg gagaacgaaa attgttatat tttgaaagta 4441 cttattacat tatttttatt ttagtatctt gatgctcctg tcaaaaggga aatgaggctt 4501 ttaaaaataa agtaccttaa ttctttattg actttttgcc ataaattgct aggtgtgacc 4561 cagcaatctt ttaggaagag attttacagt ggtgctttat ttatatcaat aatccagtat 4621 agttaggctg ttcattcctc ataatagagt acataacaga aaagtgggac tttcacattt 4681 tcatatttag gcacgttcca atttaattcc aaaaatactc tgtaattcta catctaaaaa 4741 aaccgattcc ctaattcgaa tttattggta ccaaagctct ctttggctat agacaattaa 4801 gagttgacct tttaagttaa tgtatatgct taaaaacagt tttaggaaaa tatttggtag 4861 acaaagagtt tcaactttaa atgttcacta tgtcatttag tgtccaactt tacggatagg 4921 ttgactatct aaataggcat ttttagtcat taaaaaaaat ctagtcacca ggaggatccc 4981 tataactcaa aataacttgt ttgtaaaaga aaatttgttt acttacccat tagtaagttc 5041 ctgcatattc attataagat ggcaaatcaa acttttctag gatgaagaca gcttattttt 5101 aagttgtata gtcttagttg gtttagggtc tcaattttaa ttaataaaat acttggtttt 5161 tatttgcttg tccttttgaa ttcctgtttt aataatttta aaatgagcac aaagaacgtt 5221 gaagttcaga ttaatctctt ctgaatgatg tttttttcct ctgtgatgag ttgtttctga 5281 cttttttcct tttgtatttg taatgttgat taagatgtaa aataaaaagt gtgcctgatt 5341 atttttgcaa aaaaaaaaaa aaaa //