LOCUS BC036390 5364 bp mRNA linear HUM 17-JUL-2006
DEFINITION Homo sapiens UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4), mRNA (cDNA clone
MGC:41784 IMAGE:5261236), complete cds.
ACCESSION BC036390
VERSION BC036390.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 5364)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 5364)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (05-AUG-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Aug 25, 2003 this sequence version replaced BC036390.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: Miklos Palkovits, M.D., Ph.D.
cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki
Toshiyuki and Piero Carninci (RIKEN)
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Institute for Systems Biology
http://www.systemsbiology.org
contact: amadan@systemsbiology.org
Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 73 Row: d Column: 4
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 34452724.
FEATURES Location/Qualifiers
source 1..5364
/db_xref="H-InvDB:HIT000051690"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:41784 IMAGE:5261236"
/tissue_type="Brain, hippocampus"
/clone_lib="NIH_MGC_95"
/lab_host="DH10B"
/note="Vector: pBluescriptR"
gene 1..5364
/gene="GALNT4"
/gene_synonym="GALNAC-T4"
/gene_synonym="GalNAcT4"
/db_xref="GeneID:8693"
/db_xref="HGNC:HGNC:4126"
/db_xref="MIM:603565"
CDS 214..1950
/gene="GALNT4"
/gene_synonym="GALNAC-T4"
/gene_synonym="GalNAcT4"
/codon_start=1
/product="UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4)"
/protein_id="AAH36390.1"
/db_xref="GeneID:8693"
/db_xref="HGNC:HGNC:4126"
/db_xref="MIM:603565"
/translation="MAVRWTWAGKSCLLLAFLTVAYIFVELLVSTFHASAGAGRAREL
GSRRLSGLQKNTEDLSRPLYKKPPADSRALGEWGKASKLQLNEDELKQQEELIERYAI
NIYLSDRISLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLE
TSPAVLLKEIILVDDLSDRVYLKTQLETYISNLDRVRLIRTNKREGLVRARLIGATFA
TGDVLTFLDCHCECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQIGEPMIG
GFDWRLTFQWHSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVW
GGENLELSFRVWQCGGKLEIHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE
HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRS
RGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFNSVTELCAEVPE
QKNYVGMQNCPKDGFPVPANIIWHFKEDGTIFHPHSGLCLSAYRTPEGRPDVQMRTCD
ALDKNQIWSFEK"
BASE COUNT 1568 a 942 c 1165 g 1689 t
ORIGIN
1 agttgggaga gaggagttgg gctgtgccgg aggccgagga ccgagagggc tcaggtgacc
61 cctggaaagc ctgggtggct ggaaaggagc ctagcgcctg catgaaagga agaacctgct
121 gggaagtacc tgagctcgag ctgtgggttc cgccgcactt cccctgcgtg gtggcttggt
181 ggccgcgtct gcgcctcagc cctgagaatc cggatggcgg tgaggtggac ttgggcaggc
241 aagagctgcc tgctgctggc gtttttaaca gtggcctata tcttcgtgga gctcttggtc
301 tctacttttc atgcctccgc aggagccggc cgtgccaggg agctggggtc aagaaggctc
361 tcaggcctcc agaaaaatac ggaggatttg tctcgaccgc tttataagaa gccccctgca
421 gattcccgtg cacttgggga gtgggggaaa gccagcaaac tccagctcaa cgaggatgaa
481 ctgaagcagc aagaagaact cattgagaga tacgccatca atatttacct cagtgacagg
541 atttccctgc atcgacacat agaggataaa agaatgtatg agtgtaagtc ccagaagttc
601 aactatagga cacttcctac cacctctgtt atcattgctt tctataacga agcctggtcg
661 actttgctcc gtaccattca cagtgtttta gaaacttctc ctgcagttct tttgaaagag
721 atcatcttgg tggatgactt gagtgacaga gtttatttga agacacaact tgaaacttac
781 atcagcaatc ttgatagagt acgcttgatt aggaccaata agcgagaggg gctggttagg
841 gcccgtctga ttggggccac tttcgccact ggggacgtcc tcactttcct ggattgtcac
901 tgtgagtgta attccggttg gctggaaccg cttttggaaa ggattgggag agatgaaaca
961 gcagttgtgt gtcctgttat agacacaatt gattggaata cttttgaatt ctatatgcag
1021 ataggggagc ccatgattgg tgggtttgac tggcgtttaa catttcagtg gcattctgtc
1081 cccaaacagg aaagggacag gcggatatca agaattgacc ccatcagatc acctaccatg
1141 gctggaggac tgtttgctgt cagcaagaaa tattttcagt accttggaac gtatgacaca
1201 ggaatggaag tgtggggagg tgaaaacctt gagctgtctt ttagggtgtg gcagtgtggt
1261 ggcaaattgg agatccaccc gtgttcccac gtgggccatg tgttccccaa gcgggcacca
1321 tatgctcgcc ccaatttcct acagaatact gctcgggcag cagaagtttg gatggatgaa
1381 tacaaagagc acttctacaa tagaaaccct ccagcaagaa aagaagctta tggtgatatt
1441 tctgaaagaa aattactacg agagcggttg agatgcaaga gctttgactg gtatttgaaa
1501 aacgtttttc ctaatttaca tgttccagag gatagaccag gctggcatgg ggctattcgc
1561 agtagaggga tctcgtctga atgtttagat tataattctc ctgacaacaa ccccacaggt
1621 gctaaccttt cactgtttgg atgccatggt caaggaggca atcaattctt tgaatatact
1681 tcaaacaaag aaataaggtt taattctgtg acagagttat gtgcagaggt acctgagcaa
1741 aaaaattatg tgggaatgca aaattgtccc aaagatgggt tccctgtacc agcaaacatt
1801 atttggcatt ttaaagaaga tggaactatt tttcacccac actcaggact gtgtcttagt
1861 gcttatcgga caccggaggg ccgacctgat gtacaaatga gaacttgtga tgctctagat
1921 aaaaatcaaa tttggagttt tgagaaatag agcacaacag cactttcgtc atgagctgac
1981 agtagtgtca agaaagtcaa agagccttaa gagcctcagt gaagattgta ttttatttta
2041 tcaaaagcca cctagcagtc atctgtggag cactggaaag ctggggttca ttttggtata
2101 tcacactgaa actgggtacc cagagtgctg ctgtttaata tttcacaatg ccttacttat
2161 tggttgtttt atataagagt tttgtcaata tggtctcttc ttaaaagaag ttgactatga
2221 attgaaacac acaaaacatt taagtgccag acttaatatt aaagaatgta aaggtccaag
2281 taaaatgagg tatgatttat gttgatgtgt aagttcaccg cacatcccac tttttaacaa
2341 aactcatgaa tgtgcagttt gagccattgc tattttgatt acatagaatt tgtatttctt
2401 ttttagccag cacattaaat tttagatttt attttttaat ctaatttttt tctaatcaaa
2461 aagaaaattg agcttaaggc aaaaggcctg gttttagaga tatgtgtaat tggaagaggg
2521 catttgtttg agtgtgagtt tggaggcctt tttaacatgc agacataccc atatttaaat
2581 gaaatgggga gatatttaca ttccgtactt tgtaaacttg agctattgga cttcactgat
2641 gtatatatta atacctcaga ttcctctgat tttgtaagct gtcttctctg tgaacgtgtt
2701 tgtgtgtgta gggcattttc tgattgcact tccttaagtt atgaatgtac tagaaaggga
2761 ctcatccaga atactatgcc tccctttgtt aatgcttaat catttaaagt aaacacaatt
2821 gaagcctctc tgaagttaaa cccaactatg tttattaaaa tgtgtgaaac tgaaagtggg
2881 ctaggttcta ccaaggctgt ggaactctcc tacgagttct gctgatcagg aaatttaaga
2941 atttatctta aaaatgcaag gaaaaaagac tgccttggca attgtgaatg gtgctttcaa
3001 tctcctagca ccgagcctgg cacttaggca gctttcagta agtgggtgaa tgaatgactg
3061 aatgaatgaa tgaatggctc agctgaggaa tgtaactttg gtcaagttat tatgatgtgt
3121 ttgggcttag ttttctcatt ggtaaaatgt gggtgctgga ttggatctta aagatccctt
3181 ccagctctga aatgctgatt gtacagtata ttcttcccag attgactcac tgtgcaatct
3241 ttacaatact ttttatcttt tcacttttga cataggtaat gttgttgagc agttgagcaa
3301 tgttcagtcc agttgtgaag ctggagaaga gaaatgggtt ttaaaaatta agtgagggga
3361 ggccgggtgc ggtggctcac gcatgtaatc ccagcacttt gggaggccaa ggcaggtgga
3421 tcacgaggtc aggagatcca gaccatcctg gctaacatgg tgaaacctcg tctctactaa
3481 aaatacaaaa aattagccag ctgtggtggc gggcgcctgt agtcccagct actcaggagg
3541 ctgaggcagg agaatggcgt gaacctgtga ggcagagctt acagtgagcc gagatcgtgc
3601 cactgcactc cagcctgggc gacagagcaa gactctgtct caaaaaaaaa aaataataat
3661 aaaataagtg agctgaactc acctgaagtg gtttacttct gtgggttaag aagttctagt
3721 cagtgttcat agtcgtttcg ttttgataat tgttgaacca attttgtttt taaaaccttt
3781 agactctgaa agtaatattt tgactaagaa tgtaaatatt tccaaactaa attactcggg
3841 aagtaaacgc tttttttaaa agtattttta ctggttttat accaatatta tatgcagaaa
3901 tcacaggatg aatttagaat taaatctcaa ttagttcact ttggcctaga tttatgaaaa
3961 atgcatgcct cgtaaagagt ccactgtatt cacgagtaaa gttgctttta gtgttcactt
4021 gatgacttgg agagtaggaa ttttgcaaaa tctgaattta aggaaattct ttaggataac
4081 catttcaaaa aataaaattg ctatgcaatc ttgaatattt tctcttttgc ctcgtaaaat
4141 gaaaatgcat tcacagtttc tgtaaattat ttagcagcct taaagtttat caaaaaattg
4201 tccagattcc acgtgcagca tgcttggccc tgcatttaat ttaagaagga ttaataataa
4261 tgctctgaat ttttcgaaag ggattctcct aaacccaccc acttctcttg cccaggctgc
4321 tttttaaaaa tattttttta ttttttactt atttttaaat tttctctttt tatttatttt
4381 tggttttctt gttagccacc tgttatatgg gagaacgaaa attgttatat tttgaaagta
4441 cttattacat tatttttatt ttagtatctt gatgctcctg tcaaaaggga aatgaggctt
4501 ttaaaaataa agtaccttaa ttctttattg actttttgcc ataaattgct aggtgtgacc
4561 cagcaatctt ttaggaagag attttacagt ggtgctttat ttatatcaat aatccagtat
4621 agttaggctg ttcattcctc ataatagagt acataacaga aaagtgggac tttcacattt
4681 tcatatttag gcacgttcca atttaattcc aaaaatactc tgtaattcta catctaaaaa
4741 aaccgattcc ctaattcgaa tttattggta ccaaagctct ctttggctat agacaattaa
4801 gagttgacct tttaagttaa tgtatatgct taaaaacagt tttaggaaaa tatttggtag
4861 acaaagagtt tcaactttaa atgttcacta tgtcatttag tgtccaactt tacggatagg
4921 ttgactatct aaataggcat ttttagtcat taaaaaaaat ctagtcacca ggaggatccc
4981 tataactcaa aataacttgt ttgtaaaaga aaatttgttt acttacccat tagtaagttc
5041 ctgcatattc attataagat ggcaaatcaa acttttctag gatgaagaca gcttattttt
5101 aagttgtata gtcttagttg gtttagggtc tcaattttaa ttaataaaat acttggtttt
5161 tatttgcttg tccttttgaa ttcctgtttt aataatttta aaatgagcac aaagaacgtt
5221 gaagttcaga ttaatctctt ctgaatgatg tttttttcct ctgtgatgag ttgtttctga
5281 cttttttcct tttgtatttg taatgttgat taagatgtaa aataaaaagt gtgcctgatt
5341 atttttgcaa aaaaaaaaaa aaaa
//