LOCUS BC047468 4297 bp mRNA linear HUM 17-JUL-2006
DEFINITION Homo sapiens UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7), mRNA (cDNA clone
MGC:50994 IMAGE:5271749), complete cds.
ACCESSION BC047468
VERSION BC047468.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 4297)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 4297)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (28-FEB-2003) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: Miklos Palkovits, M.D., Ph.D.
cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki
Toshiyuki and Piero Carninci (RIKEN)
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Baylor College of Medicine Human Genome
Sequencing Center
Center code: BCM-HGSC
Web site: http://www.hgsc.bcm.tmc.edu/cdna/
Contact: amg@bcm.tmc.edu
Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
A.N., Gibbs, R.A.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 91 Row: d Column: 21
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 8393408.
FEATURES Location/Qualifiers
source 1..4297
/db_xref="H-InvDB:HIT000053139"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:50994 IMAGE:5271749"
/tissue_type="Testis"
/clone_lib="NIH_MGC_97"
/lab_host="DH10B"
/note="Vector: pBluescriptR"
gene 1..4297
/gene="GALNT7"
/gene_synonym="GALNAC-T7"
/gene_synonym="GalNAcT7"
/db_xref="GeneID:51809"
/db_xref="HGNC:HGNC:4129"
/db_xref="MIM:605005"
CDS 59..2032
/gene="GALNT7"
/gene_synonym="GALNAC-T7"
/gene_synonym="GalNAcT7"
/codon_start=1
/product="UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7)"
/protein_id="AAH47468.1"
/db_xref="GeneID:51809"
/db_xref="HGNC:HGNC:4129"
/db_xref="MIM:605005"
/translation="MRLKIGFILRSLLVVGSFLGLVVLWSSLTPRPDDPSPLSRMRED
RDVNDPMPNRGGNGLAPGEDRFKPVVPWPHVEGVEVDLESIRRINKAKNEQEHHAGGD
SQKDIMQRQYLTFKPQTFTYHDPVLRPGILGNFEPKEPEPPGVVGGPGEKAKPLVLGP
EFKQAIQASIKEFGFNMVASDMISLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNE
GWSTLMRTVHSVIKRTPRKYLAEIVLIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNER
REGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAPLVAPISKDRTICTVPLIDVIN
GNTYEIIPQGGGDEDGYARGAWDWSMLWKRVPLTPQEKRLRKTKTEPYRSPAMAGGLF
AIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFVPCSRVGHIYRLEGWQGN
PPPIYVGSSPTLKNYVRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCK
SFKWFMEEIAYDITSHYPLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMG
GNQLFRINEANQLMQYDQCLTKGADGSKVMITHCNLNEFKEWQYFKNLHRFTHIPSGK
CLDRSEVLHQVFISNCDSSKTTQKWEMNNIHSV"
BASE COUNT 1345 a 763 c 890 g 1299 t
ORIGIN
1 tggggagagc ggtggcggcg gctgcgccgg gctgtgagtc tctcgccgcc ggaggaagat
61 gaggctgaag attggattca tcttacgcag tttgctggtg gtgggaagct tcctggggct
121 agtggtcctc tggtcttccc tgaccccgcg gccggacgac ccaagcccgc tgagcaggat
181 gagggaagac agagatgtca atgaccccat gcccaaccga ggcggcaatg gactagctcc
241 tggggaggac agattcaaac ctgtggtacc atggcctcat gttgaaggag tagaagtgga
301 cttagagtct attagaagaa taaacaaggc caaaaatgaa caagagcacc atgctggagg
361 agattcccag aaagatatca tgcagaggca gtatctcaca tttaagcctc agacattcac
421 ctaccatgat cctgtgcttc gcccagggat cctcggtaac tttgaaccca aagaacctga
481 gcctcctgga gtggttggtg gccctggaga gaaagccaag ccattggttt tgggaccaga
541 attcaaacaa gcaattcaag ccagcattaa agagtttgga tttaacatgg tggcaagtga
601 catgatctca ctggaccgca gcgtcaatga cttacgccaa gaagaatgca agtattggca
661 ttatgatgaa aacttgctca cttcgagcgt tgtcattgtc ttccataatg aaggatggtc
721 aaccctcatg agaacagtcc acagtgtaat taaaaggact ccaaggaaat atttagcaga
781 aattgtgtta attgacgatt tcagtaataa agaacactta aaagaaaaac tggatgaata
841 tattaagctg tggaatggcc tagtgaaggt atttcgaaat gaaagaaggg aaggtttaat
901 tcaagcacga agtattggtg ctcagaaggc taaacttgga caggttttga tataccttga
961 tgcccactgt gaggtggcag ttaactggta tgcaccactt gtagctccca tatctaagga
1021 cagaaccatt tgcactgtgc cgcttataga tgtcataaat ggcaacacat atgaaattat
1081 accccaaggg ggtggtgatg aagatgggta tgcccgagga gcatgggatt ggagtatgct
1141 ctggaaacgg gtgcctctga cccctcaaga gaagagactg agaaagacaa aaactgaacc
1201 gtatcggtcc ccagccatgg ctgggggatt atttgccatt gaacgagagt tcttctttga
1261 attgggtctc tatgatccag gtctccagat ttggggtggt gaaaactttg agatctcata
1321 caagatatgg cagtgtggtg gcaaattatt atttgttcct tgttctcgtg ttggacatat
1381 ctaccgtctt gagggctggc aaggaaatcc tccgcccatt tatgttgggt cttctccaac
1441 tctgaagaat tatgttagag ttgtggaggt ttggtgggat gaatataaag actacttcta
1501 tgctagtcgt cctgaatcgc aggcattacc atatggggat atatcggagc tgaaaaaatt
1561 tcgagaagat cacaactgca aaagttttaa gtggttcatg gaagaaatag cttatgatat
1621 cacctcacac taccctttgc cacccaaaaa tgttgactgg ggagaaatca gaggcttcga
1681 aactgcttac tgcattgata gcatgggaaa aacaaatgga ggctttgttg aactaggacc
1741 ctgccacagg atgggaggga atcagctttt cagaatcaat gaagcaaatc aactcatgca
1801 gtatgaccag tgtttgacaa agggagctga tggatcaaaa gttatgatta cacactgtaa
1861 tctaaatgaa tttaaggaat ggcagtactt caagaacctg cacagattta ctcatattcc
1921 ttcaggaaag tgtttagatc gctcagaggt cctgcatcaa gtattcatct ccaattgtga
1981 ctccagtaaa acgactcaaa aatgggaaat gaataacatc catagtgttt agagagaaaa
2041 aataaaccaa taacctacct actgacaagt aaatttatac aggactgaaa accgcctgaa
2101 acctgctgca actattgtta ttaactctgt atagctccaa acctggaacc tcctgatcag
2161 tttgaaggac attgataaac tgtgatttta caataacatt atcatctgca gttactgttt
2221 acaagactgc ttttacctta aaccttgtag atgtttacat ctttttgttg tgttttaaga
2281 tgatgttggt aatttgtgcc tttagctctg ttttattaga cagagttaaa gcatgttgtc
2341 ttctttggga ttacactcag gggtctgaaa ggcagtttga tttttatttt taacacactt
2401 gaaaaaaggt tggagtagcc agactttcat atataacttg gtgattatca acctgttgtg
2461 tctttattta attttacatc tttttgaagc actgccacag gttattagcc aaggtggcct
2521 tccttcacag tcatgctgct tttttgaaag gtgaatttca acacatttag tgcctctttc
2581 atttctcagt atatatttca agagcttgtg atgaaatcta taggatggta atgatggact
2641 tgtcacctgt atggggaata cttttactac tcagaaatga atttatgtgc tgccatttgc
2701 tataaagttg aactttgtat ggcttgaaaa agaaatgaca atatggaaca tcccaaggct
2761 gtcccatagg gttggaagtt gtgtagcatt cactccctta cctactggca ttcccagtgc
2821 cctctgtcca tacctacttc taggattgca aaggagtctt ccaactagag aaaaattgtc
2881 cactgacatt tgggatttac ttttctccaa tacctgccaa tacagaaaac tattatcagt
2941 tgttattgtt atcccttgaa agcgagggtg acaaaaacaa caaaacaccg ttataaacac
3001 atcaaaggtt cattctgact gaggtaagac tttccaagcc cttgttagat taggccttat
3061 aaaacttgtg tgcattataa cctaagctgt gcaacctgtg aagccaagag tgaactgatg
3121 tttcatttat attttcatcc aaatgacatt atctgcacgt ttttaaaatt taaaaacaaa
3181 ggactattta aaaatacagt ttattaacaa acgtgaacta ctttctgtta cattaggtgt
3241 tccctagtgt ttcttaattt ctttttagaa agtgtatttt tattagtatt tttccggtga
3301 acagaagatt tgtttggatt taaacattta ctaagacagt acctattagg aaaaccaaat
3361 attgcaaatg gtcaattcga ttttaatttc tcaaaagata ctctgttatc cagaagatta
3421 aaatgcctac attgagtgct taaaaaaaaa aaaaacaact gtgatgatgt gagcagaatg
3481 gcaagtaagt taagcatttt tgatcctgta atcatggtat cattacaatg aaaggaattc
3541 acaaactact gccagaggaa gtttgttttt taatttaaga gggaaatata acctataaat
3601 ttgtttcttc caagcttagc tcttaaattt ggagactcaa agttaaacat cctcaacaga
3661 gttttattta taattttgaa ttgtcaattt gtattttgct actgatctgt gatcaaccat
3721 tttaactttc atctctaggg atgtttaaca tttataattg caaaataaac caactataaa
3781 aaaagaaact aagagagaat tggtacttta attacttgtg tgtttgcaaa taggctccat
3841 tttccatgtt gagtagatta taaccttatt aactatgcat aggcctaaga aaggtggcaa
3901 tgaactgtgc atgtaaattt taaatgggta ctttgtgcaa ttcgttaaaa gaagatactc
3961 tatgaatatg attctatata ttgaaatcag aaaacctacc aaacaaaaac atcagaagct
4021 gctgccataa tgactatttt ctactgtagg ctgctttgga aataattccc atatccttgc
4081 tttgtaagtt ggtaatatca ctatgcattt ctacacattt tataaatttg atttatgcag
4141 attttgatac actgtatgtt tctgtagaaa ttgtataaat attcaaaatt ttattaggat
4201 aaatttgaga aacttacgta tatcttaatt ctgggttgct tgttttttag gtgacaaaaa
4261 taaaatattg tattttaatc caaaaaaaaa aaaaaaa
//