LOCUS BC012194 2292 bp mRNA linear HUM 04-OCT-2003
DEFINITION Homo sapiens N-acetylglucosamine-1-phosphodiester
alpha-N-acetylglucosaminidase, mRNA (cDNA clone MGC:20529
IMAGE:4661335), complete cds.
ACCESSION BC012194
VERSION BC012194.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2292)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2292)
AUTHORS Strausberg,R.
TITLE Direct Submission
JOURNAL Submitted (02-AUG-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Cancer Genomics Office, National Cancer
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Genome Sequence Centre,
BC Cancer Agency, Vancouver, BC, Canada
info@bcgsc.bc.ca
Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterfield,
Susanna Chan, Readman Chiu, Chris Fjell, Erin Garland, Ran Guin,
Letticia Hsiao, Martin Krzywinski, Reta Kutsche, Oliver Lee, Soo
Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven
Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi, Jacqueline
Schein, Duane Smailus, Michael Smith, Lorraine Spence, Jeff Stott,
Michael Thorne, Miranada Tsai, Natasja van den Bosch, Jill Vardy,
George Yang, Scott Zuyderduyn, Marco Marra.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 29 Row: p Column: 13
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 7705908.
FEATURES Location/Qualifiers
source 1..2292
/db_xref="H-InvDB:HIT000035727"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:20529 IMAGE:4661335"
/tissue_type="Colon, adenocarcinoma"
/clone_lib="NIH_MGC_15"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..2292
/gene="NAGPA"
/gene_synonym="APAA"
/gene_synonym="UCE"
/db_xref="GeneID:51172"
/db_xref="MIM:607985"
CDS 18..947
/gene="NAGPA"
/gene_synonym="APAA"
/gene_synonym="UCE"
/codon_start=1
/product="NAGPA protein"
/protein_id="AAH12194.1"
/db_xref="GeneID:51172"
/db_xref="MIM:607985"
/translation="MATSTGRWLLLRLALFGFLWEASGGLDSGASRDDDLLLPYPRAR
ARLPRDCTRVRAGNREHESWPPPPATPGAGGLAVRTFVSHFRDRAVAGHLTRAVEPLR
TFSVLEPGGPGGCAARRRATVEETARAADCRVAQNGGFFRMNSGECLGNVVSDERRVS
SSGGLQNAQFGIRRDGTLVTGYLSEEEVLDTENPFVQLLSGVVWLIRNGSIYINESQA
TECDETQETGSFSKFVNVISARTAIGHDRKGQLVLFHADGQTEQRGINLWEMAEFLLK
QDVVNAINLDGGGSATFVLNGTLASYPSDHWQA"
misc_feature 417..932
/gene="NAGPA"
/gene_synonym="APAA"
/gene_synonym="UCE"
/note="EpsL; Region: COG4632, EpsL, Exopolysaccharide
biosynthesis protein related to
N-acetylglucosamine-1-phosphodiester
alpha-N-acetylglucosaminidase [Carbohydrate transport and
metabolism]"
/db_xref="CDD:COG4632"
BASE COUNT 450 a 703 c 713 g 426 t
ORIGIN
1 aattcggacg aggcaatatg gcgacctcca cgggtcgctg gcttctcctc cggcttgcac
61 tattcggctt cctctgggaa gcgtccggcg gcctcgactc gggggcctcc cgcgacgacg
121 acttgctact gccctatcca cgcgcgcgcg cgcgcctccc ccgggactgc acacgggtgc
181 gcgccggcaa ccgcgagcac gagagttggc ctccgcctcc cgcgactccc ggcgccggcg
241 gtctggccgt gcgcaccttc gtgtcgcact tcagggaccg cgcggtggcc ggccacctga
301 cgcgggccgt tgagcccctg cgcaccttct cggtgctgga gcccggtggg cccggcggct
361 gcgcggcgag acgacgcgcc accgtggagg agacggcgcg ggcggccgac tgccgtgtcg
421 cccagaacgg cggcttcttc cgcatgaact cgggcgagtg cctggggaac gtggtgagcg
481 acgagcggcg ggtgagcagc tccggggggc tgcagaacgc gcagttcggg atccgccgcg
541 acgggaccct ggtcaccggg tacctgtctg aggaggaggt gctggacact gagaacccat
601 ttgtgcagct gctgagtggg gtcgtgtggc tgattcgtaa tggaagcatc tacatcaacg
661 agagccaagc cacagagtgt gacgagacac aggagacagg ttcctttagc aaatttgtga
721 atgtgatatc agccaggacg gccattggcc acgaccggaa agggcagctg gtgctctttc
781 atgcagacgg ccaaacggag cagcgtggca tcaacctgtg ggaaatggcg gagttcctgc
841 tgaaacagga cgtggtcaac gccatcaacc tggatggggg tggctctgcc acctttgtgc
901 tcaacgggac cttggccagt tacccgtcag atcactggca ggcgtgatgg tgcacacctt
961 acggttacag ctgctcagga ggctcgggtg gaaggatcgc aatacctcat ctcaaaaata
1021 gatagacgcc atcttgtagc caggacaaca tgtggcgctg tccccgccaa gtgtccaccg
1081 tggtgtgtgt gcacgaaccc cgctgccagc cgcctgactg ccacggccac gggacctgcg
1141 tggacgggca ctgccaatgc accgggcact tctggcgggg tcccggctgt gatgagctgg
1201 actgtggccc ctctaactgc agccagcacg gactgtgcac ggagaccggc tgccgctgtg
1261 atgccggatg gaccgggtcc aactgcagtg aagagtgtcc ccttggctgg catgggccgg
1321 gctgccagag gccttgtaag tgtgagcacc attgtccctg tgaccccaag actggcaact
1381 gcagcgtctc cagagtaaag cagtgtctcc agccacctga agccaccctg agggcgggag
1441 aactctcctt tttcaccagg accgcctggc tagccctcac cctggcgctg gccttcctcc
1501 tgctgatcag cactgcagca aacctgtcct tgctcctgtc cagagcagag aggaaccggc
1561 gcctgcatgg ggactatgca taccacccgc tgcaggagat gaatggggag cctctggccg
1621 cagagaagga gcagccaggg ggcgcccaca accccttcaa ggactgaagc ctcaagctgc
1681 ccggggtggc acgtcgcgaa agcttgtttc cccacggtct ggcttctgca ggggaaattt
1741 caaggccact ggcgtggacc atctgggtgt cctcagcccc tgtggggcag ccaagttcct
1801 gatagcactt gtgcctcagc ccctcacctg gccacctgcc agggcacctg caaccctagc
1861 aataccatgc tcgctggaga ggctcagctg cctgcttttc gcctgcctgt gtctgctgct
1921 gagaagcccg tgcccccggg agggctgccg cactgccaaa gagtctccct cctcctgggg
1981 aaggggctgc caacgaacca gactcagtga ccacgtcatg acagaacagc acatcctggc
2041 cagcacccct ggctggagtg ggttaaaggg acgagtctgc cttcctggct gtgacacggg
2101 accccttttc tacagacctc atcactggat ttgccaacta gaattcgatt tcctgtcata
2161 ggaagctcct tggaagaagg gatgggggga tgagatcatg tttacagacc tgttttgtca
2221 tcctgctgcc aagaagtttt ttaatcactt gaataaattg atataataaa aggaaaaaaa
2281 aaaaaaaaaa aa
//