LOCUS BC012194 2292 bp mRNA linear HUM 04-OCT-2003 DEFINITION Homo sapiens N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase, mRNA (cDNA clone MGC:20529 IMAGE:4661335), complete cds. ACCESSION BC012194 VERSION BC012194.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2292) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2292) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (02-AUG-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterfield, Susanna Chan, Readman Chiu, Chris Fjell, Erin Garland, Ran Guin, Letticia Hsiao, Martin Krzywinski, Reta Kutsche, Oliver Lee, Soo Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi, Jacqueline Schein, Duane Smailus, Michael Smith, Lorraine Spence, Jeff Stott, Michael Thorne, Miranada Tsai, Natasja van den Bosch, Jill Vardy, George Yang, Scott Zuyderduyn, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 29 Row: p Column: 13 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 7705908. FEATURES Location/Qualifiers source 1..2292 /db_xref="H-InvDB:HIT000035727" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:20529 IMAGE:4661335" /tissue_type="Colon, adenocarcinoma" /clone_lib="NIH_MGC_15" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2292 /gene="NAGPA" /gene_synonym="APAA" /gene_synonym="UCE" /db_xref="GeneID:51172" /db_xref="MIM:607985" CDS 18..947 /gene="NAGPA" /gene_synonym="APAA" /gene_synonym="UCE" /codon_start=1 /product="NAGPA protein" /protein_id="AAH12194.1" /db_xref="GeneID:51172" /db_xref="MIM:607985" /translation="MATSTGRWLLLRLALFGFLWEASGGLDSGASRDDDLLLPYPRAR ARLPRDCTRVRAGNREHESWPPPPATPGAGGLAVRTFVSHFRDRAVAGHLTRAVEPLR TFSVLEPGGPGGCAARRRATVEETARAADCRVAQNGGFFRMNSGECLGNVVSDERRVS SSGGLQNAQFGIRRDGTLVTGYLSEEEVLDTENPFVQLLSGVVWLIRNGSIYINESQA TECDETQETGSFSKFVNVISARTAIGHDRKGQLVLFHADGQTEQRGINLWEMAEFLLK QDVVNAINLDGGGSATFVLNGTLASYPSDHWQA" misc_feature 417..932 /gene="NAGPA" /gene_synonym="APAA" /gene_synonym="UCE" /note="EpsL; Region: COG4632, EpsL, Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase [Carbohydrate transport and metabolism]" /db_xref="CDD:COG4632" BASE COUNT 450 a 703 c 713 g 426 t ORIGIN 1 aattcggacg aggcaatatg gcgacctcca cgggtcgctg gcttctcctc cggcttgcac 61 tattcggctt cctctgggaa gcgtccggcg gcctcgactc gggggcctcc cgcgacgacg 121 acttgctact gccctatcca cgcgcgcgcg cgcgcctccc ccgggactgc acacgggtgc 181 gcgccggcaa ccgcgagcac gagagttggc ctccgcctcc cgcgactccc ggcgccggcg 241 gtctggccgt gcgcaccttc gtgtcgcact tcagggaccg cgcggtggcc ggccacctga 301 cgcgggccgt tgagcccctg cgcaccttct cggtgctgga gcccggtggg cccggcggct 361 gcgcggcgag acgacgcgcc accgtggagg agacggcgcg ggcggccgac tgccgtgtcg 421 cccagaacgg cggcttcttc cgcatgaact cgggcgagtg cctggggaac gtggtgagcg 481 acgagcggcg ggtgagcagc tccggggggc tgcagaacgc gcagttcggg atccgccgcg 541 acgggaccct ggtcaccggg tacctgtctg aggaggaggt gctggacact gagaacccat 601 ttgtgcagct gctgagtggg gtcgtgtggc tgattcgtaa tggaagcatc tacatcaacg 661 agagccaagc cacagagtgt gacgagacac aggagacagg ttcctttagc aaatttgtga 721 atgtgatatc agccaggacg gccattggcc acgaccggaa agggcagctg gtgctctttc 781 atgcagacgg ccaaacggag cagcgtggca tcaacctgtg ggaaatggcg gagttcctgc 841 tgaaacagga cgtggtcaac gccatcaacc tggatggggg tggctctgcc acctttgtgc 901 tcaacgggac cttggccagt tacccgtcag atcactggca ggcgtgatgg tgcacacctt 961 acggttacag ctgctcagga ggctcgggtg gaaggatcgc aatacctcat ctcaaaaata 1021 gatagacgcc atcttgtagc caggacaaca tgtggcgctg tccccgccaa gtgtccaccg 1081 tggtgtgtgt gcacgaaccc cgctgccagc cgcctgactg ccacggccac gggacctgcg 1141 tggacgggca ctgccaatgc accgggcact tctggcgggg tcccggctgt gatgagctgg 1201 actgtggccc ctctaactgc agccagcacg gactgtgcac ggagaccggc tgccgctgtg 1261 atgccggatg gaccgggtcc aactgcagtg aagagtgtcc ccttggctgg catgggccgg 1321 gctgccagag gccttgtaag tgtgagcacc attgtccctg tgaccccaag actggcaact 1381 gcagcgtctc cagagtaaag cagtgtctcc agccacctga agccaccctg agggcgggag 1441 aactctcctt tttcaccagg accgcctggc tagccctcac cctggcgctg gccttcctcc 1501 tgctgatcag cactgcagca aacctgtcct tgctcctgtc cagagcagag aggaaccggc 1561 gcctgcatgg ggactatgca taccacccgc tgcaggagat gaatggggag cctctggccg 1621 cagagaagga gcagccaggg ggcgcccaca accccttcaa ggactgaagc ctcaagctgc 1681 ccggggtggc acgtcgcgaa agcttgtttc cccacggtct ggcttctgca ggggaaattt 1741 caaggccact ggcgtggacc atctgggtgt cctcagcccc tgtggggcag ccaagttcct 1801 gatagcactt gtgcctcagc ccctcacctg gccacctgcc agggcacctg caaccctagc 1861 aataccatgc tcgctggaga ggctcagctg cctgcttttc gcctgcctgt gtctgctgct 1921 gagaagcccg tgcccccggg agggctgccg cactgccaaa gagtctccct cctcctgggg 1981 aaggggctgc caacgaacca gactcagtga ccacgtcatg acagaacagc acatcctggc 2041 cagcacccct ggctggagtg ggttaaaggg acgagtctgc cttcctggct gtgacacggg 2101 accccttttc tacagacctc atcactggat ttgccaacta gaattcgatt tcctgtcata 2161 ggaagctcct tggaagaagg gatgggggga tgagatcatg tttacagacc tgttttgtca 2221 tcctgctgcc aagaagtttt ttaatcactt gaataaattg atataataaa aggaaaaaaa 2281 aaaaaaaaaa aa //