LOCUS BC007493 2418 bp mRNA linear HUM 16-DEC-2006 DEFINITION Homo sapiens galactosidase, beta 1, mRNA (cDNA clone MGC:2315 IMAGE:2988086), complete cds. ACCESSION BC007493 VERSION BC007493.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2418) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2418) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (04-MAY-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Oct 8, 2003 this sequence version replaced BC007493.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 3 Row: p Column: 3 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 10834965. FEATURES Location/Qualifiers source 1..2418 /db_xref="H-InvDB:HIT000033247" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:2315 IMAGE:2988086" /tissue_type="Colon, adenocarcinoma" /clone_lib="NIH_MGC_15" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2418 /gene="GLB1" /db_xref="GeneID:2720" /db_xref="HGNC:HGNC:4298" /db_xref="MIM:230500" CDS 48..2081 /gene="GLB1" /codon_start=1 /product="galactosidase, beta 1" /protein_id="AAH07493.1" /db_xref="GeneID:2720" /db_xref="HGNC:HGNC:4298" /db_xref="MIM:230500" /translation="MPGFLVRILLLLLVLLLLGPTRGLRNATQRMFEIDYSRDSFLKD GQPFRYISGSIHYSRVPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPWPGQYQFSEDH DVEYFLRLAHELGLLVILRPGPYICAEWEMGGLPAWLLEKESILLRSSDPDYLAAVDK WLGVLLPKMKPLLYQNGGPVITVQVENEYGSYFACDFDYLRFLQKRFRHHLGDDVVLF TTDGAHKTFLKCGALQGLYTTVDFGTGSNITDAFLSQRKCEPKGPLINSEFYTGWLDH WGQPHSTIKTEAVASSLYDILARGASVNLYMFIGGTNFAYWNGANSPYAAQPTSYDYD APLSEAGDLTEKYFALRNIIQKFEKVPEGPIPPSTPKFAYGKVTLEKLKTVGAALDIL CPSGPIKSLYPLTFIQVKQHYGFVLYRTTLPQDCSNPAPLSSPLNGVHDRAYVAVDGI PQGVLERNNVITLNITGKAGATLDLLVENMGRVNYGAYINDFKGLVSNLTLSSNILTD WTIFPLDTEDAVRSHLGGWGHRDSGHHDEAWAHNSSNYTLPAFYMGNFSIPSGIPDLP QDTFIQFPGWTKGQVWINGFNLGRYWPARGPQLTLFVPQHILMTSAPNTITVLELEWA PCSSDDPELCAVTFVDRPVIGSSVTYDHPSKPVEKRLMPPPPQKNKDSWLDHV" BASE COUNT 587 a 629 c 607 g 595 t ORIGIN 1 aagcggccgg cctgggcgcc gactgcagag ccgggaggct ggtggtcatg ccggggttcc 61 tggttcgcat cctccttctg ctgctggttc tgctgcttct gggccctacg cgcggcttgc 121 gcaatgccac ccagaggatg tttgaaattg actatagccg ggactccttc ctcaaggatg 181 gccagccatt tcgctacatc tcaggaagca ttcactactc ccgtgtgccc cgcttctact 241 ggaaggaccg gctgctgaag atgaagatgg ctgggctgaa cgccatccag acgtatgtgc 301 cctggaactt tcatgagccc tggccaggac agtaccagtt ttctgaggac catgatgtgg 361 aatattttct tcggctggct catgagctgg gactgctggt tatcctgagg cccgggccct 421 acatctgtgc agagtgggaa atgggaggat tacctgcttg gctgctagag aaagagtcta 481 ttcttctccg ctcctccgac ccagattacc tggcagctgt ggacaagtgg ttgggagtcc 541 ttctgcccaa gatgaagcct ctcctctatc agaatggagg gccagttata acagtgcagg 601 ttgaaaatga atatggcagc tactttgcct gtgattttga ctacctgcgc ttcctgcaga 661 agcgctttcg ccaccatctg ggggatgatg tggttctgtt taccactgat ggagcacata 721 aaacattcct gaaatgtggg gccctgcagg gcctctacac cacggtggac tttggaacag 781 gcagcaacat cacagatgct ttcctaagcc agaggaagtg tgagcccaaa ggacccttga 841 tcaattctga attctatact ggctggctag atcactgggg ccaacctcac tccacaatca 901 agaccgaagc agtggcttcc tccctctatg atatacttgc ccgtggggcg agtgtgaact 961 tgtacatgtt tataggtggg accaattttg cctattggaa tggggccaac tcaccctatg 1021 cagcacagcc caccagctac gactatgatg ccccactgag tgaggctggg gacctcactg 1081 agaagtattt tgctctgcga aacatcatcc agaagtttga aaaagtacca gaaggtccta 1141 tccctccatc tacaccaaag tttgcatatg gaaaggtcac tttggaaaag ttaaagacag 1201 tgggagcagc tctggacatt ctgtgtccct ctgggcccat caaaagcctt tatcccttga 1261 catttatcca ggtgaaacag cattatgggt ttgtgctgta ccggacaaca cttcctcaag 1321 attgcagcaa cccagcacct ctctcttcac ccctcaatgg agtccacgat cgagcatatg 1381 ttgctgtgga tgggatcccc cagggagtcc ttgagcgaaa caatgtgatc actctgaaca 1441 taacagggaa agctggagcc actctggacc ttctggtaga gaacatggga cgtgtgaact 1501 atggtgcata tatcaacgat tttaagggtt tggtttctaa cctgactctc agttccaata 1561 tcctcacgga ctggacgatc tttccactgg acactgagga tgcagtgcgc agccacctgg 1621 ggggctgggg acaccgtgac agtggccacc atgatgaagc ctgggcccac aactcatcca 1681 actacacgct cccggccttt tatatgggga acttctccat tcccagtggg atcccagact 1741 tgccccagga cacctttatc cagtttcctg gatggaccaa gggccaggtc tggattaatg 1801 gctttaacct tggccgctat tggccagccc ggggccctca gttgaccttg tttgtgcccc 1861 agcacatcct gatgacctcg gccccaaaca ccatcaccgt gctggaactg gagtgggcac 1921 cctgcagcag tgatgatcca gaactatgtg ctgtgacgtt cgtggacagg ccagttattg 1981 gctcatctgt gacctacgat catccctcca aacctgttga aaaaagactc atgcccccac 2041 ccccgcaaaa aaacaaagat tcatggctgg accatgtatg atgatgaaag cctgtgtctt 2101 tgagggattc taccctgaac atacctcaca gatcctccct gttatgccac atttcactga 2161 ttggaatgtg gaaatggaaa aggaatttag gatgtgcatt ttcacctgag gtttccctgc 2221 atccctgcag tgccaaagcc ccaccttcag ggaccacctg gaatgtgtga ggggctgaca 2281 gcacagtaac gtgcatacat atctgcaggg ctggaatgga agctttaaag gtggtagtga 2341 tttttatttt ggaagaatca tgttaccttt ttgttaaata aaatttgtac tcaaatgaaa 2401 aaaaaaaaaa aaaaaaaa //