LOCUS BC007493 2418 bp mRNA linear HUM 16-DEC-2006
DEFINITION Homo sapiens galactosidase, beta 1, mRNA (cDNA clone MGC:2315
IMAGE:2988086), complete cds.
ACCESSION BC007493
VERSION BC007493.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2418)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2418)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (04-MAY-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Oct 8, 2003 this sequence version replaced BC007493.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Institute for Systems Biology
http://www.systemsbiology.org
contact: amadan@systemsbiology.org
Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 3 Row: p Column: 3
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 10834965.
FEATURES Location/Qualifiers
source 1..2418
/db_xref="H-InvDB:HIT000033247"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:2315 IMAGE:2988086"
/tissue_type="Colon, adenocarcinoma"
/clone_lib="NIH_MGC_15"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..2418
/gene="GLB1"
/db_xref="GeneID:2720"
/db_xref="HGNC:HGNC:4298"
/db_xref="MIM:230500"
CDS 48..2081
/gene="GLB1"
/codon_start=1
/product="galactosidase, beta 1"
/protein_id="AAH07493.1"
/db_xref="GeneID:2720"
/db_xref="HGNC:HGNC:4298"
/db_xref="MIM:230500"
/translation="MPGFLVRILLLLLVLLLLGPTRGLRNATQRMFEIDYSRDSFLKD
GQPFRYISGSIHYSRVPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPWPGQYQFSEDH
DVEYFLRLAHELGLLVILRPGPYICAEWEMGGLPAWLLEKESILLRSSDPDYLAAVDK
WLGVLLPKMKPLLYQNGGPVITVQVENEYGSYFACDFDYLRFLQKRFRHHLGDDVVLF
TTDGAHKTFLKCGALQGLYTTVDFGTGSNITDAFLSQRKCEPKGPLINSEFYTGWLDH
WGQPHSTIKTEAVASSLYDILARGASVNLYMFIGGTNFAYWNGANSPYAAQPTSYDYD
APLSEAGDLTEKYFALRNIIQKFEKVPEGPIPPSTPKFAYGKVTLEKLKTVGAALDIL
CPSGPIKSLYPLTFIQVKQHYGFVLYRTTLPQDCSNPAPLSSPLNGVHDRAYVAVDGI
PQGVLERNNVITLNITGKAGATLDLLVENMGRVNYGAYINDFKGLVSNLTLSSNILTD
WTIFPLDTEDAVRSHLGGWGHRDSGHHDEAWAHNSSNYTLPAFYMGNFSIPSGIPDLP
QDTFIQFPGWTKGQVWINGFNLGRYWPARGPQLTLFVPQHILMTSAPNTITVLELEWA
PCSSDDPELCAVTFVDRPVIGSSVTYDHPSKPVEKRLMPPPPQKNKDSWLDHV"
BASE COUNT 587 a 629 c 607 g 595 t
ORIGIN
1 aagcggccgg cctgggcgcc gactgcagag ccgggaggct ggtggtcatg ccggggttcc
61 tggttcgcat cctccttctg ctgctggttc tgctgcttct gggccctacg cgcggcttgc
121 gcaatgccac ccagaggatg tttgaaattg actatagccg ggactccttc ctcaaggatg
181 gccagccatt tcgctacatc tcaggaagca ttcactactc ccgtgtgccc cgcttctact
241 ggaaggaccg gctgctgaag atgaagatgg ctgggctgaa cgccatccag acgtatgtgc
301 cctggaactt tcatgagccc tggccaggac agtaccagtt ttctgaggac catgatgtgg
361 aatattttct tcggctggct catgagctgg gactgctggt tatcctgagg cccgggccct
421 acatctgtgc agagtgggaa atgggaggat tacctgcttg gctgctagag aaagagtcta
481 ttcttctccg ctcctccgac ccagattacc tggcagctgt ggacaagtgg ttgggagtcc
541 ttctgcccaa gatgaagcct ctcctctatc agaatggagg gccagttata acagtgcagg
601 ttgaaaatga atatggcagc tactttgcct gtgattttga ctacctgcgc ttcctgcaga
661 agcgctttcg ccaccatctg ggggatgatg tggttctgtt taccactgat ggagcacata
721 aaacattcct gaaatgtggg gccctgcagg gcctctacac cacggtggac tttggaacag
781 gcagcaacat cacagatgct ttcctaagcc agaggaagtg tgagcccaaa ggacccttga
841 tcaattctga attctatact ggctggctag atcactgggg ccaacctcac tccacaatca
901 agaccgaagc agtggcttcc tccctctatg atatacttgc ccgtggggcg agtgtgaact
961 tgtacatgtt tataggtggg accaattttg cctattggaa tggggccaac tcaccctatg
1021 cagcacagcc caccagctac gactatgatg ccccactgag tgaggctggg gacctcactg
1081 agaagtattt tgctctgcga aacatcatcc agaagtttga aaaagtacca gaaggtccta
1141 tccctccatc tacaccaaag tttgcatatg gaaaggtcac tttggaaaag ttaaagacag
1201 tgggagcagc tctggacatt ctgtgtccct ctgggcccat caaaagcctt tatcccttga
1261 catttatcca ggtgaaacag cattatgggt ttgtgctgta ccggacaaca cttcctcaag
1321 attgcagcaa cccagcacct ctctcttcac ccctcaatgg agtccacgat cgagcatatg
1381 ttgctgtgga tgggatcccc cagggagtcc ttgagcgaaa caatgtgatc actctgaaca
1441 taacagggaa agctggagcc actctggacc ttctggtaga gaacatggga cgtgtgaact
1501 atggtgcata tatcaacgat tttaagggtt tggtttctaa cctgactctc agttccaata
1561 tcctcacgga ctggacgatc tttccactgg acactgagga tgcagtgcgc agccacctgg
1621 ggggctgggg acaccgtgac agtggccacc atgatgaagc ctgggcccac aactcatcca
1681 actacacgct cccggccttt tatatgggga acttctccat tcccagtggg atcccagact
1741 tgccccagga cacctttatc cagtttcctg gatggaccaa gggccaggtc tggattaatg
1801 gctttaacct tggccgctat tggccagccc ggggccctca gttgaccttg tttgtgcccc
1861 agcacatcct gatgacctcg gccccaaaca ccatcaccgt gctggaactg gagtgggcac
1921 cctgcagcag tgatgatcca gaactatgtg ctgtgacgtt cgtggacagg ccagttattg
1981 gctcatctgt gacctacgat catccctcca aacctgttga aaaaagactc atgcccccac
2041 ccccgcaaaa aaacaaagat tcatggctgg accatgtatg atgatgaaag cctgtgtctt
2101 tgagggattc taccctgaac atacctcaca gatcctccct gttatgccac atttcactga
2161 ttggaatgtg gaaatggaaa aggaatttag gatgtgcatt ttcacctgag gtttccctgc
2221 atccctgcag tgccaaagcc ccaccttcag ggaccacctg gaatgtgtga ggggctgaca
2281 gcacagtaac gtgcatacat atctgcaggg ctggaatgga agctttaaag gtggtagtga
2341 tttttatttt ggaagaatca tgttaccttt ttgttaaata aaatttgtac tcaaatgaaa
2401 aaaaaaaaaa aaaaaaaa
//