LOCUS BC007257 2369 bp mRNA linear HUM 03-OCT-2003
DEFINITION Homo sapiens cystathionine-beta-synthase, mRNA (cDNA clone
MGC:15515 IMAGE:3028099), complete cds.
ACCESSION BC007257
VERSION BC007257.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2369)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2369)
AUTHORS Strausberg,R.
TITLE Direct Submission
JOURNAL Submitted (01-MAY-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Cancer Genomics Office, National Cancer
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Genome Sequence Centre,
BC Cancer Agency, Vancouver, BC, Canada
info@bcgsc.bc.ca
Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterfield,
Susanna Chan, Readman Chiu, Chris Fjell, Erin Garland, Ran Guin,
Letticia Hsiao, Martin Krzywinski, Reta Kutsche, Oliver Lee, Soo
Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven
Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi, Jacqueline
Schein, Duane Smailus, Michael Smith, Lorraine Spence, Jeff Stott,
Michael Thorne, Miranada Tsai, Natasja van den Bosch, Jill Vardy,
George Yang, Scott Zuyderduyn, Marco Marra.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 22 Row: e Column: 24
This clone was selected for full length sequencing because it
passed the following selection criteria: Hexamer frequency ORF
analysis, Similarity but not identity to protein.
FEATURES Location/Qualifiers
source 1..2369
/db_xref="H-InvDB:HIT000033101"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:15515 IMAGE:3028099"
/tissue_type="Muscle, rhabdomyosarcoma"
/clone_lib="NIH_MGC_17"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..2369
/gene="CBS"
/gene_synonym="HIP4"
/db_xref="GeneID:875"
/db_xref="MIM:236200"
CDS 223..1878
/gene="CBS"
/gene_synonym="HIP4"
/codon_start=1
/product="CBS protein"
/protein_id="AAH07257.1"
/db_xref="GeneID:875"
/db_xref="MIM:236200"
/translation="MPSETPQAEVGPTGCPHRSGPHSAKGSLEKGSPEDKEAKEPLWI
RPDAPSRCTWQLGRPASESPHHHTAPAKSPKILPDILKKIGDTPMVRINKIGKKFGLK
CELLAKCEFFNAGGSVKDRISLRMIEDAERDGTLKPGDTIIEPTSGNTGIGLALAAAV
RGYRCIIVMPEKMSSEKVDVLRALGAEIVRTPTNARFDSPESHVGVAWRLKNEIPNSH
ILDQYRNASNPLAHYDTTADEILQQCDGKLDMLVASVGTGGTITGIARKLKEKCPGCR
IIGVDPEGSILAEPEELNQTEQTTYEVEGIGYDFIPTVLDRTVVDKWFKSNDEEAFTF
ARMLIAQEGLLCGGSAGSTVAVAVKAAQELQEGQRCVVILPDSVRNYMTKFLSDRWML
QKGFLKEEDLTEKKPWWWHLRVQELGLSAPLTVLPTITCGHTIEILREKGFDQAPVVD
EAGVILGMVTLGNMLSSLLAGKVQPSDQVGKVIYKQFKQIRLTDTLGRLSHILEMDHF
ALVVHEQIQYHSTGKSSQRQMVFGVVTAIDLLNFVAAQERDQK"
misc_feature 448..1374
/gene="CBS"
/gene_synonym="HIP4"
/note="CysK; Region: Cysteine synthase [Amino acid
transport and metabolism]"
/db_xref="CDD:COG0031"
misc_feature 1486..1629
/gene="CBS"
/gene_synonym="HIP4"
/note="Region: Domain in cystathionine beta-synthase and
other proteins. Domain present in all 3 forms of cellular
life. Present in two copies in inosine monophosphate
dehydrogenase, of which one is disordered in the crystal
structure. A number of disease states are associated with
CBS-containing proteins including homocystinuria, Becker's
and Thomsen disease"
/db_xref="CDD:smart00116"
BASE COUNT 554 a 644 c 737 g 434 t
ORIGIN
1 cacgccctcg gggtcggtcc tcgaggacgc gcagggcccc ccacccacca ggacgcacgt
61 ttcaagctca tcagtaaagg ttccttaaat tccctaaggg caagaagtta accaagtaaa
121 acagcatcgg aacaccagga tcccatgaca gattctgttg tcacgtctcc ttacagagtt
181 tgagcggtgc tgaactgtca gcaccatctg tccggtccca gcatgccttc tgagaccccc
241 caggcagaag tggggcccac aggctgcccc caccgctcag ggccacactc ggcgaagggg
301 agcctggaga aggggtcccc agaggataag gaagccaagg agcccctgtg gatccggccc
361 gatgctccga gcaggtgcac ctggcagctg ggccggcctg cctccgagtc cccacatcac
421 cacactgccc cggcaaaatc tccaaaaatc ttgccagata ttctgaagaa aatcggggac
481 acccctatgg tcagaatcaa caagattggg aagaagttcg gcctgaagtg tgagctcttg
541 gccaagtgtg agttcttcaa cgcgggcggg agcgtgaagg accgcatcag cctgcggatg
601 attgaggatg ctgagcgcga cgggacgctg aagcccgggg acacgattat cgagccgaca
661 tccgggaaca ccgggatcgg gctggccctg gctgcggcag tgaggggcta tcgctgcatc
721 atcgtgatgc cagagaagat gagctccgag aaggtggacg tgctgcgggc actgggggct
781 gagattgtga ggacgcccac caatgccagg ttcgactccc cggagtcaca cgtgggggtg
841 gcctggcggc tgaagaacga aatccccaat tctcacatcc tagaccagta ccgcaacgcc
901 agcaaccccc tggctcacta tgacaccacc gctgatgaga tcctgcagca gtgtgatggg
961 aagctggaca tgctggtggc ttcagtgggc acgggcggca ccatcacggg cattgccagg
1021 aagctgaagg agaagtgtcc tggatgcagg atcattgggg tggatcccga agggtccatc
1081 ctcgcagagc cggaggagct gaaccagacg gagcagacaa cctacgaggt ggaagggatc
1141 ggctacgact tcatccccac ggtgctggac aggacggtgg tggacaagtg gttcaagagc
1201 aacgatgagg aggcgttcac ctttgcccgc atgctgatcg cgcaagaggg gctgctgtgc
1261 ggtggcagtg ctggcagcac ggtggcggtg gccgtgaagg ccgcgcagga gctgcaggag
1321 ggccagcgct gcgtggtcat tctgcccgac tcagtgcgga actacatgac caagttcctg
1381 agcgacaggt ggatgctgca gaagggcttt ctgaaggagg aggacctcac ggagaagaag
1441 ccctggtggt ggcacctccg tgttcaggag ctgggcctgt cagccccgct gaccgtgctc
1501 ccgaccatca cctgtgggca caccatcgag atcctccggg agaagggctt cgaccaggcg
1561 cccgtggtgg atgaggcggg ggtaatcctg ggaatggtga cgcttgggaa catgctctcg
1621 tccctgcttg ccgggaaggt gcagccgtca gaccaagttg gcaaagtcat ctacaagcag
1681 ttcaaacaga tccgcctcac ggacacgctg ggcaggctct cgcacatcct ggagatggac
1741 cacttcgccc tggtggtgca cgagcagatc cagtaccaca gcaccgggaa gtccagtcag
1801 cggcagatgg tgttcggggt ggtcaccgcc attgacttgc tgaacttcgt ggccgcccag
1861 gagcgggacc agaagtgaag tccggagcgc tgggcggatg tttcaccaag gaaatattga
1921 gagagaagtc ggccaggtag gatgaacaca ggcaatgact gcgcagagtg gattaaaggc
1981 aaaagagaga agagtccagg aaggggcggg gagaagcctg ggtggctcag catcctccac
2041 gggctgcgcc gtctgctcgg ggctgagctg gcgggagcag tttgcgtgtt tgggtttttt
2101 aattgagatg aaattcaaat aacctaaaaa tcaatcactt gaaagtgaac aatcagcggc
2161 atttagtaca tccagaaagt tgtgtaggca ccacctctgt cacgttccgg aacattctgt
2221 catcaccctg tgaagcaatc atttcccctc ccgtcttcct cctcccctgg caactgctga
2281 tcgactttgt gtctctgttg tctaaaatag gttttccctg ttctggacat ttcatataaa
2341 tggaatcaca caaaaaaaaa aaaaaaaaa
//