LOCUS BC007257 2369 bp mRNA linear HUM 03-OCT-2003 DEFINITION Homo sapiens cystathionine-beta-synthase, mRNA (cDNA clone MGC:15515 IMAGE:3028099), complete cds. ACCESSION BC007257 VERSION BC007257.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2369) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2369) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (01-MAY-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterfield, Susanna Chan, Readman Chiu, Chris Fjell, Erin Garland, Ran Guin, Letticia Hsiao, Martin Krzywinski, Reta Kutsche, Oliver Lee, Soo Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi, Jacqueline Schein, Duane Smailus, Michael Smith, Lorraine Spence, Jeff Stott, Michael Thorne, Miranada Tsai, Natasja van den Bosch, Jill Vardy, George Yang, Scott Zuyderduyn, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 22 Row: e Column: 24 This clone was selected for full length sequencing because it passed the following selection criteria: Hexamer frequency ORF analysis, Similarity but not identity to protein. FEATURES Location/Qualifiers source 1..2369 /db_xref="H-InvDB:HIT000033101" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:15515 IMAGE:3028099" /tissue_type="Muscle, rhabdomyosarcoma" /clone_lib="NIH_MGC_17" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2369 /gene="CBS" /gene_synonym="HIP4" /db_xref="GeneID:875" /db_xref="MIM:236200" CDS 223..1878 /gene="CBS" /gene_synonym="HIP4" /codon_start=1 /product="CBS protein" /protein_id="AAH07257.1" /db_xref="GeneID:875" /db_xref="MIM:236200" /translation="MPSETPQAEVGPTGCPHRSGPHSAKGSLEKGSPEDKEAKEPLWI RPDAPSRCTWQLGRPASESPHHHTAPAKSPKILPDILKKIGDTPMVRINKIGKKFGLK CELLAKCEFFNAGGSVKDRISLRMIEDAERDGTLKPGDTIIEPTSGNTGIGLALAAAV RGYRCIIVMPEKMSSEKVDVLRALGAEIVRTPTNARFDSPESHVGVAWRLKNEIPNSH ILDQYRNASNPLAHYDTTADEILQQCDGKLDMLVASVGTGGTITGIARKLKEKCPGCR IIGVDPEGSILAEPEELNQTEQTTYEVEGIGYDFIPTVLDRTVVDKWFKSNDEEAFTF ARMLIAQEGLLCGGSAGSTVAVAVKAAQELQEGQRCVVILPDSVRNYMTKFLSDRWML QKGFLKEEDLTEKKPWWWHLRVQELGLSAPLTVLPTITCGHTIEILREKGFDQAPVVD EAGVILGMVTLGNMLSSLLAGKVQPSDQVGKVIYKQFKQIRLTDTLGRLSHILEMDHF ALVVHEQIQYHSTGKSSQRQMVFGVVTAIDLLNFVAAQERDQK" misc_feature 448..1374 /gene="CBS" /gene_synonym="HIP4" /note="CysK; Region: Cysteine synthase [Amino acid transport and metabolism]" /db_xref="CDD:COG0031" misc_feature 1486..1629 /gene="CBS" /gene_synonym="HIP4" /note="Region: Domain in cystathionine beta-synthase and other proteins. Domain present in all 3 forms of cellular life. Present in two copies in inosine monophosphate dehydrogenase, of which one is disordered in the crystal structure. A number of disease states are associated with CBS-containing proteins including homocystinuria, Becker's and Thomsen disease" /db_xref="CDD:smart00116" BASE COUNT 554 a 644 c 737 g 434 t ORIGIN 1 cacgccctcg gggtcggtcc tcgaggacgc gcagggcccc ccacccacca ggacgcacgt 61 ttcaagctca tcagtaaagg ttccttaaat tccctaaggg caagaagtta accaagtaaa 121 acagcatcgg aacaccagga tcccatgaca gattctgttg tcacgtctcc ttacagagtt 181 tgagcggtgc tgaactgtca gcaccatctg tccggtccca gcatgccttc tgagaccccc 241 caggcagaag tggggcccac aggctgcccc caccgctcag ggccacactc ggcgaagggg 301 agcctggaga aggggtcccc agaggataag gaagccaagg agcccctgtg gatccggccc 361 gatgctccga gcaggtgcac ctggcagctg ggccggcctg cctccgagtc cccacatcac 421 cacactgccc cggcaaaatc tccaaaaatc ttgccagata ttctgaagaa aatcggggac 481 acccctatgg tcagaatcaa caagattggg aagaagttcg gcctgaagtg tgagctcttg 541 gccaagtgtg agttcttcaa cgcgggcggg agcgtgaagg accgcatcag cctgcggatg 601 attgaggatg ctgagcgcga cgggacgctg aagcccgggg acacgattat cgagccgaca 661 tccgggaaca ccgggatcgg gctggccctg gctgcggcag tgaggggcta tcgctgcatc 721 atcgtgatgc cagagaagat gagctccgag aaggtggacg tgctgcgggc actgggggct 781 gagattgtga ggacgcccac caatgccagg ttcgactccc cggagtcaca cgtgggggtg 841 gcctggcggc tgaagaacga aatccccaat tctcacatcc tagaccagta ccgcaacgcc 901 agcaaccccc tggctcacta tgacaccacc gctgatgaga tcctgcagca gtgtgatggg 961 aagctggaca tgctggtggc ttcagtgggc acgggcggca ccatcacggg cattgccagg 1021 aagctgaagg agaagtgtcc tggatgcagg atcattgggg tggatcccga agggtccatc 1081 ctcgcagagc cggaggagct gaaccagacg gagcagacaa cctacgaggt ggaagggatc 1141 ggctacgact tcatccccac ggtgctggac aggacggtgg tggacaagtg gttcaagagc 1201 aacgatgagg aggcgttcac ctttgcccgc atgctgatcg cgcaagaggg gctgctgtgc 1261 ggtggcagtg ctggcagcac ggtggcggtg gccgtgaagg ccgcgcagga gctgcaggag 1321 ggccagcgct gcgtggtcat tctgcccgac tcagtgcgga actacatgac caagttcctg 1381 agcgacaggt ggatgctgca gaagggcttt ctgaaggagg aggacctcac ggagaagaag 1441 ccctggtggt ggcacctccg tgttcaggag ctgggcctgt cagccccgct gaccgtgctc 1501 ccgaccatca cctgtgggca caccatcgag atcctccggg agaagggctt cgaccaggcg 1561 cccgtggtgg atgaggcggg ggtaatcctg ggaatggtga cgcttgggaa catgctctcg 1621 tccctgcttg ccgggaaggt gcagccgtca gaccaagttg gcaaagtcat ctacaagcag 1681 ttcaaacaga tccgcctcac ggacacgctg ggcaggctct cgcacatcct ggagatggac 1741 cacttcgccc tggtggtgca cgagcagatc cagtaccaca gcaccgggaa gtccagtcag 1801 cggcagatgg tgttcggggt ggtcaccgcc attgacttgc tgaacttcgt ggccgcccag 1861 gagcgggacc agaagtgaag tccggagcgc tgggcggatg tttcaccaag gaaatattga 1921 gagagaagtc ggccaggtag gatgaacaca ggcaatgact gcgcagagtg gattaaaggc 1981 aaaagagaga agagtccagg aaggggcggg gagaagcctg ggtggctcag catcctccac 2041 gggctgcgcc gtctgctcgg ggctgagctg gcgggagcag tttgcgtgtt tgggtttttt 2101 aattgagatg aaattcaaat aacctaaaaa tcaatcactt gaaagtgaac aatcagcggc 2161 atttagtaca tccagaaagt tgtgtaggca ccacctctgt cacgttccgg aacattctgt 2221 catcaccctg tgaagcaatc atttcccctc ccgtcttcct cctcccctgg caactgctga 2281 tcgactttgt gtctctgttg tctaaaatag gttttccctg ttctggacat ttcatataaa 2341 tggaatcaca caaaaaaaaa aaaaaaaaa //