LOCUS BC031861 2008 bp mRNA linear HUM 24-JUL-2006 DEFINITION Homo sapiens spermatogenesis and oogenesis specific basic helix-loop-helix 1, mRNA (cDNA clone MGC:42684 IMAGE:4826967), complete cds. ACCESSION BC031861 VERSION BC031861.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2008) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2008) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (06-JUN-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Miklos Palkovits, M.D., Ph.D. cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki Toshiyuki and Piero Carninci (RIKEN) cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Baylor College of Medicine Human Genome Sequencing Center Center code: BCM-HGSC Web site: http://www.hgsc.bcm.tmc.edu/cdna/ Contact: amg@bcm.tmc.edu Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H., Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, A.N., Gibbs, R.A. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 71 Row: p Column: 2 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 60302675. FEATURES Location/Qualifiers source 1..2008 /db_xref="H-InvDB:HIT000092453" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:42684 IMAGE:4826967" /tissue_type="Testis" /clone_lib="NIH_MGC_97" /lab_host="DH10B" /note="Vector: pBluescriptR" gene 1..2008 /gene="SOHLH1" /gene_synonym="bA100C15.3" /gene_synonym="NOHLH" /gene_synonym="TEB2" /db_xref="GeneID:402381" /db_xref="HGNC:HGNC:27845" /db_xref="MIM:610224" CDS 73..1059 /gene="SOHLH1" /gene_synonym="bA100C15.3" /gene_synonym="NOHLH" /gene_synonym="TEB2" /codon_start=1 /product="spermatogenesis and oogenesis specific basic helix-loop-helix 1" /protein_id="AAH31861.2" /db_xref="GeneID:402381" /db_xref="HGNC:HGNC:27845" /db_xref="MIM:610224" /translation="MASRCSEPYPEVSRIPTVRGCNGSLSGALSCCEDSAQGSGPPKA PTVAEGPSSCLRRNVISERERRKRMSLSCERLRALLPQFDGRREDMASVLEMSVQFLR LASALGPSQEQHAILASSKEMWHSLQEDVLQLTLSSQIQAGVPDPGTGASSGTRTPDV KAFLESPWSLDPASASPEPVPHILASSRQWDPASCTSLGTDKCEALLGLCQVRGGLPP FSEPSSLVPWPPGRSLPKAVRPPLSWPPFSQQQTLPVMSGEALGWLGQAGSLAMGAAP LGEPAKEDPMLAQEAGSALGSDVDDGTSFLLTAGPSSWPGEWGPGFRAGPPA" BASE COUNT 311 a 566 c 707 g 424 t ORIGIN 1 gtcgagagcg ggggggcggg gctgcctgcg gaaggggccg tgcgcgtggg gtcgctccgc 61 agctgcgagt tcatggcgtc ccggtgctcc gagccctacc cggaggtctc cagaatcccc 121 accgtcaggg gatgcaacgg ctccctgtct ggtgccctct cctgctgcga ggactcggcc 181 cagggctcgg gcccgcccaa ggcccctacg gtggccgagg gtcccagctc ctgccttcgg 241 cggaacgtga tcagcgagag ggagcgcagg aagcggatgt cgttgagctg tgagcgtctg 301 cgggccctgc tgccccagtt cgatggccgg cgggaggaca tggcctcggt cctggagatg 361 tctgtgcagt tcctgcggct tgccagcgcc ctggggccca gtcaggagca gcacgctatt 421 cttgcttcct ccaaggaaat gtggcactcg ttgcaggagg atgttttaca gttgacgttg 481 tcgagtcaga ttcaagcagg tgtgccagac cctgggacgg gagcgtccag cgggactcga 541 accccagatg tgaaggcgtt tctggaaagt ccttggtccc tggatccagc gtcggccagc 601 ccagagcccg tgccgcacat ccttgcgtcc tccaggcagt gggaccccgc gagctgcacg 661 tccctgggca cggacaagtg tgaggcactg ttggggctgt gccaggtgcg gggtgggctg 721 ccccctttct cagaaccttc cagcctggtg ccgtggcccc caggccggag tcttcctaag 781 gctgtgaggc cacccctgtc ctggcctccg ttctcgcagc agcagacctt gcccgtgatg 841 agcggggagg cccttggctg gctgggccag gctggttccc tggccatggg ggctgcacct 901 ctgggggagc cagccaagga ggaccccatg ctggcgcagg aggccgggtc tgcgttgggg 961 tctgatgtgg acgatgggac gtccttcctg ctgactgctg gtcccagctc gtggccgggt 1021 gagtggggcc ctgggttcag ggctggtccc cctgcgtaag cacctgatat gagctgggga 1081 attgggactc aggcctagcg tgtgtcaggg ggccccaggg tctgtgtctg agcccccgtt 1141 ttctccaccg ttggtgctta gctgttgggc cgtgtctggt attgagcgtt ataggtgata 1201 tgtgagtgtg tgagtgaacg tgatgtggac gagcctgcat gagagctggc ggtgcttgga 1261 gccctgtatt cttccccttg accctgtgtc agcgaggccc ctggtccagc ccgtgctggc 1321 cggcaaaggt gtatggccct gtgacatggc gtgtgtccat tccatttagg atttagtgtt 1381 tgggagcccg ttcctggttc tggtccaaca ggtgtcctgg gtctggggcc accaggtatg 1441 agctcacaca cctcatcaca gcctctgtgc agcggggaca gggcagggca gacggactga 1501 ggtgggagct ctgttcctcc gaccccacct gtctgcctca agctcccaga gcacatgcct 1561 gccctgccga tgtgtcctgt ttgcttccag ggtctctgga gggtcgagga ggcagtggcc 1621 ctgcatgggc cccagctgag agcagtccac tggatgttgg agagccaggc ttcctagggg 1681 accctgagct tggctcccag gagctccagg acagccctct ggagccgtgg ggcctggatg 1741 tggactgtgc aggcctggcc ctgaaggacg aggtggagag catcttccct gacttctttg 1801 cctgctagca gccaggctgt ggagaggagg cgggcggggc cggtggatct ggagccgtgt 1861 gtgtccctgt cgcgtgcttg ggcattttat tttgggtttg ggttgcagct gcaccctaat 1921 ttgagcgagg cttacaggct gtgagaagag gatagtgaat attaaagagg gaaagcagtt 1981 ttcctggagg aagcaaaaaa aaaaaaaa //