LOCUS BC055416 1470 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens semenogelin I, mRNA (cDNA clone MGC:61979 IMAGE:6668871), complete cds. ACCESSION BC055416 VERSION BC055416.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1470) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1470) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (31-JUL-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: CLONTECH cDNA Library Preparation: CLONTECH Laboratories, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 48 Row: m Column: 11 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 38049012. FEATURES Location/Qualifiers source 1..1470 /db_xref="H-InvDB:HIT000054119" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:61979 IMAGE:6668871" /tissue_type="Prostate" /clone_lib="NIH_MGC_83" /lab_host="DH10B" /note="Vector: pDNR-LIB" gene 1..1470 /gene="SEMG1" /gene_synonym="MGC14719" /gene_synonym="SGI" /db_xref="GeneID:6406" /db_xref="HGNC:HGNC:10742" /db_xref="MIM:182140" CDS 22..1230 /gene="SEMG1" /gene_synonym="MGC14719" /gene_synonym="SGI" /codon_start=1 /product="semenogelin I" /protein_id="AAH55416.1" /db_xref="GeneID:6406" /db_xref="HGNC:HGNC:10742" /db_xref="MIM:182140" /translation="MKPNIIFVLSLLLILEKQAAVMGQKGGSKGRLPSEFSQFPHGQK GQHYSGQKGKQQTESKGSFSIQYTYHVDANDHDQSRKSQQYDLNALHKTTKSQRHLGG SQQLLHNKQEGRDHDKSKGHFHRVVIHHKGGKAHRGTQNPSQDQGNSPSGKGISSQYS NTEERLWVHGLSKEQTSVSGAQKGRKQGGSQSSYVLQTEELVANKQQRETKNSHQNKG HYQNVVEVREEHSSKVQTSLCPAHQDKLQHGSKDIFSTQDELLVYNKNQHQTKNLNQD QQHGRKANKISYQSSSTEERRLHYGENGVQKDVSQRSIYSQTEKLVAGKSQIQAPNPK QEPWHGENAKGESGQSTNREQDLLSHEQKGRHQHGSHGGLDIVIIEQEDDSDRHLAQH LNNDQNPLFT" BASE COUNT 557 a 316 c 286 g 311 t ORIGIN 1 ggggagggtt ttccaagcaa gatgaagccc aacatcatct ttgtactttc cctgctcctc 61 atcttggaga agcaagcagc tgtgatggga caaaaaggtg gatcaaaagg ccgattacca 121 agtgaatttt cccaatttcc acacggacaa aagggccagc actattctgg acaaaaaggc 181 aagcaacaaa ctgaatccaa aggcagtttt tctattcaat acacatatca tgtagatgcc 241 aatgatcatg accagtcccg aaaaagtcag caatatgatt tgaatgccct acataagacg 301 acaaaatcac aacgacatct aggtggaagt caacaactgc tccataataa acaagaaggc 361 agagaccatg ataaatcaaa aggtcatttt cacagggtag ttatacacca taaaggaggc 421 aaagctcatc gtgggacaca aaatccttct caagatcagg ggaatagccc atctggaaag 481 ggaatatcca gtcaatattc aaacacagaa gaaaggctgt gggttcatgg actaagtaaa 541 gaacaaactt ccgtctctgg tgcacaaaaa ggtagaaaac aaggcggatc ccaaagcagt 601 tatgttctcc aaactgaaga gctagtagct aacaaacaac aacgtgagac taaaaattct 661 catcaaaata aagggcatta ccaaaatgtg gttgaagtga gagaggaaca ttcaagtaaa 721 gtacaaacct cactctgtcc tgcgcaccaa gacaaactcc aacatggatc caaagacatt 781 ttttctaccc aagatgagct cctagtatat aacaagaatc aacaccagac aaaaaatctc 841 aatcaagatc aacagcatgg ccgaaaggca aataaaatat cataccaatc ttcaagtaca 901 gaagaaagac gactccacta tggagaaaat ggtgtgcaga aagatgtatc ccaacgcagt 961 atttatagcc aaactgaaaa gctagtagca ggcaagtctc aaatccaggc accaaatcct 1021 aagcaagagc catggcatgg tgaaaacgca aaaggagagt ctggccaatc tacaaataga 1081 gaacaagacc tactcagtca tgaacaaaaa ggcagacacc aacatggatc tcatggggga 1141 ttggatattg taattataga gcaggaagat gacagtgatc gtcatttggc acaacatctt 1201 aacaacgacc aaaacccatt atttacataa acctaccatt cggtaaccat gtgaaaggat 1261 ggaccaatat caaggtgtca gttgacctca gtgaattctg tgatgtttct gagatgcaga 1321 ctcccgtgta gtttcagatt cttggtccat ggatgacacc acctgcccat gcttccttga 1381 attaggcttt cctaacctga agcgccttca aacttccaat aaagagatca ttttctgctc 1441 cgaaaaaaaa aaaaaaaaaa aaaaaaaaaa //