LOCUS BC055416 1470 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens semenogelin I, mRNA (cDNA clone MGC:61979
IMAGE:6668871), complete cds.
ACCESSION BC055416
VERSION BC055416.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 1470)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 1470)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (31-JUL-2003) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: CLONTECH
cDNA Library Preparation: CLONTECH Laboratories, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 48 Row: m Column: 11
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 38049012.
FEATURES Location/Qualifiers
source 1..1470
/db_xref="H-InvDB:HIT000054119"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:61979 IMAGE:6668871"
/tissue_type="Prostate"
/clone_lib="NIH_MGC_83"
/lab_host="DH10B"
/note="Vector: pDNR-LIB"
gene 1..1470
/gene="SEMG1"
/gene_synonym="MGC14719"
/gene_synonym="SGI"
/db_xref="GeneID:6406"
/db_xref="HGNC:HGNC:10742"
/db_xref="MIM:182140"
CDS 22..1230
/gene="SEMG1"
/gene_synonym="MGC14719"
/gene_synonym="SGI"
/codon_start=1
/product="semenogelin I"
/protein_id="AAH55416.1"
/db_xref="GeneID:6406"
/db_xref="HGNC:HGNC:10742"
/db_xref="MIM:182140"
/translation="MKPNIIFVLSLLLILEKQAAVMGQKGGSKGRLPSEFSQFPHGQK
GQHYSGQKGKQQTESKGSFSIQYTYHVDANDHDQSRKSQQYDLNALHKTTKSQRHLGG
SQQLLHNKQEGRDHDKSKGHFHRVVIHHKGGKAHRGTQNPSQDQGNSPSGKGISSQYS
NTEERLWVHGLSKEQTSVSGAQKGRKQGGSQSSYVLQTEELVANKQQRETKNSHQNKG
HYQNVVEVREEHSSKVQTSLCPAHQDKLQHGSKDIFSTQDELLVYNKNQHQTKNLNQD
QQHGRKANKISYQSSSTEERRLHYGENGVQKDVSQRSIYSQTEKLVAGKSQIQAPNPK
QEPWHGENAKGESGQSTNREQDLLSHEQKGRHQHGSHGGLDIVIIEQEDDSDRHLAQH
LNNDQNPLFT"
BASE COUNT 557 a 316 c 286 g 311 t
ORIGIN
1 ggggagggtt ttccaagcaa gatgaagccc aacatcatct ttgtactttc cctgctcctc
61 atcttggaga agcaagcagc tgtgatggga caaaaaggtg gatcaaaagg ccgattacca
121 agtgaatttt cccaatttcc acacggacaa aagggccagc actattctgg acaaaaaggc
181 aagcaacaaa ctgaatccaa aggcagtttt tctattcaat acacatatca tgtagatgcc
241 aatgatcatg accagtcccg aaaaagtcag caatatgatt tgaatgccct acataagacg
301 acaaaatcac aacgacatct aggtggaagt caacaactgc tccataataa acaagaaggc
361 agagaccatg ataaatcaaa aggtcatttt cacagggtag ttatacacca taaaggaggc
421 aaagctcatc gtgggacaca aaatccttct caagatcagg ggaatagccc atctggaaag
481 ggaatatcca gtcaatattc aaacacagaa gaaaggctgt gggttcatgg actaagtaaa
541 gaacaaactt ccgtctctgg tgcacaaaaa ggtagaaaac aaggcggatc ccaaagcagt
601 tatgttctcc aaactgaaga gctagtagct aacaaacaac aacgtgagac taaaaattct
661 catcaaaata aagggcatta ccaaaatgtg gttgaagtga gagaggaaca ttcaagtaaa
721 gtacaaacct cactctgtcc tgcgcaccaa gacaaactcc aacatggatc caaagacatt
781 ttttctaccc aagatgagct cctagtatat aacaagaatc aacaccagac aaaaaatctc
841 aatcaagatc aacagcatgg ccgaaaggca aataaaatat cataccaatc ttcaagtaca
901 gaagaaagac gactccacta tggagaaaat ggtgtgcaga aagatgtatc ccaacgcagt
961 atttatagcc aaactgaaaa gctagtagca ggcaagtctc aaatccaggc accaaatcct
1021 aagcaagagc catggcatgg tgaaaacgca aaaggagagt ctggccaatc tacaaataga
1081 gaacaagacc tactcagtca tgaacaaaaa ggcagacacc aacatggatc tcatggggga
1141 ttggatattg taattataga gcaggaagat gacagtgatc gtcatttggc acaacatctt
1201 aacaacgacc aaaacccatt atttacataa acctaccatt cggtaaccat gtgaaaggat
1261 ggaccaatat caaggtgtca gttgacctca gtgaattctg tgatgtttct gagatgcaga
1321 ctcccgtgta gtttcagatt cttggtccat ggatgacacc acctgcccat gcttccttga
1381 attaggcttt cctaacctga agcgccttca aacttccaat aaagagatca ttttctgctc
1441 cgaaaaaaaa aaaaaaaaaa aaaaaaaaaa
//