LOCUS BC003061 1972 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens legumain, mRNA (cDNA clone MGC:1395 IMAGE:3504506),
complete cds.
ACCESSION BC003061
VERSION BC003061.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 1972)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 1972)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (12-FEB-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Aug 19, 2003 this sequence version replaced BC003061.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Institute for Systems Biology
http://www.systemsbiology.org
contact: amadan@systemsbiology.org
Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 7 Row: o Column: 5
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 56682961.
FEATURES Location/Qualifiers
source 1..1972
/db_xref="H-InvDB:HIT000031396"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:1395 IMAGE:3504506"
/tissue_type="Placenta, choriocarcinoma"
/clone_lib="NIH_MGC_21"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..1972
/gene="LGMN"
/gene_synonym="AEP"
/gene_synonym="LGMN1"
/db_xref="GeneID:5641"
/db_xref="HGNC:HGNC:9472"
/db_xref="MIM:602620"
CDS 133..1434
/gene="LGMN"
/gene_synonym="AEP"
/gene_synonym="LGMN1"
/codon_start=1
/product="legumain"
/protein_id="AAH03061.1"
/db_xref="GeneID:5641"
/db_xref="HGNC:HGNC:9472"
/db_xref="MIM:602620"
/translation="MVWKVAVFLSVALGIGAIPIDDPEDGGKHWVVIVAGSNGWYNYR
HQADACHAYQIIHRNGIPDEQIVVMMYDDIAYSEDNPTPGIVINRPNGTDVYQGVPKD
YTGEDVTPQNFLAVLRGDAEAVKGIGSGKVLKSGPQDHVFIYFTDHGSTGILVFPNED
LHVKDLNETIHYMYKHKMYRKMVFYIEACESGSMMNHLPDNINVYATTAANPRESSYA
CYYDEKRSTYLGDWYSVNWMEDSDVEDLTKETLHKQYHLVKSHTNTSHVMQYGNKTIS
TMKVMQFQGMKRKASSPVPLPPVTHLDLTPSPDVPLTIMKRKLMNTNDLEESRQLTEE
IQRHLDARHLIEKSVRKIVSLLAASEAEVEQLLSERAPLTGHSCYPEALLHFRTHCFN
WHSPTYEYALRHLYVLVNLCEKPYPLHRIKLSMDHVCLGHY"
BASE COUNT 525 a 503 c 488 g 456 t
ORIGIN
1 gaggctgcga gccgccgcga gttctcacgg tcccgccggc gccaccaccg cggtcactca
61 ccgccgccgc cgccaccact gccaccacgg tcgcctgcca caggtgtctg caattgaact
121 ccaaggtgca gaatggtttg gaaagtagct gtattcctca gtgtggccct gggcattggt
181 gccattccta tagatgatcc tgaagatgga ggcaagcact gggtggtgat cgtggcaggt
241 tcaaatggct ggtataatta taggcaccag gcagacgcgt gccatgccta ccagatcatt
301 caccgcaatg ggattcctga cgaacagatc gttgtgatga tgtacgatga cattgcttac
361 tctgaagaca atcccactcc aggaattgtg atcaacaggc ccaatggcac agatgtctat
421 cagggagtcc cgaaggacta cactggagag gatgttaccc cacaaaattt ccttgctgtg
481 ttgagaggcg atgcagaagc agtgaagggc ataggatccg gcaaagtcct gaagagtggc
541 ccccaggatc acgtgttcat ttacttcact gaccatggat ctactggaat actggttttt
601 cccaatgaag atcttcatgt aaaggacctg aatgagacca tccattacat gtacaaacac
661 aaaatgtacc gaaagatggt gttctacatt gaagcctgtg agtctgggtc catgatgaac
721 cacctgccgg ataacatcaa tgtttatgca actactgctg ccaaccccag agagtcgtcc
781 tacgcctgtt actatgatga gaagaggtcc acgtacctgg gggactggta cagcgtcaac
841 tggatggaag actcggacgt ggaagatctg actaaagaga ccctgcacaa gcagtaccac
901 ctggtaaaat cgcacaccaa caccagccac gtcatgcagt atggaaacaa aacaatctcc
961 accatgaaag tgatgcagtt tcagggtatg aaacgcaaag ccagttctcc cgtcccccta
1021 cctccagtca cacaccttga cctcaccccc agccctgatg tgcctctcac catcatgaaa
1081 aggaaactga tgaacaccaa tgatctggag gagtccaggc agctcacgga ggagatccag
1141 cggcatctgg atgccaggca cctcattgag aagtcagtgc gtaagatcgt ctccttgctg
1201 gcagcgtccg aggctgaggt ggagcagctc ctgtccgaga gagccccgct cacggggcac
1261 agctgctacc cagaggccct gctgcacttc cggacccact gcttcaactg gcactccccc
1321 acgtacgagt atgcgttgag acatttgtac gtgctggtca acctttgtga gaagccgtat
1381 ccacttcaca ggataaaatt gtccatggac cacgtgtgcc ttggtcacta ctgaagagct
1441 gcctcctgga agcttttcca agtgtgagcg ccccaccgac tgtgtgctga tcagagactg
1501 gagaggtgga gtgagaagtc tccgctgctc gggccctcct ggggagcccc cgctccaggg
1561 ctcgctccag gaccttcttc acaagatgac ttgctcgctg ttacctgctt ccccagtctt
1621 ttctgaaaaa ctacaaatta gggtgggaaa agctctgtat tgagaagggt catatttgct
1681 ttctaggagg tttgttgttt tgcctgttag ttttgaggag caggaagctc atgggggctt
1741 ctgtagcccc tctcaaaagg agtctttatt ctgagaattt gaagctgaaa cctctttaaa
1801 tcttcagaat gattttattg aagagggccg caagccccaa atggaaaact gtttttagaa
1861 aatatgatga tttttgattg cttttgtatt taattctgca ggtgttcaag tcttaaaaaa
1921 taaagattta taacagaacc caaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa
//