LOCUS BC003061 1972 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens legumain, mRNA (cDNA clone MGC:1395 IMAGE:3504506), complete cds. ACCESSION BC003061 VERSION BC003061.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1972) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1972) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (12-FEB-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 19, 2003 this sequence version replaced BC003061.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 7 Row: o Column: 5 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 56682961. FEATURES Location/Qualifiers source 1..1972 /db_xref="H-InvDB:HIT000031396" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:1395 IMAGE:3504506" /tissue_type="Placenta, choriocarcinoma" /clone_lib="NIH_MGC_21" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..1972 /gene="LGMN" /gene_synonym="AEP" /gene_synonym="LGMN1" /db_xref="GeneID:5641" /db_xref="HGNC:HGNC:9472" /db_xref="MIM:602620" CDS 133..1434 /gene="LGMN" /gene_synonym="AEP" /gene_synonym="LGMN1" /codon_start=1 /product="legumain" /protein_id="AAH03061.1" /db_xref="GeneID:5641" /db_xref="HGNC:HGNC:9472" /db_xref="MIM:602620" /translation="MVWKVAVFLSVALGIGAIPIDDPEDGGKHWVVIVAGSNGWYNYR HQADACHAYQIIHRNGIPDEQIVVMMYDDIAYSEDNPTPGIVINRPNGTDVYQGVPKD YTGEDVTPQNFLAVLRGDAEAVKGIGSGKVLKSGPQDHVFIYFTDHGSTGILVFPNED LHVKDLNETIHYMYKHKMYRKMVFYIEACESGSMMNHLPDNINVYATTAANPRESSYA CYYDEKRSTYLGDWYSVNWMEDSDVEDLTKETLHKQYHLVKSHTNTSHVMQYGNKTIS TMKVMQFQGMKRKASSPVPLPPVTHLDLTPSPDVPLTIMKRKLMNTNDLEESRQLTEE IQRHLDARHLIEKSVRKIVSLLAASEAEVEQLLSERAPLTGHSCYPEALLHFRTHCFN WHSPTYEYALRHLYVLVNLCEKPYPLHRIKLSMDHVCLGHY" BASE COUNT 525 a 503 c 488 g 456 t ORIGIN 1 gaggctgcga gccgccgcga gttctcacgg tcccgccggc gccaccaccg cggtcactca 61 ccgccgccgc cgccaccact gccaccacgg tcgcctgcca caggtgtctg caattgaact 121 ccaaggtgca gaatggtttg gaaagtagct gtattcctca gtgtggccct gggcattggt 181 gccattccta tagatgatcc tgaagatgga ggcaagcact gggtggtgat cgtggcaggt 241 tcaaatggct ggtataatta taggcaccag gcagacgcgt gccatgccta ccagatcatt 301 caccgcaatg ggattcctga cgaacagatc gttgtgatga tgtacgatga cattgcttac 361 tctgaagaca atcccactcc aggaattgtg atcaacaggc ccaatggcac agatgtctat 421 cagggagtcc cgaaggacta cactggagag gatgttaccc cacaaaattt ccttgctgtg 481 ttgagaggcg atgcagaagc agtgaagggc ataggatccg gcaaagtcct gaagagtggc 541 ccccaggatc acgtgttcat ttacttcact gaccatggat ctactggaat actggttttt 601 cccaatgaag atcttcatgt aaaggacctg aatgagacca tccattacat gtacaaacac 661 aaaatgtacc gaaagatggt gttctacatt gaagcctgtg agtctgggtc catgatgaac 721 cacctgccgg ataacatcaa tgtttatgca actactgctg ccaaccccag agagtcgtcc 781 tacgcctgtt actatgatga gaagaggtcc acgtacctgg gggactggta cagcgtcaac 841 tggatggaag actcggacgt ggaagatctg actaaagaga ccctgcacaa gcagtaccac 901 ctggtaaaat cgcacaccaa caccagccac gtcatgcagt atggaaacaa aacaatctcc 961 accatgaaag tgatgcagtt tcagggtatg aaacgcaaag ccagttctcc cgtcccccta 1021 cctccagtca cacaccttga cctcaccccc agccctgatg tgcctctcac catcatgaaa 1081 aggaaactga tgaacaccaa tgatctggag gagtccaggc agctcacgga ggagatccag 1141 cggcatctgg atgccaggca cctcattgag aagtcagtgc gtaagatcgt ctccttgctg 1201 gcagcgtccg aggctgaggt ggagcagctc ctgtccgaga gagccccgct cacggggcac 1261 agctgctacc cagaggccct gctgcacttc cggacccact gcttcaactg gcactccccc 1321 acgtacgagt atgcgttgag acatttgtac gtgctggtca acctttgtga gaagccgtat 1381 ccacttcaca ggataaaatt gtccatggac cacgtgtgcc ttggtcacta ctgaagagct 1441 gcctcctgga agcttttcca agtgtgagcg ccccaccgac tgtgtgctga tcagagactg 1501 gagaggtgga gtgagaagtc tccgctgctc gggccctcct ggggagcccc cgctccaggg 1561 ctcgctccag gaccttcttc acaagatgac ttgctcgctg ttacctgctt ccccagtctt 1621 ttctgaaaaa ctacaaatta gggtgggaaa agctctgtat tgagaagggt catatttgct 1681 ttctaggagg tttgttgttt tgcctgttag ttttgaggag caggaagctc atgggggctt 1741 ctgtagcccc tctcaaaagg agtctttatt ctgagaattt gaagctgaaa cctctttaaa 1801 tcttcagaat gattttattg aagagggccg caagccccaa atggaaaact gtttttagaa 1861 aatatgatga tttttgattg cttttgtatt taattctgca ggtgttcaag tcttaaaaaa 1921 taaagattta taacagaacc caaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa //