LOCUS       BC003061                1972 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens legumain, mRNA (cDNA clone MGC:1395 IMAGE:3504506),
            complete cds.
ACCESSION   BC003061
VERSION     BC003061.2
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1972)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 1972)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (12-FEB-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Aug 19, 2003 this sequence version replaced BC003061.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Institute for Systems Biology
            http://www.systemsbiology.org
            contact: amadan@systemsbiology.org
            Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
            Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 7 Row: o Column: 5
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 56682961.
FEATURES             Location/Qualifiers
     source          1..1972
                     /db_xref="H-InvDB:HIT000031396"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:1395 IMAGE:3504506"
                     /tissue_type="Placenta, choriocarcinoma"
                     /clone_lib="NIH_MGC_21"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            1..1972
                     /gene="LGMN"
                     /gene_synonym="AEP"
                     /gene_synonym="LGMN1"
                     /db_xref="GeneID:5641"
                     /db_xref="HGNC:HGNC:9472"
                     /db_xref="MIM:602620"
     CDS             133..1434
                     /gene="LGMN"
                     /gene_synonym="AEP"
                     /gene_synonym="LGMN1"
                     /codon_start=1
                     /product="legumain"
                     /protein_id="AAH03061.1"
                     /db_xref="GeneID:5641"
                     /db_xref="HGNC:HGNC:9472"
                     /db_xref="MIM:602620"
                     /translation="MVWKVAVFLSVALGIGAIPIDDPEDGGKHWVVIVAGSNGWYNYR
                     HQADACHAYQIIHRNGIPDEQIVVMMYDDIAYSEDNPTPGIVINRPNGTDVYQGVPKD
                     YTGEDVTPQNFLAVLRGDAEAVKGIGSGKVLKSGPQDHVFIYFTDHGSTGILVFPNED
                     LHVKDLNETIHYMYKHKMYRKMVFYIEACESGSMMNHLPDNINVYATTAANPRESSYA
                     CYYDEKRSTYLGDWYSVNWMEDSDVEDLTKETLHKQYHLVKSHTNTSHVMQYGNKTIS
                     TMKVMQFQGMKRKASSPVPLPPVTHLDLTPSPDVPLTIMKRKLMNTNDLEESRQLTEE
                     IQRHLDARHLIEKSVRKIVSLLAASEAEVEQLLSERAPLTGHSCYPEALLHFRTHCFN
                     WHSPTYEYALRHLYVLVNLCEKPYPLHRIKLSMDHVCLGHY"
BASE COUNT          525 a          503 c          488 g          456 t
ORIGIN      
        1 gaggctgcga gccgccgcga gttctcacgg tcccgccggc gccaccaccg cggtcactca
       61 ccgccgccgc cgccaccact gccaccacgg tcgcctgcca caggtgtctg caattgaact
      121 ccaaggtgca gaatggtttg gaaagtagct gtattcctca gtgtggccct gggcattggt
      181 gccattccta tagatgatcc tgaagatgga ggcaagcact gggtggtgat cgtggcaggt
      241 tcaaatggct ggtataatta taggcaccag gcagacgcgt gccatgccta ccagatcatt
      301 caccgcaatg ggattcctga cgaacagatc gttgtgatga tgtacgatga cattgcttac
      361 tctgaagaca atcccactcc aggaattgtg atcaacaggc ccaatggcac agatgtctat
      421 cagggagtcc cgaaggacta cactggagag gatgttaccc cacaaaattt ccttgctgtg
      481 ttgagaggcg atgcagaagc agtgaagggc ataggatccg gcaaagtcct gaagagtggc
      541 ccccaggatc acgtgttcat ttacttcact gaccatggat ctactggaat actggttttt
      601 cccaatgaag atcttcatgt aaaggacctg aatgagacca tccattacat gtacaaacac
      661 aaaatgtacc gaaagatggt gttctacatt gaagcctgtg agtctgggtc catgatgaac
      721 cacctgccgg ataacatcaa tgtttatgca actactgctg ccaaccccag agagtcgtcc
      781 tacgcctgtt actatgatga gaagaggtcc acgtacctgg gggactggta cagcgtcaac
      841 tggatggaag actcggacgt ggaagatctg actaaagaga ccctgcacaa gcagtaccac
      901 ctggtaaaat cgcacaccaa caccagccac gtcatgcagt atggaaacaa aacaatctcc
      961 accatgaaag tgatgcagtt tcagggtatg aaacgcaaag ccagttctcc cgtcccccta
     1021 cctccagtca cacaccttga cctcaccccc agccctgatg tgcctctcac catcatgaaa
     1081 aggaaactga tgaacaccaa tgatctggag gagtccaggc agctcacgga ggagatccag
     1141 cggcatctgg atgccaggca cctcattgag aagtcagtgc gtaagatcgt ctccttgctg
     1201 gcagcgtccg aggctgaggt ggagcagctc ctgtccgaga gagccccgct cacggggcac
     1261 agctgctacc cagaggccct gctgcacttc cggacccact gcttcaactg gcactccccc
     1321 acgtacgagt atgcgttgag acatttgtac gtgctggtca acctttgtga gaagccgtat
     1381 ccacttcaca ggataaaatt gtccatggac cacgtgtgcc ttggtcacta ctgaagagct
     1441 gcctcctgga agcttttcca agtgtgagcg ccccaccgac tgtgtgctga tcagagactg
     1501 gagaggtgga gtgagaagtc tccgctgctc gggccctcct ggggagcccc cgctccaggg
     1561 ctcgctccag gaccttcttc acaagatgac ttgctcgctg ttacctgctt ccccagtctt
     1621 ttctgaaaaa ctacaaatta gggtgggaaa agctctgtat tgagaagggt catatttgct
     1681 ttctaggagg tttgttgttt tgcctgttag ttttgaggag caggaagctc atgggggctt
     1741 ctgtagcccc tctcaaaagg agtctttatt ctgagaattt gaagctgaaa cctctttaaa
     1801 tcttcagaat gattttattg aagagggccg caagccccaa atggaaaact gtttttagaa
     1861 aatatgatga tttttgattg cttttgtatt taattctgca ggtgttcaag tcttaaaaaa
     1921 taaagattta taacagaacc caaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa
//