LOCUS       X03363                  4473 bp    mRNA    linear   HUM 30-MAR-1995
DEFINITION  Human c-erb-B-2 mRNA.
ACCESSION   X03363
VERSION     X03363.1
KEYWORDS    cell surface glycoprotein; cellular oncogene; erB-2 cellular;
            glycoprotein; growth factor receptor; kinase; neu cellular
            oncogene; transmembrane protein; tyrosine kinase.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 4473)
  AUTHORS   Yamamoto T., Ikawa S., Akiyama T., Semba K., Nomura N.,
            Miyajima N., Saito T., Toyoshima K.
  TITLE     Similarity of protein encoded by the human c-erb-B-2 gene to
            epidermal growth factor receptor
  JOURNAL   Nature 319(6050), 230-234(1986).
   PUBMED   3003577
REFERENCE   2
  AUTHORS   Papewalis J., Nikitin A.Y., Rajewsky M.F.
  TITLE     G to A polymorphism at amino acid codon 655 of the human
            erbB-2/HER2 gene
  JOURNAL   Nucleic Acids Res. 19(19), 5452-5452(1991).
   PUBMED   1681519
COMMENT     The c-erb-B-2 protein shows similarity to the epidermal growth
            factor receptor.
FEATURES             Location/Qualifiers
     source          1..4473
                     /db_xref="H-InvDB:HIT000320977"
                     /organism="Homo sapiens"
                     /strain="cell line MKN-7"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
     CDS             175..3942
                     /note="c-erb-B-2 precursor"
                     /db_xref="GOA:P04626"
                     /db_xref="H-InvDB:HIT000320977.15"
                     /db_xref="HGNC:HGNC:3430"
                     /db_xref="InterPro:IPR000494"
                     /db_xref="InterPro:IPR000719"
                     /db_xref="InterPro:IPR001245"
                     /db_xref="InterPro:IPR006211"
                     /db_xref="InterPro:IPR006212"
                     /db_xref="InterPro:IPR008266"
                     /db_xref="InterPro:IPR009030"
                     /db_xref="InterPro:IPR011009"
                     /db_xref="InterPro:IPR016245"
                     /db_xref="InterPro:IPR017441"
                     /db_xref="InterPro:IPR020635"
                     /db_xref="InterPro:IPR032675"
                     /db_xref="InterPro:IPR032778"
                     /db_xref="PDB:1MFG"
                     /db_xref="PDB:1MFL"
                     /db_xref="PDB:1MW4"
                     /db_xref="PDB:1N8Z"
                     /db_xref="PDB:1OVC"
                     /db_xref="PDB:1QR1"
                     /db_xref="PDB:1S78"
                     /db_xref="PDB:2A91"
                     /db_xref="PDB:2JWA"
                     /db_xref="PDB:2KS1"
                     /db_xref="PDB:2L4K"
                     /db_xref="PDB:2N2A"
                     /db_xref="PDB:3BE1"
                     /db_xref="PDB:3H3B"
                     /db_xref="PDB:3MZW"
                     /db_xref="PDB:3N85"
                     /db_xref="PDB:3PP0"
                     /db_xref="PDB:3RCD"
                     /db_xref="PDB:3WLW"
                     /db_xref="PDB:3WSQ"
                     /db_xref="PDB:4GFU"
                     /db_xref="PDB:4HRL"
                     /db_xref="PDB:4HRM"
                     /db_xref="PDB:4HRN"
                     /db_xref="UniProtKB/Swiss-Prot:P04626"
                     /protein_id="CAA27060.1"
                     /translation="MELAALCRWGLLLALLPPGAASTQVCTGTDMKLRLPASPETHLD
                     MLRHLYQGCQVVQGNLELTYLPTNASLSFLQDIQEVQGYVLIAHNQVRQVPLQRLRIV
                     RGTQLFEDNYALAVLDNGDPLNNTTPVTGASPGGLRELQLRSLTEILKGGVLIQRNPQ
                     LCYQDTILWKDIFHKNNQLALTLIDTNRSRACHPCSPMCKGSRCWGESSEDCQSLTRT
                     VCAGGCARCKGPLPTDCCHEQCAAGCTGPKHSDCLACLHFNHSGICELHCPALVTYNT
                     DTFESMPNPEGRYTFGASCVTACPYNYLSTDVGSCTLVCPLHNQEVTAEDGTQRCEKC
                     SKPCARVCYGLGMEHLREVRAVTSANIQEFAGCKKIFGSLAFLPESFDGDPASNTAPL
                     QPEQLQVFETLEEITGYLYISAWPDSLPDLSVFQNLQVIRGRILHNGAYSLTLQGLGI
                     SWLGLRSLRELGSGLALIHHNTHLCFVHTVPWDQLFRNPHQALLHTANRPEDECVGEG
                     LACHQLCARGHCWGPGPTQCVNCSQFLRGQECVEECRVLQGLPREYVNARHCLPCHPE
                     CQPQNGSVTCFGPEADQCVACAHYKDPPFCVARCPSGVKPDLSYMPIWKFPDEEGACQ
                     PCPINCTHSCVDLDDKGCPAEQRASPLTSIISAVVGILLVVVLGVVFGILIKRRQQKI
                     RKYTMRRLLQETELVEPLTPSGAMPNQAQMRILKETELRKVKVLGSGAFGTVYKGIWI
                     PDGENVKIPVAIKVLRENTSPKANKEILDEAYVMAGVGSPYVSRLLGICLTSTVQLVT
                     QLMPYGCLLDHVRENRGRLGSQDLLNWCMQIAKGMSYLEDVRLVHRDLAARNVLVKSP
                     NHVKITDFGLARLLDIDETEYHADGGKVPIKWMALESILRRRFTHQSDVWSYGVTVWE
                     LMTFGAKPYDGIPAREIPDLLEKGERLPQPPICTIDVYMIMVKCWMIDSECRPRFREL
                     VSEFSRMARDPQRFVVIQNEDLGPASPLDSTFYRSLLEDDDMGDLVDAEEYLVPQQGF
                     FCPDPAPGAGGMVHHRHRSSSTRSGGGDLTLGLEPSEEEAPRSPLAPSEGAGSDVFDG
                     DLGMGAAKGLQSLPTHDPSPLQRYSEDPTVPLPSETDGYVAPLTCSPQPEYVNQPDVR
                     PQPPSPREGPLPAARPAGATLERPKTLSPGKNGVVKDVFAFGGAVENPEYLTPQGGAA
                     PQPHPPPAFSPAFDNLYYWDQDPPERGAPPSTFKGTPTAENPEYLGLDVPV"
     sig_peptide     175..237
                     /note="put. signal peptide (aa -21 to -1)"
     mat_peptide     238..3939
                     /note="put. c-erb-B-2 protein (aa 1-1234)"
     misc_feature    376..384
                     /note="pot. glycosylation site"
     misc_feature    544..558
                     /note="pot. glycosylation site"
     misc_feature    733..741
                     /note="pot. glycosylation site"
     misc_feature    949..957
                     /note="pot. glycosylation site"
     misc_feature    1762..1770
                     /note="pot. glycosylation site"
     misc_feature    1885..1893
                     /note="pot. glycosylation site"
     misc_feature    2059..2067
                     /note="pot. glycosylation site"
     misc_feature    2353..3132
                     /note="aa 727-986, seq. homologous to EGF receptor kinase
                     domain"
     misc_feature    2446..2454
                     /note="pot. glycosylation site"
     misc_feature    4455..4460
                     /note="put. polyA signal"
     polyA_site      4473..4473
                     /note="polyA site"
BASE COUNT          902 a         1383 c         1329 g          859 t
ORIGIN      
        1 aaggggaggt aaccctggcc cctttggtcg gggccccggg cagccgcgcg ccccttccca
       61 cggggccctt tactgcgccg cgcgcccggc ccccacccct cgcagcaccc cgcgccccgc
      121 gccctcccag ccgggtccag ccggagccat ggggccggag ccgcagtgag caccatggag
      181 ctggcggcct tgtgccgctg ggggctcctc ctcgccctct tgccccccgg agccgcgagc
      241 acccaagtgt gcaccggcac agacatgaag ctgcggctcc ctgccagtcc cgagacccac
      301 ctggacatgc tccgccacct ctaccagggc tgccaggtgg tgcagggaaa cctggaactc
      361 acctacctgc ccaccaatgc cagcctgtcc ttcctgcagg atatccagga ggtgcagggc
      421 tacgtgctca tcgctcacaa ccaagtgagg caggtcccac tgcagaggct gcggattgtg
      481 cgaggcaccc agctctttga ggacaactat gccctggccg tgctagacaa tggagacccg
      541 ctgaacaata ccacccctgt cacaggggcc tccccaggag gcctgcggga gctgcagctt
      601 cgaagcctca cagagatctt gaaaggaggg gtcttgatcc agcggaaccc ccagctctgc
      661 taccaggaca cgattttgtg gaaggacatc ttccacaaga acaaccagct ggctctcaca
      721 ctgatagaca ccaaccgctc tcgggcctgc cacccctgtt ctccgatgtg taagggctcc
      781 cgctgctggg gagagagttc tgaggattgt cagagcctga cgcgcactgt ctgtgccggt
      841 ggctgtgccc gctgcaaggg gccactgccc actgactgct gccatgagca gtgtgctgcc
      901 ggctgcacgg gccccaagca ctctgactgc ctggcctgcc tccacttcaa ccacagtggc
      961 atctgtgagc tgcactgccc agccctggtc acctacaaca cagacacgtt tgagtccatg
     1021 cccaatcccg agggccggta tacattcggc gccagctgtg tgactgcctg tccctacaac
     1081 tacctttcta cggacgtggg atcctgcacc ctcgtctgcc ccctgcacaa ccaagaggtg
     1141 acagcagagg atggaacaca gcggtgtgag aagtgcagca agccctgtgc ccgagtgtgc
     1201 tatggtctgg gcatggagca cttgcgagag gtgagggcag ttaccagtgc caatatccag
     1261 gagtttgctg gctgcaagaa gatctttggg agcctggcat ttctgccgga gagctttgat
     1321 ggggacccag cctccaacac tgccccgctc cagccagagc agctccaagt gtttgagact
     1381 ctggaagaga tcacaggtta cctatacatc tcagcatggc cggacagcct gcctgacctc
     1441 agcgtcttcc agaacctgca agtaatccgg ggacgaattc tgcacaatgg cgcctactcg
     1501 ctgaccctgc aagggctggg catcagctgg ctggggctgc gctcactgag ggaactgggc
     1561 agtggactgg ccctcatcca ccataacacc cacctctgct tcgtgcacac ggtgccctgg
     1621 gaccagctct ttcggaaccc gcaccaagct ctgctccaca ctgccaaccg gccagaggac
     1681 gagtgtgtgg gcgagggcct ggcctgccac cagctgtgcg cccgagggca ctgctggggt
     1741 ccagggccca cccagtgtgt caactgcagc cagttccttc ggggccagga gtgcgtggag
     1801 gaatgccgag tactgcaggg gctccccagg gagtatgtga atgccaggca ctgtttgccg
     1861 tgccaccctg agtgtcagcc ccagaatggc tcagtgacct gttttggacc ggaggctgac
     1921 cagtgtgtgg cctgtgccca ctataaggac cctcccttct gcgtggcccg ctgccccagc
     1981 ggtgtgaaac ctgacctctc ctacatgccc atctggaagt ttccagatga ggagggcgca
     2041 tgccagcctt gccccatcaa ctgcacccac tcctgtgtgg acctggatga caagggctgc
     2101 cccgccgagc agagagccag ccctctgacg tccatcatct ctgcggtggt tggcattctg
     2161 ctggtcgtgg tcttgggggt ggtctttggg atcctcatca agcgacggca gcagaagatc
     2221 cggaagtaca cgatgcggag actgctgcag gaaacggagc tggtggagcc gctgacacct
     2281 agcggagcga tgcccaacca ggcgcagatg cggatcctga aagagacgga gctgaggaag
     2341 gtgaaggtgc ttggatctgg cgcttttggc acagtctaca agggcatctg gatccctgat
     2401 ggggagaatg tgaaaattcc agtggccatc aaagtgttga gggaaaacac atcccccaaa
     2461 gccaacaaag aaatcttaga cgaagcatac gtgatggctg gtgtgggctc cccatatgtc
     2521 tcccgccttc tgggcatctg cctgacatcc acggtgcagc tggtgacaca gcttatgccc
     2581 tatggctgcc tcttagacca tgtccgggaa aaccgcggac gcctgggctc ccaggacctg
     2641 ctgaactggt gtatgcagat tgccaagggg atgagctacc tggaggatgt gcggctcgta
     2701 cacagggact tggccgctcg gaacgtgctg gtcaagagtc ccaaccatgt caaaattaca
     2761 gacttcgggc tggctcggct gctggacatt gacgagacag agtaccatgc agatgggggc
     2821 aaggtgccca tcaagtggat ggcgctggag tccattctcc gccggcggtt cacccaccag
     2881 agtgatgtgt ggagttatgg tgtgactgtg tgggagctga tgacttttgg ggccaaacct
     2941 tacgatggga tcccagcccg ggagatccct gacctgctgg aaaaggggga gcggctgccc
     3001 cagcccccca tctgcaccat tgatgtctac atgatcatgg tcaaatgttg gatgattgac
     3061 tctgaatgtc ggccaagatt ccgggagttg gtgtctgaat tctcccgcat ggccagggac
     3121 ccccagcgct ttgtggtcat ccagaatgag gacttgggcc cagccagtcc cttggacagc
     3181 accttctacc gctcactgct ggaggacgat gacatggggg acctggtgga tgctgaggag
     3241 tatctggtac cccagcaggg cttcttctgt ccagaccctg ccccgggcgc tgggggcatg
     3301 gtccaccaca ggcaccgcag ctcatctacc aggagtggcg gtggggacct gacactaggg
     3361 ctggagccct ctgaagagga ggcccccagg tctccactgg caccctccga aggggctggc
     3421 tccgatgtat ttgatggtga cctgggaatg ggggcagcca aggggctgca aagcctcccc
     3481 acacatgacc ccagccctct acagcggtac agtgaggacc ccacagtacc cctgccctct
     3541 gagactgatg gctacgttgc ccccctgacc tgcagccccc agcctgaata tgtgaaccag
     3601 ccagatgttc ggccccagcc cccttcgccc cgagagggcc ctctgcctgc tgcccgacct
     3661 gctggtgcca ctctggaaag gcccaagact ctctccccag ggaagaatgg ggtcgtcaaa
     3721 gacgtttttg cctttggggg tgccgtggag aaccccgagt acttgacacc ccagggagga
     3781 gctgcccctc agccccaccc tcctcctgcc ttcagcccag ccttcgacaa cctctattac
     3841 tgggaccagg acccaccaga gcggggggct ccacccagca ccttcaaagg gacacctacg
     3901 gcagagaacc cagagtacct gggtctggac gtgccagtgt gaaccagaag gccaagtccg
     3961 cagaagccct gatgtgtcct cagggagcag ggaaggcctg acttctgctg gcatcaagag
     4021 gtgggagggc cctccgacca cttccagggg aacctgccat gccaggaacc tgtcctaagg
     4081 aaccttcctt cctgcttgag ttcccagatg gctggaaggg gtccagcctc gttggaagag
     4141 gaacagcact ggggagtctt tgtggattct gaggccctgc ccaatgagac tctagggtcc
     4201 agtggatgcc acagcccagc ttggcccttt ccttccagat cctgggtact gaaagcctta
     4261 gggaagctgg cctgagaggg gaagcggccc taagggagtg tctaagaaca aaagcgaccc
     4321 attcagagac tgtccctgaa acctagtact gccccccatg aggaaggaac agcaatggtg
     4381 tcagtatcca ggctttgtac agagtgcttt tctgtttagt ttttactttt tttgttttgt
     4441 ttttttaaag atgaaataaa gacccagggg gag
//