LOCUS X03363 4473 bp mRNA linear HUM 30-MAR-1995 DEFINITION Human c-erb-B-2 mRNA. ACCESSION X03363 VERSION X03363.1 KEYWORDS cell surface glycoprotein; cellular oncogene; erB-2 cellular; glycoprotein; growth factor receptor; kinase; neu cellular oncogene; transmembrane protein; tyrosine kinase. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4473) AUTHORS Yamamoto T., Ikawa S., Akiyama T., Semba K., Nomura N., Miyajima N., Saito T., Toyoshima K. TITLE Similarity of protein encoded by the human c-erb-B-2 gene to epidermal growth factor receptor JOURNAL Nature 319(6050), 230-234(1986). PUBMED 3003577 REFERENCE 2 AUTHORS Papewalis J., Nikitin A.Y., Rajewsky M.F. TITLE G to A polymorphism at amino acid codon 655 of the human erbB-2/HER2 gene JOURNAL Nucleic Acids Res. 19(19), 5452-5452(1991). PUBMED 1681519 COMMENT The c-erb-B-2 protein shows similarity to the epidermal growth factor receptor. FEATURES Location/Qualifiers source 1..4473 /db_xref="H-InvDB:HIT000320977" /organism="Homo sapiens" /strain="cell line MKN-7" /mol_type="mRNA" /db_xref="taxon:9606" CDS 175..3942 /note="c-erb-B-2 precursor" /db_xref="GOA:P04626" /db_xref="H-InvDB:HIT000320977.15" /db_xref="HGNC:HGNC:3430" /db_xref="InterPro:IPR000494" /db_xref="InterPro:IPR000719" /db_xref="InterPro:IPR001245" /db_xref="InterPro:IPR006211" /db_xref="InterPro:IPR006212" /db_xref="InterPro:IPR008266" /db_xref="InterPro:IPR009030" /db_xref="InterPro:IPR011009" /db_xref="InterPro:IPR016245" /db_xref="InterPro:IPR017441" /db_xref="InterPro:IPR020635" /db_xref="InterPro:IPR032778" /db_xref="InterPro:IPR036941" /db_xref="PDB:1MFG" /db_xref="PDB:1MFL" /db_xref="PDB:1MW4" /db_xref="PDB:1N8Z" /db_xref="PDB:1OVC" /db_xref="PDB:1QR1" /db_xref="PDB:1S78" /db_xref="PDB:2A91" /db_xref="PDB:2JWA" /db_xref="PDB:2KS1" /db_xref="PDB:2L4K" /db_xref="PDB:2N2A" /db_xref="PDB:3BE1" /db_xref="PDB:3H3B" /db_xref="PDB:3MZW" /db_xref="PDB:3N85" /db_xref="PDB:3PP0" /db_xref="PDB:3RCD" /db_xref="PDB:3WLW" /db_xref="PDB:3WSQ" /db_xref="PDB:4GFU" /db_xref="PDB:4HRL" /db_xref="PDB:4HRM" /db_xref="PDB:4HRN" /db_xref="PDB:5K33" /db_xref="PDB:5KWG" /db_xref="PDB:5MY6" /db_xref="PDB:5O4G" /db_xref="PDB:5OB4" /db_xref="PDB:5TQS" /db_xref="PDB:6ATT" /db_xref="PDB:6BGT" /db_xref="PDB:6J71" /db_xref="PDB:6OGE" /db_xref="UniProtKB/Swiss-Prot:P04626" /protein_id="CAA27060.1" /translation="MELAALCRWGLLLALLPPGAASTQVCTGTDMKLRLPASPETHLD MLRHLYQGCQVVQGNLELTYLPTNASLSFLQDIQEVQGYVLIAHNQVRQVPLQRLRIV RGTQLFEDNYALAVLDNGDPLNNTTPVTGASPGGLRELQLRSLTEILKGGVLIQRNPQ LCYQDTILWKDIFHKNNQLALTLIDTNRSRACHPCSPMCKGSRCWGESSEDCQSLTRT VCAGGCARCKGPLPTDCCHEQCAAGCTGPKHSDCLACLHFNHSGICELHCPALVTYNT DTFESMPNPEGRYTFGASCVTACPYNYLSTDVGSCTLVCPLHNQEVTAEDGTQRCEKC SKPCARVCYGLGMEHLREVRAVTSANIQEFAGCKKIFGSLAFLPESFDGDPASNTAPL QPEQLQVFETLEEITGYLYISAWPDSLPDLSVFQNLQVIRGRILHNGAYSLTLQGLGI SWLGLRSLRELGSGLALIHHNTHLCFVHTVPWDQLFRNPHQALLHTANRPEDECVGEG LACHQLCARGHCWGPGPTQCVNCSQFLRGQECVEECRVLQGLPREYVNARHCLPCHPE CQPQNGSVTCFGPEADQCVACAHYKDPPFCVARCPSGVKPDLSYMPIWKFPDEEGACQ PCPINCTHSCVDLDDKGCPAEQRASPLTSIISAVVGILLVVVLGVVFGILIKRRQQKI RKYTMRRLLQETELVEPLTPSGAMPNQAQMRILKETELRKVKVLGSGAFGTVYKGIWI PDGENVKIPVAIKVLRENTSPKANKEILDEAYVMAGVGSPYVSRLLGICLTSTVQLVT QLMPYGCLLDHVRENRGRLGSQDLLNWCMQIAKGMSYLEDVRLVHRDLAARNVLVKSP NHVKITDFGLARLLDIDETEYHADGGKVPIKWMALESILRRRFTHQSDVWSYGVTVWE LMTFGAKPYDGIPAREIPDLLEKGERLPQPPICTIDVYMIMVKCWMIDSECRPRFREL VSEFSRMARDPQRFVVIQNEDLGPASPLDSTFYRSLLEDDDMGDLVDAEEYLVPQQGF FCPDPAPGAGGMVHHRHRSSSTRSGGGDLTLGLEPSEEEAPRSPLAPSEGAGSDVFDG DLGMGAAKGLQSLPTHDPSPLQRYSEDPTVPLPSETDGYVAPLTCSPQPEYVNQPDVR PQPPSPREGPLPAARPAGATLERPKTLSPGKNGVVKDVFAFGGAVENPEYLTPQGGAA PQPHPPPAFSPAFDNLYYWDQDPPERGAPPSTFKGTPTAENPEYLGLDVPV" sig_peptide 175..237 /note="put. signal peptide (aa -21 to -1)" mat_peptide 238..3939 /note="put. c-erb-B-2 protein (aa 1-1234)" misc_feature 376..384 /note="pot. glycosylation site" misc_feature 544..558 /note="pot. glycosylation site" misc_feature 733..741 /note="pot. glycosylation site" misc_feature 949..957 /note="pot. glycosylation site" misc_feature 1762..1770 /note="pot. glycosylation site" misc_feature 1885..1893 /note="pot. glycosylation site" misc_feature 2059..2067 /note="pot. glycosylation site" misc_feature 2353..3132 /note="aa 727-986, seq. homologous to EGF receptor kinase domain" misc_feature 2446..2454 /note="pot. glycosylation site" misc_feature 4455..4460 /note="put. polyA signal" polyA_site 4473..4473 /note="polyA site" BASE COUNT 902 a 1383 c 1329 g 859 t ORIGIN 1 aaggggaggt aaccctggcc cctttggtcg gggccccggg cagccgcgcg ccccttccca 61 cggggccctt tactgcgccg cgcgcccggc ccccacccct cgcagcaccc cgcgccccgc 121 gccctcccag ccgggtccag ccggagccat ggggccggag ccgcagtgag caccatggag 181 ctggcggcct tgtgccgctg ggggctcctc ctcgccctct tgccccccgg agccgcgagc 241 acccaagtgt gcaccggcac agacatgaag ctgcggctcc ctgccagtcc cgagacccac 301 ctggacatgc tccgccacct ctaccagggc tgccaggtgg tgcagggaaa cctggaactc 361 acctacctgc ccaccaatgc cagcctgtcc ttcctgcagg atatccagga ggtgcagggc 421 tacgtgctca tcgctcacaa ccaagtgagg caggtcccac tgcagaggct gcggattgtg 481 cgaggcaccc agctctttga ggacaactat gccctggccg tgctagacaa tggagacccg 541 ctgaacaata ccacccctgt cacaggggcc tccccaggag gcctgcggga gctgcagctt 601 cgaagcctca cagagatctt gaaaggaggg gtcttgatcc agcggaaccc ccagctctgc 661 taccaggaca cgattttgtg gaaggacatc ttccacaaga acaaccagct ggctctcaca 721 ctgatagaca ccaaccgctc tcgggcctgc cacccctgtt ctccgatgtg taagggctcc 781 cgctgctggg gagagagttc tgaggattgt cagagcctga cgcgcactgt ctgtgccggt 841 ggctgtgccc gctgcaaggg gccactgccc actgactgct gccatgagca gtgtgctgcc 901 ggctgcacgg gccccaagca ctctgactgc ctggcctgcc tccacttcaa ccacagtggc 961 atctgtgagc tgcactgccc agccctggtc acctacaaca cagacacgtt tgagtccatg 1021 cccaatcccg agggccggta tacattcggc gccagctgtg tgactgcctg tccctacaac 1081 tacctttcta cggacgtggg atcctgcacc ctcgtctgcc ccctgcacaa ccaagaggtg 1141 acagcagagg atggaacaca gcggtgtgag aagtgcagca agccctgtgc ccgagtgtgc 1201 tatggtctgg gcatggagca cttgcgagag gtgagggcag ttaccagtgc caatatccag 1261 gagtttgctg gctgcaagaa gatctttggg agcctggcat ttctgccgga gagctttgat 1321 ggggacccag cctccaacac tgccccgctc cagccagagc agctccaagt gtttgagact 1381 ctggaagaga tcacaggtta cctatacatc tcagcatggc cggacagcct gcctgacctc 1441 agcgtcttcc agaacctgca agtaatccgg ggacgaattc tgcacaatgg cgcctactcg 1501 ctgaccctgc aagggctggg catcagctgg ctggggctgc gctcactgag ggaactgggc 1561 agtggactgg ccctcatcca ccataacacc cacctctgct tcgtgcacac ggtgccctgg 1621 gaccagctct ttcggaaccc gcaccaagct ctgctccaca ctgccaaccg gccagaggac 1681 gagtgtgtgg gcgagggcct ggcctgccac cagctgtgcg cccgagggca ctgctggggt 1741 ccagggccca cccagtgtgt caactgcagc cagttccttc ggggccagga gtgcgtggag 1801 gaatgccgag tactgcaggg gctccccagg gagtatgtga atgccaggca ctgtttgccg 1861 tgccaccctg agtgtcagcc ccagaatggc tcagtgacct gttttggacc ggaggctgac 1921 cagtgtgtgg cctgtgccca ctataaggac cctcccttct gcgtggcccg ctgccccagc 1981 ggtgtgaaac ctgacctctc ctacatgccc atctggaagt ttccagatga ggagggcgca 2041 tgccagcctt gccccatcaa ctgcacccac tcctgtgtgg acctggatga caagggctgc 2101 cccgccgagc agagagccag ccctctgacg tccatcatct ctgcggtggt tggcattctg 2161 ctggtcgtgg tcttgggggt ggtctttggg atcctcatca agcgacggca gcagaagatc 2221 cggaagtaca cgatgcggag actgctgcag gaaacggagc tggtggagcc gctgacacct 2281 agcggagcga tgcccaacca ggcgcagatg cggatcctga aagagacgga gctgaggaag 2341 gtgaaggtgc ttggatctgg cgcttttggc acagtctaca agggcatctg gatccctgat 2401 ggggagaatg tgaaaattcc agtggccatc aaagtgttga gggaaaacac atcccccaaa 2461 gccaacaaag aaatcttaga cgaagcatac gtgatggctg gtgtgggctc cccatatgtc 2521 tcccgccttc tgggcatctg cctgacatcc acggtgcagc tggtgacaca gcttatgccc 2581 tatggctgcc tcttagacca tgtccgggaa aaccgcggac gcctgggctc ccaggacctg 2641 ctgaactggt gtatgcagat tgccaagggg atgagctacc tggaggatgt gcggctcgta 2701 cacagggact tggccgctcg gaacgtgctg gtcaagagtc ccaaccatgt caaaattaca 2761 gacttcgggc tggctcggct gctggacatt gacgagacag agtaccatgc agatgggggc 2821 aaggtgccca tcaagtggat ggcgctggag tccattctcc gccggcggtt cacccaccag 2881 agtgatgtgt ggagttatgg tgtgactgtg tgggagctga tgacttttgg ggccaaacct 2941 tacgatggga tcccagcccg ggagatccct gacctgctgg aaaaggggga gcggctgccc 3001 cagcccccca tctgcaccat tgatgtctac atgatcatgg tcaaatgttg gatgattgac 3061 tctgaatgtc ggccaagatt ccgggagttg gtgtctgaat tctcccgcat ggccagggac 3121 ccccagcgct ttgtggtcat ccagaatgag gacttgggcc cagccagtcc cttggacagc 3181 accttctacc gctcactgct ggaggacgat gacatggggg acctggtgga tgctgaggag 3241 tatctggtac cccagcaggg cttcttctgt ccagaccctg ccccgggcgc tgggggcatg 3301 gtccaccaca ggcaccgcag ctcatctacc aggagtggcg gtggggacct gacactaggg 3361 ctggagccct ctgaagagga ggcccccagg tctccactgg caccctccga aggggctggc 3421 tccgatgtat ttgatggtga cctgggaatg ggggcagcca aggggctgca aagcctcccc 3481 acacatgacc ccagccctct acagcggtac agtgaggacc ccacagtacc cctgccctct 3541 gagactgatg gctacgttgc ccccctgacc tgcagccccc agcctgaata tgtgaaccag 3601 ccagatgttc ggccccagcc cccttcgccc cgagagggcc ctctgcctgc tgcccgacct 3661 gctggtgcca ctctggaaag gcccaagact ctctccccag ggaagaatgg ggtcgtcaaa 3721 gacgtttttg cctttggggg tgccgtggag aaccccgagt acttgacacc ccagggagga 3781 gctgcccctc agccccaccc tcctcctgcc ttcagcccag ccttcgacaa cctctattac 3841 tgggaccagg acccaccaga gcggggggct ccacccagca ccttcaaagg gacacctacg 3901 gcagagaacc cagagtacct gggtctggac gtgccagtgt gaaccagaag gccaagtccg 3961 cagaagccct gatgtgtcct cagggagcag ggaaggcctg acttctgctg gcatcaagag 4021 gtgggagggc cctccgacca cttccagggg aacctgccat gccaggaacc tgtcctaagg 4081 aaccttcctt cctgcttgag ttcccagatg gctggaaggg gtccagcctc gttggaagag 4141 gaacagcact ggggagtctt tgtggattct gaggccctgc ccaatgagac tctagggtcc 4201 agtggatgcc acagcccagc ttggcccttt ccttccagat cctgggtact gaaagcctta 4261 gggaagctgg cctgagaggg gaagcggccc taagggagtg tctaagaaca aaagcgaccc 4321 attcagagac tgtccctgaa acctagtact gccccccatg aggaaggaac agcaatggtg 4381 tcagtatcca ggctttgtac agagtgcttt tctgtttagt ttttactttt tttgttttgt 4441 ttttttaaag atgaaataaa gacccagggg gag //