LOCUS HUMFOS 6210 bp DNA linear HUM 08-NOV-1994 DEFINITION Human fos proto-oncogene (c-fos), complete cds. ACCESSION K00650 M16287 VERSION K00650.1 KEYWORDS c-myc proto-oncogene; fos oncogene; proto-oncogene. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4165) AUTHORS van Straaten,F., Muller,R., Curran,T., Van Beveren,C. and Verma,I.M. TITLE Complete nucleotide sequence of a human c-onc gene: deduced amino acid sequence of the human c-fos protein JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80 (11), 3183-3187 (1983) PUBMED 6574479 REFERENCE 2 (sites) AUTHORS Treisman,R. TITLE Transient accumulation of c-fos RNA following serum stimulation requires a conserved 5' element and c-fos 3' sequences JOURNAL Cell 42 (3), 889-902 (1985) PUBMED 2414012 REFERENCE 3 (bases 4166 to 6210) AUTHORS Verma,I.M., Deschamps,J., Van Beveren,C. and Sassone-Corsi,P. TITLE Human fos gene JOURNAL Cold Spring Harb. Symp. Quant. Biol. 51 (Pt 2), 949-958 (1986) PUBMED 3555978 COMMENT Original source text: Human lymphoblastic cell line (CCRF-CM) DNA, clone pc-fos(human)-1. [2] sites; promoter region. C-fos is the human cellular homolog of the v-fos oncogene of Finkel-Biskis-Jinkins murine osteosarcoma virus (FBJ-MuSV). [2] It was found that both human and murine c-fos genes contained an enhancer-like element in their 5' noncoding regions that was necessary for increased transcription following serum activation. The FBJ-MuSV v-fos oncogene contains a deletion relative to murine and human c-fos proto-oncogenes that causes complete divergence of the COOH terminal protein sequences encoded. That deletion corresponds to positions 3182-3285 inclusive of this sequence. The FBJ-MuSV v-fos sequence is more closely related to murine than human c-fos sequences. The FBJ-MuSV v-fos coding sequence ends at a 'tag' stop codon coresponding to positions 3434-2436 of this sequence [1]. [1] notes two alu repeats beginning aproximately 500 and 1700 nucleotides downstream of the last base in this sequence. A TATA box is located at positions 701-707. Two potential polyadenylation signals are present in the 3' untranslated region. FEATURES Location/Qualifiers source 1..6210 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" /map="14q24.3" misc_feature 402..453 /note="transcriptional activator region [2]" prim_transcript 734..>3329 /note="c-fos mRNA [1]" CDS join(889..1029,1783..2034,2466..2573,2688..3329) /note="c-fos protein" /codon_start=1 /protein_id="AAA52471.1" /translation="MMFSGFNADYEASSSRCSSASPAGDSLSYYHSPADSFSSMGSPV NAQDFCTDLAVSSANFIPTVTAISTSPDLQWLVQPALVSSVAPSQTRAPHPFGVPAPS AGAYSRAGVVKTMTGGRAQSIGRRGKVEQLSPEEEEKRRIRRERNKMAAAKCRNRRRE LTDTLQAETDQLEDEKSALQTEIANLLKEKEKLEFILAAHRPACKIPDDLGFPEEMSV ASLDLTGGLPEVATPESEEAFTLPLLNDPEPKPSVEPVKSISSMELKTEPFDDFLFPA SSRPSGSETARSVPDMDLSGSFYAADWEPLHSGSLGMGPMATELEPLCTPVVTCTPSC TAYTSSFVFTYPEADSFPSCAAAHRKGSSSNEPSSDSLSSPTLLAL" gene 889..1029 /gene="FOS" exon <889..1029 /gene="FOS" /note="c-fos protein; G00-119-917" /number=1 intron 1030..1782 /note="c-fos intron A" exon 1783..2034 /number=2 intron 2035..2465 /note="c-fos intron B" exon 2466..2573 /number=3 intron 2574..2687 /note="c-fos intron C" exon 2688..>3329 /note="c-fos protein" /number=4 BASE COUNT 1497 a 1571 c 1619 g 1523 t ORIGIN Chromosome 14q24.3-q31. 1 gcaggaacag tgctagtatt gctcgagccc gagggctgga ggttagggga tgaaggtctg 61 cttccacgct ttgcactgaa ttagggctag aattggggat gggggtaggg gcgcattcct 121 tcgggagccg aggcttaagt cctcggggtc ctgtactcga tgccgtttct cctatctctg 181 agcctcagaa ctgtcttcag tttccgtaca agggtaaaaa ggcgctctct gccccatccc 241 ccccgacctc gggaacaagg gtccgcattg aaccaggtgc gaatgttctc tctcattctg 301 cgccgttccc gcctcccctc ccccagccgc ggcccccgcc tccccccgca ctgcaccctc 361 ggtgttggct gcagcccgcg agcagttccc gtcaatccct ccccccttac acaggatgtc 421 catattagga catctgcgtc agcaggtttc cacggccttt ccctgtagcc ctggggggag 481 ccatccccga aacccctcat cttggggggc ccacgagacc tctgagacag gaactgcgaa 541 atgctcacga gattaggaca cgcgccaagg cgggggcagg gagctgcgag cgctggggac 601 gcagccgggc ggccgcagaa gcgcccaggc ccgcgcgcca cccctctggc gccaccgtgg 661 ttgagcccgt gacgtttaca ctcattcata aaacgcttgt tataaaagca gtggctgcgg 721 cgcctcgtac tccaaccgca tctgcagcga gcaactgaga agccaagact gagccggcgg 781 ccgcggcgca gcgaacgagc agtgaccgtg ctcctaccca gctctgcttc acagcgccca 841 cctgtctccg cccctcggcc cctcgcccgg ctttgcctaa ccgccacgat gatgttctcg 901 ggcttcaacg cagactacga ggcgtcatcc tcccgctgca gcagcgcgtc cccggccggg 961 gatagcctct cttactacca ctcacccgca gactccttct ccagcatggg ctcgcctgtc 1021 aacgcgcagg taaggctggc ttcccgtcgc cgcggggccg ggggcttggg gtcgcggagg 1081 aggagacacc gggcgggacg ctccagtaga tgagtagggg gctcccttgt gcctggaggg 1141 aggctgccgt ggccggagcg gtgccggctc gggggctcgg gacttgctct gagcgcacgc 1201 acgcttgcca tagtaagaat tggttccccc ttcgggaggc aggttcgttc tgagcaacct 1261 ctggtctgca ctccaggacg gatctctgac attagctgga gcagacgtgt cccaagcaca 1321 aactcgctaa ctagagcctg gcttcttcgg ggaggtggca gaaagcggca atcccccctc 1381 ccccggcagc ctggagcacg gaggagggat gagggaggag ggtgcagcgg gcgggtgtgt 1441 aaggcagttt cattgataaa aagcgagttc attctggaga ctccggagcg gcgcctgcgt 1501 cagcgcagac gtcagggata tttataacaa accccctttc aagcaagtga tgctgaaggg 1561 ataacgggaa cgcagcggca ggatggaaga gacaggcact gcgctgcgga atgcctggga 1621 ggaaaagggg gagacctttc atccaggatg agggacattt aagatgaaat gtccgtggca 1681 ggatcgtttc tcttcactgc tgcatgcggc actgggaact cgccccacct gtgtccggaa 1741 cctgctcgct cacgtcggct ttccccttct gttttgttct aggacttctg cacggacctg 1801 gccgtctcca gtgccaactt cattcccacg gtcactgcca tctcgaccag tccggacctg 1861 cagtggctgg tgcagcccgc cctcgtctcc tctgtggccc catcgcagac cagagcccct 1921 caccctttcg gagtccccgc cccctccgct ggggcttact ccagggctgg cgttgtgaag 1981 accatgacag gaggccgagc gcagagcatt ggcaggaggg gcaaggtgga acaggtgagg 2041 aactctagcg tactcttcct gggaatgtgg gggctgggtg ggaagcagcc ccggagatgc 2101 aggagcccag tacagaggat gaagccactg atggggctgg ctgcacatcc gtaactggga 2161 gccctggctc caagcccatt ccatcccaac tcagactctg agtctcaccc taagaagtac 2221 tctcatagtt tcttccctaa gtttcttacc gcatgctttc agactgggct cttctttgtt 2281 ctcttgctga ggatcttatt ttaaatgcaa gtcacaccta ttctgcaact gcaggtcaga 2341 aatggtttca cagtggggtg ccaggaagca gggaagctgc aggagccagt tctactgggg 2401 tgggtgaatg gaggtgatgg cagacacttt tactgaatgt cggtcttttt ttgtgattat 2461 tctagttatc tccagaagaa gaagagaaaa ggagaatccg aagggaaagg aataagatgg 2521 ctgcagccaa atgccgcaac cggaggaggg agctgactga tacactccaa gcggtaggta 2581 ctctgtgggt tgctcctttt taaaacttaa gggaaagttg gagattgagc ataagggccc 2641 ttgagtaaga ctgtgtctta tgctttcctt tatccctctg tatacaggag acagaccaac 2701 tagaagatga gaagtctgct ttgcagaccg agattgccaa cctgctgaag gagaaggaaa 2761 aactagagtt catcctggca gctcaccgac ctgcctgcaa gatccctgat gacctgggct 2821 tcccagaaga gatgtctgtg gcttcccttg atctgactgg gggcctgcca gaggttgcca 2881 ccccggagtc tgaggaggcc ttcaccctgc ctctcctcaa tgaccctgag cccaagccct 2941 cagtggaacc tgtcaagagc atcagcagca tggagctgaa gaccgagccc tttgatgact 3001 tcctgttccc agcatcatcc aggcccagtg gctctgagac agcccgctcc gtgccagaca 3061 tggacctatc tgggtccttc tatgcagcag actgggagcc tctgcacagt ggctccctgg 3121 ggatggggcc catggccaca gagctggagc ccctgtgcac tccggtggtc acctgtactc 3181 ccagctgcac tgcttacacg tcttccttcg tcttcaccta ccccgaggct gactccttcc 3241 ccagctgtgc agctgcccac cgcaagggca gcagcagcaa tgagccttcc tctgactcgc 3301 tcagctcacc cacgctgctg gccctgtgag ggggcaggga aggggaggca gccggcaccc 3361 acaagtgcca ctgcccgagc tggtgcatta cagagaggag aaacacatct tccctagagg 3421 gttcctgtag acctagggag gaccttatct gtgcgtgaaa cacaccaggc tgtgggcctc 3481 aaggacttga aagcatccat gtgtggactc aagtccttac ctcttccgga gatgtagcaa 3541 aacgcatgga gtgtgtattg ttcccagtga cacttcagag agctggtagt tagtagcatg 3601 ttgagccagg cctgggtctg tgtctctttt ctctttctcc ttagtcttct catagcatta 3661 actaatctat tgggttcatt attggaatta acctggtgct ggatattttc aaattgtatc 3721 tagtgcagct gattttaaca ataactactg tgttcctggc aatagtgtgt tctgattaga 3781 aatgaccaat attatactaa gaaaagatac gactttattt tctggtagat agaaataaat 3841 agctatatcc atgtactgta gtttttcttc aacatcaatg ttcattgtaa tgttactgat 3901 catgcattgt tgaggtggtc tgaatgttct gacattaaca gttttccatg aaaacgtttt 3961 attgtgtttt taatttattt attaagatgg attctcagat atttatattt ttattttatt 4021 tttttctacc ttgaggtctt ttgacatgtg gaaagtgaat ttgaatgaaa aatttaagca 4081 ttgtttgctt attgttccaa gacattgtca ataaaagcat ttaagttgaa tgcgaccaac 4141 cttgtgctct tttcattctg gaagtcttgt aagtttctga aaggtattat tggagaccag 4201 tttgtcaaga agggtagctg ctggaggggg acacaccctc tgtctgatcc cttatcaaag 4261 aggacaagga aactatagag ctgattttag aatattttac aaatacatgc cttccattgg 4321 aatgctaaga ttttctactg cttctgggga cgggaaaccg ctgtgtaaca gcttttgtgg 4381 gaatacattt tttctgtttc agtactcgca gggggaaata tttaaatttt gttgtgctaa 4441 tattaaattc agatgttttg atcttaaagg aaccctttaa gcaaacagaa cctagctttg 4501 tacagactat tttaactttt tattctcaca aaatcacgtg gagggttatt ctacttcaaa 4561 gatgagcaaa ttgaagaatg gttagaataa acaactttct tgatattccg ttatcggcat 4621 tagaatcttc ctgctcgtta tcgtatccag caggctgaac tgcctcttga tacttggtta 4681 aaaaaaattt tcaggccggg cgcggtggcc catgcctgta atcctagcac tttgggaggc 4741 cgaggcaggc ggatcacctg aggtcgggag ttcgagacca gcctgaccaa catggagaaa 4801 ccccgtcttt actaaaaata caaaattagc ctggtgtggt ggtgcatgcc tgtaatccta 4861 gctacttgag aggctgagac aggaaaatca cttgaactcg ggaggcggat gttgcagcga 4921 actgagattg cgccattgca ctccagcctg ggcaacaaga ttgaaactct gtttaaaaaa 4981 aaaagttttc actaatgtgt acattttttt gtactctttt attctcgaaa gggaaggagg 5041 gctattgccc tatcccttat taataaatgc attgtggttt ctggtttctc taataccata 5101 tgcccttcat tcagtttata gtgggcggaa gtgggggaga aaaagttgct cagaaatcaa 5161 aagatatctc aaacagcaca aataatggct gatcgttctg caaacaaaaa gttacataat 5221 agctcaagaa ggagaagtca acatgactct gaacaagctt taacttagaa actttatcat 5281 cttaaggaag aacgtgacct ttgtccagga cgtctctggt aatggggcac ttacacacac 5341 atgcacacgt acaaaccaca gggaaaggag accgcccttc tgcctctgct cgcgagtatc 5401 acgcaggcac catgcactat gttttcacac acactgggtg gaagaagagc ttcagcgcca 5461 gtcttctaat gctttggtga taatgaaaat cactgggtgc ttatggggtg tcatattcaa 5521 tcgagttaaa agttttaatt caaaatgaca gttttactga ggttgatgtt ctcgtctatg 5581 atatctctgc ccctcccata aaaatggaca tttaaaagca acttaccgct ctttagatca 5641 ctcctatatc acacaccact tggggtgctg tttctgctag acttgtgatg acagtggcct 5701 taggatccct gtttgctgtt caaagggcaa atattttata gcctttaaat atacctaaac 5761 taaatacaga attaatataa ctaacaaaca cctggtctga aataacaagg tgatctaccc 5821 tggaaggaac ccagctggtg ggccaggagc ggtggctcac acctgtaatt ccagcacttt 5881 gggaggctga gacaggagga tcactggagt ccaggagttt gagaccagcc tgggcaacat 5941 ggcaaaaccc agtgtgcttc tgttgtccca gctacactac tcaggaggct gaggcaggag 6001 tatgacttga gcctgggagg gggaggttgc agagaactga tattgcacca ccactgcact 6061 ccagcctggg tgacacagca aaaccctatc tcaaaaaaaa aaaaaaaaaa aaggaaccca 6121 gctggttcct gtaggtgtgc aataataaca accagaggaa gaaaaggaag acgatttccc 6181 agatgaagaa gggcagctgg accttcggac //