LOCUS       HUMANTCE                3036 bp    DNA     linear   HUM 08-AUG-1995
DEFINITION  Human carcinoembryonic antigen gene, complete cds.
ACCESSION   M17303
VERSION     M17303.1
KEYWORDS    carcinoembryonic antigen.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3036)
  AUTHORS   Beauchemin,N.
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 2541)
  AUTHORS   Beauchemin,N., Benchimol,S., Cournoyer,D., Fuks,A. and
            Stanners,C.P.
  TITLE     Isolation and characterization of full-length functional cDNA
            clones for human carcinoembryonic antigen
  JOURNAL   Mol. Cell. Biol. 7 (9), 3221-3230 (1987)
   PUBMED   3670312
COMMENT     Original source text: Human colonic adenocarcinoma cell line Ls180,
            cDNA to mRNA, and DNA.
            Draft entry and computer-readable sequence for [2],[1] kindly
            provided by N.Beauchemin, 23-NOV-1987.
            [1]  revises [2].
FEATURES             Location/Qualifiers
     source          1..3036
                     /organism="Homo sapiens"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:9606"
                     /map="19q13.2"
     mRNA            1..2541
                     /product="CEA mRNA"
     variation       58
                     /note="a in one clone; t in another"
                     /replace="t"
     gene            97..2205
                     /gene="CEA"
     CDS             97..2205
                     /gene="CEA"
                     /note="carcinoembryonic antigen precursor"
                     /codon_start=1
                     /protein_id="AAB59513.1"
                     /db_xref="GDB:G00-119-054"
                     /translation="MESPSAPPHRWCIPWQRLLLTASLLTFWNPPTTAKLTIESTPFN
                     VAEGKEVLLLVHNLPQHLFGYSWYKGERVDGNRQIIGYVIGTQQATPGPAYSGREIIY
                     PNASLLIQNIIQNDTGFYTLHVIKSDLVNEEATGQFRVYPELPKPSISSNNSKPVEDK
                     DAVAFTCEPETQDATYLWWVNNQSLPVSPRLQLSNGNRTLTLFNVTRNDTASYKCETQ
                     NPVSARRSDSVILNVLYGPDAPTISPLNTSYRSGENLNLSCHAASNPPAQYSWFVNGT
                     FQQSTQELFIPNITVNNSGSYTCQAHNSDTGLNRTTVTTITVYAEPPKPFITSNNSNP
                     VEDEDAVALTCEPEIQNTTYLWWVNNQSLPVSPRLQLSNDNRTLTLLSVTRNDVGPYE
                     CGIQNELSVDHSDPVILNVLYGPDDPTISPSYTYYRPGVNLSLSCHAASNPPAQYSWL
                     IDGNIQQHTQELFISNITEKNSGLYTCQANNSASGHSRTTVKTITVSAELPKPSISSN
                     NSKPVEDKDAVAFTCEPEAQNTTYLWWVNGQSLPVSPRLQLSNGNRTLTLFNVTRNDA
                     RAYVCGIQNSVSANRSDPVTLDVLYGPDTPIISPPDSSYLSGANLNLSCHSASNPSPQ
                     YSWRINGIPQQHTQVLFIAKITPNNNGTYACFVSNLATGRNNSIVKSITVSASGTSPG
                     LSAGATVGIMIGVLVGVALI"
     sig_peptide     97..198
                     /gene="CEA"
                     /note="carcinoembryonic antigen signal peptide"
     mat_peptide     199..2202
                     /gene="CEA"
                     /product="carcinoembryonic antigen"
     repeat_region   2330..2560
                     /note="Alu repeat"
BASE COUNT          860 a          853 c          614 g          709 t
ORIGIN      477 bp upstream of BglII site; chromosome 19.
        1 cgaccagcag accagacagt cacagcagcc ttgacaaaac gttcctggaa ctcaagcact
       61 tctccacaga ggaggacaga gcagacagca gagaccatgg agtctccctc ggcccctccc
      121 cacagatggt gcatcccctg gcagaggctc ctgctcacag cctcacttct aaccttctgg
      181 aacccgccca ccactgccaa gctcactatt gaatccacgc cgttcaatgt cgcagagggg
      241 aaggaggtgc ttctacttgt ccacaatctg ccccagcatc tttttggcta cagctggtac
      301 aaaggtgaaa gagtggatgg caaccgtcaa attataggat atgtaatagg aactcaacaa
      361 gctaccccag ggcccgcata cagtggtcga gagataatat accccaatgc atccctgctg
      421 atccagaaca tcatccagaa tgacacagga ttctacaccc tacacgtcat aaagtcagat
      481 cttgtgaatg aagaagcaac tggccagttc cgggtatacc cggagctgcc caagccctcc
      541 atctccagca acaactccaa acccgtggag gacaaggatg ctgtggcctt cacctgtgaa
      601 cctgagactc aggacgcaac ctacctgtgg tgggtaaaca atcagagcct cccggtcagt
      661 cccaggctgc agctgtccaa tggcaacagg accctcactc tattcaatgt cacaagaaat
      721 gacacagcaa gctacaaatg tgaaacccag aacccagtga gtgccaggcg cagtgattca
      781 gtcatcctga atgtcctcta tggcccggat gcccccacca tttcccctct aaacacatct
      841 tacagatcag gggaaaatct gaacctctcc tgccatgcag cctctaaccc acctgcacag
      901 tactcttggt ttgtcaatgg gactttccag caatccaccc aagagctctt tatccccaac
      961 atcactgtga ataatagtgg atcctatacg tgccaagccc ataactcaga cactggcctc
     1021 aataggacca cagtcacgac gatcacagtc tatgcagagc cacccaaacc cttcatcacc
     1081 agcaacaact ccaaccccgt ggaggatgag gatgctgtag ccttaacctg tgaacctgag
     1141 attcagaaca caacctacct gtggtgggta aataatcaga gcctcccggt cagtcccagg
     1201 ctgcagctgt ccaatgacaa caggaccctc actctactca gtgtcacaag gaatgatgta
     1261 ggaccctatg agtgtggaat ccagaacgaa ttaagtgttg accacagcga cccagtcatc
     1321 ctgaatgtcc tctatggccc agacgacccc accatttccc cctcatacac ctattaccgt
     1381 ccaggggtga acctcagcct ctcctgccat gcagcctcta acccacctgc acagtattct
     1441 tggctgattg atgggaacat ccagcaacac acacaagagc tctttatctc caacatcact
     1501 gagaagaaca gcggactcta tacctgccag gccaataact cagccagtgg ccacagcagg
     1561 actacagtca agacaatcac agtctctgcg gagctgccca agccctccat ctccagcaac
     1621 aactccaaac ccgtggagga caaggatgct gtggccttca cctgtgaacc tgaggctcag
     1681 aacacaacct acctgtggtg ggtaaatggt cagagcctcc cagtcagtcc caggctgcag
     1741 ctgtccaatg gcaacaggac cctcactcta ttcaatgtca caagaaatga cgcaagagcc
     1801 tatgtatgtg gaatccagaa ctcagtgagt gcaaaccgca gtgacccagt caccctggat
     1861 gtcctctatg ggccggacac ccccatcatt tcccccccag actcgtctta cctttcggga
     1921 gcgaacctca acctctcctg ccactcggcc tctaacccat ccccgcagta ttcttggcgt
     1981 atcaatggga taccgcagca acacacacaa gttctcttta tcgccaaaat cacgccaaat
     2041 aataacggga cctatgcctg ttttgtctct aacttggcta ctggccgcaa taattccata
     2101 gtcaagagca tcacagtctc tgcatctgga acttctcctg gtctctcagc tggggccact
     2161 gtcggcatca tgattggagt gctggttggg gttgctctga tatagcagcc ctggtgtagt
     2221 ttcttcattt caggaagact gacagttgtt ttgcttcttc cttaaagcat ttgcaacagc
     2281 tacagtctaa aattgcttct ttaccaagga tatttacaga aaagactctg accagagatc
     2341 gagaccatcc tagccaacat cgtgaaaccc catctctact aaaaatacaa aaatgagctg
     2401 ggcttggtgg cgcgcacctg tagtcccagt tactcgggag gctgaggcag gagaatcgct
     2461 tgaacccggg aggtggagat tgcagtgagc ccagatcgca ccactgcact ccagtctggc
     2521 aacagagcaa gactccatct caaaaagaaa agaaaagaag actctgacct gtactcttga
     2581 atacaagttt ctgataccac tgcactgtct gagaatttcc aaaactttaa tgaactaact
     2641 gacagcttca tgaaactgtc caccaagatc aagcagagaa aataattaat ttcatgggac
     2701 taaatgaact aatgaggata atattttcat aattttttat ttgaaatttt gctgattctt
     2761 taaatgtctt gtttcccaga tttcaggaaa ctttttttct tttaagctat ccacagctta
     2821 cagcaatttg ataaaatata cttttgtgaa caaaaattga gacatttaca ttttctccct
     2881 atgtggtcgc tccagacttg ggaaactatt catgaatatt tatattgtat ggtaatatag
     2941 ttattgcaca agttcaataa aaatctgctc tttgtatgac agaatacatt tgaaaacatt
     3001 ggttatatta ccaagacttt gactagaatg tcgtat
//