LOCUS HUMANTCE 3036 bp DNA linear HUM 08-AUG-1995 DEFINITION Human carcinoembryonic antigen gene, complete cds. ACCESSION M17303 VERSION M17303.1 KEYWORDS carcinoembryonic antigen. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3036) AUTHORS Beauchemin,N. JOURNAL Unpublished REFERENCE 2 (bases 1 to 2541) AUTHORS Beauchemin,N., Benchimol,S., Cournoyer,D., Fuks,A. and Stanners,C.P. TITLE Isolation and characterization of full-length functional cDNA clones for human carcinoembryonic antigen JOURNAL Mol. Cell. Biol. 7 (9), 3221-3230 (1987) PUBMED 3670312 COMMENT Original source text: Human colonic adenocarcinoma cell line Ls180, cDNA to mRNA, and DNA. Draft entry and computer-readable sequence for [2],[1] kindly provided by N.Beauchemin, 23-NOV-1987. [1] revises [2]. FEATURES Location/Qualifiers source 1..3036 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" /map="19q13.2" mRNA 1..2541 /product="CEA mRNA" variation 58 /note="a in one clone; t in another" /replace="t" gene 97..2205 /gene="CEA" CDS 97..2205 /gene="CEA" /note="carcinoembryonic antigen precursor" /codon_start=1 /protein_id="AAB59513.1" /db_xref="GDB:G00-119-054" /translation="MESPSAPPHRWCIPWQRLLLTASLLTFWNPPTTAKLTIESTPFN VAEGKEVLLLVHNLPQHLFGYSWYKGERVDGNRQIIGYVIGTQQATPGPAYSGREIIY PNASLLIQNIIQNDTGFYTLHVIKSDLVNEEATGQFRVYPELPKPSISSNNSKPVEDK DAVAFTCEPETQDATYLWWVNNQSLPVSPRLQLSNGNRTLTLFNVTRNDTASYKCETQ NPVSARRSDSVILNVLYGPDAPTISPLNTSYRSGENLNLSCHAASNPPAQYSWFVNGT FQQSTQELFIPNITVNNSGSYTCQAHNSDTGLNRTTVTTITVYAEPPKPFITSNNSNP VEDEDAVALTCEPEIQNTTYLWWVNNQSLPVSPRLQLSNDNRTLTLLSVTRNDVGPYE CGIQNELSVDHSDPVILNVLYGPDDPTISPSYTYYRPGVNLSLSCHAASNPPAQYSWL IDGNIQQHTQELFISNITEKNSGLYTCQANNSASGHSRTTVKTITVSAELPKPSISSN NSKPVEDKDAVAFTCEPEAQNTTYLWWVNGQSLPVSPRLQLSNGNRTLTLFNVTRNDA RAYVCGIQNSVSANRSDPVTLDVLYGPDTPIISPPDSSYLSGANLNLSCHSASNPSPQ YSWRINGIPQQHTQVLFIAKITPNNNGTYACFVSNLATGRNNSIVKSITVSASGTSPG LSAGATVGIMIGVLVGVALI" sig_peptide 97..198 /gene="CEA" /note="carcinoembryonic antigen signal peptide" mat_peptide 199..2202 /gene="CEA" /product="carcinoembryonic antigen" repeat_region 2330..2560 /note="Alu repeat" BASE COUNT 860 a 853 c 614 g 709 t ORIGIN 477 bp upstream of BglII site; chromosome 19. 1 cgaccagcag accagacagt cacagcagcc ttgacaaaac gttcctggaa ctcaagcact 61 tctccacaga ggaggacaga gcagacagca gagaccatgg agtctccctc ggcccctccc 121 cacagatggt gcatcccctg gcagaggctc ctgctcacag cctcacttct aaccttctgg 181 aacccgccca ccactgccaa gctcactatt gaatccacgc cgttcaatgt cgcagagggg 241 aaggaggtgc ttctacttgt ccacaatctg ccccagcatc tttttggcta cagctggtac 301 aaaggtgaaa gagtggatgg caaccgtcaa attataggat atgtaatagg aactcaacaa 361 gctaccccag ggcccgcata cagtggtcga gagataatat accccaatgc atccctgctg 421 atccagaaca tcatccagaa tgacacagga ttctacaccc tacacgtcat aaagtcagat 481 cttgtgaatg aagaagcaac tggccagttc cgggtatacc cggagctgcc caagccctcc 541 atctccagca acaactccaa acccgtggag gacaaggatg ctgtggcctt cacctgtgaa 601 cctgagactc aggacgcaac ctacctgtgg tgggtaaaca atcagagcct cccggtcagt 661 cccaggctgc agctgtccaa tggcaacagg accctcactc tattcaatgt cacaagaaat 721 gacacagcaa gctacaaatg tgaaacccag aacccagtga gtgccaggcg cagtgattca 781 gtcatcctga atgtcctcta tggcccggat gcccccacca tttcccctct aaacacatct 841 tacagatcag gggaaaatct gaacctctcc tgccatgcag cctctaaccc acctgcacag 901 tactcttggt ttgtcaatgg gactttccag caatccaccc aagagctctt tatccccaac 961 atcactgtga ataatagtgg atcctatacg tgccaagccc ataactcaga cactggcctc 1021 aataggacca cagtcacgac gatcacagtc tatgcagagc cacccaaacc cttcatcacc 1081 agcaacaact ccaaccccgt ggaggatgag gatgctgtag ccttaacctg tgaacctgag 1141 attcagaaca caacctacct gtggtgggta aataatcaga gcctcccggt cagtcccagg 1201 ctgcagctgt ccaatgacaa caggaccctc actctactca gtgtcacaag gaatgatgta 1261 ggaccctatg agtgtggaat ccagaacgaa ttaagtgttg accacagcga cccagtcatc 1321 ctgaatgtcc tctatggccc agacgacccc accatttccc cctcatacac ctattaccgt 1381 ccaggggtga acctcagcct ctcctgccat gcagcctcta acccacctgc acagtattct 1441 tggctgattg atgggaacat ccagcaacac acacaagagc tctttatctc caacatcact 1501 gagaagaaca gcggactcta tacctgccag gccaataact cagccagtgg ccacagcagg 1561 actacagtca agacaatcac agtctctgcg gagctgccca agccctccat ctccagcaac 1621 aactccaaac ccgtggagga caaggatgct gtggccttca cctgtgaacc tgaggctcag 1681 aacacaacct acctgtggtg ggtaaatggt cagagcctcc cagtcagtcc caggctgcag 1741 ctgtccaatg gcaacaggac cctcactcta ttcaatgtca caagaaatga cgcaagagcc 1801 tatgtatgtg gaatccagaa ctcagtgagt gcaaaccgca gtgacccagt caccctggat 1861 gtcctctatg ggccggacac ccccatcatt tcccccccag actcgtctta cctttcggga 1921 gcgaacctca acctctcctg ccactcggcc tctaacccat ccccgcagta ttcttggcgt 1981 atcaatggga taccgcagca acacacacaa gttctcttta tcgccaaaat cacgccaaat 2041 aataacggga cctatgcctg ttttgtctct aacttggcta ctggccgcaa taattccata 2101 gtcaagagca tcacagtctc tgcatctgga acttctcctg gtctctcagc tggggccact 2161 gtcggcatca tgattggagt gctggttggg gttgctctga tatagcagcc ctggtgtagt 2221 ttcttcattt caggaagact gacagttgtt ttgcttcttc cttaaagcat ttgcaacagc 2281 tacagtctaa aattgcttct ttaccaagga tatttacaga aaagactctg accagagatc 2341 gagaccatcc tagccaacat cgtgaaaccc catctctact aaaaatacaa aaatgagctg 2401 ggcttggtgg cgcgcacctg tagtcccagt tactcgggag gctgaggcag gagaatcgct 2461 tgaacccggg aggtggagat tgcagtgagc ccagatcgca ccactgcact ccagtctggc 2521 aacagagcaa gactccatct caaaaagaaa agaaaagaag actctgacct gtactcttga 2581 atacaagttt ctgataccac tgcactgtct gagaatttcc aaaactttaa tgaactaact 2641 gacagcttca tgaaactgtc caccaagatc aagcagagaa aataattaat ttcatgggac 2701 taaatgaact aatgaggata atattttcat aattttttat ttgaaatttt gctgattctt 2761 taaatgtctt gtttcccaga tttcaggaaa ctttttttct tttaagctat ccacagctta 2821 cagcaatttg ataaaatata cttttgtgaa caaaaattga gacatttaca ttttctccct 2881 atgtggtcgc tccagacttg ggaaactatt catgaatatt tatattgtat ggtaatatag 2941 ttattgcaca agttcaataa aaatctgctc tttgtatgac agaatacatt tgaaaacatt 3001 ggttatatta ccaagacttt gactagaatg tcgtat //