LOCUS       HUMENDO                 2620 bp    mRNA    linear   HUM 27-APR-1993
DEFINITION  Human endoglin mRNA, 3' end.
ACCESSION   J05481
VERSION     J05481.1
KEYWORDS    endoglin.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2620)
  AUTHORS   Gougos,A. and Letarte,M.
  TITLE     Primary structure of endoglin, an RGD-containing glycoprotein of
            human endothelial cells
  JOURNAL   J. Biol. Chem. 265 (15), 8361-8364 (1990)
   PUBMED   1692830
COMMENT     Original source text: Human umbilical cord vein endothelial cell,
            cDNA to mRNA, clone 18A/11A.
            Draft entry and computer-readable sequence for [1] kindly submitted
            by M.Letarte, 05-APR-1990.
FEATURES             Location/Qualifiers
     source          1..2620
                     /db_xref="H-InvDB:HIT000191336"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
     CDS             <1..1938
                     /note="endoglin precursor"
                     /codon_start=1
                     /protein_id="AAA35800.1"
                     /translation="GASCSLSPTSLAETVHCDLQPVGPERGEVTYTTSQVSKGCVAQA
                     PNAILEVHVLFLEFPTGPSQLELTLQASKQNGTWPREVLLVLSVNSSVFLHLQALGIP
                     LHLAYNSSLVTFQEPPGVNTTELPSFPKTQILEWAAERGPITSAAELNDPQSILLRLG
                     QAQGSLSFCMLEASQDMGRTLEWRPRTPALVRGCHLEGVAGHKEAHILRVLPGHSAGP
                     RTVTVKVELSCAPGDLDAVLILQGPPYVSWLIDANHNMQIWTTGEYSFKIFPEKNIRG
                     FKLPDTPQGLLGEARMLNASIVASFVELPLASIVSLHASSCGGRLQTSPAPIQTTPPK
                     DTCSPELLMSLIQTKCADDAMTLVLKKELVAHLKCTITGLTFWDPSCEAEDRGDKFVL
                     RSAYSSCGMQVSASMISNEAVVNILSSSSPQRKKVHCLNMDSLSFQLGLYLSPHFLQA
                     SNTIEPGQQSFVQVRVSPSVSEFLLQLDSCHLDLGPEGGTVELIQGRAAKGNCVSLLS
                     PSPEGDPRFSFLLHFYTVPIPKTGTLSCTVALRPKTGSQDQEVHRTVFMRLNIISPDL
                     SGCTSKGLVLPAVLGITFGAFLIGALLTAALWYIYSHTRSPSKREPVVAVAAPASSES
                     SSTNHSIGSTQSTPCSTSSMA"
     sig_peptide     <1..36
                     /note="endoglin signal peptide"
     mat_peptide     37..1935
                     /product="endoglin"
     misc_feature    880..888
                     /note="RGD-tripeptide"
BASE COUNT          545 a          890 c          717 g          468 t
ORIGIN      
        1 ggggccagct gcagcctcag ccccacaagt cttgcagaaa cagtccattg tgaccttcag
       61 cctgtgggcc ccgagagggg cgaggtgaca tataccacta gccaggtctc gaagggctgc
      121 gtggctcagg cccccaatgc catccttgaa gtccatgtcc tcttcctgga gttcccaacg
      181 ggcccgtcac agctggagct gactctccag gcatccaagc aaaatggcac ctggccccga
      241 gaggtgcttc tggtcctcag tgtaaacagc agtgtcttcc tgcatctcca ggccctggga
      301 atcccactgc acttggccta caattccagc ctggtcacct tccaagagcc cccgggggtc
      361 aacaccacag agctgccatc cttccccaag acccagatcc ttgagtgggc agctgagagg
      421 ggccccatca cctctgctgc tgagctgaat gacccccaga gcatcctcct ccgactgggc
      481 caagcccagg ggtcactgtc cttctgcatg ctggaagcca gccaggacat gggccgcacg
      541 ctcgagtggc ggccgcgtac tccagccttg gtccggggct gccacttgga aggcgtggcc
      601 ggccacaagg aggcgcacat cctgagggtc ctgccgggcc actcggccgg gccccggacg
      661 gtgacggtga aggtggaact gagctgcgca cccggggatc tcgatgccgt cctcatcctg
      721 cagggtcccc cctacgtgtc ctggctcatc gacgccaacc acaacatgca gatctggacc
      781 actggagaat actccttcaa gatctttcca gagaaaaaca ttcgtggctt caagctccca
      841 gacacacctc aaggcctcct gggggaggcc cggatgctca atgccagcat tgtggcatcc
      901 ttcgtggagc taccgctggc cagcattgtc tcacttcatg cctccagctg cggtggtagg
      961 ctgcagacct cacccgcacc gatccagacc actcctccca aggacacttg tagcccggag
     1021 ctgctcatgt ccttgatcca gacaaagtgt gccgacgacg ccatgaccct ggtactaaag
     1081 aaagagcttg ttgcgcattt gaagtgcacc atcacgggcc tgaccttctg ggaccccagc
     1141 tgtgaggcag aggacagggg tgacaagttt gtcttgcgca gtgcttactc cagctgtggc
     1201 atgcaggtgt cagcaagtat gatcagcaat gaggcggtgg tcaatatcct gtcgagctca
     1261 tcaccacagc ggaaaaaggt gcactgcctc aacatggaca gcctctcttt ccagctgggc
     1321 ctctacctca gcccacactt cctccaggcc tccaacacca tcgagccggg gcagcagagc
     1381 tttgtgcagg tcagagtgtc cccatccgtc tccgagttcc tgctccagtt agacagctgc
     1441 cacctggact tggggcctga gggaggcacc gtggaactca tccagggccg ggcggccaag
     1501 ggcaactgtg tgagcctgct gtccccaagc cccgagggtg acccgcgctt cagcttcctc
     1561 ctccacttct acacagtacc catacccaaa accggcaccc tcagctgcac ggtagccctg
     1621 cgtcccaaga ccgggtctca agaccaggaa gtccatagga ctgtcttcat gcgcttgaac
     1681 atcatcagcc ctgacctgtc tggttgcaca agcaaaggcc tcgtcctgcc cgccgtgctg
     1741 ggcatcacct ttggtgcctt cctcatcggg gccctgctca ctgctgcact ctggtacatc
     1801 tactcgcaca cgcgttcccc cagcaagcgg gagcccgtgg tggcggtggc tgccccggcc
     1861 tcctcggaga gcagcagcac caaccacagc atcgggagca cccagagcac cccctgctcc
     1921 accagcagca tggcatagcc ccggcccccc gcgctcgccc agcaggagag actgagcagc
     1981 cgccagctgg gagcactggt gtgaactcac cctgggagcc agtcctccac tcgacccaga
     2041 atggagcctg ctctccgcgc ctacccttcc cgcctccctc tcagaggcct gctgccagtg
     2101 cagccactgg cttggaacac cttggggtcc ctccacccca cagaaccttc aacccagtgg
     2161 gtctgggata tggctgccca ggagacagac cacttgccac gctgttgtaa aaacccaagt
     2221 ccctgtcatt tgaacctgga tccagcactg gtgaactgag ctgggcagga agggagaact
     2281 tgaaacagat tcaggccagc ccagccaggc caacagcacc tccccgctgg gaagagaaga
     2341 gggcccagcc cagagccacc tggatctatc cctgcggcct ccacacctga acttgcctaa
     2401 ctaactggca ggggagacag gagcctagcg gagcccagcc tgggagccca gagggtggca
     2461 agaacagtgg gcgttgggag cctagctcct gccacatgga gccccctctg ccggtcgggc
     2521 agccagcaga gggggagtag ccaagctgct tgtcctgggc ctgcccctgt gtattcacca
     2581 ccaataaatc agaccatgaa accagtgaaa aaaaaaaaaa
//