LOCUS HUMLSPRO 2622 bp mRNA linear HUM 27-APR-1993 DEFINITION Human lymphocyte surface protein exons 1-5, complete cds. ACCESSION M99578 VERSION M99578.1 KEYWORDS lymphocyte surface protein. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2622) AUTHORS Voland,J.R., Wyzykowski,R.J., Huang,M. and Dutton,R.W. TITLE Cloning and sequencing of a trophoblast-endothelial-activated lymphocyte surface protein: cDNA sequence and genomic structure JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89 (21), 10425-10429 (1992) PUBMED 1438229 COMMENT Original source text: Homo sapiens (library: lambda gt11) neonate placenta cDNA to mRNA. FEATURES Location/Qualifiers source 1..2622 /db_xref="H-InvDB:HIT000197044_03" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /cell_type="trophoblast" /tissue_type="placenta" /dev_stage="neonate" /tissue_lib="lambda gt11" mRNA join(1..153,153..933,934..1082,1083..1323,1324..2622) 5'UTR 1..171 /note="84% GC in 5'UTR" exon 1..153 /note="codes the 5' UTR; intron between exon 1 and exon 2 is 2.7Kb" /number=1 exon 153..933 /note="codes the extracellular domain; intron between exon 2 and exon 3 is 0.7Kb" /number=2 CDS 170..1822 /note="550 amino acids MW=61kDa, glycosylated=75 kDa; expressed on endothelium, activated lymphocytes and syncytiotrophoblast, contains leucine zipper and basic region homologous to myc; 721P" /codon_start=1 /protein_id="AAA36187.1" /translation="MAAATIVHDTSEAVELCPAYGLYLKPITKMTISVALPQLKQPGK SISNWEVMERLKGMVQNHQFSTLRISKSTMDFIRFEGEVENKSLVKSFLACLDGKTIK LSGFSDILKVRAAEFKIDFPTRHDWDSFFRDAKDMNETLPGERPDTIHLEGLPCKWFA LKESGSEKPSEDVLVKVFEKFGEIRNVDIPMLDPYREEMTGRNFHTFSFGGHLNFEAY VQYREYMGFIQAMSALRGMKLMYKGEDGKAVACNIKVSFDSTKHLSDASIKKRQLERQ KLQELEQQREEQKRREKEAEERQRAEERKQKELEELERERKREEKLRKREQKQRDREL RRNQKKLEKLQAEEQKQLQEKIKLEERKLLLAQRNLQSIRLIAELLSRAKAVKLREQE QKEEKLRLQQQEERRRLQEAELRRVEEEKERALGLQRKERELRERLLSILQSKKPDDS HTHDELGVAHGPAAARPGHPADRVVRLCERHHAAPPRGPAPGRCPQGEPGPPRGRRRS QKRERERGRGGPMQGGSELLSCGPRGWLSREEVPGRRPLLHS" misc_feature 172..232 /note="codes for protein leader sequence" exon 934..1082 /note="codes the putative basic region; intron between exon 3 and exon 4 is 3.5Kb" /number=3 exon 1083..1323 /note="codes the leucine zipper region; the intron between exon 4 and exon 5 is 1.3Kb" /number=4 misc_feature 1199..1263 /note="codes for leucine zipper, homologous to myc" exon 1324..2622 /note="codes the serine phosphorylation site and the 3'UTR" /number=5 BASE COUNT 589 a 792 c 898 g 343 t ORIGIN 1 ggcgacggcg gtggcggcgt cggaggcgcc tccgggggac ggtggcggct cccggcggtg 61 aggccgcgcc tgtccgggga tcgtcgaggg acggcgggag cttgggccag cggcggcggc 121 ggcctgggac gcaggcggag ccccgcgcag gcccaaggtc ccggaggcta tggcagcggc 181 taccatcgtg cacgacacgt ctgaggccgt ggagctctgc cctgcttacg gcttgtacct 241 gaagcccatc accaagatga ccatcagcgt ggcactcccg cagctgaagc agccggggaa 301 gtccatctcc aactgggagg tgatggagag gctgaagggc atggtgcaga accaccagtt 361 ctccacgctg cgtatttcca agagcaccat ggacttcatc cgcttcgagg gggaggtgga 421 gaacaagagc ctggtcaagt cttttctggc ctgcctggac ggcaagacca tcaagctcag 481 cggcttctcc gacatcctga aggtgcgcgc ggccgagttc aagatcgact tccccacccg 541 ccacgactgg gactccttct tccgcgacgc caaggacatg aacgagaccc tgccggggga 601 gcggccggac accatccacc tggaggggct gccctgcaag tggttcgccc tgaaggagtc 661 gggctccgag aagcccagcg aggacgtcct ggtcaaggtg tttgagaagt tcggggagat 721 ccggaatgtg gacatcccca tgctggaccc ctaccgggag gagatgacgg gccgcaactt 781 ccacaccttc agtttcgggg ggcacttgaa cttcgaggcc tatgtgcagt accgtgagta 841 catgggcttc atccaggcca tgagcgccct gcgcgggatg aaactcatgt acaagggcga 901 ggacggcaag gccgtggcct gcaacatcaa ggtttctttt gattcgacca aacacctgag 961 tgatgcctca attaagaagc ggcagctgga gaggcagaag cttcaggaac tggagcagca 1021 aagagaagaa caaaagcgca gagagaagga agcggaggag aggcagcgag cggaggaaag 1081 gaaacaaaag gagctggaag agctggagcg agagaggaaa agagaagaga agcttcgcaa 1141 gagggagcag aagcagaggg accgtgagct gcgccggaat cagaagaagc tggagaagct 1201 gcaggcggag gagcagaagc agctgcagga gaagatcaag ctggaggagc gcaagctgct 1261 gctggcccag aggaacctgc agtccatccg gctcatcgcc gagctgctca gcagagccaa 1321 ggctgtgaag ctacgggaac aggagcagaa ggaggagaag ctgaggctcc agcagcagga 1381 ggagcggcgg cggctgcagg aggccgagct gcggcgcgtg gaggaggaga aggagcgcgc 1441 gctgggcctg cagcggaaag agcgggagct gcgcgagcgg ctgctgagca tcctgcagag 1501 caagaagccg gacgacagcc acacacacga cgagctgggc gtggcacacg gacctgctgc 1561 agcccgtcct ggacatcctg cagaccgtgt cgtccggctg tgtgagcgcc accacgctgc 1621 accccctcgg gggccagccc ccggccggtg cccccaagga gagcccggcc cccccagagg 1681 ccgacggcgc tcccaaaagc gtgaacggga gcgtggccga ggaggcccca tgcaaggagg 1741 ttcagagctc ctgtcgtgtg gtccccgagg atggctctcc agagaagagg tgcccgggcg 1801 gcgtcctctc ctgcattcct gacaacaacc aacagcccaa gggcatccct gcctgcgagc 1861 agaatgtctc cagaaaggac acccggtcag aacaggacaa gtgcaaccgg gagcccagca 1921 agggccgggg ccgggccacc ggagacgggc ttgctgaccg gcacaagcgg gagaggagcc 1981 gggccaggcg ggccagcagc agggaggacg ggaggccacg caaggagcgg cggccccaca 2041 agaagcacgc ctacaaggat gacagccccc gccggcgcag cacgagcccg gaccacaccc 2101 ggtcccggag gtcccacagc aaagacaggc accggaggga gcggagccgg gagcggaggg 2161 gcagcgccag caggaagcac agccgccacc gccgccgaag cgagcggtcg cgctcccggt 2221 ccccgagcag gcaccgcagt acctggaaca ggtaatgacg ggcacggcct ccccacggcc 2281 tgtccgggaa agaccaggac ctgctcgagc ctcctggccg ctccttggcc gctctccgtc 2341 cacccctgca aagccaagac ccttctgcag ccacgaatgt ccacggagcc cgccggcagg 2401 aaggaagaca ccatgcttta gagatccatc tttctccact caccgcagcg tacttggcac 2461 ttcagtttca aacacgtagt cctttaaaac ttgatccgat agctttaatg cggccggtcc 2521 tctctcagtc aggaaaattg cacagaccga cagtcgtgag gatggcagag ctgctgcatt 2581 cccccacacg gggatttctg tgtctgcttg gcgacctcct ac //