LOCUS       BC029886                2406 bp    mRNA    linear   HUM 06-OCT-2003
DEFINITION  Homo sapiens occludin, mRNA (cDNA clone MGC:34277 IMAGE:5179203),
            complete cds.
ACCESSION   BC029886
VERSION     BC029886.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2406)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2406)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (06-MAY-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Life Technologies, Inc.
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Baylor College of Medicine Human Genome
            Sequencing Center
            Center code: BCM-HGSC
            Web site: http://www.hgsc.bcm.tmc.edu/cdna/
            Contact: amg@bcm.tmc.edu
            Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
            Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
            A.N., Gibbs, R.A.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 51 Row: p Column: 15
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 9257230.
FEATURES             Location/Qualifiers
     source          1..2406
                     /db_xref="H-InvDB:HIT000040910"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:34277 IMAGE:5179203"
                     /tissue_type="Brain, Lung, Testis, adult, pooled whole"
                     /clone_lib="NIH_MGC_115"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..2406
                     /gene="OCLN"
                     /db_xref="GeneID:4950"
                     /db_xref="MIM:602876"
     CDS             179..1747
                     /gene="OCLN"
                     /codon_start=1
                     /product="OCLN protein"
                     /protein_id="AAH29886.1"
                     /db_xref="GeneID:4950"
                     /db_xref="MIM:602876"
                     /translation="MSSRPLESPPPYRPDEFKPNHYAPSNDIYGGEMHVRPMLSQPAY
                     SFYPEDEILHFYKWTSPPGVIRILSMLIIVMCIAIFACVASTLAWDRGYGTSLLGGSV
                     GYPYGGSGFGSYGSGYGYGYGYGYGYGGYTDPRAAKGFMLAMAAFCFIAALVIFVTSV
                     IRSEMSRTRRYYLSVIIVSAILGIMVFIATIVYIMGVNPTAQSSGSLYGSQIYALCNQ
                     FYTPAATGLYVDQYSYHYCVVDPQEAIAIVLGFMIIVAFALIIFFAVKTRRKMDRYDK
                     SNILWDKEHIYDEQPPNVEEWVKNVSAGTQDVPSPPSDYVERVDSPMAYSSNGKVNDK
                     RFYPESSYKSTPVPEVVQELPLTSPVDDFRQPRYSSGGNFETPSKRAPAKGRAGRSKR
                     TEQDHYETDYTTGGESCDELEEDWIREYPPITSDQQRQLYKRNFDTGLQEYKSLQSEL
                     DEINKELSRLDKELDDYREESEEYMAAADEYNRLKQVKGSADYKSKKNHCKQLKSKLS
                     HIKKMVGDYDRQKT"
     misc_feature    341..1741
                     /gene="OCLN"
                     /note="Occludin; Region: Occludin/ELL family"
                     /db_xref="CDD:pfam02168"
BASE COUNT          694 a          496 c          542 g          674 t
ORIGIN      
        1 gtccccggct gagcgctggc ggtcggtgcg gcgtcaggtg cgcccgccag gtgagcgcgc
       61 tccctggcac cgttggcccc cggagggtcg ggcccagttg cggcgagcgg attggtttat
      121 cttggaagct aaagggcatt gctcatcctg aagatcagct gaccattgac aatcagccat
      181 gtcatccagg cctcttgaaa gtccacctcc ttacaggcct gatgaattca aaccgaatca
      241 ttatgcacca agcaatgaca tatatggtgg agagatgcat gttcgaccaa tgctctctca
      301 gccagcctac tctttttacc cagaagatga aattcttcac ttctacaaat ggacctctcc
      361 tccaggagtg attcggatcc tgtctatgct cattattgtg atgtgcattg ccatctttgc
      421 ctgtgtggcc tccacgcttg cctgggacag aggctatgga acttcccttt taggaggtag
      481 tgtaggctac ccttatggag gaagtggctt tggtagctac ggaagtggct atggctatgg
      541 ctatggttat ggctatggct acggaggcta tacagaccca agagcagcaa agggcttcat
      601 gttggccatg gctgcctttt gtttcattgc cgcgttggtg atctttgtta ccagtgttat
      661 aagatctgaa atgtccagaa caagaagata ctacttaagt gtgataatag tgagtgctat
      721 cctgggcatc atggtgttta ttgccacaat tgtctatata atgggagtga acccaactgc
      781 tcagtcttct ggatctctat atggttcaca aatatatgcc ctctgcaacc aattttatac
      841 acctgcagct actggactct acgtggatca gtattcgtat cactactgtg ttgtggatcc
      901 ccaggaggcc attgccattg tactggggtt catgattatt gtggcttttg ctttaataat
      961 tttctttgct gtgaaaactc gaagaaagat ggacaggtat gacaagtcca atattttgtg
     1021 ggacaaggaa cacatttatg atgagcagcc ccccaatgtc gaggagtggg ttaaaaatgt
     1081 gtctgcaggc acacaggacg tgccttcacc cccatctgac tatgtggaaa gagttgacag
     1141 tcccatggca tactcttcca atggcaaagt gaatgacaag cggttttatc cagagtcttc
     1201 ctataaatcc acgccggttc ctgaagtggt tcaggagctt ccattaactt cgcctgtgga
     1261 tgacttcagg cagcctcgtt acagcagcgg tggtaacttt gagacacctt caaaaagagc
     1321 acctgcaaag ggaagagcag gaaggtcaaa gagaacagag caagatcact atgagacaga
     1381 ctacacaact ggcggcgagt cctgtgatga gctggaggag gactggatca gggaatatcc
     1441 acctatcact tcagatcaac aaagacaact gtacaagagg aattttgaca ctggcctaca
     1501 ggaatacaag agcttacaat cagaacttga tgagatcaat aaagaactct cccgtttgga
     1561 taaagaattg gatgactata gagaagaaag tgaagagtac atggctgctg ctgatgaata
     1621 caatagactg aagcaagtga agggatctgc agattacaaa agtaagaaga atcattgcaa
     1681 gcagttaaag agcaaattgt cacacatcaa gaagatggtt ggagactatg atagacagaa
     1741 aacatagaag gctgatgcca agttgtttga gaaattaagt atctgacatc tctgcaatct
     1801 tctcagaagg caaatgactt tggaccataa ccccggaagc caaacctctg tgagcatcac
     1861 aaagttttgg ttgctttaac atcatcagta ttgaagcatt ttataaatcg cttttgataa
     1921 tcaactgggc tgaacactcc aattaaggat tttatgcttt aaacattggt tcttgtatta
     1981 agaatgaaat actgtttgag gtttttaagc cttaaaggaa ggttctggtg tgaactaaac
     2041 tttcacaccc cagacgatgt cttcatacct acatgtattt gtttgcatag gtgatctcat
     2101 ttaatcctct caaccacctt tcagataact gttatttata atcacttttt tccacataag
     2161 gaaactgggt tcctgcaatg aagtctctga agtgaaactg cttgtttcct agcacacact
     2221 tttggttaag tctgttttat gacttcatta ataataaatt ccctggcctt tcatatttta
     2281 gctactatat atgtgatgat ctaccagcct ccctattttt tttctgttat ataaatggtt
     2341 aaaagaggtt tttcttaaat aataaagatc atgtaaaagt aaaaaaaaaa aaaaaaaaaa
     2401 aaaaaa
//