LOCUS BC029886 2406 bp mRNA linear HUM 06-OCT-2003 DEFINITION Homo sapiens occludin, mRNA (cDNA clone MGC:34277 IMAGE:5179203), complete cds. ACCESSION BC029886 VERSION BC029886.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2406) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2406) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (06-MAY-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Life Technologies, Inc. cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Baylor College of Medicine Human Genome Sequencing Center Center code: BCM-HGSC Web site: http://www.hgsc.bcm.tmc.edu/cdna/ Contact: amg@bcm.tmc.edu Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H., Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, A.N., Gibbs, R.A. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 51 Row: p Column: 15 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 9257230. FEATURES Location/Qualifiers source 1..2406 /db_xref="H-InvDB:HIT000040910" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:34277 IMAGE:5179203" /tissue_type="Brain, Lung, Testis, adult, pooled whole" /clone_lib="NIH_MGC_115" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..2406 /gene="OCLN" /db_xref="GeneID:4950" /db_xref="MIM:602876" CDS 179..1747 /gene="OCLN" /codon_start=1 /product="OCLN protein" /protein_id="AAH29886.1" /db_xref="GeneID:4950" /db_xref="MIM:602876" /translation="MSSRPLESPPPYRPDEFKPNHYAPSNDIYGGEMHVRPMLSQPAY SFYPEDEILHFYKWTSPPGVIRILSMLIIVMCIAIFACVASTLAWDRGYGTSLLGGSV GYPYGGSGFGSYGSGYGYGYGYGYGYGGYTDPRAAKGFMLAMAAFCFIAALVIFVTSV IRSEMSRTRRYYLSVIIVSAILGIMVFIATIVYIMGVNPTAQSSGSLYGSQIYALCNQ FYTPAATGLYVDQYSYHYCVVDPQEAIAIVLGFMIIVAFALIIFFAVKTRRKMDRYDK SNILWDKEHIYDEQPPNVEEWVKNVSAGTQDVPSPPSDYVERVDSPMAYSSNGKVNDK RFYPESSYKSTPVPEVVQELPLTSPVDDFRQPRYSSGGNFETPSKRAPAKGRAGRSKR TEQDHYETDYTTGGESCDELEEDWIREYPPITSDQQRQLYKRNFDTGLQEYKSLQSEL DEINKELSRLDKELDDYREESEEYMAAADEYNRLKQVKGSADYKSKKNHCKQLKSKLS HIKKMVGDYDRQKT" misc_feature 341..1741 /gene="OCLN" /note="Occludin; Region: Occludin/ELL family" /db_xref="CDD:pfam02168" BASE COUNT 694 a 496 c 542 g 674 t ORIGIN 1 gtccccggct gagcgctggc ggtcggtgcg gcgtcaggtg cgcccgccag gtgagcgcgc 61 tccctggcac cgttggcccc cggagggtcg ggcccagttg cggcgagcgg attggtttat 121 cttggaagct aaagggcatt gctcatcctg aagatcagct gaccattgac aatcagccat 181 gtcatccagg cctcttgaaa gtccacctcc ttacaggcct gatgaattca aaccgaatca 241 ttatgcacca agcaatgaca tatatggtgg agagatgcat gttcgaccaa tgctctctca 301 gccagcctac tctttttacc cagaagatga aattcttcac ttctacaaat ggacctctcc 361 tccaggagtg attcggatcc tgtctatgct cattattgtg atgtgcattg ccatctttgc 421 ctgtgtggcc tccacgcttg cctgggacag aggctatgga acttcccttt taggaggtag 481 tgtaggctac ccttatggag gaagtggctt tggtagctac ggaagtggct atggctatgg 541 ctatggttat ggctatggct acggaggcta tacagaccca agagcagcaa agggcttcat 601 gttggccatg gctgcctttt gtttcattgc cgcgttggtg atctttgtta ccagtgttat 661 aagatctgaa atgtccagaa caagaagata ctacttaagt gtgataatag tgagtgctat 721 cctgggcatc atggtgttta ttgccacaat tgtctatata atgggagtga acccaactgc 781 tcagtcttct ggatctctat atggttcaca aatatatgcc ctctgcaacc aattttatac 841 acctgcagct actggactct acgtggatca gtattcgtat cactactgtg ttgtggatcc 901 ccaggaggcc attgccattg tactggggtt catgattatt gtggcttttg ctttaataat 961 tttctttgct gtgaaaactc gaagaaagat ggacaggtat gacaagtcca atattttgtg 1021 ggacaaggaa cacatttatg atgagcagcc ccccaatgtc gaggagtggg ttaaaaatgt 1081 gtctgcaggc acacaggacg tgccttcacc cccatctgac tatgtggaaa gagttgacag 1141 tcccatggca tactcttcca atggcaaagt gaatgacaag cggttttatc cagagtcttc 1201 ctataaatcc acgccggttc ctgaagtggt tcaggagctt ccattaactt cgcctgtgga 1261 tgacttcagg cagcctcgtt acagcagcgg tggtaacttt gagacacctt caaaaagagc 1321 acctgcaaag ggaagagcag gaaggtcaaa gagaacagag caagatcact atgagacaga 1381 ctacacaact ggcggcgagt cctgtgatga gctggaggag gactggatca gggaatatcc 1441 acctatcact tcagatcaac aaagacaact gtacaagagg aattttgaca ctggcctaca 1501 ggaatacaag agcttacaat cagaacttga tgagatcaat aaagaactct cccgtttgga 1561 taaagaattg gatgactata gagaagaaag tgaagagtac atggctgctg ctgatgaata 1621 caatagactg aagcaagtga agggatctgc agattacaaa agtaagaaga atcattgcaa 1681 gcagttaaag agcaaattgt cacacatcaa gaagatggtt ggagactatg atagacagaa 1741 aacatagaag gctgatgcca agttgtttga gaaattaagt atctgacatc tctgcaatct 1801 tctcagaagg caaatgactt tggaccataa ccccggaagc caaacctctg tgagcatcac 1861 aaagttttgg ttgctttaac atcatcagta ttgaagcatt ttataaatcg cttttgataa 1921 tcaactgggc tgaacactcc aattaaggat tttatgcttt aaacattggt tcttgtatta 1981 agaatgaaat actgtttgag gtttttaagc cttaaaggaa ggttctggtg tgaactaaac 2041 tttcacaccc cagacgatgt cttcatacct acatgtattt gtttgcatag gtgatctcat 2101 ttaatcctct caaccacctt tcagataact gttatttata atcacttttt tccacataag 2161 gaaactgggt tcctgcaatg aagtctctga agtgaaactg cttgtttcct agcacacact 2221 tttggttaag tctgttttat gacttcatta ataataaatt ccctggcctt tcatatttta 2281 gctactatat atgtgatgat ctaccagcct ccctattttt tttctgttat ataaatggtt 2341 aaaagaggtt tttcttaaat aataaagatc atgtaaaagt aaaaaaaaaa aaaaaaaaaa 2401 aaaaaa //