LOCUS BC044244 2073 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens tetraspanin 33, mRNA (cDNA clone MGC:50844 IMAGE:5759132), complete cds. ACCESSION BC044244 VERSION BC044244.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2073) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2073) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (13-JAN-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Life Technologies, Inc. cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 89 Row: i Column: 3 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 31341698. FEATURES Location/Qualifiers source 1..2073 /db_xref="H-InvDB:HIT000052858" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:50844 IMAGE:5759132" /tissue_type="Colon, Kidney, Stomach, adult, whole pooled" /clone_lib="NIH_MGC_116" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..2073 /gene="TSPAN33" /gene_synonym="MGC50844" /gene_synonym="PEN" /db_xref="GeneID:340348" /db_xref="HGNC:HGNC:28743" /db_xref="MIM:610120" CDS 230..1081 /gene="TSPAN33" /gene_synonym="MGC50844" /gene_synonym="PEN" /codon_start=1 /product="tetraspanin 33" /protein_id="AAH44244.1" /db_xref="GeneID:340348" /db_xref="HGNC:HGNC:28743" /db_xref="MIM:610120" /translation="MARRPRAPAASGEEFSFVSPLVKYLLFFFNMLFWVISMVMVAVG VYARLMKHAEAALACLAVDPAILLIVVGVLMFLLTFCGCIGSLRENICLLQTFSLCLT AVFLLQLAAGILGFVFSDKARGKVSEIINNAIVHYRDDLDLQNLIDFGQKKFSCCGGI SYKDWSQNMYFNCSEDNPSRERCSVPYSCCLPTPDQAVINTMCGQGMQAFDYLEASKV IYTNGCIDKLVNWIHSNLFLLGGVALGLAIPQLVGILLSQILVNQIKDQIKLQLYNQQ HRADPWY" BASE COUNT 450 a 602 c 580 g 441 t ORIGIN 1 ctcgtccgct cgcgctgccc accgcggctc cagcagctcc aggcgcggtt ccccggcccg 61 cgccgctccc ggccccccgg ctcgggcgcc tcccgccgca gcgcaggctc ccccgccggc 121 cgggctcctg cgcggcgcgg ctcggctcat gcccccgggc gcggggcaca caggccggcc 181 ggcagccgct gggaaatagg cccccggggg cggtggcggc ggcggggcca tggcgcggag 241 accccgggcg ccggccgcct ccggggagga gttctccttc gtcagcccgc tggtgaaata 301 cctgctcttc ttcttcaaca tgctcttctg ggtgatttcc atggtgatgg tggctgtggg 361 tgtctacgct cggctaatga agcatgcaga agcagcccta gcctgcctgg cagtggaccc 421 tgccatcctg ctgatcgtgg tgggtgtcct catgttcctg ctcaccttct gtggctgcat 481 tgggtccctc cgcgagaaca tctgcctcct gcagacgttc tccctctgcc tcaccgctgt 541 gttcctgctg cagctggccg ctgggatcct gggcttcgtc ttctcagaca aggctcgagg 601 gaaagtgagt gagatcatca acaatgccat tgtgcactac cgagatgact tggatctgca 661 gaacctcatt gattttggcc agaaaaagtt tagctgctgt ggagggattt cctacaagga 721 ctggtctcag aacatgtatt tcaactgctc agaagacaac cccagtcgag agcgctgctc 781 tgtgccttac tcctgttgct tgcctactcc tgaccaggca gtgatcaaca ctatgtgtgg 841 ccaaggtatg caggcctttg actacttgga agctagcaaa gtcatctaca ccaatggctg 901 tattgacaag ttggtcaact ggatacacag caacctattc ttacttggtg gtgtggctct 961 aggcctggcc atcccccagc tggtgggaat tctgctgtcc cagatcctag tgaatcagat 1021 caaagatcag atcaagctac agctctacaa ccagcagcac cgggctgacc catggtactg 1081 agaatccatc ctgcacctcc tcaccatgga aactggcaag cctcataaac gaacagcagt 1141 gggtgctgaa agcagcacca aatggagatt tggattccag ccccccagtg acagcccagt 1201 gggaagaagc aaactccaga tgggcagaag gcagggtgca caggtggctc cagtctcagg 1261 aggatgcgcc tcctctcccc catcccagtc ctcagcattg tgccagagtg atacccttaa 1321 gtgtttgggt ttatgttttc agttttgttt gggaaacagc agttgcacag agagttgggg 1381 gtactgctgc tgccttttca ccgaggcact gccaccacca gctctagcag ggatgctcct 1441 gagcttggcg gacatactta gatcctaacg tgccagtgag acctggctgt ggagagtagc 1501 actggcagcc ctgcctggac tccacttggc atgataccag ctccagaagg gaagggagtg 1561 gagcaggcag tgaggagaga gcctgggggt cggctgggga cagccgtatg tgctaggtag 1621 gagtggaggg agatatgttt accaaatgcc tgtcctgcca tcctcccagg tagtcagagt 1681 gagctacatc ctgccccgcc ttcatttcca tggaaacatg gcagctagga cacggggtac 1741 aacagcagcc aaattcttcc ccacctccct tacttcgaaa aaaagtttgg aaccctggtc 1801 cctatactct gcagtcagaa gtgggactga gccatacatg cccttgaatt cctccctgtc 1861 tggccctccc tctccagcaa gcagggtttt ctttaacttg gcagtgtgca gaggagaagt 1921 ggtaacaccc ccaccccatt cccctgcatc ggagctcagt attcctacag ggtaagaggt 1981 aggaatcttg ctgggacgag gggagccaga agtggcaata aaagcgtgtt gacctggaaa 2041 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa //