LOCUS BC044244 2073 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens tetraspanin 33, mRNA (cDNA clone MGC:50844
IMAGE:5759132), complete cds.
ACCESSION BC044244
VERSION BC044244.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2073)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2073)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (13-JAN-2003) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: Life Technologies, Inc.
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Genome Sequence Centre,
BC Cancer Agency, Vancouver, BC, Canada
info@bcgsc.bc.ca
Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson
Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen
Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel
Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave
Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth
Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao,
Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR
Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang,
Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 89 Row: i Column: 3
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 31341698.
FEATURES Location/Qualifiers
source 1..2073
/db_xref="H-InvDB:HIT000052858"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:50844 IMAGE:5759132"
/tissue_type="Colon, Kidney, Stomach, adult, whole pooled"
/clone_lib="NIH_MGC_116"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..2073
/gene="TSPAN33"
/gene_synonym="MGC50844"
/gene_synonym="PEN"
/db_xref="GeneID:340348"
/db_xref="HGNC:HGNC:28743"
/db_xref="MIM:610120"
CDS 230..1081
/gene="TSPAN33"
/gene_synonym="MGC50844"
/gene_synonym="PEN"
/codon_start=1
/product="tetraspanin 33"
/protein_id="AAH44244.1"
/db_xref="GeneID:340348"
/db_xref="HGNC:HGNC:28743"
/db_xref="MIM:610120"
/translation="MARRPRAPAASGEEFSFVSPLVKYLLFFFNMLFWVISMVMVAVG
VYARLMKHAEAALACLAVDPAILLIVVGVLMFLLTFCGCIGSLRENICLLQTFSLCLT
AVFLLQLAAGILGFVFSDKARGKVSEIINNAIVHYRDDLDLQNLIDFGQKKFSCCGGI
SYKDWSQNMYFNCSEDNPSRERCSVPYSCCLPTPDQAVINTMCGQGMQAFDYLEASKV
IYTNGCIDKLVNWIHSNLFLLGGVALGLAIPQLVGILLSQILVNQIKDQIKLQLYNQQ
HRADPWY"
BASE COUNT 450 a 602 c 580 g 441 t
ORIGIN
1 ctcgtccgct cgcgctgccc accgcggctc cagcagctcc aggcgcggtt ccccggcccg
61 cgccgctccc ggccccccgg ctcgggcgcc tcccgccgca gcgcaggctc ccccgccggc
121 cgggctcctg cgcggcgcgg ctcggctcat gcccccgggc gcggggcaca caggccggcc
181 ggcagccgct gggaaatagg cccccggggg cggtggcggc ggcggggcca tggcgcggag
241 accccgggcg ccggccgcct ccggggagga gttctccttc gtcagcccgc tggtgaaata
301 cctgctcttc ttcttcaaca tgctcttctg ggtgatttcc atggtgatgg tggctgtggg
361 tgtctacgct cggctaatga agcatgcaga agcagcccta gcctgcctgg cagtggaccc
421 tgccatcctg ctgatcgtgg tgggtgtcct catgttcctg ctcaccttct gtggctgcat
481 tgggtccctc cgcgagaaca tctgcctcct gcagacgttc tccctctgcc tcaccgctgt
541 gttcctgctg cagctggccg ctgggatcct gggcttcgtc ttctcagaca aggctcgagg
601 gaaagtgagt gagatcatca acaatgccat tgtgcactac cgagatgact tggatctgca
661 gaacctcatt gattttggcc agaaaaagtt tagctgctgt ggagggattt cctacaagga
721 ctggtctcag aacatgtatt tcaactgctc agaagacaac cccagtcgag agcgctgctc
781 tgtgccttac tcctgttgct tgcctactcc tgaccaggca gtgatcaaca ctatgtgtgg
841 ccaaggtatg caggcctttg actacttgga agctagcaaa gtcatctaca ccaatggctg
901 tattgacaag ttggtcaact ggatacacag caacctattc ttacttggtg gtgtggctct
961 aggcctggcc atcccccagc tggtgggaat tctgctgtcc cagatcctag tgaatcagat
1021 caaagatcag atcaagctac agctctacaa ccagcagcac cgggctgacc catggtactg
1081 agaatccatc ctgcacctcc tcaccatgga aactggcaag cctcataaac gaacagcagt
1141 gggtgctgaa agcagcacca aatggagatt tggattccag ccccccagtg acagcccagt
1201 gggaagaagc aaactccaga tgggcagaag gcagggtgca caggtggctc cagtctcagg
1261 aggatgcgcc tcctctcccc catcccagtc ctcagcattg tgccagagtg atacccttaa
1321 gtgtttgggt ttatgttttc agttttgttt gggaaacagc agttgcacag agagttgggg
1381 gtactgctgc tgccttttca ccgaggcact gccaccacca gctctagcag ggatgctcct
1441 gagcttggcg gacatactta gatcctaacg tgccagtgag acctggctgt ggagagtagc
1501 actggcagcc ctgcctggac tccacttggc atgataccag ctccagaagg gaagggagtg
1561 gagcaggcag tgaggagaga gcctgggggt cggctgggga cagccgtatg tgctaggtag
1621 gagtggaggg agatatgttt accaaatgcc tgtcctgcca tcctcccagg tagtcagagt
1681 gagctacatc ctgccccgcc ttcatttcca tggaaacatg gcagctagga cacggggtac
1741 aacagcagcc aaattcttcc ccacctccct tacttcgaaa aaaagtttgg aaccctggtc
1801 cctatactct gcagtcagaa gtgggactga gccatacatg cccttgaatt cctccctgtc
1861 tggccctccc tctccagcaa gcagggtttt ctttaacttg gcagtgtgca gaggagaagt
1921 ggtaacaccc ccaccccatt cccctgcatc ggagctcagt attcctacag ggtaagaggt
1981 aggaatcttg ctgggacgag gggagccaga agtggcaata aaagcgtgtt gacctggaaa
2041 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa
//