LOCUS BC008931 1329 bp mRNA linear HUM 16-SEP-2003 DEFINITION Homo sapiens SET domain and mariner transposase fusion gene, mRNA (cDNA clone IMAGE:2819917), partial cds. ACCESSION BC008931 VERSION BC008931.2 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1329) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1329) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (29-MAY-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 19, 2003 this sequence version replaced BC008931.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: DCTD/DTP cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 2 Row: k Column: 7 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 5730038. FEATURES Location/Qualifiers source 1..1329 /db_xref="H-InvDB:HIT000034086" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="IMAGE:2819917" /tissue_type="Lung, small cell carcinoma" /clone_lib="NIH_MGC_7" /lab_host="DH10B-R" /note="Vector: pOTB7" gene <1..1329 /gene="SETMAR" /db_xref="GeneID:6419" CDS <1..1292 /gene="SETMAR" /codon_start=3 /product="SETMAR protein" /protein_id="AAH08931.2" /db_xref="GeneID:6419" /translation="QRRHGLVGWRSLRRSLRPRLSSWMSRAARKTCRWARGPRGPRRR PSSSLYCPVEKSNISCGNEKEPSMCGSAPSVFPSCKRLTLETMKMMLDKKQIRAIFLF EFKMGRKAAETTRNINNAFGPGTANERTVQWWFKKFCKGDESLEDEERSGRPSEVDND QLRAIIEADPLTTTREVAEELNVNHSTVVRHLKQIGKVKKLDKWVPHELTENQKNRRF EVSSSLILRNHNEPFLDRIVTCDEKWILYDNRRRSAQWLDQEEAPKHFPKPILHPKKV MVTIWWSAAGLIHYSFLNPGETITSEKYAQEIDEMNQKLQRLQLALVNRKGPILLHDN ARPHVAQPTLQKLNELGYEVLPHPPYSPDLLPTNYHVFKHLNNFLQGKRFHNQQDAEN AFQEFVESQSTDFYATGINQLISRWQKCVDCNGSYFD" misc_feature 738..980 /gene="SETMAR" /note="Transposase_1; Region: Transposase. This family includes the mariner transposase" /db_xref="CDD:pfam01359" BASE COUNT 401 a 311 c 309 g 308 t ORIGIN 1 ggcaaagacg acacggcctt gtgggatggc ggagtttaag gagaagcctg aggccccgac 61 tgagcagctg gatgtcgcgt gcggccagga aaacttgccg gtgggcgcgt ggcccccggg 121 ggccgcgccg gcgcccttcc agttctctgt actgccccgt agaaaagtcg aacatcagtt 181 gtggaaatga gaaggaaccc agcatgtgtg gctcagcccc ttctgtgttc ccctcctgca 241 agcgattgac ccttgagact atgaaaatga tgttagacaa aaagcaaatt cgagcaattt 301 tcttattcga gttcaaaatg ggtcgtaaag cagcagaaac aactcgcaac atcaacaatg 361 catttggccc aggaactgct aacgaacgta cagtgcagtg gtggttcaag aagttttgca 421 aaggagatga gagccttgaa gatgaggagc gtagtggccg gccatcagaa gttgacaacg 481 accagttgag agcaatcatc gaagctgatc cccttacaac tacacgagaa gttgctgaag 541 aactcaatgt caaccattct acggtcgttc gacatttgaa gcaaattgga aaggtgaaaa 601 agctcgataa gtgggtgcct catgagctga ctgaaaatca aaaaaatcgt cgttttgaag 661 tgtcatcttc tcttattcta cgcaaccaca acgaaccatt tctcgatcgg attgtgacgt 721 gtgatgaaaa gtggatttta tatgacaacc ggcgacgatc agctcagtgg ttggatcaag 781 aagaagctcc aaagcacttc ccaaagccaa tcttgcaccc aaaaaaggtc atggtcacta 841 tttggtggtc tgctgctggt ctgatccact acagctttct gaatcccggt gaaaccatta 901 catctgagaa gtatgctcag gaaatcgatg agatgaacca aaaactgcaa cgcctgcagc 961 tggcattggt caacagaaag ggcccaattc ttctccacga caatgcccga ccgcatgttg 1021 cacaacccac acttcaaaag ttgaatgaat tgggctatga agttttgcct catccaccgt 1081 attcacctga cctcttgcca accaactacc acgtctttaa gcatctcaac aactttttgc 1141 agggaaaacg cttccacaac cagcaggatg cagaaaatgc tttccaagag ttcgtcgaat 1201 cccaaagcac ggatttttac gctacaggaa taaaccaact tatttctcgt tggcaaaaat 1261 gtgttgattg taatggttcc tattttgatt aataaaaatg cgttgagcct aaaaaaaaaa 1321 aaaaaaaaa //