LOCUS BC003611 2026 bp mRNA linear HUM 09-NOV-2006 DEFINITION Homo sapiens SUMO1 activating enzyme subunit 1, mRNA (cDNA clone MGC:1437 IMAGE:2988252), complete cds. ACCESSION BC003611 VERSION BC003611.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2026) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2026) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (26-FEB-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Sep 16, 2003 this sequence version replaced BC003611.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 3 Row: f Column: 15 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 4885584. FEATURES Location/Qualifiers source 1..2026 /db_xref="H-InvDB:HIT000031640" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:1437 IMAGE:2988252" /tissue_type="Colon, adenocarcinoma" /clone_lib="NIH_MGC_15" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2026 /gene="SAE1" /gene_synonym="AOS1" /gene_synonym="FLJ3091" /gene_synonym="HSPC140" /gene_synonym="SUA1" /db_xref="GeneID:10055" /db_xref="HGNC:HGNC:30660" CDS 30..1070 /gene="SAE1" /gene_synonym="AOS1" /gene_synonym="FLJ3091" /gene_synonym="HSPC140" /gene_synonym="SUA1" /codon_start=1 /product="SUMO1 activating enzyme subunit 1" /protein_id="AAH03611.1" /db_xref="GeneID:10055" /db_xref="HGNC:HGNC:30660" /translation="MVEKEEAGGGISEEEAAQYDRQIRLWGLEAQKRLRASRVLLVGL KGLGAEIAKNLILAGVKGLTMLDHEQVTPEDPGAQFLIRTGSVGRNRAEASLERAQNL NPMVDVKVDTEDIEKKPESFFTQFDAVCLTCCSRDVIVKVDQICHKNSIKFFTGDVFG YHGYTFANLGEHEFVEEKTKVAKVSQGVEDGPDTKRAKLDSSETTMVKKKVVFCPVKE ALEVDWSSEKAKAALKRTTSDYFLLQVLLKFRTDKGRDPSSDTYEEDSELLLQIRNDV LDSLGISPDLLPEDFVRYCFSEMAPVCAVVGGILAQEIVKALSQRDPPHNNFFFFDGM KGNGIVECLGPK" BASE COUNT 499 a 468 c 560 g 499 t ORIGIN 1 ccggagctga ggcaggaaga gccggcgcca tggtggagaa ggaggaggct ggcggcggca 61 ttagcgagga ggaggcggca cagtatgacc ggcagatccg cctgtgggga ctggaggccc 121 agaaacggct gcgggcctct cgggtgcttc ttgtcggctt gaaaggactt ggggctgaaa 181 ttgccaagaa tctcatcttg gcaggagtga aaggactgac catgctggat cacgaacagg 241 taactccaga agatcccgga gctcagttct tgattcgtac tgggtctgtt ggccgaaata 301 gggctgaagc ctctttggag cgagctcaga atctcaaccc catggtggat gtgaaggtgg 361 acactgagga tatagagaag aaaccagagt catttttcac tcaattcgat gctgtgtgtc 421 tgacttgctg ctccagggat gtcatagtta aagttgacca gatctgtcac aaaaatagca 481 tcaagttctt tacaggagat gtttttggct accatggata cacatttgcc aatctaggag 541 agcatgagtt tgtagaggag aaaactaaag ttgccaaagt tagccaagga gtagaagatg 601 ggcccgacac caagagagca aaacttgatt cttctgagac aacgatggtc aaaaagaagg 661 tggtcttctg ccctgttaaa gaagccctgg aggtggactg gagcagtgag aaagcaaagg 721 ctgctctgaa gcgcacgacc tccgactact ttctccttca agtgctctta aagttccgta 781 cagataaagg aagagatccc agttctgata catatgagga agattctgag ttgttgctcc 841 agatacgaaa tgatgtgctt gactcactgg gtattagtcc tgacctgctt cctgaggact 901 ttgtcaggta ctgcttctcc gagatggccc cagtgtgtgc ggtggttgga gggattttgg 961 cacaggaaat tgtgaaggcc ctgtctcagc gggaccctcc tcacaacaac ttcttcttct 1021 tcgatggcat gaaggggaat gggattgtgg agtgccttgg ccccaagtga actcaagatt 1081 tggcagcccc agagatgcca actgcagcat gcccacctgt attccctgtc cccttccttc 1141 atgaaggcat ctccaggcaa ggaaaactga agtcattggc ccgatacaaa acatttcctg 1201 caacgaagga ggtggtgccg acgtgctgct tcccatcacc agcagctgct cgacaagggg 1261 cgcagggtgg ctgtctttgt tccagcactg ttcaggctgc ctgtcatccc gggcctgcca 1321 gctcccctga gtgatgagca cttccaagca cccctctgcc ctttctctgt ccttatgctg 1381 tcccggcctc gccagccctc tggggcattg tgggagatgc ctgccaggaa tgagcaagct 1441 ctgttgctcg ggagcctctt gtcaccttct tggacttatt ccccacctga taccttatag 1501 agaaaagtgt caattcaggt ggagagtagg cccaggcccc atgaggcacc agtggaagca 1561 cagctccaag ttcagacagg tgcccttaga gaggaaaacc atgacaggca aatgcatttc 1621 ctctggagtt tgagaccctg acaaacaaca ggtggcatct ggtgtgctgt tcttgagttt 1681 tcgtttagga ttagttgagt tccagctggg ttttgggaga aaggagatgc taccaagtct 1741 tggatgttag ggcgagaccc tgcaagttga gtattagaga gcttgtcttt caaggcaggt 1801 tcctggggct tcagggctag gagggaggag cctgcccttt taacagaacc ccagtcacat 1861 gcggctcaag tcactcagag gctgttgcat ttcagggcta tgttggtcct ttgtttacct 1921 cctaaaccac agctgtttgt gtttcacata tgttgtgaat tttccttggt tctttttaaa 1981 ggaatgataa taaagttact tgctttaaaa aaaaaaaaaa aaaaaa //