LOCUS BC003611 2026 bp mRNA linear HUM 09-NOV-2006
DEFINITION Homo sapiens SUMO1 activating enzyme subunit 1, mRNA (cDNA clone
MGC:1437 IMAGE:2988252), complete cds.
ACCESSION BC003611
VERSION BC003611.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2026)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2026)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (26-FEB-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Sep 16, 2003 this sequence version replaced BC003611.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Institute for Systems Biology
http://www.systemsbiology.org
contact: amadan@systemsbiology.org
Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 3 Row: f Column: 15
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 4885584.
FEATURES Location/Qualifiers
source 1..2026
/db_xref="H-InvDB:HIT000031640"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:1437 IMAGE:2988252"
/tissue_type="Colon, adenocarcinoma"
/clone_lib="NIH_MGC_15"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..2026
/gene="SAE1"
/gene_synonym="AOS1"
/gene_synonym="FLJ3091"
/gene_synonym="HSPC140"
/gene_synonym="SUA1"
/db_xref="GeneID:10055"
/db_xref="HGNC:HGNC:30660"
CDS 30..1070
/gene="SAE1"
/gene_synonym="AOS1"
/gene_synonym="FLJ3091"
/gene_synonym="HSPC140"
/gene_synonym="SUA1"
/codon_start=1
/product="SUMO1 activating enzyme subunit 1"
/protein_id="AAH03611.1"
/db_xref="GeneID:10055"
/db_xref="HGNC:HGNC:30660"
/translation="MVEKEEAGGGISEEEAAQYDRQIRLWGLEAQKRLRASRVLLVGL
KGLGAEIAKNLILAGVKGLTMLDHEQVTPEDPGAQFLIRTGSVGRNRAEASLERAQNL
NPMVDVKVDTEDIEKKPESFFTQFDAVCLTCCSRDVIVKVDQICHKNSIKFFTGDVFG
YHGYTFANLGEHEFVEEKTKVAKVSQGVEDGPDTKRAKLDSSETTMVKKKVVFCPVKE
ALEVDWSSEKAKAALKRTTSDYFLLQVLLKFRTDKGRDPSSDTYEEDSELLLQIRNDV
LDSLGISPDLLPEDFVRYCFSEMAPVCAVVGGILAQEIVKALSQRDPPHNNFFFFDGM
KGNGIVECLGPK"
BASE COUNT 499 a 468 c 560 g 499 t
ORIGIN
1 ccggagctga ggcaggaaga gccggcgcca tggtggagaa ggaggaggct ggcggcggca
61 ttagcgagga ggaggcggca cagtatgacc ggcagatccg cctgtgggga ctggaggccc
121 agaaacggct gcgggcctct cgggtgcttc ttgtcggctt gaaaggactt ggggctgaaa
181 ttgccaagaa tctcatcttg gcaggagtga aaggactgac catgctggat cacgaacagg
241 taactccaga agatcccgga gctcagttct tgattcgtac tgggtctgtt ggccgaaata
301 gggctgaagc ctctttggag cgagctcaga atctcaaccc catggtggat gtgaaggtgg
361 acactgagga tatagagaag aaaccagagt catttttcac tcaattcgat gctgtgtgtc
421 tgacttgctg ctccagggat gtcatagtta aagttgacca gatctgtcac aaaaatagca
481 tcaagttctt tacaggagat gtttttggct accatggata cacatttgcc aatctaggag
541 agcatgagtt tgtagaggag aaaactaaag ttgccaaagt tagccaagga gtagaagatg
601 ggcccgacac caagagagca aaacttgatt cttctgagac aacgatggtc aaaaagaagg
661 tggtcttctg ccctgttaaa gaagccctgg aggtggactg gagcagtgag aaagcaaagg
721 ctgctctgaa gcgcacgacc tccgactact ttctccttca agtgctctta aagttccgta
781 cagataaagg aagagatccc agttctgata catatgagga agattctgag ttgttgctcc
841 agatacgaaa tgatgtgctt gactcactgg gtattagtcc tgacctgctt cctgaggact
901 ttgtcaggta ctgcttctcc gagatggccc cagtgtgtgc ggtggttgga gggattttgg
961 cacaggaaat tgtgaaggcc ctgtctcagc gggaccctcc tcacaacaac ttcttcttct
1021 tcgatggcat gaaggggaat gggattgtgg agtgccttgg ccccaagtga actcaagatt
1081 tggcagcccc agagatgcca actgcagcat gcccacctgt attccctgtc cccttccttc
1141 atgaaggcat ctccaggcaa ggaaaactga agtcattggc ccgatacaaa acatttcctg
1201 caacgaagga ggtggtgccg acgtgctgct tcccatcacc agcagctgct cgacaagggg
1261 cgcagggtgg ctgtctttgt tccagcactg ttcaggctgc ctgtcatccc gggcctgcca
1321 gctcccctga gtgatgagca cttccaagca cccctctgcc ctttctctgt ccttatgctg
1381 tcccggcctc gccagccctc tggggcattg tgggagatgc ctgccaggaa tgagcaagct
1441 ctgttgctcg ggagcctctt gtcaccttct tggacttatt ccccacctga taccttatag
1501 agaaaagtgt caattcaggt ggagagtagg cccaggcccc atgaggcacc agtggaagca
1561 cagctccaag ttcagacagg tgcccttaga gaggaaaacc atgacaggca aatgcatttc
1621 ctctggagtt tgagaccctg acaaacaaca ggtggcatct ggtgtgctgt tcttgagttt
1681 tcgtttagga ttagttgagt tccagctggg ttttgggaga aaggagatgc taccaagtct
1741 tggatgttag ggcgagaccc tgcaagttga gtattagaga gcttgtcttt caaggcaggt
1801 tcctggggct tcagggctag gagggaggag cctgcccttt taacagaacc ccagtcacat
1861 gcggctcaag tcactcagag gctgttgcat ttcagggcta tgttggtcct ttgtttacct
1921 cctaaaccac agctgtttgt gtttcacata tgttgtgaat tttccttggt tctttttaaa
1981 ggaatgataa taaagttact tgctttaaaa aaaaaaaaaa aaaaaa
//