LOCUS BC012375 2028 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens arylsulfatase G, mRNA (cDNA clone MGC:8996 IMAGE:3882163), complete cds. ACCESSION BC012375 VERSION BC012375.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2028) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2028) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (15-AUG-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: DCTD/DTP/Gazdar cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Baylor College of Medicine Human Genome Sequencing Center Center code: BCM-HGSC Web site: http://www.hgsc.bcm.tmc.edu/cdna/ Contact: amg@bcm.tmc.edu Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H., Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, A.N., Gibbs, R.A. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 21 Row: o Column: 14 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 45430056. FEATURES Location/Qualifiers source 1..2028 /db_xref="H-InvDB:HIT000035788" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:8996 IMAGE:3882163" /tissue_type="Lung, large cell carcinoma" /clone_lib="NIH_MGC_68" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..2028 /gene="ARSG" /gene_synonym="KIAA1001" /db_xref="GeneID:22901" /db_xref="HGNC:HGNC:24102" /db_xref="MIM:610008" CDS 40..1617 /gene="ARSG" /gene_synonym="KIAA1001" /codon_start=1 /product="arylsulfatase G" /protein_id="AAH12375.1" /db_xref="GeneID:22901" /db_xref="HGNC:HGNC:24102" /db_xref="MIM:610008" /translation="MGWLFLKVLLAGVSFSGFLYPLVDFCISGKTRGQKPNFVIILAD DMGWGDLGANWAETKDTANLDKMASEGMRFVDFHAAASTCSPSRASLLTGRLGLRNGV TRNFAVTSVGGLPLNETTLAEVLQQAGYVTGIIGKWHLGHHGSYHPNFRGFDYYFGIP YSHDMGCTDTPGYNHPPCPACPQGDGPSRNLQRDCYTDVALPLYENLNIVEQPVNLSS LAQKYAEKATQFIQRASTSGRPFLLYVALAHMHVPLPVTQLPAAPRGRSLYGAGLWEM DSLVGQIKDKVDHTVKENTFLWFTGDNGPWAQKCELAGSVGPFTGFWQTRQGGSPAKQ TTWEGGHRVPALAYWPGRVPVNVTSTALLSVLDIFPTVVALAQASLPQGRRFDGVDVS EVLFGRSQPGHRVLFHPNSGAAGEFGALQTVRLERYKAFYITGGARACDGSTGPELQH KFPLIFNLEDDTAEAVPLERGGAEYQAVLPEVRKVLADVLQDIANDNISSADYTQDPS VTPCCNPYQIACRCQAA" BASE COUNT 472 a 532 c 542 g 482 t ORIGIN 1 gccgtcgctc cagacaatcg gaatcctgcc ttcaccacca tgggctggct ttttctaaag 61 gttttgttgg cgggagtgag tttctcagga tttctttatc ctcttgtgga tttttgcatc 121 agtgggaaaa caagaggaca gaagccaaac tttgtgatta ttttggccga tgacatgggg 181 tggggtgacc tgggagcaaa ctgggcagaa acaaaggaca ctgccaacct tgataagatg 241 gcttcggagg gaatgaggtt tgtggatttc catgcagctg cctccacctg ctcaccctcc 301 cgggcttcct tgctcaccgg ccggcttggc cttcgcaatg gagtcacacg caactttgca 361 gtcacttctg tgggaggcct tccgctcaac gagaccacct tggcagaggt gctgcagcag 421 gcgggttacg tcactgggat aataggcaaa tggcatcttg gacaccacgg ctcttatcac 481 cccaacttcc gtggttttga ttactacttt ggaatcccat atagccatga tatgggctgt 541 actgatactc caggctacaa ccaccctcct tgtccagcgt gtccacaggg tgatggacca 601 tcaaggaacc ttcaaagaga ctgttacact gacgtggccc tccctcttta tgaaaacctc 661 aacattgtgg agcagccggt gaacttgagc agccttgccc agaagtatgc tgagaaagca 721 acccagttca tccagcgtgc aagcaccagc gggaggccct tcctgctcta tgtggctctg 781 gcccacatgc acgtgccctt acctgtgact cagctaccag cagcgccacg gggcagaagc 841 ctgtatggtg cagggctctg ggagatggac agtctggtgg gccagatcaa ggacaaagtt 901 gaccacacag tgaaggaaaa cacattcctc tggtttacag gagacaatgg cccgtgggct 961 cagaagtgtg agctagcggg cagtgtgggt cccttcactg gattttggca aactcgtcaa 1021 gggggaagtc cagccaagca gacgacctgg gaaggagggc accgggtccc agcactggct 1081 tactggcctg gcagagttcc agttaatgtc accagcactg ccttgttaag cgtgctggac 1141 atttttccaa ctgtggtagc cctggcccag gccagcttac ctcaaggacg gcgctttgat 1201 ggtgtggacg tctccgaggt gctctttggc cggtcacagc ctgggcacag ggtgctgttc 1261 caccccaaca gcggggcagc tggagagttt ggagccctgc agactgtccg cctggagcgt 1321 tacaaggcct tctacattac cggtggagcc agggcgtgtg atgggagcac ggggcctgag 1381 ctgcagcata agtttcctct gattttcaac ctggaagacg ataccgcaga agctgtgccc 1441 ctagaaagag gtggtgcgga gtaccaggct gtgctgcccg aggtcagaaa ggttcttgca 1501 gacgtcctcc aagacattgc caacgacaac atctccagcg cagattacac tcaggaccct 1561 tcagtaactc cctgctgtaa tccctaccaa attgcctgcc gctgtcaagc cgcataacag 1621 accaattttt attccacgag gaggagtacc tggaaattag gcaagtttgc ttccaaattt 1681 catttttacc ctctttacaa acacacgctt tagtttagtc ttggagttta gttttggagt 1741 tagccttgca tatcccttct gtatcctgtc cctcctccac gccgacccga gagcagctga 1801 gctgcgctgg ctctgggcag ggagtgtgcc ttaatgggaa gcacacgggc tttggagtca 1861 ggcacaggtg ccagctccag cttttgaact tgggcaattg tttaacctaa cctgcaagtt 1921 gattttgagg gttaaataaa ggcatacatg aaaatgcctg gcaaattacc tgacacagag 1981 cagacattca atacatttta gtttccttgt ttcaaaaaaa aaaaaaaa //