LOCUS       BC021959                3898 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens high-mobility group 20A, mRNA (cDNA clone MGC:8813
            IMAGE:3908842), complete cds.
ACCESSION   BC021959
VERSION     BC021959.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3898)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 3898)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (22-JAN-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Genome Sequence Centre,
            BC Cancer Agency, Vancouver, BC, Canada
            info@bcgsc.bc.ca
            Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson
            Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen
            Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel
            Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave
            Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth
            Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao,
            Kim MacDonald,  Mike R. Mayo, Josh Moran, Diana Palmquist, JR
            Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang,
            Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 15 Row: d Column: 3
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 21359925.
FEATURES             Location/Qualifiers
     source          1..3898
                     /db_xref="H-InvDB:HIT000039314"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:8813 IMAGE:3908842"
                     /tissue_type="Uterus, leiomyosarcoma"
                     /clone_lib="NIH_MGC_71"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..3898
                     /gene="HMG20A"
                     /gene_synonym="FLJ10739"
                     /gene_synonym="HMGX1"
                     /db_xref="GeneID:10363"
                     /db_xref="HGNC:HGNC:5001"
                     /db_xref="MIM:605534"
     CDS             202..1245
                     /gene="HMG20A"
                     /gene_synonym="FLJ10739"
                     /gene_synonym="HMGX1"
                     /codon_start=1
                     /product="high-mobility group 20A"
                     /protein_id="AAH21959.1"
                     /db_xref="GeneID:10363"
                     /db_xref="HGNC:HGNC:5001"
                     /db_xref="MIM:605534"
                     /translation="MENLMTSSTLPPLFADEDGSKESNDLATTGLNHPEVPYSSGATS
                     STNNPEFVEDLSQGQLLQSESSNAAEGNEQRHEDEQRSKRGGWSKGRKRKKPLRDSNA
                     PKSPLTGYVRFMNERREQLRAKRPEVPFPEITRMLGNEWSKLPPEEKQRYLDEADRDK
                     ERYMKELEQYQKTEAYKVFSRKTQDRQKGKSHRQDAARQATHDHEKETEVKERSVFDI
                     PIFTEEFLNHSKAREAELRQLRKSNMEFEERNAALQKHVESMRTAVEKLEVDVIQERS
                     RNTVLQQHLETLRQVLTSSFASMPLPGSGETPTVDTIDSYMNRLHSIILANPQDNENF
                     IATVREVVNRLDR"
BASE COUNT         1062 a          859 c          866 g         1111 t
ORIGIN      
        1 cgagtgcgtg aagtgaaggc gattgagagg ggctgaggga attgtcctct gtggaaggga
       61 ctttcttttg gccctaggcc ccttcctgcc cctgtcgtca gcagagtctc tacaaggaag
      121 ataacggact gtaaaattct ataaagcaaa gctacacatc acttgacacc atacaccatc
      181 ttggttacat aatgaagaga gatggaaaac ttgatgacta gctccaccct accgcccctt
      241 tttgcagatg aagacggttc caaggagagt aatgatctgg ctaccactgg gttaaatcac
      301 ccagaggttc catacagtag tggcgccaca tcatccacca acaatccaga atttgtggag
      361 gatctctctc aaggtcagtt gcttcagagt gagtcttcaa atgcagcaga aggcaatgaa
      421 cagaggcatg aagatgagca acgaagtaaa cgaggaggtt ggtccaaagg aagaaagagg
      481 aagaaacctc ttcgagacag caatgcaccc aaatcccccc ttacaggata tgttcggttc
      541 atgaatgagc gtcgagaaca acttcgagca aagagaccag aagtcccatt tccagaaatc
      601 acaaggatgt taggcaatga atggagtaaa ctgcctcctg aggaaaaaca gcgctacctt
      661 gatgaagcag acagagataa ggagcgttac atgaaggaac tggaacagta tcagaaaaca
      721 gaggcctaca aggtcttcag taggaaaacc caggaccgtc agaaaggcaa atctcatagg
      781 caagatgcag cccggcaggc cactcatgat catgagaaag aaacagaggt aaaggaacgg
      841 tctgtttttg acatccctat atttacagag gaattcttga accatagcaa agctcgggaa
      901 gcagagctcc gccagcttcg caaatccaac atggagtttg aggagaggaa tgcagccctg
      961 caaaagcacg tggagagcat gcgcacagca gtggagaagc tggaggtgga tgtgatccag
     1021 gagcggagcc gcaacacagt cttacagcag cacctggaga ccctgcggca ggtgctgacc
     1081 agcagctttg ccagcatgcc cttgcctgga agtggagaga cacctacagt ggacaccatt
     1141 gactcatata tgaacagact gcacagtatt attttagcta atccccaaga caatgaaaac
     1201 ttcatagcta cagttcgaga agttgtgaac agactcgatc gttagggaat ggtcttagaa
     1261 ctccaagatg ttccataagt gtttttactt gtgaggaatg agaagccatc catggaaatt
     1321 tgaactgagt gggggcagag aaagagtgca gatccctttg cttgtgaaag aattatcagt
     1381 gagtgaaagg ccatcacccc aggaagccaa atgagggagc agcaacatgt atatgagctt
     1441 cctatggaat tgtccttatg tgaagctttg aaggtgtaca gccactctcc cgggtcttca
     1501 ggttcctacc atttccattt ctgttaaagt ggatctgcat atcttcagct tactaggtga
     1561 cccggatgct gacatctgct gctgcagaaa ggaagacttt tcattgtaat ttcgcttaga
     1621 cccttttatc agtggagctc cagttttctt acctagctgt cactttttta aatgcctctg
     1681 ggggttattt ttgctttcct tggcccccac caatttatac atctccattt tctgacctct
     1741 ggactaactg gttgctcagc aaggttctga aggagagttt cttgcattgg acaggcccag
     1801 tcttctccca tcattgccct gctgtgactc caaagaaagg agcttcttgc tgacagtgcc
     1861 ctgtggagca aggctgtgtt tcctacccca cacggtgctc agtgggtgcc agccctcagt
     1921 gtggctttgt gattgctgcc ctaaaggaga atgctctttc cttcctcact ggtactgcct
     1981 gctgttttct aagcattgct cctgcacaga catggagtcc cagccccagc aaggctcttc
     2041 tgttcccatc tgttgacaat gtcttgtgga gcatttttgc tgaggaaaag gtcacttgta
     2101 aacagaggag aaagggaaag agtacaaagc cctaagttta ttgtaagtga aaactgaggg
     2161 aattcctgtc ttctttagga gtaatgattc atagatctag ataggtggaa atatcattca
     2221 aaatagtcac ttgagctcac aaaaaaagca aggaagaatt ctcatgtcct ttgtcttcct
     2281 tctgtagcca ttaactgctg aatccatgtg aggaagacag gcttcccttc cttccccctc
     2341 cttagtgatt ttttctttaa cagcataagt aaagaggact ttctggttca tttttgtttg
     2401 ttttgttttg ttttgttttg tttacagatg aggtcttgct gtgttgccca ggctggagtg
     2461 cggtggctat tcacagatgc tatcatagca cactacagcc tccaactctt gggctcaagc
     2521 atcacgccta gcagtttctg gttcctttaa cagcaaaagg aaagagaggt tctgattctt
     2581 acctcagggt tttttggttg ttcattgttt ttgtttttgt ttttgttttg acactgcaga
     2641 gcacaaggct aaaggttaca gctgagatct ttggaaccaa aggcagagca agcagagccc
     2701 gttgtctggg ccccacacca ctgcaggcag gtggatagaa gtgcggcccc tctcatagta
     2761 tgcccataag tcagggcata gggcagaact acctgtcatg ttgctacacc atcctgtctt
     2821 ctcagcatct ccttgcctgt tttctttatt agtccaaagg aaaacaacag caacaaaatc
     2881 tgtttttaaa atgtcttata tgaacatata tcaaatatcc atgcgctgaa acccacatac
     2941 catcacttgg caatttttta gaataagacc ccattattat ctattgctat aaacctagcc
     3001 agttctcttg ctcttctgta ttttcctatt tccctgccat catctgctat ttctgccact
     3061 tctcttagac tccttgtctg caaagcccaa gctagaactc actgtctatg gcagaaggac
     3121 atccagagcc cattctggag ttttgttttt tccttctgcc agatgctttg tgtcctgtct
     3181 tccttcctcc tcatatttct gtttctcatt tgtgttcagt tttgtgcagc attgctagca
     3241 ctgcttttgt gaccagaaaa ggccataaca tggtccagga tcatcattct tctgactcta
     3301 gatgggacac ttgacagtga cttgaaacat ttgcatattc aggaatgcat gagatttcaa
     3361 gagagcctac agtatgaaat cattttcaca aaataagcag cttgcttctg aaatgctgtc
     3421 tttcccagta gctactcacc tgcctctggt ggctgggatt cagatgccac aaaactgtca
     3481 gtatctatag accaggtctg tgccacctcc tctctcctct gtgctcagtg aggaggcagt
     3541 aaatgaagtt acaggctagc acaataccta attcatgttt cccagtacac ctgtagatat
     3601 tactgtactt ttatgttctc aagaaataag ttgttgccta ttcagtgtta cagatttctt
     3661 tgtttctttt taattaaaat acaagaagca gctgaggaaa gggagacaag gtattttatt
     3721 tctgactgat tttagaaaaa acttgtgtac atgtgtttgg aactgttgaa atgccaagtt
     3781 ttctgtataa gtgtttttgt aattaaactt tcagattttc tttgtttttt aagaagttga
     3841 tgtgcttgtt tgacatttgt ctcattaaaa cttttctacg ttgaaaaaaa aaaaaaaa
//