LOCUS BC021959 3898 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens high-mobility group 20A, mRNA (cDNA clone MGC:8813
IMAGE:3908842), complete cds.
ACCESSION BC021959
VERSION BC021959.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 3898)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 3898)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (22-JAN-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Genome Sequence Centre,
BC Cancer Agency, Vancouver, BC, Canada
info@bcgsc.bc.ca
Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson
Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen
Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel
Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave
Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth
Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao,
Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR
Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang,
Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 15 Row: d Column: 3
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 21359925.
FEATURES Location/Qualifiers
source 1..3898
/db_xref="H-InvDB:HIT000039314"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:8813 IMAGE:3908842"
/tissue_type="Uterus, leiomyosarcoma"
/clone_lib="NIH_MGC_71"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..3898
/gene="HMG20A"
/gene_synonym="FLJ10739"
/gene_synonym="HMGX1"
/db_xref="GeneID:10363"
/db_xref="HGNC:HGNC:5001"
/db_xref="MIM:605534"
CDS 202..1245
/gene="HMG20A"
/gene_synonym="FLJ10739"
/gene_synonym="HMGX1"
/codon_start=1
/product="high-mobility group 20A"
/protein_id="AAH21959.1"
/db_xref="GeneID:10363"
/db_xref="HGNC:HGNC:5001"
/db_xref="MIM:605534"
/translation="MENLMTSSTLPPLFADEDGSKESNDLATTGLNHPEVPYSSGATS
STNNPEFVEDLSQGQLLQSESSNAAEGNEQRHEDEQRSKRGGWSKGRKRKKPLRDSNA
PKSPLTGYVRFMNERREQLRAKRPEVPFPEITRMLGNEWSKLPPEEKQRYLDEADRDK
ERYMKELEQYQKTEAYKVFSRKTQDRQKGKSHRQDAARQATHDHEKETEVKERSVFDI
PIFTEEFLNHSKAREAELRQLRKSNMEFEERNAALQKHVESMRTAVEKLEVDVIQERS
RNTVLQQHLETLRQVLTSSFASMPLPGSGETPTVDTIDSYMNRLHSIILANPQDNENF
IATVREVVNRLDR"
BASE COUNT 1062 a 859 c 866 g 1111 t
ORIGIN
1 cgagtgcgtg aagtgaaggc gattgagagg ggctgaggga attgtcctct gtggaaggga
61 ctttcttttg gccctaggcc ccttcctgcc cctgtcgtca gcagagtctc tacaaggaag
121 ataacggact gtaaaattct ataaagcaaa gctacacatc acttgacacc atacaccatc
181 ttggttacat aatgaagaga gatggaaaac ttgatgacta gctccaccct accgcccctt
241 tttgcagatg aagacggttc caaggagagt aatgatctgg ctaccactgg gttaaatcac
301 ccagaggttc catacagtag tggcgccaca tcatccacca acaatccaga atttgtggag
361 gatctctctc aaggtcagtt gcttcagagt gagtcttcaa atgcagcaga aggcaatgaa
421 cagaggcatg aagatgagca acgaagtaaa cgaggaggtt ggtccaaagg aagaaagagg
481 aagaaacctc ttcgagacag caatgcaccc aaatcccccc ttacaggata tgttcggttc
541 atgaatgagc gtcgagaaca acttcgagca aagagaccag aagtcccatt tccagaaatc
601 acaaggatgt taggcaatga atggagtaaa ctgcctcctg aggaaaaaca gcgctacctt
661 gatgaagcag acagagataa ggagcgttac atgaaggaac tggaacagta tcagaaaaca
721 gaggcctaca aggtcttcag taggaaaacc caggaccgtc agaaaggcaa atctcatagg
781 caagatgcag cccggcaggc cactcatgat catgagaaag aaacagaggt aaaggaacgg
841 tctgtttttg acatccctat atttacagag gaattcttga accatagcaa agctcgggaa
901 gcagagctcc gccagcttcg caaatccaac atggagtttg aggagaggaa tgcagccctg
961 caaaagcacg tggagagcat gcgcacagca gtggagaagc tggaggtgga tgtgatccag
1021 gagcggagcc gcaacacagt cttacagcag cacctggaga ccctgcggca ggtgctgacc
1081 agcagctttg ccagcatgcc cttgcctgga agtggagaga cacctacagt ggacaccatt
1141 gactcatata tgaacagact gcacagtatt attttagcta atccccaaga caatgaaaac
1201 ttcatagcta cagttcgaga agttgtgaac agactcgatc gttagggaat ggtcttagaa
1261 ctccaagatg ttccataagt gtttttactt gtgaggaatg agaagccatc catggaaatt
1321 tgaactgagt gggggcagag aaagagtgca gatccctttg cttgtgaaag aattatcagt
1381 gagtgaaagg ccatcacccc aggaagccaa atgagggagc agcaacatgt atatgagctt
1441 cctatggaat tgtccttatg tgaagctttg aaggtgtaca gccactctcc cgggtcttca
1501 ggttcctacc atttccattt ctgttaaagt ggatctgcat atcttcagct tactaggtga
1561 cccggatgct gacatctgct gctgcagaaa ggaagacttt tcattgtaat ttcgcttaga
1621 cccttttatc agtggagctc cagttttctt acctagctgt cactttttta aatgcctctg
1681 ggggttattt ttgctttcct tggcccccac caatttatac atctccattt tctgacctct
1741 ggactaactg gttgctcagc aaggttctga aggagagttt cttgcattgg acaggcccag
1801 tcttctccca tcattgccct gctgtgactc caaagaaagg agcttcttgc tgacagtgcc
1861 ctgtggagca aggctgtgtt tcctacccca cacggtgctc agtgggtgcc agccctcagt
1921 gtggctttgt gattgctgcc ctaaaggaga atgctctttc cttcctcact ggtactgcct
1981 gctgttttct aagcattgct cctgcacaga catggagtcc cagccccagc aaggctcttc
2041 tgttcccatc tgttgacaat gtcttgtgga gcatttttgc tgaggaaaag gtcacttgta
2101 aacagaggag aaagggaaag agtacaaagc cctaagttta ttgtaagtga aaactgaggg
2161 aattcctgtc ttctttagga gtaatgattc atagatctag ataggtggaa atatcattca
2221 aaatagtcac ttgagctcac aaaaaaagca aggaagaatt ctcatgtcct ttgtcttcct
2281 tctgtagcca ttaactgctg aatccatgtg aggaagacag gcttcccttc cttccccctc
2341 cttagtgatt ttttctttaa cagcataagt aaagaggact ttctggttca tttttgtttg
2401 ttttgttttg ttttgttttg tttacagatg aggtcttgct gtgttgccca ggctggagtg
2461 cggtggctat tcacagatgc tatcatagca cactacagcc tccaactctt gggctcaagc
2521 atcacgccta gcagtttctg gttcctttaa cagcaaaagg aaagagaggt tctgattctt
2581 acctcagggt tttttggttg ttcattgttt ttgtttttgt ttttgttttg acactgcaga
2641 gcacaaggct aaaggttaca gctgagatct ttggaaccaa aggcagagca agcagagccc
2701 gttgtctggg ccccacacca ctgcaggcag gtggatagaa gtgcggcccc tctcatagta
2761 tgcccataag tcagggcata gggcagaact acctgtcatg ttgctacacc atcctgtctt
2821 ctcagcatct ccttgcctgt tttctttatt agtccaaagg aaaacaacag caacaaaatc
2881 tgtttttaaa atgtcttata tgaacatata tcaaatatcc atgcgctgaa acccacatac
2941 catcacttgg caatttttta gaataagacc ccattattat ctattgctat aaacctagcc
3001 agttctcttg ctcttctgta ttttcctatt tccctgccat catctgctat ttctgccact
3061 tctcttagac tccttgtctg caaagcccaa gctagaactc actgtctatg gcagaaggac
3121 atccagagcc cattctggag ttttgttttt tccttctgcc agatgctttg tgtcctgtct
3181 tccttcctcc tcatatttct gtttctcatt tgtgttcagt tttgtgcagc attgctagca
3241 ctgcttttgt gaccagaaaa ggccataaca tggtccagga tcatcattct tctgactcta
3301 gatgggacac ttgacagtga cttgaaacat ttgcatattc aggaatgcat gagatttcaa
3361 gagagcctac agtatgaaat cattttcaca aaataagcag cttgcttctg aaatgctgtc
3421 tttcccagta gctactcacc tgcctctggt ggctgggatt cagatgccac aaaactgtca
3481 gtatctatag accaggtctg tgccacctcc tctctcctct gtgctcagtg aggaggcagt
3541 aaatgaagtt acaggctagc acaataccta attcatgttt cccagtacac ctgtagatat
3601 tactgtactt ttatgttctc aagaaataag ttgttgccta ttcagtgtta cagatttctt
3661 tgtttctttt taattaaaat acaagaagca gctgaggaaa gggagacaag gtattttatt
3721 tctgactgat tttagaaaaa acttgtgtac atgtgtttgg aactgttgaa atgccaagtt
3781 ttctgtataa gtgtttttgt aattaaactt tcagattttc tttgtttttt aagaagttga
3841 tgtgcttgtt tgacatttgt ctcattaaaa cttttctacg ttgaaaaaaa aaaaaaaa
//