LOCUS BC051025 5272 bp mRNA linear HUM 18-MAR-2009
DEFINITION Homo sapiens HMG box domain containing 3, mRNA (cDNA clone
IMAGE:4914524), partial cds.
ACCESSION BC051025
VERSION BC051025.1
KEYWORDS .
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 5272)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 5272)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (11-APR-2003) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: David N. Louis, M.D.
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 93 Row: e Column: 7
This clone was selected for full length sequencing because it
passed the following selection criteria: GenomeScan gene
prediction.
FEATURES Location/Qualifiers
source 1..5272
/db_xref="H-InvDB:HIT000099282"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="IMAGE:4914524"
/tissue_type="Brain, anaplastic oligodendroglioma with
1p/19q loss"
/clone_lib="NCI_CGAP_Brn67"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene <1..5272
/gene="HMGXB3"
/gene_synonym="SMF"
/db_xref="GeneID:22993"
/db_xref="HGNC:HGNC:28982"
CDS <1..4321
/gene="HMGXB3"
/gene_synonym="SMF"
/codon_start=2
/product="HMGXB3 protein"
/protein_id="AAH51025.1"
/db_xref="GeneID:22993"
/db_xref="HGNC:HGNC:28982"
/translation="SVVCRYCRCSRQTGASVTTVSLPSSSSSPGLDPRGPRQASVRSL
RSEPVLLFLPFRTPYRDSEEGKREGLSRLRAVCRRAGPRGRGSFSPRDARASPRLHFL
VAAVTTGAASRRQRGARVRQPSPSSSRRAKRLRECERRSLHAPPAMDASYDGTEVTVV
MEEIEEAYCYTSPGPPKKKKKYKIHGEKTKKPRSAYLLYYYDIYLKVQQELPHLPQSE
INKKISESWRLLSVAERSYYLEKAKLEKEGLDPNSKLSALTAVVPDIPGFRKILPRSD
YIIIPKSSLQEDRSCPQLELCVAQNQMSPKGPPLVSNTAPETVPSHAGMAEQCLAVEA
LAEEVGALTQSGAVQEIATSEILSQDVLLEDASLEVGESHQPYQTSLVIEETLVNGSP
DLPTGSLAVPHPQVGESVSVVTVMRDSSESSSSAPATQFIMLPLPAYSVVENPTSIKL
TTTYTRRGHGTCTSPGCSFTYVTRHKPPKCPTCGNFLGGKWIPKEKPAKVKVELASGV
SSKGSVVKRNQQPVTTEQNSSKENASKLTLENSEAVSQLLNVAPPREVGEESEWEEVI
ISDAHVLVKEAPGNCGTAVTKTPVVKSGVQPEVTLGTTDNDSPGADVPTPSEGTSTSS
PLPAPKKPTGVDLLTPGSRAPELKGRARGKPSLLAAARPMRAILPAPVNVGRGSSMGL
PRARQAFSLSDKTPSVRTCGLKPSTLKQLGQPIQQPSGPGEVKLPSGPSNRTSQVKVV
EVKPDMFPPYKYSCTVTLDLGLATSRGRGKCKNPSCSYVYTNRHKPRICPSCGVNLAK
DRTEKTTKAIEVSSPLPDVLNATEPLSTAQREIQRQSTLQLLRKVLQIPENESELAEV
FALIHELNSSRLILSNVSEETVTIEQTSWSNYYESPSTQCLLCSSPLFKGGQNSLAGP
QECWLLTASRLQTVTAQVKMCLNPHCLALHSFIDIYTGLFNVGNKLLVSLDLLFAIRN
QIKLGEDPRVSINVVLKSVQEQTEKTLTSEELSQLQELLCNGYWAFECLTVRDYNDMI
CGICGVAPKVEMAQRSEENVLALKSVEFTWPEFLGSNEVNVEDFWATMETEVIEQVAF
PASIPITKFDASVIAPFFPPLMRGAVVVNTEKDKNLDVQPVPGSGSALVRLLQEGTCK
LDEIGSYSEEKLQHLLRQCGIPFGAEDSKDQLCFSLLALYESVQNGARAIRPPRHFTG
GKIYKVCPHQVVCGSKYLVRGESARDHVDLLASSRHWPPVYVVDMATSVALCADLCYP
ELTNQMWGRNQGCFSSPTEPPVSVSCPELLDQHYTVDMTETEHSIQHPVTKTATRRIV
HAGLQPNPGDPSAGHHSLALCPELAPYATILASIVDSKPNGVRQRPIAFDNATHYYLY
NRLMDFLTSREIVNRQIHDIVQSCQPGEVVIRDTLYRLGVAQIKTETEEEGEEEEVAA
VAE"
BASE COUNT 1213 a 1480 c 1469 g 1110 t
ORIGIN
1 atccgtggtg tgccgctact gccggtgcag ccgccaaacc ggtgcctcgg tgacgaccgt
61 gtccctgccg tcttcctcga gctccccggg gcttgacccc cggggccctc ggcaggcatc
121 ggtgaggagc ctgcggagcg aacctgtgct cctattcttg cccttcagga ccccatatcg
181 cgactccgag gaggggaagc gagaggggct gtcgcgactc cgcgccgtgt gtcgccgggc
241 ggggccgcgg ggccggggct ccttcagccc ccgggatgcg cgcgcgagcc ctcgcctcca
301 cttccttgtt gctgctgtca cgactggagc cgcctctcgc cgacagcggg gagcgcgagt
361 gcgccagcca tccccctcgt ccagccgccg ggccaagcgc ctccgggaat gtgagcggcg
421 cagcttgcac gctcctccgg ccatggacgc atcatatgat ggtactgagg taactgtcgt
481 gatggaggaa attgaggaag cctattgtta cacctctcct gggccaccca agaagaagaa
541 aaagtataaa atacatggag aaaagacaaa gaaacccagg tctgcttacc ttctgtacta
601 ttacgacatc tacctgaaag tgcagcagga gctcccccac ctccctcagt ctgagatcaa
661 taagaagatt agtgagagtt ggaggcttct cagcgtggcc gagaggagtt actacttgga
721 gaaagccaaa ctagagaagg aaggtttgga tcctaactct aagctctctg cactgactgc
781 tgtggttccg gacatcccag gtttccgcaa gatcctccca cgctcagatt atatcatcat
841 ccccaagagc agcctgcagg aggaccggag ctgccctcag ctagagctat gtgtggctca
901 gaaccagatg tccccgaaag gacctcctct tgtgtccaac actgccccgg agacagtgcc
961 cagccatgca ggcatggcag agcagtgcct ggctgtggag gccctggctg aggaggtggg
1021 agcccttacc cagtcaggtg ctgtacagga gattgccacc tcagagatcc tcagccagga
1081 tgtgctccta gaggacgctt ccctagaagt aggggagagc caccaacctt accagacaag
1141 cctggtaatt gaagagacct tggtgaatgg ctcaccagac ctccccactg gaagcctggc
1201 tgtgccccac ccccaggttg gggagagtgt atcagtggta acagtcatga gggattccag
1261 tgagagtagc tcctctgcac cagccacaca gttcatcatg ttgcctctgc ctgcctactc
1321 ggttgtggag aaccccacct ccatcaaact gaccactaca tatacccgcc ggggccatgg
1381 gacatgcacc agcccagggt gctcctttac atatgtcacc aggcacaagc cacctaagtg
1441 ccctacctgt ggtaacttcc taggagggaa gtggatccca aaggaaaagc cagccaaagt
1501 aaaagtggaa ttggcttctg gcgtctcttc caaaggctct gtggtgaaaa gaaatcagca
1561 acctgtcacc actgagcaaa attcctctaa ggaaaatgcc tccaaactga ctctggagaa
1621 ttcggaagct gtaagccagc tcctgaacgt agctcctccc agagaagtag gtgaggagag
1681 tgagtgggag gaagtgatca tctccgatgc ccatgttttg gttaaggaag ctcccgggaa
1741 ttgtggtaca gcagtcacta agacgccagt cgtcaaaagt ggtgtgcagc ctgaggtcac
1801 tctggggaca actgacaatg acagtcctgg agcagacgta ccaacaccat ccgaggggac
1861 aagtacctcc agtccactcc ctgctcctaa aaaacctaca ggagttgacc tgcttacccc
1921 tgggtccaga gctccagagc ttaaaggcag agcacggggc aagccctcat tactggctgc
1981 agcaagaccc atgagagcaa ttttgccagc cccagttaac gtggggcgag gcagcagcat
2041 gggactgccc agggccaggc aggccttttc cctgagtgat aagactccct ctgtgaggac
2101 ttgtggtctg aagccaagca cactgaagca gctgggccag cccattcaac agccatctgg
2161 ccctggtgag gtgaagctac caagtggccc atccaacagg acttctcagg tgaaagttgt
2221 ggaggtcaag cccgatatgt tccctccata taagtacagc tgcactgtca cattggattt
2281 gggcctggct acatcaagag gccggggaaa gtgcaagaat ccctcttgta gctatgtcta
2341 caccaacagg cacaaacctc gaatttgtcc cagctgtggt gttaaccttg ccaaagaccg
2401 gactgagaaa accaccaagg ctatcgaggt gagctcacca ctcccagatg tactgaatgc
2461 cacagagccc ctgagcacag cccagaggga gatccagcgc cagtccacac tgcagctgct
2521 gcgcaaagtc ctgcagattc ctgagaatga gtcagagctg gctgaggtct tcgccttgat
2581 tcatgaactc aacagctctc gacttatctt gtccaacgtg agtgaggaga cagtcaccat
2641 cgagcaaacc tcttggtcga attattatga gtctccgtcc acgcagtgcc ttctctgtag
2701 cagcccatta ttcaaagggg gacaaaactc cctggctggg ccccaggagt gctggctgct
2761 gacagccagc cgtctgcaga cagtgactgc ccaggtgaag atgtgtctga acccccattg
2821 tctggccctg cacagcttca tagacatcta cacaggtctc tttaatgtgg ggaacaagct
2881 gctggtaagc ctggacttgc tttttgcaat cagaaatcag atcaagctcg gagaggaccc
2941 cagagtgtcc atcaatgttg ttctgaagtc ggtgcaggag cagacagaga agactctgac
3001 ctcggaggag ctgagccagc tgcaggagct gctgtgcaat ggctattggg cctttgagtg
3061 cctcactgtc cgagactaca atgacatgat ctgtggcatc tgtggtgtgg cccccaaagt
3121 ggaaatggct cagaggagtg aagagaatgt gctagcactg aagagcgtgg agttcacctg
3181 gcctgaattc ctgggctcta atgaggtaaa tgtggaggac ttttgggcca cgatggagac
3241 agaggtgatt gagcaggtgg catttcctgc cagcatccct atcaccaaat ttgatgcgtc
3301 tgttattgcc cccttcttcc caccactcat gagaggagct gtggtcgtca acactgagaa
3361 agacaaaaac ctggatgtgc agccagtacc tggcagtggc agtgccttgg tgaggctgct
3421 ccaggagggc acctgcaagc ttgatgagat tggctcctac agtgaagaga agctgcagca
3481 cctgctaagg cagtgtggaa tcccctttgg ggcagaagac tccaaggacc agctctgctt
3541 ctccttgttg gccctctacg aatctgtaca gaatggagct agagctatac ggcccccacg
3601 tcacttcaca ggtggtaaaa tctacaaggt gtgcccccat caggtggtct gcggctccaa
3661 gtatcttgtg cgaggtgaga gtgcccgtga ccatgtggac ctgcttgcct cttcccgcca
3721 ctggccgcct gtctatgtgg tagatatggc cacgtcagtg gccctgtgtg ctgacctctg
3781 ctacccagag ctgactaacc agatgtgggg gaggaaccag ggctgtttct ctagccccac
3841 agagccacct gtgagtgtgt cctgcccaga gctcttggac cagcattata ctgtggacat
3901 gacagaaact gagcactcta tccagcaccc agtcaccaag actgccacgc ggcgcatcgt
3961 ccatgcaggc ctacagccca atcctggtga ccccagtgct gggcaccact ccttggccct
4021 gtgccctgaa ttggcacctt acgcaaccat cctggcctcc atcgtggaca gcaaaccaaa
4081 cggtgtccgc cagcggccca ttgccttcga caatgccact cactattacc tctacaaccg
4141 cctcatggac ttcctcacca gccgcgaaat tgtcaatcgt cagatccatg acattgtaca
4201 gagctgccag cctggtgagg tggtcattcg tgacaccctc taccgccttg gggttgctca
4261 gatcaagaca gagacagagg aggagggtga ggaagaggag gtggccgcag tggcagaata
4321 agccaggctg ttgtacaggg actacaccat ctctcaagcc atagtaaggc ccttgcctga
4381 ggcagagcta tccaggggac ctgcagaagt ggtctcctgt ggggagggcc tctgactgct
4441 gggactgacc aaagagcttc cattccctga gcatggtggg acccagggtc ctcagttctc
4501 aaccctccag gggtcaggag tggtaccagg aaacctcttc tggccccgag agagcacttg
4561 ggggacacgg tatgtttaat ggaggggagg ctgagggaaa ggctggtagc tggtgggttc
4621 cgtggggcct gctgtgtggg tcagggtgga ggtcctgggt gggctaaggc gtgagcccca
4681 gcactaggtg ggaaggctgc tgaggtctct cccacccctg aggagccctg gtttcagccc
4741 cctcagtctg atgaattgct tagcctgttg cctttgacta ggggcctggg tggcctcatt
4801 aactctaggg gtccctttgg gctcttgatt ctccctgaag gagggatgca tttctctctt
4861 gctcttcctg tacccacatt tgggggaagc tgaggaggga ggaacagtca gccacagctc
4921 tcttccagca ctgtcctctc caccccaagc tttgaggaag agcatcccct tcctcctttc
4981 cctggccact gctgctgcag ccaatatcct ctctgggcct gggaccctct ccacagaggg
5041 gatgtggtcc ctggtcatga cataacctag cagcagtagg aaaaactccc ttctatgaag
5101 gggaagcaga ctgggccata aggaaacagc aggactggct caagtgccca aggtttgttt
5161 agggcctggg aattggccat gtgttaattt attgagtgga gtaggtggct ttttttccct
5221 ccctcttccc ccaacaagaa taaagtttat taaattaaaa aaaaaaaaaa aa
//