LOCUS BC022329 2679 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens HMG-box transcription factor 1, mRNA (cDNA clone
MGC:22757 IMAGE:4278438), complete cds.
ACCESSION BC022329
VERSION BC022329.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2679)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2679)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (01-FEB-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Sep 16, 2003 this sequence version replaced BC022329.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: CLONTECH Laboratories, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Sequencing Group at the Stanford Human Genome
Center, Stanford University School of Medicine, Stanford, CA 94305
Web site: http://www-shgc.stanford.edu
Contact: (Dickson, Mark) mcd@paxil.stanford.edu
Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
R. M.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 35 Row: k Column: 9
This clone was selected for full length sequencing because it
passed the following selection criteria: Hexamer frequency ORF
analysis, Similarity but not identity to protein.
FEATURES Location/Qualifiers
source 1..2679
/db_xref="H-InvDB:HIT000039476"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:22757 IMAGE:4278438"
/tissue_type="Brain, primitive neuroectodermal"
/clone_lib="NIH_MGC_56"
/lab_host="DH10B"
/note="Vector: pDNR-LIB"
gene 1..2679
/gene="HBP1"
/db_xref="GeneID:26959"
/db_xref="HGNC:HGNC:23200"
CDS 16..1560
/gene="HBP1"
/codon_start=1
/product="HMG-box transcription factor 1"
/protein_id="AAH22329.1"
/db_xref="GeneID:26959"
/db_xref="HGNC:HGNC:23200"
/translation="MVWEVKTNQMPNAVQKLLLVMDKRASGMNDSLELLQCNENLPSS
PGYNSCDEHMELDDLPELQAVQSDPTQSGMYQLSSDVSHQEYPRSSWNQNTSDIPETT
YRENEVDWLTELANIATSPQSPLMLCSFYNRSSPVHIIATSKSLHSYARPPPVSSSSK
SEPAFPHHHWKEETPVRHERANSESESGIFCMSSLSDDDDLGWCNSWPSTVWHCFLKG
TRLCFHKGSNKEWQDVEDFARAEGCDNEEDLQMGIHKGYGSDGLKLLSHEESVSFGES
VLKLTFDPGTVEDGLLTVECKLDHPFYVKNKGWSSFYPSLTVVQHGIPCCEVHIGDVC
LPPGHPDAINFDDPGVFDTFKSYDFTPMDSSAVYVLSSMARQRRASLSCGGPGGQDFA
RSGFSKNCGSPGSSQLSSNSLYAKAVKNHSSGTVSATSPNKCKRPMNAFMLFAKKYRV
EYTQMYPGKDNRAISVILGDRWKKMKNEERRMYTLEAKALAEEQKRLNPDCWKRKRTN
SGSQQH"
BASE COUNT 818 a 523 c 560 g 778 t
ORIGIN
1 tcagagcacc ataacatggt gtgggaagtg aagacaaatc agatgcctaa tgcagtacag
61 aaactcctgt tggtgatgga caagagagcc tcaggaatga atgactcatt ggagttgctg
121 cagtgtaatg agaatttgcc atcttcacct ggatataact cctgtgatga acacatggag
181 cttgatgacc ttcctgaact tcaggcagtt caaagtgatc ctacccaatc tggcatgtac
241 cagctgagtt cagatgtttc acatcaagaa tacccaagat catcttggaa ccaaaatacc
301 tcagacatac cagaaactac ttaccgtgaa aatgaggtgg actggctaac agaattggca
361 aatatcgcga ccagtccaca aagtccactg atgctgtgct cattttacaa tagatcatct
421 cctgtacaca tcatagccac tagcaaaagt ttacattcct atgcacgccc tccaccagtg
481 tcctcttctt cgaagagtga accagccttc cctcatcacc attggaagga ggaaacacca
541 gtaagacacg aaagggcaaa tagtgagtca gaatctggca ttttctgcat gtcctccctg
601 tcagatgatg atgatttggg atggtgcaat tcctggcctt caactgtctg gcactgtttt
661 ttgaaaggca cacgactgtg ctttcataag ggaagcaata aggaatggca agatgttgaa
721 gattttgcta gagctgaagg ctgtgataat gaggaagatc ttcaaatggg cattcacaag
781 ggctatggtt ctgatggtct aaagttgtta tcacatgaag aaagtgtatc atttggcgag
841 tctgtactga agttgacttt tgatcctggt acagtagaag atggtttact taccgtagag
901 tgtaagctgg accacccttt ctatgttaaa aataaaggtt ggtcatcatt ttatccaagc
961 ttgactgtgg tacagcatgg cattccatgt tgtgaagttc atattggcga tgtatgtcta
1021 cctcctggac accccgatgc cattaatttt gatgatccag gtgtttttga tacatttaaa
1081 agctatgact tcacacctat ggattcttct gcagtttatg tgttaagtag tatggctcgc
1141 cagcgtcgtg catctttgtc ttgtggagga cctggtggtc aagactttgc aagatctgga
1201 ttcagtaaaa actgtggctc acctggatca tcacagctct cttccaattc tttgtatgct
1261 aaagctgtca aaaaccacag ctcagggact gtgagtgcca cttctcctaa taagtgcaaa
1321 agaccaatga atgccttcat gctttttgcc aaaaaataca gagttgaata tactcagatg
1381 tatccaggga aagataacag agccataagt gtgatccttg gtgacaggtg gaagaaaatg
1441 aagaatgaag agagaagaat gtacacatta gaagcaaagg ctttggctga agaacagaaa
1501 cgtttaaatc ctgactgttg gaagaggaaa agaaccaatt caggctcaca acaacattaa
1561 accaggatgc ttatgttctt aagtctatat ttgcatatac attgactctt gatggaaaga
1621 cttaagaaga tcaaggtctc accatttgtc ctcaattcgt gtgaccataa gatactgata
1681 gcattgagtc ttgaaatgat ttaataatat gagtgaggat ttgctttctc cattagagca
1741 ttaagctaaa actatcaaca ttttaaacca aattgcctta tttttcttcc aaacttcata
1801 tatgtctatc aggtaataat aggcttgaaa attgatatcc tgtggtgcta aagtacagta
1861 gaaagagagg agaagtgtat acatgtttta ttttaaattg tacgaaaggg gaatttaaaa
1921 aatatgtaac tgctgtttat acattggctc cttactgctt attaatccgt attgtacaca
1981 tgatgaaatg aagcagaagc tgggagtcgg cctttcctct agtaaccacc acatggctca
2041 gcatctgtgc caaacatagg cgctcctagt ctggtcagtg ccaagaggct accagaacat
2101 ggggcaggtg gctggtgttg gtgtcccagc ctaagagcca cctgctgcag ttaccatggc
2161 atgctgagtt gatgcaccag gtggcagcag ccatccgtta ttatttccaa tggagaccta
2221 gcccaggcca aggtaaagtt agttaatagc attgggatat agtcactgta atggtgctat
2281 taacaaacag tcaacaccat tgtatttttt aacttcgtgt tctgtatctc ctcagccatg
2341 tatcttaaat atattttgtc atcataatct ttatggtggg ggcagacttt gcacttactg
2401 cagtgcaaca cttgcacttt aattttcctc caactgtcta aaattagagc aaatacattg
2461 gcaatacagc tgcttttgct ctgagctaca atcatggctt ttcatgttac ttaccaagtg
2521 gtgtttctgg ttaggaatca cagttgtaaa attgatttca gttcatcaca cttcttcatg
2581 atgttgcccc taaattttgc acactatatt cttgtatatt atttcaaata aatgggaaaa
2641 aaagttacca aaaaaaaaaa aaaaaaaaaa aaaaaaaaa
//