LOCUS BC022329 2679 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens HMG-box transcription factor 1, mRNA (cDNA clone MGC:22757 IMAGE:4278438), complete cds. ACCESSION BC022329 VERSION BC022329.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2679) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2679) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (01-FEB-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Sep 16, 2003 this sequence version replaced BC022329.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: CLONTECH Laboratories, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 35 Row: k Column: 9 This clone was selected for full length sequencing because it passed the following selection criteria: Hexamer frequency ORF analysis, Similarity but not identity to protein. FEATURES Location/Qualifiers source 1..2679 /db_xref="H-InvDB:HIT000039476" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:22757 IMAGE:4278438" /tissue_type="Brain, primitive neuroectodermal" /clone_lib="NIH_MGC_56" /lab_host="DH10B" /note="Vector: pDNR-LIB" gene 1..2679 /gene="HBP1" /db_xref="GeneID:26959" /db_xref="HGNC:HGNC:23200" CDS 16..1560 /gene="HBP1" /codon_start=1 /product="HMG-box transcription factor 1" /protein_id="AAH22329.1" /db_xref="GeneID:26959" /db_xref="HGNC:HGNC:23200" /translation="MVWEVKTNQMPNAVQKLLLVMDKRASGMNDSLELLQCNENLPSS PGYNSCDEHMELDDLPELQAVQSDPTQSGMYQLSSDVSHQEYPRSSWNQNTSDIPETT YRENEVDWLTELANIATSPQSPLMLCSFYNRSSPVHIIATSKSLHSYARPPPVSSSSK SEPAFPHHHWKEETPVRHERANSESESGIFCMSSLSDDDDLGWCNSWPSTVWHCFLKG TRLCFHKGSNKEWQDVEDFARAEGCDNEEDLQMGIHKGYGSDGLKLLSHEESVSFGES VLKLTFDPGTVEDGLLTVECKLDHPFYVKNKGWSSFYPSLTVVQHGIPCCEVHIGDVC LPPGHPDAINFDDPGVFDTFKSYDFTPMDSSAVYVLSSMARQRRASLSCGGPGGQDFA RSGFSKNCGSPGSSQLSSNSLYAKAVKNHSSGTVSATSPNKCKRPMNAFMLFAKKYRV EYTQMYPGKDNRAISVILGDRWKKMKNEERRMYTLEAKALAEEQKRLNPDCWKRKRTN SGSQQH" BASE COUNT 818 a 523 c 560 g 778 t ORIGIN 1 tcagagcacc ataacatggt gtgggaagtg aagacaaatc agatgcctaa tgcagtacag 61 aaactcctgt tggtgatgga caagagagcc tcaggaatga atgactcatt ggagttgctg 121 cagtgtaatg agaatttgcc atcttcacct ggatataact cctgtgatga acacatggag 181 cttgatgacc ttcctgaact tcaggcagtt caaagtgatc ctacccaatc tggcatgtac 241 cagctgagtt cagatgtttc acatcaagaa tacccaagat catcttggaa ccaaaatacc 301 tcagacatac cagaaactac ttaccgtgaa aatgaggtgg actggctaac agaattggca 361 aatatcgcga ccagtccaca aagtccactg atgctgtgct cattttacaa tagatcatct 421 cctgtacaca tcatagccac tagcaaaagt ttacattcct atgcacgccc tccaccagtg 481 tcctcttctt cgaagagtga accagccttc cctcatcacc attggaagga ggaaacacca 541 gtaagacacg aaagggcaaa tagtgagtca gaatctggca ttttctgcat gtcctccctg 601 tcagatgatg atgatttggg atggtgcaat tcctggcctt caactgtctg gcactgtttt 661 ttgaaaggca cacgactgtg ctttcataag ggaagcaata aggaatggca agatgttgaa 721 gattttgcta gagctgaagg ctgtgataat gaggaagatc ttcaaatggg cattcacaag 781 ggctatggtt ctgatggtct aaagttgtta tcacatgaag aaagtgtatc atttggcgag 841 tctgtactga agttgacttt tgatcctggt acagtagaag atggtttact taccgtagag 901 tgtaagctgg accacccttt ctatgttaaa aataaaggtt ggtcatcatt ttatccaagc 961 ttgactgtgg tacagcatgg cattccatgt tgtgaagttc atattggcga tgtatgtcta 1021 cctcctggac accccgatgc cattaatttt gatgatccag gtgtttttga tacatttaaa 1081 agctatgact tcacacctat ggattcttct gcagtttatg tgttaagtag tatggctcgc 1141 cagcgtcgtg catctttgtc ttgtggagga cctggtggtc aagactttgc aagatctgga 1201 ttcagtaaaa actgtggctc acctggatca tcacagctct cttccaattc tttgtatgct 1261 aaagctgtca aaaaccacag ctcagggact gtgagtgcca cttctcctaa taagtgcaaa 1321 agaccaatga atgccttcat gctttttgcc aaaaaataca gagttgaata tactcagatg 1381 tatccaggga aagataacag agccataagt gtgatccttg gtgacaggtg gaagaaaatg 1441 aagaatgaag agagaagaat gtacacatta gaagcaaagg ctttggctga agaacagaaa 1501 cgtttaaatc ctgactgttg gaagaggaaa agaaccaatt caggctcaca acaacattaa 1561 accaggatgc ttatgttctt aagtctatat ttgcatatac attgactctt gatggaaaga 1621 cttaagaaga tcaaggtctc accatttgtc ctcaattcgt gtgaccataa gatactgata 1681 gcattgagtc ttgaaatgat ttaataatat gagtgaggat ttgctttctc cattagagca 1741 ttaagctaaa actatcaaca ttttaaacca aattgcctta tttttcttcc aaacttcata 1801 tatgtctatc aggtaataat aggcttgaaa attgatatcc tgtggtgcta aagtacagta 1861 gaaagagagg agaagtgtat acatgtttta ttttaaattg tacgaaaggg gaatttaaaa 1921 aatatgtaac tgctgtttat acattggctc cttactgctt attaatccgt attgtacaca 1981 tgatgaaatg aagcagaagc tgggagtcgg cctttcctct agtaaccacc acatggctca 2041 gcatctgtgc caaacatagg cgctcctagt ctggtcagtg ccaagaggct accagaacat 2101 ggggcaggtg gctggtgttg gtgtcccagc ctaagagcca cctgctgcag ttaccatggc 2161 atgctgagtt gatgcaccag gtggcagcag ccatccgtta ttatttccaa tggagaccta 2221 gcccaggcca aggtaaagtt agttaatagc attgggatat agtcactgta atggtgctat 2281 taacaaacag tcaacaccat tgtatttttt aacttcgtgt tctgtatctc ctcagccatg 2341 tatcttaaat atattttgtc atcataatct ttatggtggg ggcagacttt gcacttactg 2401 cagtgcaaca cttgcacttt aattttcctc caactgtcta aaattagagc aaatacattg 2461 gcaatacagc tgcttttgct ctgagctaca atcatggctt ttcatgttac ttaccaagtg 2521 gtgtttctgg ttaggaatca cagttgtaaa attgatttca gttcatcaca cttcttcatg 2581 atgttgcccc taaattttgc acactatatt cttgtatatt atttcaaata aatgggaaaa 2641 aaagttacca aaaaaaaaaa aaaaaaaaaa aaaaaaaaa //