LOCUS BC121051 2294 bp mRNA linear HUM 04-OCT-2006 DEFINITION Homo sapiens heat shock transcription factor 2, mRNA (cDNA clone MGC:149748 IMAGE:40117979), complete cds. ACCESSION BC121051 VERSION BC121051.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2294) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2294) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (02-AUG-2006) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Baylor Human Genome Sequencing Center cDNA Library Preparation: Baylor Human Genome Sequencing Center cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAM Plate: 27 Row: k Column: 10. FEATURES Location/Qualifiers source 1..2294 /db_xref="H-InvDB:HIT000388248" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:149748 IMAGE:40117979" /tissue_type="PCR rescued clones" /clone_lib="NIH_MGC_282" /note="Vector: pCR-Blunt II-TOPO; Clone identification sequence tag: TGCATTTA sequenced from the reverse primer" gene 1..2294 /gene="HSF2" /db_xref="GeneID:3298" /db_xref="HGNC:HGNC:5225" /db_xref="MIM:140581" CDS 45..1655 /gene="HSF2" /codon_start=1 /product="heat shock transcription factor 2" /protein_id="AAI21052.1" /db_xref="GeneID:3298" /db_xref="HGNC:HGNC:5225" /db_xref="MIM:140581" /translation="MKQSSNVPAFLSKLWTLVEETHTNEFITWSQNGQSFLVLDEQRF AKEILPKYFKHNNMASFVRQLNMYGFRKVVHIDSGIVKQERDGPVEFQHPYFKQGQDD LLENIKRKVSSSKPEENKIRQEDLTKIISSAQKVQIKQETIESRLSELKSENESLWKE VSELRAKHAQQQQVIRKIVQFIVTLVQNNQLVSLKRKRPLLLNTNGAQKKNLFQHIVK EPTDNHHHKVPHSRTEGLKPRERISDDIIIYDVTDDNADEENIPVIPETNEDVISDPS NCSQYPDIVIVEDDNEDEYAPVIQSGEQNEPARESLSSGSDGSSPLMSSAVQLNGSSS LTSEDPVTMMDSILNDNINLLGKVELLDYLDSIDCSLEDFQAMLSGRQFSIDPDLLVD LFTSSVQMNPTDYINNTKSENKGLETTKNNVVQPVSEEGRKSKSKPDKQLIQYTAFPL LAFLDGNPASSVEQASTTASSEVLSSVDKPIEVDELLDSSLDPEPTQSKLVRLEPLTE AEASEATLFYLCELAPAPLDSDMPLLDS" BASE COUNT 713 a 429 c 468 g 684 t ORIGIN 1 ccgcgttcgg gtgtagaatt tggaatccct gcgccgcgtt aacaatgaag cagagttcga 61 acgtgccggc tttcctcagc aagctgtgga cgcttgtgga ggaaacccac actaacgagt 121 tcatcacctg gagccagaat ggccaaagtt ttctggtctt ggatgagcaa cgatttgcaa 181 aagaaattct tcccaaatat ttcaagcaca ataatatggc aagctttgtg aggcaactga 241 atatgtatgg tttccgtaaa gtagtacata tcgactctgg aattgtaaag caagaaagag 301 atggtcctgt agaatttcag catccttact tcaaacaagg acaggatgac ttgttggaga 361 acattaaaag gaaggtttca tcttcaaaac cagaagaaaa taaaattcgt caggaagatt 421 taacaaaaat tataagtagt gctcagaagg ttcagataaa acaggaaact attgagtcca 481 ggctttctga attaaaaagt gagaatgagt ccctttggaa ggaggtgtca gaattacgag 541 caaagcatgc acaacagcaa caagttattc gaaagattgt ccagtttatt gttacattgg 601 ttcaaaataa ccaacttgtg agtttaaaac gtaaaaggcc tctacttcta aacactaatg 661 gagcccaaaa gaagaacctg tttcagcaca tagtcaaaga accaactgat aatcatcatc 721 ataaagttcc acacagtagg actgaaggtt taaagccaag ggagaggatt tcagatgaca 781 tcattattta tgatgttact gatgataatg cagatgaaga aaatatccca gttattccag 841 aaactaatga ggatgttata tctgatccct ccaactgtag ccagtaccct gatattgtca 901 tcgttgaaga tgacaatgaa gatgagtatg cacctgtcat tcagagtgga gagcagaatg 961 aaccagccag agaatcccta agttcaggca gtgatggcag cagccctctc atgtctagtg 1021 ctgtccagct aaatggctca tccagtctga cctcagaaga tccagtgacc atgatggatt 1081 ccattttgaa tgataacatc aatcttttgg gaaaggttga gctgttggat tatcttgaca 1141 gtattgactg cagtttagag gacttccagg ccatgctatc aggaagacaa tttagcatag 1201 acccagatct cctggttgat cttttcacta gttctgtgca gatgaatccc acagattaca 1261 tcaataatac aaaatctgag aataaaggat tagaaactac caagaacaat gtagttcagc 1321 cagtttcgga agagggaaga aaatctaaat ccaaaccaga taagcagctt atccagtata 1381 ccgcctttcc acttcttgca ttcctcgatg ggaaccctgc ttcttctgtt gaacaggcga 1441 gtacaacagc atcatcagaa gttttgtcct ctgtagataa acccatagaa gttgatgagc 1501 ttctggatag cagcctagac ccagaaccaa cccaaagtaa gcttgttcgc ctggagccat 1561 tgactgaagc tgaagctagt gaagctacac tgttttattt atgtgaactt gctcctgcac 1621 ctctggatag tgatatgcca cttttagata gctaaatccc caggaagtgg actttacatg 1681 tatatattca tcaaaatgat gaactattta ttttaaagta tcatttggta ctttttttgt 1741 aaattgcttt gttttgttta atcagatact gtggaataaa agcacctttt gcttttctca 1801 ctaaccacac actcttgcag agctttcagg tgttactcag ctgcatagtt acgcagatgt 1861 aatgcacatt attggcgtat ctttaagttg gattcaaatg gccatttttc tccaattttg 1921 gtaaattgga tatctttttt ttacaaatac gaccattaac ctcagttaaa tttttgtttg 1981 ttttcctgtt tgatgctgtc tatttgcatt gagtgtaagt catttgaact aatggtataa 2041 ctcctaaagc tttctctgct ccagttattt ttattaaata tttttcactt ggcttatttt 2101 taaaactggg aacataaagt gcctgtatct tgtaaaactt catttgtttc ttttggttca 2161 gagaagttca tttatgttca aagacgttta ttcatgttca acaggaaaga caaagtgtac 2221 gtgaatgctc gctgtctgat agggttccag ctccatatat atagaaagat cgggggtggg 2281 atgggatgga gtga //