LOCUS BC011665 4211 bp mRNA linear HUM 23-DEC-2003 DEFINITION Homo sapiens transcription factor 3 (E2A immunoglobulin enhancer binding factors E12/E47), mRNA (cDNA clone IMAGE:4110737), partial cds. ACCESSION BC011665 VERSION BC011665.2 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4211) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 4211) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (30-JUL-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Dec 19, 2003 this sequence version replaced BC011665.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 26 Row: b Column: 8. FEATURES Location/Qualifiers source 1..4211 /db_xref="H-InvDB:HIT000087686" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="IMAGE:4110737" /tissue_type="Muscle, rhabdomyosarcoma" /clone_lib="NIH_MGC_17" /lab_host="DH10B-R" /note="Vector: pOTB7" gene <1..4211 /gene="TCF3" /gene_synonym="E2A" /gene_synonym="ITF1" /db_xref="GeneID:6929" /db_xref="MIM:147141" CDS <1..1778 /gene="TCF3" /gene_synonym="E2A" /gene_synonym="ITF1" /codon_start=3 /product="TCF3 protein" /protein_id="AAH11665.1" /db_xref="GeneID:6929" /db_xref="MIM:147141" /translation="QSSSSFDPSRTFSEGTHFTESHSSLSSSTFLGPGLGGKSGERGA YASFGRDAGVGGLTQAGFLSGELALNSPGPLSPSGMKGTSQYYPSYSGSSRRRAADGS LDTQPKKVRKVPPGLPSSVYPPSSGEDYGRDATAYPSAKTPSSTYPAPFYVADGSLHP SAELWSPPGQAGFGPMLGGGSSPLPLPPGSGPVGSSGSSSTFGGLHQHERMGYQLHGA EVNGGLPSASSFSSAPGATYGGVSSHTPPVSGADSLLGSRGTTAGSSGDALGKALASI YSPDHSSNNFSSSPSTPVGSPQGLAGTSQWPRAGAPGALSPSYDGGLHGLQSKIEDHL DEAIHVLRSHAVGTAGDMHTLLPGHGALASGFTGPMSLGGRHAGLVGGSHPEDGLAGS TSLMHNHAALPSQPGTLPDLSRPPDSYSGLGRAGATAAASEIKREEKEDEENTSAADH SEEEKKELKAPRARTSPDEDEDDLLPPEQKAEREKERRVANNARERLRVRDINEAFKE LGRMCQLHLNSEKPQTKLLILHQAVSVILNLEQQVRERNLNPKAACLKRREEEKVSGV VGDPQMVLSAPHPGLSEAHNPAGHM" misc_feature 1461..1622 /gene="TCF3" /gene_synonym="E2A" /gene_synonym="ITF1" /note="HLH; Region: Helix-loop-helix DNA-binding domain" /db_xref="CDD:pfam00010" BASE COUNT 853 a 1359 c 1180 g 819 t ORIGIN 1 accagagcag ctcctccttt gaccccagcc ggaccttcag cgagggcacc cacttcactg 61 agtcgcacag cagcctctct tcatccacat tcctgggacc gggactcgga ggcaagagcg 121 gtgagcgggg cgcctatgcc tccttcggga gagacgcagg cgtgggcggc ctgactcagg 181 ctggcttcct gtcaggcgag ctggccctca acagccccgg gcccctgtcc ccttcgggca 241 tgaaggggac ctcccagtac tacccctcct actccggcag ctcccggcgg agagcggcag 301 acggcagcct agacacgcag cccaagaagg tccggaaggt cccgccgggt cttccatcct 361 cggtgtaccc acccagctca ggtgaggact acggcaggga tgccaccgcc tacccgtccg 421 ccaagacccc cagcagcacc tatcccgccc ccttctacgt ggcagatggc agcctgcacc 481 cctcagccga gctctggagt cccccgggcc aggcgggctt cgggcccatg ctgggtgggg 541 gctcatcccc gctgcccctc ccgcccggta gcggcccggt gggcagcagt ggaagcagca 601 gcacgtttgg tggcctgcac cagcacgagc gtatgggcta ccagctgcat ggagcagagg 661 tgaacggtgg gctcccatct gcatcctcct tctcctcagc ccccggagcc acgtacggcg 721 gcgtctccag ccacacgccg cctgtcagcg gggccgacag cctcctgggc tcccgaggga 781 ccacagctgg cagctccggg gatgccctcg gcaaagcact ggcctcgatc tactccccgg 841 atcactcaag caataacttc tcgtccagcc cttctacccc cgtgggctcc ccccagggcc 901 tggcaggaac gtcacagtgg cctcgagcag gagcccccgg tgccttatcg cccagctacg 961 acgggggtct ccacggcctg cagagtaaga tagaagacca cctggacgag gccatccacg 1021 tgctccgcag ccacgccgtg ggcacagccg gcgacatgca cacgctgctg cctggccacg 1081 gggcgctggc ctcaggtttc accggcccca tgtcgctggg cgggcggcac gcaggcctgg 1141 ttggaggcag ccaccccgag gacggcctcg caggcagcac cagcctcatg cacaaccacg 1201 cggccctccc cagccagcca ggcaccctcc ctgacctgtc tcggcctccc gactcctaca 1261 gtgggctagg gcgagcaggt gccacggcgg ccgccagcga gatcaagcgg gaggagaagg 1321 aggacgagga gaacacgtca gcggctgacc actcggagga ggagaagaag gagctgaagg 1381 ccccccgggc ccggaccagc ccagacgagg acgaggacga ccttctcccc ccagagcaga 1441 aggccgagcg ggagaaggag cgccgggtgg ccaataacgc ccgggagcgg ctgcgggtcc 1501 gtgacatcaa cgaggccttt aaggagctgg ggcgcatgtg ccaactgcac ctcaacagcg 1561 agaagcccca gaccaaactg ctcatcctgc accaggctgt ctcggtcatc ctgaacttgg 1621 agcagcaagt gcgagagcgg aacctgaatc ccaaagcagc ctgtttgaaa cggcgagaag 1681 aggaaaaggt gtcaggtgtg gttggagacc cccagatggt gctttcagct ccccacccag 1741 gcctgagcga agcccacaac cccgccgggc acatgtgaaa ggtatgcctc cgtgggacga 1801 gccacccgct ttcagccctg tgctctggcc ccagaacggc cactcgagac cccgggcttc 1861 atccacatcc acacctcaca cacctgttgt cagcatcgag ccaacaccaa cctgacaagg 1921 ttcggagtga tgggggcggc caaggtgaca ctgggtccag gagctccctg gggccctggc 1981 ctaccactca ctggcctcgc tccccctgtc cccgaatctc agccaccgtg tcactctgtg 2041 acctgtccca tggatcctga aactgcatct tggccctgtt gcctgggctg acaggagcat 2101 tttttttttt tccagtaaac aaaacctgaa agcaagcaac aaaacataca ctttgtcaga 2161 gaagaaaaaa atgccttaac tataaaaagc ggagaaatgg aaacatatca ctcaaggggg 2221 atgctgtgga aacctggctt atttttttaa agccaccagc aaattgtgcc taagcgaaat 2281 atttttttta aggaaaataa aaacattagt tacaagattt tttttttttt aatgtagatg 2341 aaaattagca aggatgctgc ctttggtctc tggttttttt aagctttttt tgcatatgtt 2401 ttgtaagcaa caaatttttt tgtataaaag tcccgtgtct ctcgctattt ctgctgctgt 2461 tcctagactg agcattgcat ttcttgatca accagatgat taaacgttgt attaaaaaga 2521 ccccgtgtaa acctgagccc ccccgtcccc cccccccccc cggaagccac tgcacacaga 2581 cagaacgggg acaggcggcg ggtcttttgt ttttttgatg ttgggggttc tcttggtttt 2641 gtcatgtgga aagtgatgcg tgggcgttcc ctgatgaagg caccttgggg cttccctgcc 2701 gcatcctctc ccctcaggaa ggggactgac ctgggcttgg gggaagggac gtcagcaagg 2761 tggctctgac cctcccaggt gactctgcca agcagctgtg gcccccaggg ctaccctaca 2821 caacgccctc cccaggcccc cctaagctgc tctcccttgg aacctgcaca gctctctgaa 2881 atggggcatt ttgttgggac cagtgacccc tggcatgggg accacaccct ggagcccggt 2941 gctggggacc tcctggacac cctgtccttc actcctttgc cccagggacc caggctcatg 3001 ctctgaactc tggctgagag gatgctgctc aggagccagc acaggacacc ccccacccca 3061 ccccaccatg tccccattac accagagggc catcgtgacg tagacaggat gccaggggcc 3121 tggccagcct cccccaatgc tggggagcat ccctgggcct ggggccacac ctgctgccct 3181 ccctctgtgt ggtccaaggg caagagtggc tggagccggg ggactgtgct ggtctgagcc 3241 ccacgaaggc cttgggctgt gcgtccgacc ctgctgcaga accagcaggg tgtcccctcg 3301 ggcccatctg tgtcccatgt cccagcaccc aggcctctct ccaggtctcc ttttctggtc 3361 ttttgccatg agggtaacca gctcttccca gctggctggg gactgtcttg ggtttaaaac 3421 tgcaagtctc ctaccctggg atcccatcca gttccacacg aactagggca gtggtcactg 3481 tggcacccag gtgtgggcct ggctagctgg gggccttcat gtgcccttca tgcccctccc 3541 tgcattgagg ccttgtggac ccctgggctg gctgtgttca tccccgctgc aggtcgggcg 3601 tctccccccg tgccactcct gagactccca ccgttacccc caggagatcc tggactgcct 3661 gactcccctc cccagactgg cttgggagcc tgggccccat ggtagatgca agggaaacct 3721 caaggccagc tcaatgcctg gtatctgccc ccagtccagg ccaggcggag gggaggggct 3781 gtccggctgc ctctcccttc tcggtggctt cccctacgcc ctgggagttt gatctcttaa 3841 gggaacttgc ctctccctct tgttttgctc ctggccctgc ccctaggtct gggtgggcag 3901 tggccccata gcctctggaa ctgtgcgttc tgcatagaat tcaaacgaga ttcacccagc 3961 gcgaggagga agaaacagca gttcctggga accacaatta tggggggtgg ggggtgtgat 4021 ctgagtgcct caagatggtt ttcaaaaaaa tttttttaaa gaaaataatt gtatacgtgt 4081 caacacagct ggctggatga ttgggacttt aaaacgaccc tctttcaggt ggattcagag 4141 acctgtcctg tatataacag cactgtagca ataaacgtga cattttataa cgaaaaaaaa 4201 aaaaaaaaaa a //