LOCUS BC011665 4211 bp mRNA linear HUM 23-DEC-2003
DEFINITION Homo sapiens transcription factor 3 (E2A immunoglobulin enhancer
binding factors E12/E47), mRNA (cDNA clone IMAGE:4110737), partial
cds.
ACCESSION BC011665
VERSION BC011665.2
KEYWORDS .
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 4211)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 4211)
AUTHORS Strausberg,R.
TITLE Direct Submission
JOURNAL Submitted (30-JUL-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Cancer Genomics Office, National Cancer
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Dec 19, 2003 this sequence version replaced BC011665.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 26 Row: b Column: 8.
FEATURES Location/Qualifiers
source 1..4211
/db_xref="H-InvDB:HIT000087686"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="IMAGE:4110737"
/tissue_type="Muscle, rhabdomyosarcoma"
/clone_lib="NIH_MGC_17"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene <1..4211
/gene="TCF3"
/gene_synonym="E2A"
/gene_synonym="ITF1"
/db_xref="GeneID:6929"
/db_xref="MIM:147141"
CDS <1..1778
/gene="TCF3"
/gene_synonym="E2A"
/gene_synonym="ITF1"
/codon_start=3
/product="TCF3 protein"
/protein_id="AAH11665.1"
/db_xref="GeneID:6929"
/db_xref="MIM:147141"
/translation="QSSSSFDPSRTFSEGTHFTESHSSLSSSTFLGPGLGGKSGERGA
YASFGRDAGVGGLTQAGFLSGELALNSPGPLSPSGMKGTSQYYPSYSGSSRRRAADGS
LDTQPKKVRKVPPGLPSSVYPPSSGEDYGRDATAYPSAKTPSSTYPAPFYVADGSLHP
SAELWSPPGQAGFGPMLGGGSSPLPLPPGSGPVGSSGSSSTFGGLHQHERMGYQLHGA
EVNGGLPSASSFSSAPGATYGGVSSHTPPVSGADSLLGSRGTTAGSSGDALGKALASI
YSPDHSSNNFSSSPSTPVGSPQGLAGTSQWPRAGAPGALSPSYDGGLHGLQSKIEDHL
DEAIHVLRSHAVGTAGDMHTLLPGHGALASGFTGPMSLGGRHAGLVGGSHPEDGLAGS
TSLMHNHAALPSQPGTLPDLSRPPDSYSGLGRAGATAAASEIKREEKEDEENTSAADH
SEEEKKELKAPRARTSPDEDEDDLLPPEQKAEREKERRVANNARERLRVRDINEAFKE
LGRMCQLHLNSEKPQTKLLILHQAVSVILNLEQQVRERNLNPKAACLKRREEEKVSGV
VGDPQMVLSAPHPGLSEAHNPAGHM"
misc_feature 1461..1622
/gene="TCF3"
/gene_synonym="E2A"
/gene_synonym="ITF1"
/note="HLH; Region: Helix-loop-helix DNA-binding domain"
/db_xref="CDD:pfam00010"
BASE COUNT 853 a 1359 c 1180 g 819 t
ORIGIN
1 accagagcag ctcctccttt gaccccagcc ggaccttcag cgagggcacc cacttcactg
61 agtcgcacag cagcctctct tcatccacat tcctgggacc gggactcgga ggcaagagcg
121 gtgagcgggg cgcctatgcc tccttcggga gagacgcagg cgtgggcggc ctgactcagg
181 ctggcttcct gtcaggcgag ctggccctca acagccccgg gcccctgtcc ccttcgggca
241 tgaaggggac ctcccagtac tacccctcct actccggcag ctcccggcgg agagcggcag
301 acggcagcct agacacgcag cccaagaagg tccggaaggt cccgccgggt cttccatcct
361 cggtgtaccc acccagctca ggtgaggact acggcaggga tgccaccgcc tacccgtccg
421 ccaagacccc cagcagcacc tatcccgccc ccttctacgt ggcagatggc agcctgcacc
481 cctcagccga gctctggagt cccccgggcc aggcgggctt cgggcccatg ctgggtgggg
541 gctcatcccc gctgcccctc ccgcccggta gcggcccggt gggcagcagt ggaagcagca
601 gcacgtttgg tggcctgcac cagcacgagc gtatgggcta ccagctgcat ggagcagagg
661 tgaacggtgg gctcccatct gcatcctcct tctcctcagc ccccggagcc acgtacggcg
721 gcgtctccag ccacacgccg cctgtcagcg gggccgacag cctcctgggc tcccgaggga
781 ccacagctgg cagctccggg gatgccctcg gcaaagcact ggcctcgatc tactccccgg
841 atcactcaag caataacttc tcgtccagcc cttctacccc cgtgggctcc ccccagggcc
901 tggcaggaac gtcacagtgg cctcgagcag gagcccccgg tgccttatcg cccagctacg
961 acgggggtct ccacggcctg cagagtaaga tagaagacca cctggacgag gccatccacg
1021 tgctccgcag ccacgccgtg ggcacagccg gcgacatgca cacgctgctg cctggccacg
1081 gggcgctggc ctcaggtttc accggcccca tgtcgctggg cgggcggcac gcaggcctgg
1141 ttggaggcag ccaccccgag gacggcctcg caggcagcac cagcctcatg cacaaccacg
1201 cggccctccc cagccagcca ggcaccctcc ctgacctgtc tcggcctccc gactcctaca
1261 gtgggctagg gcgagcaggt gccacggcgg ccgccagcga gatcaagcgg gaggagaagg
1321 aggacgagga gaacacgtca gcggctgacc actcggagga ggagaagaag gagctgaagg
1381 ccccccgggc ccggaccagc ccagacgagg acgaggacga ccttctcccc ccagagcaga
1441 aggccgagcg ggagaaggag cgccgggtgg ccaataacgc ccgggagcgg ctgcgggtcc
1501 gtgacatcaa cgaggccttt aaggagctgg ggcgcatgtg ccaactgcac ctcaacagcg
1561 agaagcccca gaccaaactg ctcatcctgc accaggctgt ctcggtcatc ctgaacttgg
1621 agcagcaagt gcgagagcgg aacctgaatc ccaaagcagc ctgtttgaaa cggcgagaag
1681 aggaaaaggt gtcaggtgtg gttggagacc cccagatggt gctttcagct ccccacccag
1741 gcctgagcga agcccacaac cccgccgggc acatgtgaaa ggtatgcctc cgtgggacga
1801 gccacccgct ttcagccctg tgctctggcc ccagaacggc cactcgagac cccgggcttc
1861 atccacatcc acacctcaca cacctgttgt cagcatcgag ccaacaccaa cctgacaagg
1921 ttcggagtga tgggggcggc caaggtgaca ctgggtccag gagctccctg gggccctggc
1981 ctaccactca ctggcctcgc tccccctgtc cccgaatctc agccaccgtg tcactctgtg
2041 acctgtccca tggatcctga aactgcatct tggccctgtt gcctgggctg acaggagcat
2101 tttttttttt tccagtaaac aaaacctgaa agcaagcaac aaaacataca ctttgtcaga
2161 gaagaaaaaa atgccttaac tataaaaagc ggagaaatgg aaacatatca ctcaaggggg
2221 atgctgtgga aacctggctt atttttttaa agccaccagc aaattgtgcc taagcgaaat
2281 atttttttta aggaaaataa aaacattagt tacaagattt tttttttttt aatgtagatg
2341 aaaattagca aggatgctgc ctttggtctc tggttttttt aagctttttt tgcatatgtt
2401 ttgtaagcaa caaatttttt tgtataaaag tcccgtgtct ctcgctattt ctgctgctgt
2461 tcctagactg agcattgcat ttcttgatca accagatgat taaacgttgt attaaaaaga
2521 ccccgtgtaa acctgagccc ccccgtcccc cccccccccc cggaagccac tgcacacaga
2581 cagaacgggg acaggcggcg ggtcttttgt ttttttgatg ttgggggttc tcttggtttt
2641 gtcatgtgga aagtgatgcg tgggcgttcc ctgatgaagg caccttgggg cttccctgcc
2701 gcatcctctc ccctcaggaa ggggactgac ctgggcttgg gggaagggac gtcagcaagg
2761 tggctctgac cctcccaggt gactctgcca agcagctgtg gcccccaggg ctaccctaca
2821 caacgccctc cccaggcccc cctaagctgc tctcccttgg aacctgcaca gctctctgaa
2881 atggggcatt ttgttgggac cagtgacccc tggcatgggg accacaccct ggagcccggt
2941 gctggggacc tcctggacac cctgtccttc actcctttgc cccagggacc caggctcatg
3001 ctctgaactc tggctgagag gatgctgctc aggagccagc acaggacacc ccccacccca
3061 ccccaccatg tccccattac accagagggc catcgtgacg tagacaggat gccaggggcc
3121 tggccagcct cccccaatgc tggggagcat ccctgggcct ggggccacac ctgctgccct
3181 ccctctgtgt ggtccaaggg caagagtggc tggagccggg ggactgtgct ggtctgagcc
3241 ccacgaaggc cttgggctgt gcgtccgacc ctgctgcaga accagcaggg tgtcccctcg
3301 ggcccatctg tgtcccatgt cccagcaccc aggcctctct ccaggtctcc ttttctggtc
3361 ttttgccatg agggtaacca gctcttccca gctggctggg gactgtcttg ggtttaaaac
3421 tgcaagtctc ctaccctggg atcccatcca gttccacacg aactagggca gtggtcactg
3481 tggcacccag gtgtgggcct ggctagctgg gggccttcat gtgcccttca tgcccctccc
3541 tgcattgagg ccttgtggac ccctgggctg gctgtgttca tccccgctgc aggtcgggcg
3601 tctccccccg tgccactcct gagactccca ccgttacccc caggagatcc tggactgcct
3661 gactcccctc cccagactgg cttgggagcc tgggccccat ggtagatgca agggaaacct
3721 caaggccagc tcaatgcctg gtatctgccc ccagtccagg ccaggcggag gggaggggct
3781 gtccggctgc ctctcccttc tcggtggctt cccctacgcc ctgggagttt gatctcttaa
3841 gggaacttgc ctctccctct tgttttgctc ctggccctgc ccctaggtct gggtgggcag
3901 tggccccata gcctctggaa ctgtgcgttc tgcatagaat tcaaacgaga ttcacccagc
3961 gcgaggagga agaaacagca gttcctggga accacaatta tggggggtgg ggggtgtgat
4021 ctgagtgcct caagatggtt ttcaaaaaaa tttttttaaa gaaaataatt gtatacgtgt
4081 caacacagct ggctggatga ttgggacttt aaaacgaccc tctttcaggt ggattcagag
4141 acctgtcctg tatataacag cactgtagca ataaacgtga cattttataa cgaaaaaaaa
4201 aaaaaaaaaa a
//