LOCUS BC033890 2874 bp mRNA linear HUM 17-JUL-2006 DEFINITION Homo sapiens forkhead box A1, mRNA (cDNA clone MGC:33105 IMAGE:5269380), complete cds. ACCESSION BC033890 VERSION BC033890.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2874) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2874) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (02-JUL-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Miklos Palkovits, M.D., Ph.D. cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki Toshiyuki and Piero Carninci (RIKEN) cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 47 Row: o Column: 19 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 24497500. FEATURES Location/Qualifiers source 1..2874 /db_xref="H-InvDB:HIT000042018" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:33105 IMAGE:5269380" /tissue_type="Testis" /clone_lib="NIH_MGC_97" /lab_host="DH10B" /note="Vector: pBluescriptR" gene 1..2874 /gene="FOXA1" /gene_synonym="MGC33105" /gene_synonym="TCF3A" /db_xref="GeneID:3169" /db_xref="HGNC:HGNC:5021" /db_xref="MIM:602294" CDS 63..1481 /gene="FOXA1" /gene_synonym="MGC33105" /gene_synonym="TCF3A" /codon_start=1 /product="forkhead box A1" /protein_id="AAH33890.1" /db_xref="GeneID:3169" /db_xref="HGNC:HGNC:5021" /db_xref="MIM:602294" /translation="MLGTVKMEGHETSDWNSYYADTQEAYSSVPVSNMNSGLGSMNSM NTYMTMNTMTTSGNMTPASFNMSYANPGLGAGLSPGAVAGMPGGSAGAMNSMTAAGVT AMGTALSPSGMGAMGAQQAASMNGLGPYAAAMNPCMSPMAYAPSNLGRSRAGGGGDAK TFKRSYPHAKPPYSYISLITMAIQQAPSKMLTLSEIYQWIMDLFPYYRQNQQRWQNSI RHSLSFNDCFVKVARSPDKPGKGSYWTLHPDSGNMFENGCYLRRQKRFKCEKQPGAGG GGGSGSGGSGAKGGPESRKDPSGASNPSADSPLHRGVHGKTGQLEGAPAPGPAASPQT LDHSGATATGGASELKTPASSTAPPISSGPGALASVPASHPAHGLAPHESQLHLKGDP HYSFNHPFSINNLMSSSEQQHKLDFKAYEQALQYSPYGSTLPASLPLGSASVTTRSPI EPSALEPAYYQGVYSRPVLNTS" BASE COUNT 739 a 793 c 675 g 667 t ORIGIN 1 agcggggccg cccgtcgctt cgcacagggc tggatggttg tattgggcag ggtggctcca 61 ggatgttagg aactgtgaag atggaagggc atgaaaccag cgactggaac agctactacg 121 cagacacgca ggaggcctac tcctccgtcc cggtcagcaa catgaactca ggcctgggct 181 ccatgaactc catgaacacc tacatgacca tgaacaccat gactacgagc ggcaacatga 241 ccccggcgtc cttcaacatg tcctatgcca acccgggcct aggggccggc ctgagtcccg 301 gcgcagtagc cggcatgccg gggggctcgg cgggcgccat gaacagcatg actgcggccg 361 gcgtgacggc catgggtacg gcgctgagcc cgagcggcat gggcgccatg ggtgcgcagc 421 aggcggcctc catgaatggc ctgggcccct acgcggccgc catgaacccg tgcatgagcc 481 ccatggcgta cgcgccgtcc aacctgggcc gcagccgcgc gggcggcggc ggcgacgcca 541 agacgttcaa gcgcagctac ccgcacgcca agccgcccta ctcgtacatc tcgctcatca 601 ccatggccat ccagcaggcg cccagcaaga tgctcacgct gagcgagatc taccagtgga 661 tcatggacct cttcccctat taccggcaga accagcagcg ctggcagaac tccatccgcc 721 actcgctgtc cttcaatgac tgcttcgtca aggtggcacg ctccccggac aagccgggca 781 agggctccta ctggacgctg cacccggact ccggcaacat gttcgagaac ggctgctact 841 tgcgccgcca gaagcgcttc aagtgcgaga agcagccggg ggccggcggc gggggcggga 901 gcggaagcgg gggcagcggc gccaagggcg gccctgagag ccgcaaggac ccctctggcg 961 cctctaaccc cagcgccgac tcgcccctcc atcggggtgt gcacgggaag accggccagc 1021 tagagggcgc gccggccccc gggcccgccg ccagccccca gactctggac cacagtgggg 1081 cgacggcgac agggggcgcc tcggagttga agactccagc ctcctcaact gcgcccccca 1141 taagctccgg gcccggggcg ctggcctctg tgcccgcctc tcacccggca cacggcttgg 1201 caccccacga gtcccagctg cacctgaaag gggaccccca ctactccttc aaccacccgt 1261 tctccatcaa caacctcatg tcctcctcgg agcagcagca taagctggac ttcaaggcat 1321 acgaacaggc actgcaatac tcgccttacg gctctacgtt gcccgccagc ctgcctctag 1381 gcagcgcctc ggtgaccacc aggagcccca tcgagccctc agccctggag ccggcgtact 1441 accaaggtgt gtattccaga cccgtcctaa acacttccta gctcccggga ctggggggtt 1501 tgtctggcat agccatgctg gtagcaagag agaaaaaatc aacagcaaac aaaaccacac 1561 aaaccaaacc gtcaacagca taataaaatc ccaacaacta tttttatttc atttttcatg 1621 cacaaccttt cccccagtgc aaaagactgt tactttatta ttgtattcaa aattcattgt 1681 gtatattact acaaagacaa ccccaaacca atttttttcc tgcgaagttt aatgatccac 1741 aagtgtatat atgaaattct cctccttcct tgcccccctc tctttcttcc ctctttcccc 1801 tccagacatt ctagtttgtg gagggttatt taaaaaaaca aaaaaggaag atggtcaagt 1861 ttgtaaaata tttgtttgtg ctttttcccc ctccttacct gaccccctac gagtttacag 1921 gtctgtggca atactcttaa ccataagaat tgaaatggtg aagaaacaag tatacactag 1981 aggctcttaa aagtattgaa agacaatact gctgttatat agcaagacat aaacagatta 2041 taaacatcag agccatttgc ttctcagttt acatttctga tacatgcaga tagcagatgt 2101 ctttaaatga aatacatgta tattgtgtat ggacttaatt atgcacatgc tcagatgtgt 2161 agacatcctc cgtatattta cataacatat agaggtaata gataggtgat atacatgata 2221 cattctcaag agttgcttga ccgaaagtta caaggacccc aacccctttg tcctctctac 2281 ccacagatgg ccctgggaat caattcctca ggaattgccc tcaagaactc tgcttcttgc 2341 tttgcagagt gccatggtca tgtcattctg aggtcacata acacataaaa ttagtttcta 2401 tgagtgtata ccatttaaag aatttttttt tcagtaaaag ggaatagtac aatgttggag 2461 gagagataag ttatagggag ctggatttca aaacgtggtc caagattcaa aaatcctatt 2521 gatagtggcc attttaatca ttgccatcgt gtgcttgttt catccagtgt tatgcacttt 2581 ccacagttgg acatggtgtt agtatagcca gacgggtttc attattattt ctctttgctt 2641 tctcaatgtt aatttattgc atggtttatt ctttttcttt acagctgaaa ttgctttaaa 2701 tgatggttaa aattacaaat taaattgtta atttttatca atgtgattgt aattaaaaat 2761 attttgattt aaataacaaa aataatacca gattttaagc cgtggaaaat gttcttgatc 2821 atttgcagtt aaggacttta aataaatcaa atgttaacaa aaaaaaaaaa aaaa //