LOCUS       BC033890                2874 bp    mRNA    linear   HUM 17-JUL-2006
DEFINITION  Homo sapiens forkhead box A1, mRNA (cDNA clone MGC:33105
            IMAGE:5269380), complete cds.
ACCESSION   BC033890
VERSION     BC033890.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2874)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2874)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (02-JUL-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Miklos Palkovits, M.D., Ph.D.
            cDNA Library Preparation: Michael J. Brownstein (NHGRI) &  Shiraki
            Toshiyuki and Piero Carninci (RIKEN)
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Institute for Systems Biology
            http://www.systemsbiology.org
            contact: amadan@systemsbiology.org
            Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
            Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 47 Row: o Column: 19
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 24497500.
FEATURES             Location/Qualifiers
     source          1..2874
                     /db_xref="H-InvDB:HIT000042018"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:33105 IMAGE:5269380"
                     /tissue_type="Testis"
                     /clone_lib="NIH_MGC_97"
                     /lab_host="DH10B"
                     /note="Vector: pBluescriptR"
     gene            1..2874
                     /gene="FOXA1"
                     /gene_synonym="MGC33105"
                     /gene_synonym="TCF3A"
                     /db_xref="GeneID:3169"
                     /db_xref="HGNC:HGNC:5021"
                     /db_xref="MIM:602294"
     CDS             63..1481
                     /gene="FOXA1"
                     /gene_synonym="MGC33105"
                     /gene_synonym="TCF3A"
                     /codon_start=1
                     /product="forkhead box A1"
                     /protein_id="AAH33890.1"
                     /db_xref="GeneID:3169"
                     /db_xref="HGNC:HGNC:5021"
                     /db_xref="MIM:602294"
                     /translation="MLGTVKMEGHETSDWNSYYADTQEAYSSVPVSNMNSGLGSMNSM
                     NTYMTMNTMTTSGNMTPASFNMSYANPGLGAGLSPGAVAGMPGGSAGAMNSMTAAGVT
                     AMGTALSPSGMGAMGAQQAASMNGLGPYAAAMNPCMSPMAYAPSNLGRSRAGGGGDAK
                     TFKRSYPHAKPPYSYISLITMAIQQAPSKMLTLSEIYQWIMDLFPYYRQNQQRWQNSI
                     RHSLSFNDCFVKVARSPDKPGKGSYWTLHPDSGNMFENGCYLRRQKRFKCEKQPGAGG
                     GGGSGSGGSGAKGGPESRKDPSGASNPSADSPLHRGVHGKTGQLEGAPAPGPAASPQT
                     LDHSGATATGGASELKTPASSTAPPISSGPGALASVPASHPAHGLAPHESQLHLKGDP
                     HYSFNHPFSINNLMSSSEQQHKLDFKAYEQALQYSPYGSTLPASLPLGSASVTTRSPI
                     EPSALEPAYYQGVYSRPVLNTS"
BASE COUNT          739 a          793 c          675 g          667 t
ORIGIN      
        1 agcggggccg cccgtcgctt cgcacagggc tggatggttg tattgggcag ggtggctcca
       61 ggatgttagg aactgtgaag atggaagggc atgaaaccag cgactggaac agctactacg
      121 cagacacgca ggaggcctac tcctccgtcc cggtcagcaa catgaactca ggcctgggct
      181 ccatgaactc catgaacacc tacatgacca tgaacaccat gactacgagc ggcaacatga
      241 ccccggcgtc cttcaacatg tcctatgcca acccgggcct aggggccggc ctgagtcccg
      301 gcgcagtagc cggcatgccg gggggctcgg cgggcgccat gaacagcatg actgcggccg
      361 gcgtgacggc catgggtacg gcgctgagcc cgagcggcat gggcgccatg ggtgcgcagc
      421 aggcggcctc catgaatggc ctgggcccct acgcggccgc catgaacccg tgcatgagcc
      481 ccatggcgta cgcgccgtcc aacctgggcc gcagccgcgc gggcggcggc ggcgacgcca
      541 agacgttcaa gcgcagctac ccgcacgcca agccgcccta ctcgtacatc tcgctcatca
      601 ccatggccat ccagcaggcg cccagcaaga tgctcacgct gagcgagatc taccagtgga
      661 tcatggacct cttcccctat taccggcaga accagcagcg ctggcagaac tccatccgcc
      721 actcgctgtc cttcaatgac tgcttcgtca aggtggcacg ctccccggac aagccgggca
      781 agggctccta ctggacgctg cacccggact ccggcaacat gttcgagaac ggctgctact
      841 tgcgccgcca gaagcgcttc aagtgcgaga agcagccggg ggccggcggc gggggcggga
      901 gcggaagcgg gggcagcggc gccaagggcg gccctgagag ccgcaaggac ccctctggcg
      961 cctctaaccc cagcgccgac tcgcccctcc atcggggtgt gcacgggaag accggccagc
     1021 tagagggcgc gccggccccc gggcccgccg ccagccccca gactctggac cacagtgggg
     1081 cgacggcgac agggggcgcc tcggagttga agactccagc ctcctcaact gcgcccccca
     1141 taagctccgg gcccggggcg ctggcctctg tgcccgcctc tcacccggca cacggcttgg
     1201 caccccacga gtcccagctg cacctgaaag gggaccccca ctactccttc aaccacccgt
     1261 tctccatcaa caacctcatg tcctcctcgg agcagcagca taagctggac ttcaaggcat
     1321 acgaacaggc actgcaatac tcgccttacg gctctacgtt gcccgccagc ctgcctctag
     1381 gcagcgcctc ggtgaccacc aggagcccca tcgagccctc agccctggag ccggcgtact
     1441 accaaggtgt gtattccaga cccgtcctaa acacttccta gctcccggga ctggggggtt
     1501 tgtctggcat agccatgctg gtagcaagag agaaaaaatc aacagcaaac aaaaccacac
     1561 aaaccaaacc gtcaacagca taataaaatc ccaacaacta tttttatttc atttttcatg
     1621 cacaaccttt cccccagtgc aaaagactgt tactttatta ttgtattcaa aattcattgt
     1681 gtatattact acaaagacaa ccccaaacca atttttttcc tgcgaagttt aatgatccac
     1741 aagtgtatat atgaaattct cctccttcct tgcccccctc tctttcttcc ctctttcccc
     1801 tccagacatt ctagtttgtg gagggttatt taaaaaaaca aaaaaggaag atggtcaagt
     1861 ttgtaaaata tttgtttgtg ctttttcccc ctccttacct gaccccctac gagtttacag
     1921 gtctgtggca atactcttaa ccataagaat tgaaatggtg aagaaacaag tatacactag
     1981 aggctcttaa aagtattgaa agacaatact gctgttatat agcaagacat aaacagatta
     2041 taaacatcag agccatttgc ttctcagttt acatttctga tacatgcaga tagcagatgt
     2101 ctttaaatga aatacatgta tattgtgtat ggacttaatt atgcacatgc tcagatgtgt
     2161 agacatcctc cgtatattta cataacatat agaggtaata gataggtgat atacatgata
     2221 cattctcaag agttgcttga ccgaaagtta caaggacccc aacccctttg tcctctctac
     2281 ccacagatgg ccctgggaat caattcctca ggaattgccc tcaagaactc tgcttcttgc
     2341 tttgcagagt gccatggtca tgtcattctg aggtcacata acacataaaa ttagtttcta
     2401 tgagtgtata ccatttaaag aatttttttt tcagtaaaag ggaatagtac aatgttggag
     2461 gagagataag ttatagggag ctggatttca aaacgtggtc caagattcaa aaatcctatt
     2521 gatagtggcc attttaatca ttgccatcgt gtgcttgttt catccagtgt tatgcacttt
     2581 ccacagttgg acatggtgtt agtatagcca gacgggtttc attattattt ctctttgctt
     2641 tctcaatgtt aatttattgc atggtttatt ctttttcttt acagctgaaa ttgctttaaa
     2701 tgatggttaa aattacaaat taaattgtta atttttatca atgtgattgt aattaaaaat
     2761 attttgattt aaataacaaa aataatacca gattttaagc cgtggaaaat gttcttgatc
     2821 atttgcagtt aaggacttta aataaatcaa atgttaacaa aaaaaaaaaa aaaa
//