LOCUS BC011780 1937 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens forkhead box A2, mRNA (cDNA clone MGC:19807 IMAGE:3940139), complete cds. ACCESSION BC011780 VERSION BC011780.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1937) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1937) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (30-JUL-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Dec 19, 2003 this sequence version replaced BC011780.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: DCTD/DTP cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 27 Row: j Column: 6 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 24497502. FEATURES Location/Qualifiers source 1..1937 /db_xref="H-InvDB:HIT000035448" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:19807 IMAGE:3940139" /tissue_type="Lung, small cell carcinoma" /clone_lib="NIH_MGC_7" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..1937 /gene="FOXA2" /gene_synonym="MGC19807" /gene_synonym="TCF3B" /db_xref="GeneID:3170" /db_xref="HGNC:HGNC:5022" /db_xref="MIM:600288" CDS 43..1416 /gene="FOXA2" /gene_synonym="MGC19807" /gene_synonym="TCF3B" /codon_start=1 /product="forkhead box A2" /protein_id="AAH11780.1" /db_xref="GeneID:3170" /db_xref="HGNC:HGNC:5022" /db_xref="MIM:600288" /translation="MLGAVKMEGHEPSDWSSYYAEPEGYSSVSNMNAGLGMNGMNTYM SMSAAAMGSGSGNMSAGSMNMSSYVGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGM GPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPKTYRRSYTHAKP PYSYISLITMAIQQSPNKMLTLSEIYQWIMDLFPFYRQNQQRWQNSIRHSLSFNDCFL KVPRSPDKPGKGSFWTLHPDSGNMFENGCYLRRQKRFKCEKQLALKEAAGAAGSGKKA AAGAQASQAQLGEAAGPASETPAGTESPHSSASPCQEHKRGGLGELKGTPAAALSPPE PAPSPGQQQQAAAHLLGPPHHPGLPPEAHLKPEHHYAFNHPFSINNLMSSEQQHHHSH HHHQPHKMDLKAYEQVMHYPGYGSPMPGSLAMGPVTNKTGLDASPLAADTSYYQGVYS RPIMNSS" BASE COUNT 442 a 628 c 556 g 311 t ORIGIN 1 gtggctgtta aattttaaac tgccatgcac tcggcttcca gtatgctggg agcggtgaag 61 atggaagggc acgagccgtc cgactggagc agctactatg cagagcccga gggctactcc 121 tccgtgagca acatgaacgc cggcctgggg atgaacggca tgaacacgta catgagcatg 181 tcggcggccg ccatgggcag cggctcgggc aacatgagcg cgggctccat gaacatgtcg 241 tcgtacgtgg gcgctggcat gagcccgtcc ctggcgggga tgtcccccgg cgcgggcgcc 301 atggcgggca tgggcggctc ggccggggcg gctggcgtgg cgggcatggg gccgcacttg 361 agtcccagcc tgagcccgct cggggggcag gcggccgggg ccatgggcgg cctggccccc 421 tacgccaaca tgaactccat gagccccatg tacgggcagg cgggcctgag ccgcgcccgc 481 gaccccaaga cctacaggcg cagctacacg cacgcaaagc cgccctactc gtacatctcg 541 ctcatcacca tggccatcca gcagagcccc aacaagatgc tgacgctgag cgagatctac 601 cagtggatca tggacctctt ccccttctac cggcagaacc agcagcgctg gcagaactcc 661 atccgccact cgctctcctt caacgactgt ttcctgaagg tgccccgctc gcccgacaag 721 cccggcaagg gctccttctg gaccctgcac cctgactcgg gcaacatgtt cgagaacggc 781 tgctacctgc gccgccagaa gcgcttcaag tgcgagaagc agctggcgct gaaggaggcc 841 gcaggcgccg ccggcagcgg caagaaggcg gccgccgggg cccaggcctc acaggctcaa 901 ctcggggagg ccgccgggcc ggcctccgag actccggcgg gcaccgagtc gcctcactcg 961 agcgcctccc cgtgccagga gcacaagcga gggggcctgg gagagctgaa ggggacgccg 1021 gctgcggcgc tgagcccccc agagccggcg ccctctcccg ggcagcagca gcaggccgcg 1081 gcccacctgc tgggcccgcc ccaccacccg ggcctgccgc ctgaggccca cctgaagccg 1141 gaacaccact acgccttcaa ccacccgttc tccatcaaca acctcatgtc ctcggagcag 1201 cagcaccacc acagccacca ccaccaccag ccccacaaaa tggacctcaa ggcctacgaa 1261 caggtgatgc actaccccgg ctacggttcc cccatgcctg gcagcttggc catgggcccg 1321 gtcacgaaca aaacgggcct ggacgcctcg cccctggccg cagatacctc ctactaccag 1381 ggggtgtact cccggcccat tatgaactcc tcttaagaag acgacggctt caggcccggc 1441 taactctggc accccggatc gaggataagt gagagagcaa gtgggggtcg agactttggg 1501 gagacggtgt tgcagagacg caagggagaa gaaatccata acacccccac cccaacaccc 1561 ccaagacagc agtcttcctt cacccgctgc agctgttccg tcccaaacag agggccacac 1621 agatacccca cgttctatat aaggaggaaa acgggaaaga atataaagtt aaaaaaaagc 1681 ctccggtttc cactactgtg tagactcctg cttcttcaag cacctgcaga ttctgatttt 1741 tttgttgttg ttgttctcct ccattgctgt tgttgcaggg aagtcttact taaaaaaaaa 1801 aaaaaacttt tgtgagtgac tcggtgtaaa accatgtagt tttaacagaa ccagagggtt 1861 gtactattgt ttaaaaacag gaaaaaaaat aatgtaaggg tctgttgtaa atgaccaaga 1921 aaaaaaaaaa aaaaaaa //