LOCUS BC006221 2131 bp mRNA linear HUM 30-JAN-2008 DEFINITION Homo sapiens NK2 homeobox 1, mRNA (cDNA clone MGC:10523 IMAGE:3941576), complete cds. ACCESSION BC006221 VERSION BC006221.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2131) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2131) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (09-APR-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 19, 2003 this sequence version replaced BC006221.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: DCTD/DTP cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 15 Row: e Column: 9 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 31881814. FEATURES Location/Qualifiers source 1..2131 /db_xref="H-InvDB:HIT000032680" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:10523 IMAGE:3941576" /tissue_type="Lung, small cell carcinoma" /clone_lib="NIH_MGC_7" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2131 /gene="NKX2-1" /gene_synonym="BHC" /gene_synonym="NK-2" /gene_synonym="NKX2.1" /gene_synonym="TEBP" /gene_synonym="TTF-1" /gene_synonym="TTF1" /db_xref="GeneID:7080" /db_xref="HGNC:HGNC:11825" /db_xref="MIM:600635" CDS 121..1236 /gene="NKX2-1" /gene_synonym="BHC" /gene_synonym="NK-2" /gene_synonym="NKX2.1" /gene_synonym="TEBP" /gene_synonym="TTF-1" /gene_synonym="TTF1" /codon_start=1 /product="NK2 homeobox 1" /protein_id="AAH06221.2" /db_xref="GeneID:7080" /db_xref="HGNC:HGNC:11825" /db_xref="MIM:600635" /translation="MSMSPKHTTPFSVSDILSPLEESYKKVGMEGGGLGAPLAAYRQG QAAPPTAAMQQHAVGHHGAVTAAYHMTAAGVPQLSHSAVGGYCNGNLGNMSELPPYQD TMRNSASGPGWYGANPDPRFPAISRFMGPASGMNMSGMGGLGSLGDVSKNMAPLPSAP RRKRRVLFSQAQVYELERRFKQQKYLSAPEREHLASMIHLTPTQVKIWFQNHRYKMKR QAKDKAAQQQLQQDSGGGGGGGGTGCPQQQQAQQQSPRRVAVPVLVKDGKPCQAGAPA PGAASLQGHAQQQAQHQAQAAQAAAAAISVGSGGAGLGAHPGHQPGSAGQSPDLAHHA ASPAALQGQVSSLSHLNSSGSDYGTMSCSTLLYGRTW" BASE COUNT 478 a 643 c 629 g 381 t ORIGIN 1 ggcgactggg gctcagcgca gcgaagcccg atgtggtccg gaggcagtgg gaaggcgcgg 61 ggctgggagg ccgcggcggg agggaggagc agccccggca ggctcagccg ccgccgaatc 121 atgtcgatga gtccaaagca cacgactccg ttctcagtgt ctgacatctt gagtcccctg 181 gaggaaagct acaagaaagt gggcatggag ggcggcggcc tcggggctcc gctggcggcg 241 tacaggcagg gccaggcggc accgccaaca gcggccatgc agcagcacgc cgtggggcac 301 cacggcgccg tcaccgccgc ctaccacatg acggcggcgg gggtgcccca gctctcgcac 361 tccgccgtgg ggggctactg caacggcaac ctgggcaaca tgagcgagct gccgccgtac 421 caggacacca tgaggaacag cgcctctggc cccggatggt acggcgccaa cccagacccg 481 cgcttccccg ccatctcccg cttcatgggc ccggcgagcg gcatgaacat gagcggcatg 541 ggcggcctgg gctcgctggg ggacgtgagc aagaacatgg ccccgctgcc aagcgcgccg 601 cgcaggaagc gccgggtgct cttctcgcag gcgcaggtgt acgagctgga gcgacgcttc 661 aagcaacaga agtacctgtc ggcgccggag cgcgagcacc tggccagcat gatccacctg 721 acgcccacgc aggtcaagat ctggttccag aaccaccgct acaaaatgaa gcgccaggcc 781 aaggacaagg cggcgcagca gcaactgcag caggacagcg gcggcggcgg gggcggcggg 841 ggcaccgggt gcccgcagca gcaacaggct cagcagcagt cgccgcgacg cgtggcggtg 901 ccggtcctgg tgaaagacgg caaaccgtgc caggcgggtg cccccgcgcc gggcgccgcc 961 agcctacaag gccacgcgca gcagcaggcg cagcaccagg cgcaggccgc gcaggcggcg 1021 gcagcggcca tctccgtggg cagcggtggc gccggccttg gcgcacaccc gggccaccag 1081 ccaggcagcg caggccagtc tccggacctg gcgcaccacg ccgccagccc cgcggcgctg 1141 cagggccagg tatccagcct gtcccacctg aactcctcgg gctcggacta cggcaccatg 1201 tcctgctcca ccttgctata cggtcggacc tggtgagagg acgccgggcc ggccctagcc 1261 cagcgctctg cctcaccgct tccctcctgc ccgccacaca gaccaccatc caccgctgct 1321 ccacgcgctt cgacttttct taacaacctg gccgcgttta gaccaaggaa caaaaaaacc 1381 acaaaggcca aactgctgga cgtctttctt tttttccccc cctaaaattt gtgggttttt 1441 tttttaaaaa aaagaaaatg aaaaacaacc aagcgcatcc aatctcaagg aatctttaag 1501 cagagaaggg cataaaacag ctttggggtg tctttttttg gtgattcaaa tgggttttcc 1561 acgctagggc ggggcacaga ttggagaggg ctctgtgctg acatggctct ggactctaaa 1621 gaccaaactt cactctgggc acactctgcc agcaaagagg actcgcttgt aaataccagg 1681 attttttttt ttttttgaag ggaggacggg agctggggag aggaaagagt cttcaacata 1741 acccacttgt cactgacaca aaggaagtgc cccctccccg gcaccctctg gccgcctagg 1801 ctcagcggcg accgccctcc gcgaaaatag tttgtttaat gtgaacttgt agctgtaaaa 1861 cgctgtcaaa agttggacta aatgcctagt ttttagtaat ctgtacattt tgttgtaaaa 1921 agaaaaacca ctcccagtcc ccagcccttc acatttttta tgggcattga caaatctgtg 1981 tatattattt ggcagtttgg tatttgcggc gtcagtcttt ttctgttgta acttatgtag 2041 atatttggct taaatatagt tcctaagaag cttctaataa attatacaaa ttaaaaagat 2101 tctttttctg attaaaaaaa aaaaaaaaaa a //