LOCUS       BC006221                2131 bp    mRNA    linear   HUM 30-JAN-2008
DEFINITION  Homo sapiens NK2 homeobox 1, mRNA (cDNA clone MGC:10523
            IMAGE:3941576), complete cds.
ACCESSION   BC006221
VERSION     BC006221.2
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2131)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2131)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (09-APR-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Aug 19, 2003 this sequence version replaced BC006221.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: DCTD/DTP
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 15 Row: e Column: 9
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 31881814.
FEATURES             Location/Qualifiers
     source          1..2131
                     /db_xref="H-InvDB:HIT000032680"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:10523 IMAGE:3941576"
                     /tissue_type="Lung, small cell carcinoma"
                     /clone_lib="NIH_MGC_7"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            1..2131
                     /gene="NKX2-1"
                     /gene_synonym="BHC"
                     /gene_synonym="NK-2"
                     /gene_synonym="NKX2.1"
                     /gene_synonym="TEBP"
                     /gene_synonym="TTF-1"
                     /gene_synonym="TTF1"
                     /db_xref="GeneID:7080"
                     /db_xref="HGNC:HGNC:11825"
                     /db_xref="MIM:600635"
     CDS             121..1236
                     /gene="NKX2-1"
                     /gene_synonym="BHC"
                     /gene_synonym="NK-2"
                     /gene_synonym="NKX2.1"
                     /gene_synonym="TEBP"
                     /gene_synonym="TTF-1"
                     /gene_synonym="TTF1"
                     /codon_start=1
                     /product="NK2 homeobox 1"
                     /protein_id="AAH06221.2"
                     /db_xref="GeneID:7080"
                     /db_xref="HGNC:HGNC:11825"
                     /db_xref="MIM:600635"
                     /translation="MSMSPKHTTPFSVSDILSPLEESYKKVGMEGGGLGAPLAAYRQG
                     QAAPPTAAMQQHAVGHHGAVTAAYHMTAAGVPQLSHSAVGGYCNGNLGNMSELPPYQD
                     TMRNSASGPGWYGANPDPRFPAISRFMGPASGMNMSGMGGLGSLGDVSKNMAPLPSAP
                     RRKRRVLFSQAQVYELERRFKQQKYLSAPEREHLASMIHLTPTQVKIWFQNHRYKMKR
                     QAKDKAAQQQLQQDSGGGGGGGGTGCPQQQQAQQQSPRRVAVPVLVKDGKPCQAGAPA
                     PGAASLQGHAQQQAQHQAQAAQAAAAAISVGSGGAGLGAHPGHQPGSAGQSPDLAHHA
                     ASPAALQGQVSSLSHLNSSGSDYGTMSCSTLLYGRTW"
BASE COUNT          478 a          643 c          629 g          381 t
ORIGIN      
        1 ggcgactggg gctcagcgca gcgaagcccg atgtggtccg gaggcagtgg gaaggcgcgg
       61 ggctgggagg ccgcggcggg agggaggagc agccccggca ggctcagccg ccgccgaatc
      121 atgtcgatga gtccaaagca cacgactccg ttctcagtgt ctgacatctt gagtcccctg
      181 gaggaaagct acaagaaagt gggcatggag ggcggcggcc tcggggctcc gctggcggcg
      241 tacaggcagg gccaggcggc accgccaaca gcggccatgc agcagcacgc cgtggggcac
      301 cacggcgccg tcaccgccgc ctaccacatg acggcggcgg gggtgcccca gctctcgcac
      361 tccgccgtgg ggggctactg caacggcaac ctgggcaaca tgagcgagct gccgccgtac
      421 caggacacca tgaggaacag cgcctctggc cccggatggt acggcgccaa cccagacccg
      481 cgcttccccg ccatctcccg cttcatgggc ccggcgagcg gcatgaacat gagcggcatg
      541 ggcggcctgg gctcgctggg ggacgtgagc aagaacatgg ccccgctgcc aagcgcgccg
      601 cgcaggaagc gccgggtgct cttctcgcag gcgcaggtgt acgagctgga gcgacgcttc
      661 aagcaacaga agtacctgtc ggcgccggag cgcgagcacc tggccagcat gatccacctg
      721 acgcccacgc aggtcaagat ctggttccag aaccaccgct acaaaatgaa gcgccaggcc
      781 aaggacaagg cggcgcagca gcaactgcag caggacagcg gcggcggcgg gggcggcggg
      841 ggcaccgggt gcccgcagca gcaacaggct cagcagcagt cgccgcgacg cgtggcggtg
      901 ccggtcctgg tgaaagacgg caaaccgtgc caggcgggtg cccccgcgcc gggcgccgcc
      961 agcctacaag gccacgcgca gcagcaggcg cagcaccagg cgcaggccgc gcaggcggcg
     1021 gcagcggcca tctccgtggg cagcggtggc gccggccttg gcgcacaccc gggccaccag
     1081 ccaggcagcg caggccagtc tccggacctg gcgcaccacg ccgccagccc cgcggcgctg
     1141 cagggccagg tatccagcct gtcccacctg aactcctcgg gctcggacta cggcaccatg
     1201 tcctgctcca ccttgctata cggtcggacc tggtgagagg acgccgggcc ggccctagcc
     1261 cagcgctctg cctcaccgct tccctcctgc ccgccacaca gaccaccatc caccgctgct
     1321 ccacgcgctt cgacttttct taacaacctg gccgcgttta gaccaaggaa caaaaaaacc
     1381 acaaaggcca aactgctgga cgtctttctt tttttccccc cctaaaattt gtgggttttt
     1441 tttttaaaaa aaagaaaatg aaaaacaacc aagcgcatcc aatctcaagg aatctttaag
     1501 cagagaaggg cataaaacag ctttggggtg tctttttttg gtgattcaaa tgggttttcc
     1561 acgctagggc ggggcacaga ttggagaggg ctctgtgctg acatggctct ggactctaaa
     1621 gaccaaactt cactctgggc acactctgcc agcaaagagg actcgcttgt aaataccagg
     1681 attttttttt ttttttgaag ggaggacggg agctggggag aggaaagagt cttcaacata
     1741 acccacttgt cactgacaca aaggaagtgc cccctccccg gcaccctctg gccgcctagg
     1801 ctcagcggcg accgccctcc gcgaaaatag tttgtttaat gtgaacttgt agctgtaaaa
     1861 cgctgtcaaa agttggacta aatgcctagt ttttagtaat ctgtacattt tgttgtaaaa
     1921 agaaaaacca ctcccagtcc ccagcccttc acatttttta tgggcattga caaatctgtg
     1981 tatattattt ggcagtttgg tatttgcggc gtcagtcttt ttctgttgta acttatgtag
     2041 atatttggct taaatatagt tcctaagaag cttctaataa attatacaaa ttaaaaagat
     2101 tctttttctg attaaaaaaa aaaaaaaaaa a
//