LOCUS       BC101720                2169 bp    mRNA    linear   HUM 17-JUL-2007
DEFINITION  Homo sapiens highly divergent homeobox, mRNA (cDNA clone MGC:126769
            IMAGE:8069226), complete cds.
ACCESSION   BC101720
VERSION     BC101720.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2169)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2169)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (12-AUG-2005) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Mike Brownstein, NIMH
            cDNA Library Preparation: British Columbia Cancer Research Center
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Genome Sequence Centre,
            BC Cancer Agency, Vancouver, BC, Canada
            info@bcgsc.bc.ca
            Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson
            Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen
            Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel
            Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave
            Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth
            Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao,
            Kim MacDonald,  Mike R. Mayo, Josh Moran, Diana Palmquist, JR
            Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang,
            Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRCB Plate: 2 Row: K Column: 16.
FEATURES             Location/Qualifiers
     source          1..2169
                     /db_xref="H-InvDB:HIT000337072"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:126769 IMAGE:8069226"
                     /tissue_type="Lung and heart, PCR rescued clones"
                     /clone_lib="NIH_MGC_315"
                     /note="Vector: pCR4-TOPO with reversed insert; Clone
                     identification sequence tag: TGCACACC"
     gene            1..2169
                     /gene="HDX"
                     /gene_synonym="D030011N01Rik"
                     /db_xref="GeneID:139324"
                     /db_xref="HGNC:HGNC:26411"
     CDS             51..2123
                     /gene="HDX"
                     /gene_synonym="D030011N01Rik"
                     /codon_start=1
                     /product="highly divergent homeobox"
                     /protein_id="AAI01721.1"
                     /db_xref="GeneID:139324"
                     /db_xref="HGNC:HGNC:26411"
                     /translation="MNLRSVFTVEQQRILQRYYENGMTNQSKNCFQLILQCAQETKLD
                     FSVVRTWVGNKRRKMSSKNSESGTATTGTSLSAPDITVRNVVNIARPSSQQSSWTSAN
                     NDVIVTGIYSPASSSSRQGTNKHTDTQITEAHKIPIQKTATKNDTEFQLHIPVQRQVA
                     HCKNASLLLGEKTIILSRQTSVLNAGNSVFNHAKKNYGNSSVQASEMTVPQKPSVCHR
                     PCKIEPVGIQRSYKPEHTGPALHNLCGQKPTIRDPYCRTQNLEIREVFSLAVSDYPQR
                     ILGGNAPQKPSSAEGNCLSIAMETGDAEDEYAREEELASMRAQIPSYSRFYESGSSLR
                     AENQSTTLPGPGRNMPNSQMVNIRDMSDNVLYQNRNYHLTPRTSLHTASSTMYSNTNP
                     LRSNFSPHFASSNQLRLSQNQNNYQISGNLTVPWITGCSRKRALQDRTQFSDRDLATL
                     KKYWDNGMTSLGSVCREKIEAVATELNVDCEIVRTWIGNRRRKYRLMGIEVPPPRGGP
                     ADFSEQPESGSLSALTPGEEAGPEVGEDNDRNDEVSICLSEGSSQEEPNEVVPNDARA
                     HKEEDHHAVTTDNVKIEIIDDEESDMISNSEVEQVNSFLDYKNEEVKFIENELEIQKQ
                     KYFKLQTFVRSLILAMKADDKEQQQALLSDLPPELEEMDFNHASLEPDDTSFSVSSLS
                     EKNVSESL"
BASE COUNT          743 a          424 c          461 g          541 t
ORIGIN      
        1 ggcctgatct gggttccgct gattcctttc gtaaccgcac cacacccgag atgaatctac
       61 gttctgtatt tactgtagaa caacaaagga ttttacagcg ttattatgaa aatggaatga
      121 caaatcaaag taaaaattgc tttcagctca tattacagtg tgcacaggag actaagctgg
      181 acttcagtgt agtcaggacg tgggttggca ataagagaag aaagatgagt agtaagaact
      241 ctgaatctgg aacagcaaca acaggaacct ctttgtcagc tccagacatc acagtcagaa
      301 atgtggttaa tattgctcga ccctcaagcc agcagtcttc ttggacatct gccaataatg
      361 atgtcattgt aactggtata tacagtccag ccagttcatc aagtaggcaa ggaacaaaca
      421 aacatacaga cacacaaatt acagaagcac ataaaatccc tattcagaaa acagccacta
      481 aaaatgatac tgagtttcag ttacacattc ctgtccaaag acaagtagca cactgtaaaa
      541 atgcttccct actcctaggt gaaaaaacaa ttattttgtc aagacagaca agtgtgctaa
      601 atgctggaaa ctcagtattc aatcacgcaa agaaaaacta tggaaactct tcagtacaag
      661 cttctgaaat gacagtacct caaaagcctt ctgtgtgcca ccgaccttgt aaaattgaac
      721 cagttgggat tcaaaggtca tataagcctg aacacacagg cccagcatta cataacttat
      781 gtgggcaaaa gccaactatt agagaccctt actgtagaac acaaaacttg gaaatccgtg
      841 aagtgttttc attggcagtt agcgattacc cccagagaat tctgggagga aatgccccac
      901 agaagcctag ctcagcagaa ggaaattgtt tgtccattgc aatggagact ggagatgctg
      961 aggatgaata tgccagagag gaagagctgg catcgatgag agcacagata ccaagctatt
     1021 cgagatttta tgaaagtggc agttcccttc gagctgagaa ccaaagtaca accttgcccg
     1081 gaccaggaag aaatatgcca aattcacaaa tggtgaatat tagagatatg tcagacaatg
     1141 tactgtatca aaacagaaac taccatttga caccacggac ctcattacat acagcatcta
     1201 gtacaatgta cagtaatacc aatccattac ggagtaattt ttctcctcat tttgcatcat
     1261 caaaccaatt gagattatca caaaaccaaa acaattacca gatttcagga aaccttactg
     1321 tgccttggat tacagggtgt tctagaaaaa gagcactaca ggaccgcact cagttcagtg
     1381 accgagactt agccaccctt aagaagtatt gggacaatgg catgaccagc ctgggctctg
     1441 tttgtagaga gaaaattgaa gctgtggcaa ctgaattaaa tgttgactgt gaaatagttc
     1501 ggacttggat tgggaatcga agaaggaaat atcgtttaat ggggattgaa gttccacctc
     1561 caagaggagg ccctgctgat ttctctgagc agcctgagtc tggttcttta tctgcactca
     1621 caccaggaga ggaagctggg cctgaagtag gagaggataa tgacagaaat gatgaagtat
     1681 ccatctgttt gtctgaagga agctctcaag aagagcccaa tgaagttgtt ccgaatgatg
     1741 caagggctca taaggaagag gaccaccatg cagtaaccac agataatgtg aaaatagaaa
     1801 ttattgatga tgaagaaagt gacatgataa gtaattctga agtagaacaa gtaaactctt
     1861 tcttggatta taagaatgaa gaagtcaaat tcattgaaaa tgagctcgag attcaaaagc
     1921 aaaaatactt taaacttcag acttttgtta gaagcttgat attagcaatg aaagctgatg
     1981 ataaggaaca acagcaggca ctgctgtcag atttacctcc tgaattagag gaaatggatt
     2041 tcaatcatgc ctcactggag cctgatgata cctcattcag tgtatcttct ttgtcagaga
     2101 aaaatgtctc agaaagtttg tgatttcagt tggagggaat atatgataca gtcttttggc
     2161 ttcgtaaca
//