LOCUS BC101720 2169 bp mRNA linear HUM 17-JUL-2007 DEFINITION Homo sapiens highly divergent homeobox, mRNA (cDNA clone MGC:126769 IMAGE:8069226), complete cds. ACCESSION BC101720 VERSION BC101720.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2169) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2169) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (12-AUG-2005) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Mike Brownstein, NIMH cDNA Library Preparation: British Columbia Cancer Research Center cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRCB Plate: 2 Row: K Column: 16. FEATURES Location/Qualifiers source 1..2169 /db_xref="H-InvDB:HIT000337072" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:126769 IMAGE:8069226" /tissue_type="Lung and heart, PCR rescued clones" /clone_lib="NIH_MGC_315" /note="Vector: pCR4-TOPO with reversed insert; Clone identification sequence tag: TGCACACC" gene 1..2169 /gene="HDX" /gene_synonym="D030011N01Rik" /db_xref="GeneID:139324" /db_xref="HGNC:HGNC:26411" CDS 51..2123 /gene="HDX" /gene_synonym="D030011N01Rik" /codon_start=1 /product="highly divergent homeobox" /protein_id="AAI01721.1" /db_xref="GeneID:139324" /db_xref="HGNC:HGNC:26411" /translation="MNLRSVFTVEQQRILQRYYENGMTNQSKNCFQLILQCAQETKLD FSVVRTWVGNKRRKMSSKNSESGTATTGTSLSAPDITVRNVVNIARPSSQQSSWTSAN NDVIVTGIYSPASSSSRQGTNKHTDTQITEAHKIPIQKTATKNDTEFQLHIPVQRQVA HCKNASLLLGEKTIILSRQTSVLNAGNSVFNHAKKNYGNSSVQASEMTVPQKPSVCHR PCKIEPVGIQRSYKPEHTGPALHNLCGQKPTIRDPYCRTQNLEIREVFSLAVSDYPQR ILGGNAPQKPSSAEGNCLSIAMETGDAEDEYAREEELASMRAQIPSYSRFYESGSSLR AENQSTTLPGPGRNMPNSQMVNIRDMSDNVLYQNRNYHLTPRTSLHTASSTMYSNTNP LRSNFSPHFASSNQLRLSQNQNNYQISGNLTVPWITGCSRKRALQDRTQFSDRDLATL KKYWDNGMTSLGSVCREKIEAVATELNVDCEIVRTWIGNRRRKYRLMGIEVPPPRGGP ADFSEQPESGSLSALTPGEEAGPEVGEDNDRNDEVSICLSEGSSQEEPNEVVPNDARA HKEEDHHAVTTDNVKIEIIDDEESDMISNSEVEQVNSFLDYKNEEVKFIENELEIQKQ KYFKLQTFVRSLILAMKADDKEQQQALLSDLPPELEEMDFNHASLEPDDTSFSVSSLS EKNVSESL" BASE COUNT 743 a 424 c 461 g 541 t ORIGIN 1 ggcctgatct gggttccgct gattcctttc gtaaccgcac cacacccgag atgaatctac 61 gttctgtatt tactgtagaa caacaaagga ttttacagcg ttattatgaa aatggaatga 121 caaatcaaag taaaaattgc tttcagctca tattacagtg tgcacaggag actaagctgg 181 acttcagtgt agtcaggacg tgggttggca ataagagaag aaagatgagt agtaagaact 241 ctgaatctgg aacagcaaca acaggaacct ctttgtcagc tccagacatc acagtcagaa 301 atgtggttaa tattgctcga ccctcaagcc agcagtcttc ttggacatct gccaataatg 361 atgtcattgt aactggtata tacagtccag ccagttcatc aagtaggcaa ggaacaaaca 421 aacatacaga cacacaaatt acagaagcac ataaaatccc tattcagaaa acagccacta 481 aaaatgatac tgagtttcag ttacacattc ctgtccaaag acaagtagca cactgtaaaa 541 atgcttccct actcctaggt gaaaaaacaa ttattttgtc aagacagaca agtgtgctaa 601 atgctggaaa ctcagtattc aatcacgcaa agaaaaacta tggaaactct tcagtacaag 661 cttctgaaat gacagtacct caaaagcctt ctgtgtgcca ccgaccttgt aaaattgaac 721 cagttgggat tcaaaggtca tataagcctg aacacacagg cccagcatta cataacttat 781 gtgggcaaaa gccaactatt agagaccctt actgtagaac acaaaacttg gaaatccgtg 841 aagtgttttc attggcagtt agcgattacc cccagagaat tctgggagga aatgccccac 901 agaagcctag ctcagcagaa ggaaattgtt tgtccattgc aatggagact ggagatgctg 961 aggatgaata tgccagagag gaagagctgg catcgatgag agcacagata ccaagctatt 1021 cgagatttta tgaaagtggc agttcccttc gagctgagaa ccaaagtaca accttgcccg 1081 gaccaggaag aaatatgcca aattcacaaa tggtgaatat tagagatatg tcagacaatg 1141 tactgtatca aaacagaaac taccatttga caccacggac ctcattacat acagcatcta 1201 gtacaatgta cagtaatacc aatccattac ggagtaattt ttctcctcat tttgcatcat 1261 caaaccaatt gagattatca caaaaccaaa acaattacca gatttcagga aaccttactg 1321 tgccttggat tacagggtgt tctagaaaaa gagcactaca ggaccgcact cagttcagtg 1381 accgagactt agccaccctt aagaagtatt gggacaatgg catgaccagc ctgggctctg 1441 tttgtagaga gaaaattgaa gctgtggcaa ctgaattaaa tgttgactgt gaaatagttc 1501 ggacttggat tgggaatcga agaaggaaat atcgtttaat ggggattgaa gttccacctc 1561 caagaggagg ccctgctgat ttctctgagc agcctgagtc tggttcttta tctgcactca 1621 caccaggaga ggaagctggg cctgaagtag gagaggataa tgacagaaat gatgaagtat 1681 ccatctgttt gtctgaagga agctctcaag aagagcccaa tgaagttgtt ccgaatgatg 1741 caagggctca taaggaagag gaccaccatg cagtaaccac agataatgtg aaaatagaaa 1801 ttattgatga tgaagaaagt gacatgataa gtaattctga agtagaacaa gtaaactctt 1861 tcttggatta taagaatgaa gaagtcaaat tcattgaaaa tgagctcgag attcaaaagc 1921 aaaaatactt taaacttcag acttttgtta gaagcttgat attagcaatg aaagctgatg 1981 ataaggaaca acagcaggca ctgctgtcag atttacctcc tgaattagag gaaatggatt 2041 tcaatcatgc ctcactggag cctgatgata cctcattcag tgtatcttct ttgtcagaga 2101 aaaatgtctc agaaagtttg tgatttcagt tggagggaat atatgataca gtcttttggc 2161 ttcgtaaca //