LOCUS BC049204 2040 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens homeobox B4, mRNA (cDNA clone MGC:54130 IMAGE:5533346), complete cds. ACCESSION BC049204 VERSION BC049204.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2040) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2040) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (21-MAR-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 93 Row: o Column: 6 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 45580720. FEATURES Location/Qualifiers source 1..2040 /db_xref="H-InvDB:HIT000053359" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:54130 IMAGE:5533346" /tissue_type="Uterus, leiomyosarcoma" /clone_lib="NIH_MGC_71" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..2040 /gene="HOXB4" /gene_synonym="HOX-2.6" /db_xref="GeneID:3214" /db_xref="HGNC:HGNC:5115" /db_xref="MIM:142965" CDS 63..818 /gene="HOXB4" /gene_synonym="HOX-2.6" /codon_start=1 /product="homeobox B4" /protein_id="AAH49204.1" /db_xref="GeneID:3214" /db_xref="HGNC:HGNC:5115" /db_xref="MIM:142965" /translation="MAMSSFLINSNYVDPKFPPCEEYSQSDYLPSDHSPGYYAGGQRR ESSFQPEAGFGRRAACTVQRYAACRDPGPPPPPPPPPPPPPPPGLSPRAPAPPPAGAL LPEPGQRCEAVSSSPPPPPCAQNPLHPSPSHSACKEPVVYPWMRKVHVSTVNPNYAGG EPKRSRTAYTRQQVLELEKEFHYNRYLTRRRRVEIAHALCLSERQIKIWFQNRRMKWK KDHKLPNTKIRSGGAAGSAGGPPGRPNGGPRAL" BASE COUNT 473 a 600 c 567 g 400 t ORIGIN 1 ggaaaacgag tcaggggtcg gaataaattt tagtatattt tgtgggcaat tcccagaaat 61 taatggctat gagttctttt ttgatcaact caaactatgt cgaccccaag ttccctccat 121 gcgaggaata ttcacagagc gattacctac ccagcgacca ctcgcccggg tactacgccg 181 gcggccagag gcgagagagc agcttccagc cggaggcggg cttcgggcgg cgcgcggcgt 241 gcaccgtgca gcgctacgcg gcctgccggg accctgggcc cccgccgcct ccgccaccac 301 ccccgccgcc cccgccaccg cccggtctgt cccctcgggc tcctgcgccg ccacccgccg 361 gggccctcct cccggagccc ggccagcgct gcgaggcggt cagcagcagc cccccgccgc 421 ctccctgcgc ccagaacccc ctgcacccca gcccgtccca ctccgcgtgc aaagagcccg 481 tcgtctaccc ctggatgcgc aaagttcacg tgagcacggt aaaccccaat tacgccggcg 541 gggagcccaa gcgctctcgg accgcctaca cgcgccagca ggtcttggag ctggagaagg 601 aatttcacta caaccgctac ctgacacggc gccggagggt ggagatcgcc cacgcgctct 661 gcctctccga gcgccagatc aagatctggt tccagaaccg gcgcatgaag tggaaaaaag 721 accacaagtt gcccaacacc aagatccgct cgggtggtgc ggcaggctca gccggagggc 781 cccctggccg gcccaatgga ggcccccgcg cgctctagtg cccccgcacg cgggagccac 841 gaacctcggg gtgggggtgg gcagtgagtg caggggatgg ggtgggggga caggaggggg 901 ccctggggcc tgggccccgg aaaaatctat ctgccctccc ccacacttta tatacgaata 961 aacgcagaag agggggaggg gaagctttat ttatagaaat gacaatagag ggccacgggg 1021 aggccccccc agaagcaaga ttcaaatctc ttgctttctt tcttaaaaaa aagaaaaaga 1081 aaaagcaaga agaaggaaga aagaaaaaga cagaaagaga aataggagga ggctgcagct 1141 cctcgttttc agctttggcg aagatggatc cacgtttcat ctttaatcac gccaggtcca 1201 ggcccatctg tcttgtttcc tctgccgagg agaagacggg cctcggtggc gaccattacc 1261 tcgacacccg ctaacaaatg aggcccggct cggccgcctc cgcctctgct actgccgctg 1321 ctggaagaca gcctggattt cctttctttg tcccccactc ccgataccca gcgaaagcac 1381 cctctgactg ccagatagtg cagtgttttg gtcacggtaa cacacacaca ctctccctca 1441 tctttcgtgc ccattcactg agggccagaa tgactgctca cccacttcca ccgtggggtt 1501 gggggtgggc aacagaggag gggagcaagt agggaagggg gtggccttga caactcagga 1561 gtgagcaggg aaattgagtc caaggaaaaa gagagactca gagacccggg agggccttcc 1621 tctgaaaggc caagccaagc catgcttggc agggtgaggg gccagttgag ttctgggagc 1681 tgggcactac tctgccagtc cagagttgta cagcagaagc ctctctccta gactgaaaat 1741 gaatgtgaaa ctaggaaata aaatgtgccc ctcccagtct gggaggagga tgttgcagag 1801 ccctctccca tagtttatta tgttgcatcg tttattatta ttattgataa tattattatt 1861 actatttttt tgtgtcatgt gagtcctctc tccttttctc tttctgacat tccaaaacca 1921 ggccccttcc tacctctggg gctgcttgag tctagaaccc ttcgtatgtg tgaatatctg 1981 tgtgctgtac agagtgacaa tagaaataaa tgtttggttt cttgtgaaaa aaaaaaaaaa //