LOCUS BC049204 2040 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens homeobox B4, mRNA (cDNA clone MGC:54130
IMAGE:5533346), complete cds.
ACCESSION BC049204
VERSION BC049204.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2040)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2040)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (21-MAR-2003) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 93 Row: o Column: 6
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 45580720.
FEATURES Location/Qualifiers
source 1..2040
/db_xref="H-InvDB:HIT000053359"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:54130 IMAGE:5533346"
/tissue_type="Uterus, leiomyosarcoma"
/clone_lib="NIH_MGC_71"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..2040
/gene="HOXB4"
/gene_synonym="HOX-2.6"
/db_xref="GeneID:3214"
/db_xref="HGNC:HGNC:5115"
/db_xref="MIM:142965"
CDS 63..818
/gene="HOXB4"
/gene_synonym="HOX-2.6"
/codon_start=1
/product="homeobox B4"
/protein_id="AAH49204.1"
/db_xref="GeneID:3214"
/db_xref="HGNC:HGNC:5115"
/db_xref="MIM:142965"
/translation="MAMSSFLINSNYVDPKFPPCEEYSQSDYLPSDHSPGYYAGGQRR
ESSFQPEAGFGRRAACTVQRYAACRDPGPPPPPPPPPPPPPPPGLSPRAPAPPPAGAL
LPEPGQRCEAVSSSPPPPPCAQNPLHPSPSHSACKEPVVYPWMRKVHVSTVNPNYAGG
EPKRSRTAYTRQQVLELEKEFHYNRYLTRRRRVEIAHALCLSERQIKIWFQNRRMKWK
KDHKLPNTKIRSGGAAGSAGGPPGRPNGGPRAL"
BASE COUNT 473 a 600 c 567 g 400 t
ORIGIN
1 ggaaaacgag tcaggggtcg gaataaattt tagtatattt tgtgggcaat tcccagaaat
61 taatggctat gagttctttt ttgatcaact caaactatgt cgaccccaag ttccctccat
121 gcgaggaata ttcacagagc gattacctac ccagcgacca ctcgcccggg tactacgccg
181 gcggccagag gcgagagagc agcttccagc cggaggcggg cttcgggcgg cgcgcggcgt
241 gcaccgtgca gcgctacgcg gcctgccggg accctgggcc cccgccgcct ccgccaccac
301 ccccgccgcc cccgccaccg cccggtctgt cccctcgggc tcctgcgccg ccacccgccg
361 gggccctcct cccggagccc ggccagcgct gcgaggcggt cagcagcagc cccccgccgc
421 ctccctgcgc ccagaacccc ctgcacccca gcccgtccca ctccgcgtgc aaagagcccg
481 tcgtctaccc ctggatgcgc aaagttcacg tgagcacggt aaaccccaat tacgccggcg
541 gggagcccaa gcgctctcgg accgcctaca cgcgccagca ggtcttggag ctggagaagg
601 aatttcacta caaccgctac ctgacacggc gccggagggt ggagatcgcc cacgcgctct
661 gcctctccga gcgccagatc aagatctggt tccagaaccg gcgcatgaag tggaaaaaag
721 accacaagtt gcccaacacc aagatccgct cgggtggtgc ggcaggctca gccggagggc
781 cccctggccg gcccaatgga ggcccccgcg cgctctagtg cccccgcacg cgggagccac
841 gaacctcggg gtgggggtgg gcagtgagtg caggggatgg ggtgggggga caggaggggg
901 ccctggggcc tgggccccgg aaaaatctat ctgccctccc ccacacttta tatacgaata
961 aacgcagaag agggggaggg gaagctttat ttatagaaat gacaatagag ggccacgggg
1021 aggccccccc agaagcaaga ttcaaatctc ttgctttctt tcttaaaaaa aagaaaaaga
1081 aaaagcaaga agaaggaaga aagaaaaaga cagaaagaga aataggagga ggctgcagct
1141 cctcgttttc agctttggcg aagatggatc cacgtttcat ctttaatcac gccaggtcca
1201 ggcccatctg tcttgtttcc tctgccgagg agaagacggg cctcggtggc gaccattacc
1261 tcgacacccg ctaacaaatg aggcccggct cggccgcctc cgcctctgct actgccgctg
1321 ctggaagaca gcctggattt cctttctttg tcccccactc ccgataccca gcgaaagcac
1381 cctctgactg ccagatagtg cagtgttttg gtcacggtaa cacacacaca ctctccctca
1441 tctttcgtgc ccattcactg agggccagaa tgactgctca cccacttcca ccgtggggtt
1501 gggggtgggc aacagaggag gggagcaagt agggaagggg gtggccttga caactcagga
1561 gtgagcaggg aaattgagtc caaggaaaaa gagagactca gagacccggg agggccttcc
1621 tctgaaaggc caagccaagc catgcttggc agggtgaggg gccagttgag ttctgggagc
1681 tgggcactac tctgccagtc cagagttgta cagcagaagc ctctctccta gactgaaaat
1741 gaatgtgaaa ctaggaaata aaatgtgccc ctcccagtct gggaggagga tgttgcagag
1801 ccctctccca tagtttatta tgttgcatcg tttattatta ttattgataa tattattatt
1861 actatttttt tgtgtcatgt gagtcctctc tccttttctc tttctgacat tccaaaacca
1921 ggccccttcc tacctctggg gctgcttgag tctagaaccc ttcgtatgtg tgaatatctg
1981 tgtgctgtac agagtgacaa tagaaataaa tgtttggttt cttgtgaaaa aaaaaaaaaa
//