LOCUS BC024201 3088 bp mRNA linear HUM 11-SEP-2007 DEFINITION Homo sapiens prospero homeobox 1, mRNA (cDNA clone MGC:3668 IMAGE:3532312), complete cds. ACCESSION BC024201 VERSION BC024201.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3088) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3088) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (19-FEB-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 19, 2003 this sequence version replaced BC024201.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 11 Row: n Column: 11 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 34147628. FEATURES Location/Qualifiers source 1..3088 /db_xref="H-InvDB:HIT000039766" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:3668 IMAGE:3532312" /tissue_type="Muscle, rhabdomyosarcoma" /clone_lib="NIH_MGC_17" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..3088 /gene="PROX1" /db_xref="GeneID:5629" /db_xref="HGNC:HGNC:9459" /db_xref="MIM:601546" CDS 273..2486 /gene="PROX1" /codon_start=1 /product="prospero homeobox 1" /protein_id="AAH24201.1" /db_xref="GeneID:5629" /db_xref="HGNC:HGNC:9459" /db_xref="MIM:601546" /translation="MPDHDSTALLSRQTKRRRVDIGVKRTVGTASAFFAKARATFFSA MNPQGSEQDVEYSVVQHADGEKSNVLRKLLKRANSYEDAMMPFPGATIISQLLKNNMN KNGGTEPSFQASGLSSTGSEVHQEDICSNSSRDSPPECLSPFGRPTMSQFDMDRLCDE HLRAKRARVENIIRGMSHSPSVALRGNENEREMAPQSVSPRESYRENKRKQKLPQQQQ QSFQQLVSARKEQKREERRQLKQQLEDMQKQLRQLQEKFYQIYDSTDSENDEDGNLSE DSMRSEILDARAQDSVGRSDNEMCELDPGQFIDRARALIREQEMAENKPKREGNNKER DHGPNSLQPEGKHLAETLKQELNTAMSQVVDTVVKVFSAKPSRQVPQVFPPLQIPQAR FAVNGENHNFHTANQRLQCFGDVIIPNPLDTFGNVQMASSTDQTEALPLVVRKNSSDQ SASGPAAGGHHQPLHQSPLSATTGFTTSTFRHPFPLPLMAYPFQSPLGAPSGSFSGKD RASPESLDLTRDTTSLRTKMSSHHLSHHPCSPAHPPSTAEGLSLSLIKSECGDLQDMS EISPYSGSAMQEGLSPNHLKKAKLMFFYTRYPSSNMLKTYFSDVKFNRCITSQLIKWF SNFREFYYIQMEKYARQAINDGVTSTEELSITRDCELYRALNMHYNKANDFEVPERFL EVAQITLREFFNAIIAGKDVDPSWKKAIYKVICKLDSEVPEIFKSPNCLQELLHE" BASE COUNT 876 a 772 c 700 g 740 t ORIGIN 1 ccccttttcc agaatcactt gcactgtctt gttcttgaat gagaaaggaa gaaaagagcc 61 tcccattact cagacccgtg taaacattat tccccccagg agaaaatggt gttattcaaa 121 tgaatcataa taaaatagcc tctaaacagt ttctaagcgg gagcctccgt ggaactcagc 181 gctccgctcc tcccagttcc taagaggtcc cgggattctt gagctgtgcc cagctgacga 241 gcttttgaag atggcacaat aaccgtccag tgatgcctga ccatgacagc acagccctct 301 taagccggca aaccaagagg agaagagttg acattggagt gaaaaggacg gtagggacag 361 catctgcatt ttttgctaag gcaagagcaa cgttttttag tgccatgaat ccccaaggtt 421 ctgagcagga tgttgagtat tcagtggtgc agcatgcaga tggggaaaag tcaaatgtac 481 tccgcaagct gctgaagagg gcgaactcgt atgaagatgc catgatgcct tttccaggag 541 caaccataat ttcccagctg ttgaaaaata acatgaacaa aaatggtggc acggagccca 601 gtttccaagc cagcggtctc tctagtacag gctccgaagt acatcaggag gatatatgca 661 gcaactcttc aagagacagc cccccagagt gtctttcccc ttttggcagg cctactatga 721 gccagtttga tatggatcgc ttatgtgatg agcacctgag agcaaagcgc gcccgggttg 781 agaatataat tcggggtatg agccattccc ccagtgtggc attaaggggc aatgaaaatg 841 aaagagagat ggccccgcag tctgtgagtc cccgagaaag ttacagagaa aacaaacgca 901 agcaaaagct tccccagcag cagcaacaga gtttccagca gctggtttca gcccgaaaag 961 aacagaagcg agaggagcgc cgacagctga aacagcagct ggaggacatg cagaaacagc 1021 tgcgccagct gcaggaaaag ttctaccaaa tctatgacag cactgattcg gaaaatgatg 1081 aagatggtaa cctgtctgaa gacagcatgc gctcggagat cctggatgcc agggcccagg 1141 actctgtcgg aaggtcagat aatgagatgt gcgagctaga cccaggacag tttattgacc 1201 gagctcgagc cctgatcaga gagcaggaaa tggctgaaaa caagccgaag cgagaaggca 1261 acaacaaaga aagagaccat gggccaaact ccttacaacc ggaaggcaaa catttggctg 1321 agaccttgaa acaggaactg aacactgcca tgtcgcaagt tgtggacact gtggtcaaag 1381 tcttttcggc caagccctcc cgccaggttc ctcaggtctt cccacctctc cagatccccc 1441 aggccagatt tgcagtcaat ggggaaaacc acaatttcca caccgccaac cagcgcctgc 1501 agtgctttgg cgacgtcatc attccgaacc ccctggacac ctttggcaat gtgcagatgg 1561 ccagttccac tgaccagaca gaagcactgc ccctggttgt ccgcaaaaac tcctctgacc 1621 agtctgcctc cggccctgcc gctggcggcc accaccagcc cctgcaccag tcgcctctct 1681 ctgccaccac gggcttcacc acgtccacct tccgccaccc cttccccctt cccttgatgg 1741 cctatccatt tcagagccca ttaggtgctc cctccggctc cttctctgga aaagacagag 1801 cctctcctga atccttagac ttaactaggg ataccacgag tctgaggacc aagatgtcat 1861 ctcaccacct gagccaccac ccttgttcac cagcacaccc gcccagcacc gccgaagggc 1921 tctccttgtc gctcataaag tccgagtgcg gcgatcttca agatatgtct gaaatatcac 1981 cttattcggg aagtgcaatg caggaaggat tgtcacccaa tcacttgaaa aaagcaaagc 2041 tcatgttttt ttatacccgt tatcccagct ccaatatgct gaagacctac ttctccgacg 2101 taaagttcaa cagatgcatt acctctcagc tcatcaagtg gtttagcaat ttccgtgagt 2161 tttactacat tcagatggag aagtacgcac gtcaagccat caacgatggg gtcaccagta 2221 ctgaagagct gtctataacc agagactgtg agctgtacag ggctctgaac atgcactaca 2281 ataaagcaaa tgactttgag gttccagaga gattcctgga agttgctcag atcacattac 2341 gggagttttt caatgccatt atcgcaggca aagatgttga tccttcctgg aagaaggcca 2401 tatacaaggt catctgcaag ctggatagtg aagtccctga gattttcaaa tccccgaact 2461 gcctacaaga gctgcttcat gagtagaaat ttcaacaact ctttttgaat gtatgaagag 2521 tagcagtccc ctttggatgt ccaagttata tgtgtctaga ttttgatttc atatatatgt 2581 gtatgggagg catggatatg ttatgaaatc agctggtaat tcctcctcat cacgtttctc 2641 tcattttctt ttgttttcca ttgcaagggg atggttgttt tctttctgcc tttagtttgc 2701 ttttgcccaa ggcccttaac atttggacac ttaaaatagg gttaattttc agggaaaaag 2761 aatgttggcg tgtgtaaagt ctctattagc aatgaaggga atttgttaac gatgcatcca 2821 cttgattgat gacttattgc aaatggcggt tggctgagga aaacccatga cacagcacaa 2881 ctctacagac agtgatgtgt ctcttgtttc tactgctaag aaggtctgaa aatttaatga 2941 aaccacttca tacatttaag tattttgttt ggtttgaact caatcagtag cttttcctta 3001 catgtttaaa aataattcca atgacagatg agcagctcac ttttccaaag taccccaaaa 3061 ggccaaatta aaaaaaaaaa aaaaaaaa //