LOCUS       BC024201                3088 bp    mRNA    linear   HUM 11-SEP-2007
DEFINITION  Homo sapiens prospero homeobox 1, mRNA (cDNA clone MGC:3668
            IMAGE:3532312), complete cds.
ACCESSION   BC024201
VERSION     BC024201.2
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3088)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 3088)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (19-FEB-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Aug 19, 2003 this sequence version replaced BC024201.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Institute for Systems Biology
            http://www.systemsbiology.org
            contact: amadan@systemsbiology.org
            Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
            Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 11 Row: n Column: 11
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 34147628.
FEATURES             Location/Qualifiers
     source          1..3088
                     /db_xref="H-InvDB:HIT000039766"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:3668 IMAGE:3532312"
                     /tissue_type="Muscle, rhabdomyosarcoma"
                     /clone_lib="NIH_MGC_17"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            1..3088
                     /gene="PROX1"
                     /db_xref="GeneID:5629"
                     /db_xref="HGNC:HGNC:9459"
                     /db_xref="MIM:601546"
     CDS             273..2486
                     /gene="PROX1"
                     /codon_start=1
                     /product="prospero homeobox 1"
                     /protein_id="AAH24201.1"
                     /db_xref="GeneID:5629"
                     /db_xref="HGNC:HGNC:9459"
                     /db_xref="MIM:601546"
                     /translation="MPDHDSTALLSRQTKRRRVDIGVKRTVGTASAFFAKARATFFSA
                     MNPQGSEQDVEYSVVQHADGEKSNVLRKLLKRANSYEDAMMPFPGATIISQLLKNNMN
                     KNGGTEPSFQASGLSSTGSEVHQEDICSNSSRDSPPECLSPFGRPTMSQFDMDRLCDE
                     HLRAKRARVENIIRGMSHSPSVALRGNENEREMAPQSVSPRESYRENKRKQKLPQQQQ
                     QSFQQLVSARKEQKREERRQLKQQLEDMQKQLRQLQEKFYQIYDSTDSENDEDGNLSE
                     DSMRSEILDARAQDSVGRSDNEMCELDPGQFIDRARALIREQEMAENKPKREGNNKER
                     DHGPNSLQPEGKHLAETLKQELNTAMSQVVDTVVKVFSAKPSRQVPQVFPPLQIPQAR
                     FAVNGENHNFHTANQRLQCFGDVIIPNPLDTFGNVQMASSTDQTEALPLVVRKNSSDQ
                     SASGPAAGGHHQPLHQSPLSATTGFTTSTFRHPFPLPLMAYPFQSPLGAPSGSFSGKD
                     RASPESLDLTRDTTSLRTKMSSHHLSHHPCSPAHPPSTAEGLSLSLIKSECGDLQDMS
                     EISPYSGSAMQEGLSPNHLKKAKLMFFYTRYPSSNMLKTYFSDVKFNRCITSQLIKWF
                     SNFREFYYIQMEKYARQAINDGVTSTEELSITRDCELYRALNMHYNKANDFEVPERFL
                     EVAQITLREFFNAIIAGKDVDPSWKKAIYKVICKLDSEVPEIFKSPNCLQELLHE"
BASE COUNT          876 a          772 c          700 g          740 t
ORIGIN      
        1 ccccttttcc agaatcactt gcactgtctt gttcttgaat gagaaaggaa gaaaagagcc
       61 tcccattact cagacccgtg taaacattat tccccccagg agaaaatggt gttattcaaa
      121 tgaatcataa taaaatagcc tctaaacagt ttctaagcgg gagcctccgt ggaactcagc
      181 gctccgctcc tcccagttcc taagaggtcc cgggattctt gagctgtgcc cagctgacga
      241 gcttttgaag atggcacaat aaccgtccag tgatgcctga ccatgacagc acagccctct
      301 taagccggca aaccaagagg agaagagttg acattggagt gaaaaggacg gtagggacag
      361 catctgcatt ttttgctaag gcaagagcaa cgttttttag tgccatgaat ccccaaggtt
      421 ctgagcagga tgttgagtat tcagtggtgc agcatgcaga tggggaaaag tcaaatgtac
      481 tccgcaagct gctgaagagg gcgaactcgt atgaagatgc catgatgcct tttccaggag
      541 caaccataat ttcccagctg ttgaaaaata acatgaacaa aaatggtggc acggagccca
      601 gtttccaagc cagcggtctc tctagtacag gctccgaagt acatcaggag gatatatgca
      661 gcaactcttc aagagacagc cccccagagt gtctttcccc ttttggcagg cctactatga
      721 gccagtttga tatggatcgc ttatgtgatg agcacctgag agcaaagcgc gcccgggttg
      781 agaatataat tcggggtatg agccattccc ccagtgtggc attaaggggc aatgaaaatg
      841 aaagagagat ggccccgcag tctgtgagtc cccgagaaag ttacagagaa aacaaacgca
      901 agcaaaagct tccccagcag cagcaacaga gtttccagca gctggtttca gcccgaaaag
      961 aacagaagcg agaggagcgc cgacagctga aacagcagct ggaggacatg cagaaacagc
     1021 tgcgccagct gcaggaaaag ttctaccaaa tctatgacag cactgattcg gaaaatgatg
     1081 aagatggtaa cctgtctgaa gacagcatgc gctcggagat cctggatgcc agggcccagg
     1141 actctgtcgg aaggtcagat aatgagatgt gcgagctaga cccaggacag tttattgacc
     1201 gagctcgagc cctgatcaga gagcaggaaa tggctgaaaa caagccgaag cgagaaggca
     1261 acaacaaaga aagagaccat gggccaaact ccttacaacc ggaaggcaaa catttggctg
     1321 agaccttgaa acaggaactg aacactgcca tgtcgcaagt tgtggacact gtggtcaaag
     1381 tcttttcggc caagccctcc cgccaggttc ctcaggtctt cccacctctc cagatccccc
     1441 aggccagatt tgcagtcaat ggggaaaacc acaatttcca caccgccaac cagcgcctgc
     1501 agtgctttgg cgacgtcatc attccgaacc ccctggacac ctttggcaat gtgcagatgg
     1561 ccagttccac tgaccagaca gaagcactgc ccctggttgt ccgcaaaaac tcctctgacc
     1621 agtctgcctc cggccctgcc gctggcggcc accaccagcc cctgcaccag tcgcctctct
     1681 ctgccaccac gggcttcacc acgtccacct tccgccaccc cttccccctt cccttgatgg
     1741 cctatccatt tcagagccca ttaggtgctc cctccggctc cttctctgga aaagacagag
     1801 cctctcctga atccttagac ttaactaggg ataccacgag tctgaggacc aagatgtcat
     1861 ctcaccacct gagccaccac ccttgttcac cagcacaccc gcccagcacc gccgaagggc
     1921 tctccttgtc gctcataaag tccgagtgcg gcgatcttca agatatgtct gaaatatcac
     1981 cttattcggg aagtgcaatg caggaaggat tgtcacccaa tcacttgaaa aaagcaaagc
     2041 tcatgttttt ttatacccgt tatcccagct ccaatatgct gaagacctac ttctccgacg
     2101 taaagttcaa cagatgcatt acctctcagc tcatcaagtg gtttagcaat ttccgtgagt
     2161 tttactacat tcagatggag aagtacgcac gtcaagccat caacgatggg gtcaccagta
     2221 ctgaagagct gtctataacc agagactgtg agctgtacag ggctctgaac atgcactaca
     2281 ataaagcaaa tgactttgag gttccagaga gattcctgga agttgctcag atcacattac
     2341 gggagttttt caatgccatt atcgcaggca aagatgttga tccttcctgg aagaaggcca
     2401 tatacaaggt catctgcaag ctggatagtg aagtccctga gattttcaaa tccccgaact
     2461 gcctacaaga gctgcttcat gagtagaaat ttcaacaact ctttttgaat gtatgaagag
     2521 tagcagtccc ctttggatgt ccaagttata tgtgtctaga ttttgatttc atatatatgt
     2581 gtatgggagg catggatatg ttatgaaatc agctggtaat tcctcctcat cacgtttctc
     2641 tcattttctt ttgttttcca ttgcaagggg atggttgttt tctttctgcc tttagtttgc
     2701 ttttgcccaa ggcccttaac atttggacac ttaaaatagg gttaattttc agggaaaaag
     2761 aatgttggcg tgtgtaaagt ctctattagc aatgaaggga atttgttaac gatgcatcca
     2821 cttgattgat gacttattgc aaatggcggt tggctgagga aaacccatga cacagcacaa
     2881 ctctacagac agtgatgtgt ctcttgtttc tactgctaag aaggtctgaa aatttaatga
     2941 aaccacttca tacatttaag tattttgttt ggtttgaact caatcagtag cttttcctta
     3001 catgtttaaa aataattcca atgacagatg agcagctcac ttttccaaag taccccaaaa
     3061 ggccaaatta aaaaaaaaaa aaaaaaaa
//