LOCUS       BC006511                3110 bp    mRNA    linear   HUM 16-SEP-2003
DEFINITION  Homo sapiens chromosome 14 open reading frame 43, mRNA (cDNA clone
            IMAGE:3010441), partial cds.
ACCESSION   BC006511
VERSION     BC006511.2
KEYWORDS    .
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3110)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 3110)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-APR-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Aug 19, 2003 this sequence version replaced BC006511.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Institute for Systems Biology
            http://www.systemsbiology.org
            contact: amadan@systemsbiology.org
            Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
            Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 3 Row: l Column: 4
            This clone was selected for full length sequencing because it
            passed the following selection criteria: GenomeScan gene
            prediction.
FEATURES             Location/Qualifiers
     source          1..3110
                     /db_xref="H-InvDB:HIT000086683"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="IMAGE:3010441"
                     /tissue_type="Muscle, rhabdomyosarcoma"
                     /clone_lib="NIH_MGC_17"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            <1..>3110
                     /gene="C14orf43"
                     /gene_synonym="c14_5541"
                     /db_xref="GeneID:91748"
     CDS             <1..>3110
                     /gene="C14orf43"
                     /gene_synonym="c14_5541"
                     /codon_start=2
                     /product="C14orf43 protein"
                     /protein_id="AAH06511.1"
                     /db_xref="GeneID:91748"
                     /translation="GSHRQGSVGHIMNLQAQPKAQNKRKRCLFGGQEPAPKEQPPPLQ
                     PPQQSIRVKEEQYLGHEGPGGAVSTSQPVELPPPSSLALLNSVVYGPERTSAAMLSQQ
                     VASVKWPNSVMAPGRGPERGGGGGVSDSSWQQQPGQPPPHSTWNCHSLSLYSATKGSP
                     HPGVGVPTYYNHPEALKREKAGGPQLDRYVRPMMPQKVQLEVGRPQAPLNSFHAAKKP
                     PNQSLPLQPFQLAFGHQVNRQVFRQGPPPPNPVAAFPPQKQQQQQQPQQQQQQQQAAL
                     PQMPLFENFYSMPQQPSQQPQDFGLQPAGPLGQSHLAHHSMAPYPFPPNPDMNPELRK
                     ALLQDSAPQPALPQVQIPFPRRSRRLSKEGILPPSALDGAGTQPGQEATGNLFLHHWP
                     LQQPPPGSLGQPHPEALGFPLELRESQLLPDGERLAPNGREREAPAMGSEEGMRAVST
                     GDCGQVLRGGVIQSTRRRRRASQEANLLTLAQKAVELASLQNAKDGSGSEEKRKSVLA
                     STTKCGVEFSEPSLATKRAREDSGMVPLIIPVSVPVRTVDPTEAAQAGGLDEDGKGPE
                     QNPAEHKPSVIVTRRRSTRIPGTDAQAQAEDMNVKLEGEPSVRKPKQRPRPEPLIIPT
                     KAGTFIAPPVYSNITPYQSHLRSPVRLADHPSERSFELPPYTPPPILSPVREGSGLYF
                     NAIISTSTIPAPPPITPKSAHRTLLRTNSAEVTPPVLSVMGEATPVSIEPRINVGSRF
                     QAEIPLMRDRALAAADPHKADLVWQPWEDLESSREKQRQVEDLLTAACSSIFPGAGTN
                     QELALHCLHESRGDILETLNKLLLKKPLRPHNHPLATYHYTGSDQWKMAERKLFNKGI
                     AIYKKDFFLVQKLIQTKTVAQCVEFYYTYKKQVKIGRNGTLTFGDVDTSDEKSAQEEV
                     EVDIKTSQKFPRVPLPRRESPSEERLEPKREVKEPRKEGEEEVPEIQEKEEQEEGRER
                     SRRAAAVKATQTLQANESASDILILRSHESNAPGSAGGQASEKPREGTGKSRRALPFS
                     EKKKKK"
     misc_feature    2531..2662
                     /gene="C14orf43"
                     /gene_synonym="c14_5541"
                     /note="SANT; Region: SANT SWI3, ADA2, N-CoR and TFIIIB''
                     DNA-binding domains"
                     /db_xref="CDD:smart00717"
BASE COUNT          715 a         1004 c          914 g          477 t
ORIGIN      
        1 gggatctcac cgccagggct cagttggcca catcatgaac ctccaggccc agcccaaggc
       61 tcagaacaag cggaagcgtt gcctcttcgg gggccaggaa ccagctccca aggagcagcc
      121 ccctcccctg cagccccccc agcagtccat cagagtgaag gaggagcagt acctcgggca
      181 cgagggtcca ggaggggcag tctccacctc tcagcctgtg gaactgcccc ctcctagcag
      241 cctggccctg ctgaactctg tggtgtatgg gcctgagcgg acctcagcag ccatgctgtc
      301 ccagcaggtg gcctcagtaa agtggcccaa ctctgtgatg gctccagggc ggggcccgga
      361 gcgtggagga ggtgggggtg tcagtgacag cagctggcag cagcagccag gccagcctcc
      421 accccattca acatggaact gccacagtct gtccctctac agtgcaacca aggggagccc
      481 gcatcctgga gtgggagtcc cgacttacta taaccaccct gaggcactga agcgggagaa
      541 agcggggggc ccacagctgg accgctatgt gcgaccaatg atgccacaga aggtgcagct
      601 ggaggtaggg cggccccagg cacccctgaa ttctttccac gcagccaaga aacccccaaa
      661 ccagtcactg cccctgcaac ccttccagct ggcattcggc caccaggtga accggcaggt
      721 cttccggcag ggcccaccgc ccccaaaccc ggtggctgcc ttccctccac agaagcagca
      781 gcagcagcag caaccacagc agcagcagca gcagcagcag gcagccctac cccagatgcc
      841 gctctttgag aacttctatt ccatgccgca gcaaccctcg cagcaacccc aggactttgg
      901 cctgcagcca gctgggccac tgggacagtc ccacctggct caccacagca tggcacccta
      961 ccccttcccc cccaacccag atatgaaccc agaactgcgc aaggcccttc tgcaggactc
     1021 agccccgcag ccagcgctac ctcaggtcca gatccccttc ccccgccgct cccgccgcct
     1081 ctctaaggag ggtatcctgc ctcccagcgc cctggatggg gctggcaccc agcctgggca
     1141 ggaggccact ggcaacctgt tcctacatca ctggcccctg cagcagccgc cacctggctc
     1201 cctggggcag ccccatcctg aagctctggg attcccgctg gagctgaggg agtcgcagct
     1261 actgcctgat ggggagagac tagcacccaa tggccgggag cgagaggctc ctgccatggg
     1321 cagcgaggag ggcatgaggg cagtgagcac aggggactgt gggcaggtgc tacggggcgg
     1381 agtgatccag agcacgcgac ggaggcgccg ggcatcccag gaggccaatt tgctgaccct
     1441 ggcccagaag gctgtggagc tggcctcact gcagaatgca aaggatggca gtggttctga
     1501 agagaagcgg aaaagtgtat tggcctcaac taccaagtgt ggggtggagt tttctgagcc
     1561 ttccttagcc accaagcgag cacgagaaga cagtgggatg gtacccctca tcatcccagt
     1621 gtctgtgcct gtgcgaactg tggacccaac tgaggcagcc caggctggag gtcttgatga
     1681 ggacgggaag ggtcctgaac agaaccctgc tgagcacaag ccatcagtca tcgtcacccg
     1741 caggcggtcc acccgaatcc ccgggacaga tgctcaagct caggcggagg acatgaatgt
     1801 caagttggag ggggagcctt ccgtgcggaa accaaagcag cggcccaggc ccgagcccct
     1861 catcatcccc accaaggcgg gcactttcat cgcccctccc gtctactcca acatcacccc
     1921 ataccagagc cacctgcgct ctcccgtgcg cctagctgac cacccctctg agcggagctt
     1981 tgagctacct ccctacacgc cgccccccat cctcagccct gtgcgggaag gctctggcct
     2041 ctacttcaat gccatcatat caaccagcac catccctgcc cctcctccca tcacgcctaa
     2101 gagtgcccat cgcacgctgc tccggactaa cagtgctgaa gtaaccccgc ctgtcctctc
     2161 tgtgatgggg gaggccaccc cagtgagcat cgagccacgg atcaacgtgg gctcccggtt
     2221 ccaggcagaa atccccttga tgagggaccg tgccctggca gctgcagatc cccacaaggc
     2281 tgacttggtg tggcagccat gggaggacct agagagcagc cgggagaagc agaggcaagt
     2341 ggaagacctg ctgacagccg cctgctccag cattttccct ggtgctggca ccaaccagga
     2401 gctggccctg cactgtctgc acgaatccag aggagacatc ctggaaacgc tgaataagct
     2461 gctgctgaag aagcccctgc ggccccacaa ccatccgctg gcaacttatc actacacagg
     2521 ctctgaccag tggaagatgg ccgagaggaa gctgttcaac aaaggcattg ccatctacaa
     2581 gaaggatttc ttcctggtgc agaagctgat ccagaccaag accgtggccc agtgcgtgga
     2641 gttctactac acctacaaga agcaggtgaa aatcggccgc aatgggactc taacctttgg
     2701 ggatgtggat acgagcgatg agaagtcggc ccaggaagag gttgaagtgg atattaagac
     2761 ttcccaaaag ttcccaaggg tgcctcttcc cagaagagag tccccaagtg aagagaggct
     2821 ggagcccaag agggaggtga aggagcccag gaaggagggg gaggaggagg tgccagagat
     2881 ccaagagaag gaggagcagg aagaggggcg agagcgcagc aggcgggcag cggcagtcaa
     2941 agccacgcag acactacagg ccaatgagtc ggccagtgac atcctcatcc tccggagcca
     3001 cgagtccaac gcccctgggt ctgccggtgg ccaggcctcg gagaagccaa gggaagggac
     3061 agggaagtca cgaagggcac tacctttttc agaaaaaaaa aaaaaaaaaa
//