LOCUS BC006511 3110 bp mRNA linear HUM 16-SEP-2003 DEFINITION Homo sapiens chromosome 14 open reading frame 43, mRNA (cDNA clone IMAGE:3010441), partial cds. ACCESSION BC006511 VERSION BC006511.2 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3110) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3110) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (17-APR-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 19, 2003 this sequence version replaced BC006511.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 3 Row: l Column: 4 This clone was selected for full length sequencing because it passed the following selection criteria: GenomeScan gene prediction. FEATURES Location/Qualifiers source 1..3110 /db_xref="H-InvDB:HIT000086683" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="IMAGE:3010441" /tissue_type="Muscle, rhabdomyosarcoma" /clone_lib="NIH_MGC_17" /lab_host="DH10B-R" /note="Vector: pOTB7" gene <1..>3110 /gene="C14orf43" /gene_synonym="c14_5541" /db_xref="GeneID:91748" CDS <1..>3110 /gene="C14orf43" /gene_synonym="c14_5541" /codon_start=2 /product="C14orf43 protein" /protein_id="AAH06511.1" /db_xref="GeneID:91748" /translation="GSHRQGSVGHIMNLQAQPKAQNKRKRCLFGGQEPAPKEQPPPLQ PPQQSIRVKEEQYLGHEGPGGAVSTSQPVELPPPSSLALLNSVVYGPERTSAAMLSQQ VASVKWPNSVMAPGRGPERGGGGGVSDSSWQQQPGQPPPHSTWNCHSLSLYSATKGSP HPGVGVPTYYNHPEALKREKAGGPQLDRYVRPMMPQKVQLEVGRPQAPLNSFHAAKKP PNQSLPLQPFQLAFGHQVNRQVFRQGPPPPNPVAAFPPQKQQQQQQPQQQQQQQQAAL PQMPLFENFYSMPQQPSQQPQDFGLQPAGPLGQSHLAHHSMAPYPFPPNPDMNPELRK ALLQDSAPQPALPQVQIPFPRRSRRLSKEGILPPSALDGAGTQPGQEATGNLFLHHWP LQQPPPGSLGQPHPEALGFPLELRESQLLPDGERLAPNGREREAPAMGSEEGMRAVST GDCGQVLRGGVIQSTRRRRRASQEANLLTLAQKAVELASLQNAKDGSGSEEKRKSVLA STTKCGVEFSEPSLATKRAREDSGMVPLIIPVSVPVRTVDPTEAAQAGGLDEDGKGPE QNPAEHKPSVIVTRRRSTRIPGTDAQAQAEDMNVKLEGEPSVRKPKQRPRPEPLIIPT KAGTFIAPPVYSNITPYQSHLRSPVRLADHPSERSFELPPYTPPPILSPVREGSGLYF NAIISTSTIPAPPPITPKSAHRTLLRTNSAEVTPPVLSVMGEATPVSIEPRINVGSRF QAEIPLMRDRALAAADPHKADLVWQPWEDLESSREKQRQVEDLLTAACSSIFPGAGTN QELALHCLHESRGDILETLNKLLLKKPLRPHNHPLATYHYTGSDQWKMAERKLFNKGI AIYKKDFFLVQKLIQTKTVAQCVEFYYTYKKQVKIGRNGTLTFGDVDTSDEKSAQEEV EVDIKTSQKFPRVPLPRRESPSEERLEPKREVKEPRKEGEEEVPEIQEKEEQEEGRER SRRAAAVKATQTLQANESASDILILRSHESNAPGSAGGQASEKPREGTGKSRRALPFS EKKKKK" misc_feature 2531..2662 /gene="C14orf43" /gene_synonym="c14_5541" /note="SANT; Region: SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains" /db_xref="CDD:smart00717" BASE COUNT 715 a 1004 c 914 g 477 t ORIGIN 1 gggatctcac cgccagggct cagttggcca catcatgaac ctccaggccc agcccaaggc 61 tcagaacaag cggaagcgtt gcctcttcgg gggccaggaa ccagctccca aggagcagcc 121 ccctcccctg cagccccccc agcagtccat cagagtgaag gaggagcagt acctcgggca 181 cgagggtcca ggaggggcag tctccacctc tcagcctgtg gaactgcccc ctcctagcag 241 cctggccctg ctgaactctg tggtgtatgg gcctgagcgg acctcagcag ccatgctgtc 301 ccagcaggtg gcctcagtaa agtggcccaa ctctgtgatg gctccagggc ggggcccgga 361 gcgtggagga ggtgggggtg tcagtgacag cagctggcag cagcagccag gccagcctcc 421 accccattca acatggaact gccacagtct gtccctctac agtgcaacca aggggagccc 481 gcatcctgga gtgggagtcc cgacttacta taaccaccct gaggcactga agcgggagaa 541 agcggggggc ccacagctgg accgctatgt gcgaccaatg atgccacaga aggtgcagct 601 ggaggtaggg cggccccagg cacccctgaa ttctttccac gcagccaaga aacccccaaa 661 ccagtcactg cccctgcaac ccttccagct ggcattcggc caccaggtga accggcaggt 721 cttccggcag ggcccaccgc ccccaaaccc ggtggctgcc ttccctccac agaagcagca 781 gcagcagcag caaccacagc agcagcagca gcagcagcag gcagccctac cccagatgcc 841 gctctttgag aacttctatt ccatgccgca gcaaccctcg cagcaacccc aggactttgg 901 cctgcagcca gctgggccac tgggacagtc ccacctggct caccacagca tggcacccta 961 ccccttcccc cccaacccag atatgaaccc agaactgcgc aaggcccttc tgcaggactc 1021 agccccgcag ccagcgctac ctcaggtcca gatccccttc ccccgccgct cccgccgcct 1081 ctctaaggag ggtatcctgc ctcccagcgc cctggatggg gctggcaccc agcctgggca 1141 ggaggccact ggcaacctgt tcctacatca ctggcccctg cagcagccgc cacctggctc 1201 cctggggcag ccccatcctg aagctctggg attcccgctg gagctgaggg agtcgcagct 1261 actgcctgat ggggagagac tagcacccaa tggccgggag cgagaggctc ctgccatggg 1321 cagcgaggag ggcatgaggg cagtgagcac aggggactgt gggcaggtgc tacggggcgg 1381 agtgatccag agcacgcgac ggaggcgccg ggcatcccag gaggccaatt tgctgaccct 1441 ggcccagaag gctgtggagc tggcctcact gcagaatgca aaggatggca gtggttctga 1501 agagaagcgg aaaagtgtat tggcctcaac taccaagtgt ggggtggagt tttctgagcc 1561 ttccttagcc accaagcgag cacgagaaga cagtgggatg gtacccctca tcatcccagt 1621 gtctgtgcct gtgcgaactg tggacccaac tgaggcagcc caggctggag gtcttgatga 1681 ggacgggaag ggtcctgaac agaaccctgc tgagcacaag ccatcagtca tcgtcacccg 1741 caggcggtcc acccgaatcc ccgggacaga tgctcaagct caggcggagg acatgaatgt 1801 caagttggag ggggagcctt ccgtgcggaa accaaagcag cggcccaggc ccgagcccct 1861 catcatcccc accaaggcgg gcactttcat cgcccctccc gtctactcca acatcacccc 1921 ataccagagc cacctgcgct ctcccgtgcg cctagctgac cacccctctg agcggagctt 1981 tgagctacct ccctacacgc cgccccccat cctcagccct gtgcgggaag gctctggcct 2041 ctacttcaat gccatcatat caaccagcac catccctgcc cctcctccca tcacgcctaa 2101 gagtgcccat cgcacgctgc tccggactaa cagtgctgaa gtaaccccgc ctgtcctctc 2161 tgtgatgggg gaggccaccc cagtgagcat cgagccacgg atcaacgtgg gctcccggtt 2221 ccaggcagaa atccccttga tgagggaccg tgccctggca gctgcagatc cccacaaggc 2281 tgacttggtg tggcagccat gggaggacct agagagcagc cgggagaagc agaggcaagt 2341 ggaagacctg ctgacagccg cctgctccag cattttccct ggtgctggca ccaaccagga 2401 gctggccctg cactgtctgc acgaatccag aggagacatc ctggaaacgc tgaataagct 2461 gctgctgaag aagcccctgc ggccccacaa ccatccgctg gcaacttatc actacacagg 2521 ctctgaccag tggaagatgg ccgagaggaa gctgttcaac aaaggcattg ccatctacaa 2581 gaaggatttc ttcctggtgc agaagctgat ccagaccaag accgtggccc agtgcgtgga 2641 gttctactac acctacaaga agcaggtgaa aatcggccgc aatgggactc taacctttgg 2701 ggatgtggat acgagcgatg agaagtcggc ccaggaagag gttgaagtgg atattaagac 2761 ttcccaaaag ttcccaaggg tgcctcttcc cagaagagag tccccaagtg aagagaggct 2821 ggagcccaag agggaggtga aggagcccag gaaggagggg gaggaggagg tgccagagat 2881 ccaagagaag gaggagcagg aagaggggcg agagcgcagc aggcgggcag cggcagtcaa 2941 agccacgcag acactacagg ccaatgagtc ggccagtgac atcctcatcc tccggagcca 3001 cgagtccaac gcccctgggt ctgccggtgg ccaggcctcg gagaagccaa gggaagggac 3061 agggaagtca cgaagggcac tacctttttc agaaaaaaaa aaaaaaaaaa //