LOCUS BC015668 3489 bp mRNA linear HUM 30-SEP-2003 DEFINITION Homo sapiens chromosome 14 open reading frame 43, mRNA (cDNA clone IMAGE:4328688), partial cds. ACCESSION BC015668 VERSION BC015668.2 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3489) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3489) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (04-OCT-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 19, 2003 this sequence version replaced BC015668.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 33 Row: b Column: 19 This clone was selected for full length sequencing because it passed the following selection criteria: Hexamer frequency ORF analysis, GenomeScan gene prediction. FEATURES Location/Qualifiers source 1..3489 /db_xref="H-InvDB:HIT000088767" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="IMAGE:4328688" /tissue_type="Uterus, leiomyosarcoma" /clone_lib="NIH_MGC_46" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..>3489 /gene="C14orf43" /gene_synonym="c14_5541" /db_xref="GeneID:91748" CDS 418..>3489 /gene="C14orf43" /gene_synonym="c14_5541" /codon_start=1 /product="C14orf43 protein" /protein_id="AAH15668.1" /db_xref="GeneID:91748" /translation="MNLQAQPKAQNKRKRCLFGGQEPAPKEQPPPLQPPQQSIRVKEE QYLGHEGPGGAVSTSQPVELPPPSSLALLNSVVYGPERTSAAMLSQQVASVKWPNSVM APGRGPERGGGGGVSDSSWQQQPGQPPPHSTWNCHSLSLYSATKGSPHPGVGVPTYYN HPEALKREKAGGPQLDRYVRPMMPQKVQLEVGRPQAPLNSFHAAKKPPNQSLPLQPFQ LAFGHQVNRQVFRQGPPPPNPVAAFPPQKQQQQQQPQQQQQQQQAALPQMPLFENFYS MPQQPSQQPQDFGLQPAGPLGQSHLAHHSMAPYPFPPNPDMNPELRKALLQDSAPQPA LPQVQIPFPRRSRRLSKEGILPPSALDGAGTQPGQEATGNLFLHHWPLQQPPPGSLGQ PHPEALGFPLELRESQLLPDGERLAPNGREREAPAMGSEEGMRAVSTGDCGQVLRGGV IQSTRRRRRASQEANLLTLAQKAVELASLQNAKDGSGSEEKRKSVLASTTKCGVEFSE PSLATKRAREDSGMVPLIIPVSVPVRTVDPTEAAQAGGLDEDGKGPEQNPAEHKPSVI VTRRRSTRIPGTDAQAQAEDMNVKLEGEPSVRKPKQRPRPEPLIIPTKAGTFIAPPVY SNITPYQSHLRSPVRLADHPSERSFELPPYTPPPILSPVREGSGLYFNAIISTSTIPA PPPITPKSAHRTLLRTNSAEVTPPVLSVMGEATPVSIEPRINVGSRFQAEIPLMRDRA LAAADPHKADLVWQPWEDLESSREKQRQVEDLLTAACSSIFPGAGTNQELALHCLHES RGDILETLNKLLLKKPLRPHNHPLATYHYTGSDQWKMAERKLFNKGIAIYKKDFFLVQ KLIQTKTVAQCVEFYYTYKKQVKIGRNGTLTFGDVDTSDEKSAQEEVEVDIKTSQKFP RVPLPRRESPSEERLEPKREVKEPRKEGEEEVPEIQEKEEQEEGRERSRRAAAVKATQ TLQANESASDILILRSHESNAPGSAGGQASEKPREGTGKSRRALPFSKKKKK" misc_feature 2914..3045 /gene="C14orf43" /gene_synonym="c14_5541" /note="SANT; Region: SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains" /db_xref="CDD:smart00717" BASE COUNT 781 a 1123 c 1048 g 537 t ORIGIN 1 cgcgggctgg cggaagcccc gcgagcgccg cggggaggcg acggcgcctg tttgttttta 61 aaatcgggag tgcgtgcagg cggctggagt cccggaggcg accgaaggcg gcgacccgcg 121 gcggaagggg gacagccgag cccggagccc ggagcccggg caagagctgg gtgccagaac 181 cctgtggagc atcatgaact gggaagagta gctgagcccc agagcctctc tggaagagaa 241 aggaagagcc agcagttctt tctcccagtg tcccacctca ctgtccagcg tcttcctctg 301 cccctgctct gccctccctg gctcctggac tagagcccgg cttccagcag gacgtttccc 361 caggggatgg gcgactgttg aaggggatct caccgccagg gctcagttgg ccacatcatg 421 aacctccagg cccagcccaa ggctcagaac aagcggaagc gttgcctctt cgggggccag 481 gaaccagctc ccaaggagca gccccctccc ctgcagcccc cccagcagtc catcagagtg 541 aaggaggagc agtacctcgg gcacgagggt ccaggagggg cagtctccac ctctcagcct 601 gtggaactgc cccctcctag cagcctggcc ctgctgaact ctgtggtgta tgggcctgag 661 cggacctcag cagccatgct gtcccagcag gtggcctcag taaagtggcc caactctgtg 721 atggctccag ggcggggccc ggagcgtgga ggaggtgggg gtgtcagtga cagcagctgg 781 cagcagcagc caggccagcc tccaccccat tcaacatgga actgccacag tctgtccctc 841 tacagtgcaa ccaaggggag cccgcatcct ggagtgggag tcccgactta ctataaccac 901 cctgaggcac tgaagcggga gaaagcgggg ggcccacagc tggaccgcta tgtgcgacca 961 atgatgccac agaaggtgca gctggaggta gggcggcccc aggcacccct gaattctttc 1021 cacgcagcca agaaaccccc aaaccagtca ctgcccctgc aacccttcca gctggcattc 1081 ggccaccagg tgaaccggca ggtcttccgg cagggcccac cgcccccaaa cccggtggct 1141 gccttccctc cacagaagca gcagcagcag cagcaaccac agcagcagca gcagcagcag 1201 caggcagccc taccccagat gccgctcttt gagaacttct attccatgcc gcagcaaccc 1261 tcgcagcaac cccaggactt tggcctgcag ccagctgggc cactgggaca gtcccacctg 1321 gctcaccaca gcatggcacc ctaccccttc ccccccaacc cagatatgaa cccagaactg 1381 cgcaaggccc ttctgcagga ctcagccccg cagccagcgc tacctcaggt ccagatcccc 1441 ttcccccgcc gctcccgccg cctctctaag gagggtatcc tgcctcccag cgccctggat 1501 ggggctggca cccagcctgg gcaggaggcc actggcaacc tgttcctaca tcactggccc 1561 ctgcagcagc cgccacctgg ctccctgggg cagccccatc ctgaagctct gggattcccg 1621 ctggagctga gggagtcgca gctactgcct gatggggaga gactagcacc caatggccgg 1681 gagcgagagg ctcctgccat gggcagcgag gagggcatga gggcagtgag cacaggggac 1741 tgtgggcagg tgctacgggg cggagtgatc cagagcacgc gacggaggcg ccgggcatcc 1801 caggaggcca atttgctgac cctggcccag aaggctgtgg agctggcctc actgcagaat 1861 gcaaaggatg gcagtggttc tgaagagaag cggaaaagtg tattggcctc aactaccaag 1921 tgtggggtgg agttttctga gccttcctta gccaccaagc gagcacgaga agacagtggg 1981 atggtacccc tcatcatccc agtgtctgtg cctgtgcgaa ctgtggaccc aactgaggca 2041 gcccaggctg gaggtcttga tgaggacggg aagggtcctg aacagaaccc tgctgagcac 2101 aagccatcag tcatcgtcac ccgcaggcgg tccacccgaa tccccgggac agatgctcaa 2161 gctcaggcag aggacatgaa tgtcaagttg gagggggagc cttccgtgcg gaaaccaaag 2221 cagcggccca ggcccgagcc cctcatcatc cccaccaagg cgggcacttt catcgcccct 2281 cccgtctact ccaacatcac cccataccag agccacctgc gctctcccgt gcgcctagct 2341 gaccacccct ctgagcggag ctttgagcta cctccctaca cgccgccccc catcctcagc 2401 cctgtgcggg aaggctctgg cctctacttc aatgccatca tatcaaccag caccatccct 2461 gcccctcctc ccatcacgcc taagagtgcc catcgcacgc tgctccggac taacagtgct 2521 gaagtaaccc cgcctgtcct ctctgtgatg ggggaggcca ccccagtgag catcgagcca 2581 cggatcaacg tgggctcccg gttccaggca gaaatcccct tgatgaggga ccgtgccctg 2641 gcagctgcag atccccacaa ggctgacttg gtgtggcagc catgggagga cctagagagc 2701 agccgggaga agcagaggca agtggaagac ctgctgacag ccgcctgctc cagcattttc 2761 cctggtgctg gcaccaacca ggagctggcc ctgcactgtc tgcacgaatc cagaggagac 2821 atcctggaaa cgctgaataa gctgctgctg aagaagcccc tgcggcccca caaccatccg 2881 ctggcaactt atcactacac aggctctgac cagtggaaga tggccgagag gaagctgttc 2941 aacaaaggca ttgccatcta caagaaggat ttcttcctgg tgcagaagct gatccagacc 3001 aagaccgtgg cccagtgcgt ggagttctac tacacctaca agaagcaggt gaaaatcggc 3061 cgcaatggga ctctaacctt tggggatgtg gatacgagcg atgagaagtc ggcccaggaa 3121 gaggttgaag tggatattaa gacttcccaa aagttcccaa gggtgcctct tcccagaaga 3181 gagtccccaa gtgaagagag gctggagccc aagagggagg tgaaggagcc caggaaggag 3241 ggggaggagg aggtgccaga gatccaagag aaggaggagc aggaagaggg gcgagagcgc 3301 agcaggcggg cagcggcagt caaagccacg cagacactac aggccaatga gtcggccagt 3361 gacatcctca tcctccggag ccacgagtcc aacgcccctg ggtctgccgg tggccaggcc 3421 tcggagaagc caagggaagg gacagggaag tcacgaaggg cactaccttt ttcaaaaaaa 3481 aaaaaaaaa //