LOCUS BC084567 3811 bp mRNA linear HUM 19-MAR-2007
DEFINITION Homo sapiens polymerase (DNA directed) sigma, mRNA (cDNA clone
MGC:99645 IMAGE:6380576), complete cds.
ACCESSION BC084567
VERSION BC084567.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 3811)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 3811)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (12-OCT-2004) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: DCTD/DTP/Gazdar
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 59 Row: l Column: 14
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 62548868.
FEATURES Location/Qualifiers
source 1..3811
/db_xref="H-InvDB:HIT000266076"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:99645 IMAGE:6380576"
/tissue_type="Lung, large cell carcinoma"
/clone_lib="NIH_MGC_18"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..3811
/gene="POLS"
/gene_synonym="LAK-1"
/gene_synonym="POLK"
/gene_synonym="TRF4"
/gene_synonym="TRF4-1"
/db_xref="GeneID:11044"
/db_xref="HGNC:HGNC:16705"
/db_xref="MIM:605198"
CDS 63..1691
/gene="POLS"
/gene_synonym="LAK-1"
/gene_synonym="POLK"
/gene_synonym="TRF4"
/gene_synonym="TRF4-1"
/codon_start=1
/product="polymerase (DNA directed) sigma"
/protein_id="AAH84567.2"
/db_xref="GeneID:11044"
/db_xref="HGNC:HGNC:16705"
/db_xref="MIM:605198"
/translation="MSPCPEEAAMRREVVKRIETVVKDLWPTADVQIFGSFSTGLYLP
TSDIDLVVFGKWERPPLQLLEQALRKHNVAEPCSIKVLDKATVPIIKLTDQETEVKVD
ISFNMETGVRAAEFIKNYMKKYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMA
ISFLQLHPRIDARRADENLGMLLVEFFELYGRNFNYLKTGIRIKEGGAYIAKEEIMKA
MTSGYRPSMLCIEDPLLPGNDVGRSSYGAMQVKQVFDYAYIVLSHAVSPLARSYPNRD
AESTLGRIIKVTQEVIDYRRWIKEKWGSKAHPSPGMDSRIKIKERIATCNGEQTQNRE
PESPYGQRLTLSLSSPQLLSSGSSASSVSSLSGSDVDSDTPPCTTPSVYQFSLQAPAP
LMAGLPTALPMPSGKPQPTTSRTLIMTTNNQTRFTIPPPTLGVAPVPCRQAGVEGTAS
LKAVHHMSSPAIPSASPNPLSSPHLYHKQHNGMKLSMKGSHGHTQGGGYSSVGSGGVR
PPVGNRGHHQYNRTGWRRKKHTHTRDSLPVSLSR"
BASE COUNT 958 a 903 c 946 g 1004 t
ORIGIN
1 gccgcgcgta cagcccgggc atccagggac tacatgagga aataattgac ttttataact
61 tcatgtcccc ttgtcctgaa gaagcagcta tgagaagaga ggtggtgaaa cggatcgaaa
121 ctgtggtgaa agacctttgg ccgacggctg atgtacagat atttggcagc tttagtacag
181 gtctttatct tccaactagc gacatagacc tggtggtctt cgggaaatgg gagcgtcctc
241 ctttacagct gctggagcaa gccctgcgga agcacaacgt ggctgagccg tgttccatca
301 aagtccttga caaggctacg gtaccaataa taaagctcac agatcaggag actgaagtga
361 aagttgacat cagctttaac atggagacgg gcgtccgggc agcggagttc atcaagaatt
421 acatgaagaa atattcattg ctgccttact tgattttagt attgaaacag ttccttctgc
481 agagggacct gaatgaagtt tttacaggtg gaattagctc atacagccta attttaatgg
541 ccattagctt tctacagttg catccaagaa ttgatgcccg gagagctgat gaaaaccttg
601 gaatgcttct tgtagaattt tttgaactct atgggagaaa ttttaattac ttgaaaaccg
661 gtattagaat caaagaagga ggtgcctata tcgccaaaga ggagatcatg aaagccatga
721 ccagcgggta cagaccgtcg atgctgtgca ttgaggaccc cctgctgcca gggaatgacg
781 ttggccggag ctcctatggc gccatgcagg tgaagcaggt cttcgattat gcctacatag
841 tgctcagcca tgccgtgtca ccgctggcca ggtcctatcc aaacagagac gccgaaagta
901 ctttaggaag aatcatcaaa gtaactcagg aggtgattga ctaccggagg tggatcaaag
961 agaagtgggg cagcaaagcc cacccgtcgc caggcatgga cagcaggatc aagatcaaag
1021 agcgaatagc cacatgcaat ggggagcaga cgcagaaccg agagcccgag tctccctatg
1081 gccagcgctt gactttgtcg ctgtccagcc cccagctcct gtcttcaggc tcctcggcct
1141 cttctgtgtc ttcactttct gggagtgacg ttgattcaga cacaccgccc tgcacaacgc
1201 ccagtgttta ccagttcagt ctgcaagcgc cagctcctct catggccggc ttacccaccg
1261 ccttgccaat gcccagtggc aaacctcagc ccaccacttc cagaacactg atcatgacaa
1321 ccaacaatca gaccaggttt actatacctc caccgaccct aggggttgct cctgttcctt
1381 gcagacaagc tggtgtagaa ggaactgcgt ctttgaaagc cgtccaccac atgtcttccc
1441 cggccattcc ctcagcgtcc cccaacccgc tctcgagccc tcatctgtat cataagcagc
1501 acaacggcat gaaactgtcc atgaagggct ctcacggcca cacccaaggc ggcggctaca
1561 gctctgtggg tagcggaggt gtgcggcccc ctgtgggcaa caggggacac caccagtata
1621 accgcaccgg ctggaggagg aaaaaacaca cacacacacg ggacagtctg cccgtgagcc
1681 tcagcagata atggctcctg gctgcgtcag cctcccccac ccctctgcag actgccccgc
1741 ggcctcggcc accggcaggg gaaccgagac cagcaccccg cacgtcagcc gggctcgcgg
1801 cacgcccgcc gctgatcact ctgcatgttt cttcgtgtgg tggtcgcgtc catcttcaag
1861 aacagctcgt tgtgctcatc tgtgaagcct tattaaacgt ggacgttgtt ttctgccttc
1921 ccaggattct tccttcagtg ctgaggcagg ttgggctcag gaactgcagg gacgtgaaca
1981 tgcgcttgcg gtttgaggta gccgtgtctg ttccttcgcg gtttgctatt ttcatttcct
2041 gttcgtcaaa gcagcagagg agatcaaacc ccgttcgtgt gtctttcctc catggataag
2101 cttgggaggt cattgtttta ctgccctcac attttgtttg aaatttcaga actgtttttc
2161 tatgtaaata ttgaaaactt atgatttgtg caataactca gatatttttt atttaatttc
2221 ctattttcac ataagttata tttaagggag gagggaattt tttttaaaca agcttaggtc
2281 ctttcccgag ctgcattttc taagttgggt catcgtgtcg gctggttgtc tgacgagcat
2341 cgttacaaac accatgatga ggggtttggg gttttatttt gatgtctttt cttttggtcg
2401 gaagtgagtg aaggagccag gtcgccctga aggttttcca aagggcttgg ctccagagcc
2461 acctggcaga ctgcccgtgg ccctgctgtc gggccccagg ccgttgtcct gctctgacca
2521 cagagtttta atgttttggt tttcacttct tttaaactgg acaacaaatc cagcatttca
2581 agtgccagaa gtataacttt ctaaggagag aagggttgtc acattataaa atctttagga
2641 aaatgtgaac tggaaaacgc ttcggtcagt tttagtgaca tagcctgtga tgatgggtct
2701 ggtgactatt attgcggacc gtggtaccca gttttaggaa tgtggagaaa ggaattctgt
2761 tgattccgtt gaggaatctg tagcgtatgc attcgttccg ttaagagcaa atctaggaga
2821 agtgcttcag ctgcccagtg cgccgtgggg agtgttttaa cggatcgtgt cgcaggagag
2881 cacagcccag cgttggggcc gggaccgctg gcgcccgacg tcggaagcat acaggtatac
2941 tatgcaagtg tattctgcca caacaaccac tgtctttgtt accttttttt gaacaagaat
3001 atatccatcc tgcctaaccc tgagtttttg gagcaccaca gttgtcctgg gagttggttg
3061 catcttgtag gccatctgac ttcctgtttt taaaacgggg gtctggtctt gctaaacact
3121 acaggtaggt tggtctttga agtccactag tggagaatgt caagacaaga tacttattac
3181 catgacatct gatgcatgtg cagcagtggg gagttctaga ttgatctctg aatgtgatcg
3241 acgcccagca aggacaagct ttaaaatgtc tgcggtctgc ccttttgaag caggactggc
3301 tcactctgtc attgggagct gtcagctgcg actgcaggtt ctctaggagg cattccagaa
3361 tagagtagca cactgtgtct gcagttctcg atgaccgaaa gttatcaaaa atatttaaaa
3421 tatttaaatt gtgaacctat tgataaagaa tatttataaa aactgatctg taggcctgta
3481 ctaatctcta cgcattagca atattgactg taaacccaca ttaaggaaac cactacgggt
3541 ctggcagtgc gtgtcccgtg gggtgtgcat tttaaaactc gattcataga cacaggtacc
3601 atgttccatt tccgtcatgg tgaagcaaat gaattggcct ggctaccact gtggtcgcgt
3661 gctacaggtt tgacaaaaag atatcatgtt tcgatttttt tgtgtgtgga caacaatatg
3721 gaagctaaaa ttgacatatt tttatgtaaa gtttttctat tctttgattt ttaataaact
3781 ttggaaacca gaaaaaaaaa aaaaaaaaaa a
//