LOCUS BC103917 1921 bp mRNA linear HUM 04-OCT-2006 DEFINITION Homo sapiens keratin 71, mRNA (cDNA clone MGC:119390 IMAGE:40006780), complete cds. ACCESSION BC103917 VERSION BC103917.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1921) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1921) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (07-SEP-2005) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Baylor Human Genome Sequencing Center cDNA Library Preparation: Baylor Human Genome Sequencing Center cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAM Plate: 5 Row: g Column: 19. FEATURES Location/Qualifiers source 1..1921 /db_xref="H-InvDB:HIT000337469" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:119390 IMAGE:40006780" /tissue_type="PCR rescued clones" /clone_lib="NIH_MGC_283" /note="Vector: pCR-Blunt II-TOPO with reversed insert; Clone identification sequence tag: TAAAATAT sequenced from the forward primer" gene 1..1921 /gene="KRT71" /gene_synonym="K6IRS1" /gene_synonym="KRT6IRS" /gene_synonym="KRT6IRS1" /db_xref="GeneID:112802" /db_xref="HGNC:HGNC:28927" /db_xref="MIM:608245" CDS 21..1592 /gene="KRT71" /gene_synonym="K6IRS1" /gene_synonym="KRT6IRS" /gene_synonym="KRT6IRS1" /codon_start=1 /product="keratin 71" /protein_id="AAI03918.1" /db_xref="GeneID:112802" /db_xref="HGNC:HGNC:28927" /db_xref="MIM:608245" /translation="MSRQFTCKSGAAAKGGFSGCSAVLSGGSSSSFRAGSKGLSGGFG SRSLYSLGGVRSLNVASGSGKSGGYGFGRGRASGFAGSMFGSVALGPVCPTVCPPGGI HQVTVNESLLAPLNVELDPEIQKVRAQEREQIKALNNKFASFIDKVRFLEQQNQVLET KWELLQQLDLNNCKNNLEPILEGYISNLRKQLETLSGDRVRLDSELRNVRDVVEDYKK RYEEEINKRTAAENEFVLLKKDVDAAYANKVELQAKVESMDQEIKFFRCLFEAEITQI QSHISDMSVILSMDNNRNLDLDSIIDEVRTQYEEIALKSKAEAEALYQTKFQELQLAA GRHGDDLKNTKNEISELTRLIQRIRSEIENVKKQASNLETAIADAEQRGDNALKDARA KLDELEGALHQAKEELARMLREYQELMSLKLALDMEIATYRKLLESEECRMSGEFPSP VSISIISSTSGGSGYGFRPSMVSGGYVANSSNCISGVCSVRGGEGRSRGSANDYKDTL GKGSSLSAPSKKTSR" BASE COUNT 433 a 561 c 583 g 344 t ORIGIN 1 gctttgtctc caccagcaac atgagccgcc aattcacctg caagtcggga gctgccgcca 61 aggggggctt cagtggctgc tcagctgtgc tctcaggggg cagctcatcc tccttccggg 121 cagggagcaa agggctcagt gggggctttg gcagccggag cctctacagc ctggggggtg 181 tccggagcct caatgtggcc agtggcagcg ggaagagtgg aggctatgga tttggccggg 241 gccgggccag tggctttgct ggaagcatgt ttggcagtgt ggccctgggg cctgtgtgcc 301 caactgtatg cccacctgga ggcatccacc aggttaccgt caatgagagc ctcctggccc 361 ccctcaacgt ggagctggac cccgagatcc agaaagtgcg tgcccaggag cgagagcaga 421 tcaaggctct gaacaacaag ttcgcctcct tcatcgacaa ggtgcggttc ctggagcagc 481 agaaccaggt actggagacc aagtgggagc tgctgcagca gctggacctg aacaactgca 541 agaacaacct ggagcccatc ctcgagggct acatcagcaa cctgcggaag cagctggaga 601 cgctgtctgg ggacagggtg aggctggact cggagctgag gaatgtgcgg gacgtagtgg 661 aggactacaa gaagaggtat gaggaggaaa tcaacaagcg gacagcagca gagaacgagt 721 ttgtgctgct caagaaggat gtggatgctg cttacgccaa taaggtggaa ctgcaggcca 781 aggtggaatc catggaccag gagatcaagt tcttcaggtg tctctttgaa gccgagatca 841 ctcagatcca gtcccacatc agtgacatgt ctgtcatcct gtccatggac aacaaccgga 901 acctagacct ggacagcatc attgacgaag tccgcaccca gtatgaggag attgccttga 961 agagtaaggc cgaggctgag gccctgtacc agaccaagtt ccaagagctt cagctggcag 1021 ctggcaggca tggggacgac ctcaaaaaca ccaagaatga aatctcggag ctcactcggc 1081 tcatccagag aatccgctca gagatcgaga acgtgaagaa gcaggcttcc aacctggaga 1141 cagccatcgc tgatgctgag cagcggggag acaacgccct gaaggatgcc cgggccaagc 1201 tggacgagct ggagggcgcc ctgcaccagg ccaaggagga gctggcgcgg atgctgcgcg 1261 agtaccagga gctcatgagc ctgaagctgg ccctggacat ggagatcgcc acctatcgca 1321 agctactgga gagcgaggag tgcaggatgt caggagaatt tccctcccct gtcagcatct 1381 ccatcatcag cagcaccagt ggcggcagtg gctatggctt ccggcccagc atggtcagcg 1441 gtggctatgt ggccaacagc agcaactgca tctctggagt gtgcagcgtg agaggcgggg 1501 agggcaggag ccggggcagt gccaacgatt acaaagacac cctggggaag ggttccagcc 1561 tgagtgcacc ctccaagaaa accagtcggt agagaagact gccccgggcc ccgcctcatt 1621 ccatgacccg gctctggatc ccacactgta cttcccacag cccactctca gctccatctc 1681 caccctgctg gtcctgctcc catacacctg gcactggcct tggccaccca cttctcccag 1741 cctgtgtctt cctgatcctg ggaaggcctg gatgaccaag cttggtgaaa ttcctccctg 1801 tacacaccct attaactcct tggctgtggt cccccagcta caccaccagc ccaggtcctg 1861 gctgccagct ttcctcctct gcccggcctc tagcgcagtc gctaactact ctgctgggct 1921 c //