LOCUS BC002700 1668 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens keratin 7, mRNA (cDNA clone MGC:3625 IMAGE:3610347),
complete cds.
ACCESSION BC002700
VERSION BC002700.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 1668)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 1668)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (05-FEB-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Oct 31, 2003 this sequence version replaced BC002700.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 12 Row: o Column: 19
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 67782364.
FEATURES Location/Qualifiers
source 1..1668
/db_xref="H-InvDB:HIT000031109"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:3625 IMAGE:3610347"
/tissue_type="Pancreas, adenocarcinoma"
/clone_lib="NIH_MGC_39"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..1668
/gene="KRT7"
/gene_synonym="CK7"
/gene_synonym="K2C7"
/gene_synonym="K7"
/gene_synonym="MGC3625"
/gene_synonym="SCL"
/db_xref="GeneID:3855"
/db_xref="HGNC:HGNC:6445"
/db_xref="MIM:148059"
CDS 43..1452
/gene="KRT7"
/gene_synonym="CK7"
/gene_synonym="K2C7"
/gene_synonym="K7"
/gene_synonym="MGC3625"
/gene_synonym="SCL"
/codon_start=1
/product="keratin 7"
/protein_id="AAH02700.1"
/db_xref="GeneID:3855"
/db_xref="HGNC:HGNC:6445"
/db_xref="MIM:148059"
/translation="MSIHFSSPVFTSRSAAFSGRGAQVRLSSARPGGLGSSSLYGLGA
SRPRVAVRSAYGGPVGAGIREVTINQSLLAPLRLDADPSLQRVRQEESEQIKTLNNKF
ASFIDKVRFLEQQNKLLETKWTLLQEQKSAKSSRLPDIFEAQIAGLRGQLEALQVDGG
RLEAELRSMQDVVEDFKNKYEDEINRRTAAENEFVVLKKDVDAAYMSKVELEAKVDAL
NDEINFLRTLNETELTELQSQISDTSVVLSMDNSRSLDLDGIIAEVKAQYEEMAKCSR
AEAEAWYQTKFETLQAQAGKHGDDLRNTRNEISEMNRAIQRLQAEIDNIKNQRAKLEA
AIAEAEERGELALKDARAKQEELEAALQRAKQDMARQLREYQELMSVKLALDIEIATY
RKLLEGEESRLAGDGVGAVNISVMNSTGGSSSGGGIGLTLGGTMGSNALSFSSSAGPG
LLKAYSIRTASASRRSARD"
BASE COUNT 392 a 488 c 514 g 274 t
ORIGIN
1 ctcctcctcg cccgccgcta ggtccatccc ggcccagcca ccatgtccat ccacttcagc
61 tccccggtat tcacctcgcg ctcagccgcc ttctcgggcc gcggcgccca ggtgcgcctg
121 agctccgctc gccccggcgg ccttggcagc agcagcctct acggcctcgg cgcctcgcgg
181 ccgcgcgtgg ccgtgcgctc tgcctatggg ggcccggtgg gcgccggcat ccgcgaggtc
241 accattaacc agagcctgct ggccccgctg cggctggacg ccgacccctc cctccagcgg
301 gtgcgccagg aggagagcga gcagatcaag accctcaaca acaagtttgc ctccttcatc
361 gacaaggtgc ggtttctgga gcagcagaac aagctgctgg agaccaagtg gacgctgctg
421 caggagcaga agtcggccaa gagcagccgc ctcccagaca tctttgaggc ccagattgct
481 ggccttcggg gtcagcttga ggcactgcag gtggatgggg gccgcctgga ggcggagctg
541 cggagcatgc aggatgtggt ggaggacttc aagaataagt acgaagatga aattaaccgc
601 cgcacagctg ctgagaatga gtttgtggtg ctgaagaagg atgtggatgc tgcctacatg
661 agcaaggtgg agctggaggc caaggtggat gccctgaatg atgagatcaa cttcctcagg
721 accctcaatg agacggagtt gacagagctg cagtcccaga tctccgacac atctgtggtg
781 ctgtccatgg acaacagtcg ctccctggac ctggacggca tcatcgctga ggtcaaggca
841 cagtatgagg agatggccaa atgcagccgg gctgaggctg aagcctggta ccagaccaag
901 tttgagaccc tccaggccca ggctgggaag catggggacg acctccggaa tacccggaat
961 gagatttcag agatgaaccg ggccatccag aggctgcagg ctgagatcga caacatcaag
1021 aaccagcgtg ccaagttgga ggccgccatt gccgaggctg aggagcgtgg ggagctggcg
1081 ctcaaggatg ctcgtgccaa gcaggaggag ctggaagccg ccctgcagcg ggccaagcag
1141 gatatggcac ggcagctgcg tgagtaccag gaactcatga gcgtgaagct ggccctggac
1201 atcgagatcg ccacctaccg caagctgctg gagggcgagg agagccggtt ggctggagat
1261 ggagtgggag ccgtgaatat ctctgtgatg aattccactg gtggcagtag cagtggcggt
1321 ggcattgggc tgaccctcgg gggaaccatg ggcagcaatg ccctgagctt ctccagcagt
1381 gcgggtcctg ggctcctgaa ggcttattcc atccggaccg catccgccag tcgcaggagt
1441 gcccgcgact gagccgcctc ccaccactcc actcctccag ccaccaccca caatcacaag
1501 aagattccca cccctgcctc ccatgcctgg tcccaagaca gtgagacagt ctggaaagtg
1561 atgtcagaat agcttccaat aaagcagcct cattctgagg cctgagtgat ccacgtgaaa
1621 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa
//