LOCUS BC011561 4279 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens hephaestin, mRNA (cDNA clone MGC:20135 IMAGE:4644318),
complete cds.
ACCESSION BC011561
VERSION BC011561.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 4279)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 4279)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (30-JUL-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Genome Sequence Centre,
BC Cancer Agency, Vancouver, BC, Canada
info@bcgsc.bc.ca
Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson
Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen
Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel
Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave
Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth
Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao,
Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR
Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang,
Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 29 Row: m Column: 7
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 21166384.
FEATURES Location/Qualifiers
source 1..4279
/db_xref="H-InvDB:HIT000035290"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:20135 IMAGE:4644318"
/tissue_type="Colon, adenocarcinoma"
/clone_lib="NIH_MGC_15"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..4279
/gene="HEPH"
/gene_synonym="CPL"
/gene_synonym="KIAA0698"
/db_xref="GeneID:9843"
/db_xref="HGNC:HGNC:4866"
/db_xref="MIM:300167"
CDS 70..3546
/gene="HEPH"
/gene_synonym="CPL"
/gene_synonym="KIAA0698"
/codon_start=1
/product="hephaestin"
/protein_id="AAH11561.1"
/db_xref="GeneID:9843"
/db_xref="HGNC:HGNC:4866"
/db_xref="MIM:300167"
/translation="MESGHLLWALLFMQSLWPQLTDGATRVYYLGIRDVQWNYAPKGR
NVITNQPLDSDIVASSFLKSDKNRIGGTYKKTIYKEYKDDSYTDEVAQPAWLGFLGPV
LQAEVGDVILIHLKNFATRPYTIHPHGVFYEKDSEGSLYPDGSSGPLKADDSVPPGGS
HIYNWTIPEGHAPTDADPACLTWIYHSHVDAPRDIATGLIGPLITCKRGALDGNSPPQ
RQDVDHDFFLLFSVVDENLSWHLNENIATYCSDPASVDKEDETFQESNRMHAINGFVF
GNLPELNMCAQKRVAWHLFGMGNEIDVHTAFFHGQMLTTRGHHTDVANIFPATFVTAE
MVPWEPGTWLISCQVNSHFRDGMQALYKVKSCSMAPPVDLLTGKVRQYFIEAHEIQWD
YGPMGHDGSTGKNLREPGSISDKFFQKSSSRIGGTYWKVRYEAFQDETFQEKMHLEED
RHLGILGPVIRAEVGDTIQVVFYNRASQPFSMQPHGVFYEKDYEGTVYNDGSSYPGLV
AKPFEKVTYRWTVPPHAGPTAQDPACLTWMYFSAADPIRDTNSGLVGPLLVCRAGALG
ADGKQKGVDKEFFLLFTVLDENKSWYSNANQAAAMLDFRLLSEDIEGFQDSNRMHAIN
GFLFSNLPRLDMCKGDTVAWHLLGLGTETDVHGVMFQGNTVQLQGMRKGAAMLFPHTF
VMAIMQPDNLGTFEIYCQAGSHREAGMRAIYNVSQCPGHQATPRQRYQAARIYYIMAE
EVEWDYCPDRSWEREWHNQSEKDSYGYIFLSNKDGLLGSRYKKAVFREYTDGTFRIPR
PRTGPEEHLGILGPLIKGEVGDILTVVFKNNASRPYSVHAHGVLESTTVWPLAAEPGE
VVTYQWNIPERSGPGPNDSACVSWIYYSAVDPIKDMYSGLVGPLAICQKGILEPHGGR
SDMDREFALLFLIFDENKSWYLEENVATHGSQDPGSINLQDETFLESNKMHAINGKLY
ANLRGLTMYQGERVAWYMLAMGQDVDLHTIHFHAESFLYRNGENYRADVVDLFPGTFE
VVEMVASNPGTWLMHCHVTDHVHAGMETLFTVFSRTEHLSPLTVITKETEKAVPPRDI
EEGNVKMLGMQIPIKNVEMLASVLVAISVTLLLVVLALGGVVWYQHRQRKLRRNRRSI
LDDSFKLLSFKQ"
BASE COUNT 1099 a 1015 c 1087 g 1078 t
ORIGIN
1 gcccagcctg cctggagaaa agcgtctgct cctagccaag atctcctcat cacaaaagta
61 atgtgggcca tggagtcagg ccacctcctc tgggctctgc tgttcatgca gtccttgtgg
121 cctcaactga ctgatggagc cactcgagtc tactacctgg gcatccggga tgtgcagtgg
181 aactatgctc ccaagggaag aaatgtcatc acgaaccagc ctctggacag tgacatagtg
241 gcttccagct tcttaaagtc tgacaagaac cggatagggg gaacctacaa gaagaccatc
301 tataaagaat acaaggatga ctcatacaca gatgaagtgg cccagcctgc ctggttgggc
361 ttcctggggc cagtgttgca ggctgaagtg ggggatgtca ttcttattca cctgaagaat
421 tttgccactc gtccctatac catccaccct catggtgtct tctacgagaa ggactctgaa
481 ggttccctat acccagatgg ctcctctggg ccactgaaag ctgatgactc tgttcccccg
541 gggggcagcc atatctacaa ctggaccatt ccagaaggcc atgcacccac cgatgctgac
601 ccagcgtgcc tcacctggat ctaccattct catgtagatg ctccacgaga cattgcaact
661 ggcctaattg ggcctctcat cacctgtaaa agaggagccc tggatgggaa ctcccctcct
721 caacgccagg atgtagacca tgatttcttc ctcctcttca gtgtggtaga tgagaacctc
781 agctggcatc tcaatgagaa cattgccact tactgctcag atcctgcttc agtggacaaa
841 gaagatgaga catttcagga gagcaatagg atgcatgcaa tcaatggctt tgtttttggg
901 aatttacctg agctgaacat gtgtgcacag aaacgtgtgg cctggcactt gtttggcatg
961 ggcaatgaaa ttgatgtcca cacagcattt ttccatggac agatgctgac tacccgtgga
1021 caccacactg atgtggctaa catctttcca gccacctttg tgactgctga gatggtgccc
1081 tgggaacctg gtacctggtt aattagctgc caagtgaaca gtcactttcg agatggcatg
1141 caggcactct acaaggtcaa gtcttgctcc atggcccctc ctgtggacct gctcacaggc
1201 aaagttcgac agtacttcat tgaggcccat gagattcaat gggactatgg cccgatgggg
1261 catgatggga gtactgggaa gaatttgaga gagccaggca gtatctcaga taagtttttc
1321 cagaagagct ccagccgaat tgggggcact tactggaaag tgcgatatga agcctttcaa
1381 gatgagacat tccaagagaa gatgcatttg gaggaagata ggcatcttgg aatcctgggg
1441 ccagtgatcc gggctgaggt gggtgacacc attcaggtgg tcttctacaa ccgtgcctcc
1501 cagccattca gcatgcagcc ccatggggtc ttttatgaga aagactatga aggcactgtg
1561 tacaatgatg gctcatctta ccctggcttg gttgccaagc cctttgagaa agtaacatac
1621 cgctggacag tcccccctca tgccggtccc actgctcagg atcctgcttg tctcacttgg
1681 atgtacttct ctgctgcaga tcccataaga gacacaaatt ctggcctggt gggcccgctg
1741 ctggtgtgca gggctggtgc cttgggtgca gatggcaagc agaaaggggt ggataaagaa
1801 ttctttcttc tcttcactgt gttggatgag aacaagagct ggtacagcaa tgccaatcaa
1861 gcagctgcta tgttggattt ccgactgctt tcagaggata ttgagggctt ccaagactcc
1921 aatcggatgc atgccattaa tgggtttctg ttctctaacc tgcccaggct ggacatgtgc
1981 aagggtgaca cagtggcctg gcacctgctc ggcctgggca cagagactga tgtgcatgga
2041 gtcatgttcc agggcaacac tgtgcagctt cagggcatga ggaagggtgc agctatgctc
2101 tttcctcata cctttgtcat ggccatcatg cagcctgaca accttgggac atttgagatt
2161 tattgccagg caggcagcca tcgagaagca gggatgaggg caatctataa tgtctcccag
2221 tgtcctggcc accaagccac ccctcgccaa cgctaccaag ctgcaagaat ctactatatc
2281 atggcagaag aagtagagtg ggactattgc cctgaccgga gctgggaacg ggaatggcac
2341 aaccagtctg agaaggacag ttatggttac attttcctga gcaacaagga tgggctcctg
2401 ggttccagat acaagaaagc tgtattcagg gaatacactg atggtacatt caggatccct
2461 cggccaagga ctggaccaga agaacacttg ggaatcttgg gtccacttat caaaggtgaa
2521 gttggtgata tcctgactgt ggtattcaag aataatgcca gccgccccta ctctgtgcat
2581 gctcatggag tgctagaatc tactactgtc tggccactgg ctgctgagcc tggtgaggtg
2641 gtcacttatc agtggaacat cccagagagg tctggccctg ggcccaatga ctctgcttgt
2701 gtttcctgga tctattattc tgcagtggat cccatcaagg acatgtatag tggcctggtg
2761 gggcccttgg ctatctgcca aaagggcatc ctggagcccc atggaggacg gagtgacatg
2821 gatcgggaat ttgcattgtt gttcttgatt tttgatgaaa ataagtcttg gtatttggag
2881 gaaaatgtgg caacccatgg gtcccaggat ccaggcagta ttaacctaca ggatgaaact
2941 ttcttggaga gcaataaaat gcatgcaatc aatgggaaac tctatgccaa ccttaggggt
3001 cttaccatgt accaaggaga acgagtggcc tggtacatgc tggccatggg ccaagatgtg
3061 gatctacaca ccatccactt tcatgcagag agcttcctct atcggaatgg cgagaactac
3121 cgggcagatg tggtggatct gttcccaggg acttttgagg ttgtggagat ggtggccagc
3181 aaccctggga catggctgat gcactgccat gtgactgacc atgtccatgc tggcatggag
3241 accctcttca ctgttttttc tcgaacagaa cacttaagcc ctctcaccgt catcaccaaa
3301 gagactgaaa aagcagtgcc ccccagagac attgaagaag gcaatgtgaa gatgctgggc
3361 atgcagatcc ccataaagaa tgttgagatg ctggcctctg ttttggttgc cattagtgtc
3421 acccttctgc tcgttgttct ggctcttggt ggagtggttt ggtaccaaca tcgacagaga
3481 aagctacgac gcaataggag gtccatcctg gatgacagct tcaagcttct gtctttcaaa
3541 cagtaacatc tggagcctgg agatatcctc aggaagcaca tctgtagtgc actcccagca
3601 ggccatggac tagtcactaa ccccacactc aaaggggcat gggtggtgga gaagcagaag
3661 gagcaatcaa gcttatctgg atatttcttt ctttatttat tttacatgga aataatatga
3721 tttcactttt tctttagttt ctttgctcta cgtgggcacc tggcactaag ggagtacctt
3781 attatcctac atcgcaaatt tcaacagcta cattatattt ccttctgaca cttggaaggt
3841 attgaaattt ctagaaatgt atccttctca caaagtagag accaagagaa aaactcattg
3901 attgggtttc tacttctttc aaggactcag gaaatttcac tttgaactga ggccaagtga
3961 gctgttaaga taacccacac ttaaactaaa ggctaagaat ataggcttga tgggaaattg
4021 aaggtaggct gagtattggg aatccaaatt gaattttgat tctccttggc agtgaactac
4081 tttgaagaag tggtcaatgg gttgttgctg ccatgagcat gtacaacctc tggagctaga
4141 agctcctcag gaaagccagt tctccaagtt cttaacctgt ggcactgaaa ggaatgttga
4201 gttacctctt catgttttag acagcaaacc ctatccatta aagtacttgt tagaacactg
4261 aaaaaaaaaa aaaaaaaaa
//