LOCUS BC011561 4279 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens hephaestin, mRNA (cDNA clone MGC:20135 IMAGE:4644318), complete cds. ACCESSION BC011561 VERSION BC011561.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4279) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 4279) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (30-JUL-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 29 Row: m Column: 7 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 21166384. FEATURES Location/Qualifiers source 1..4279 /db_xref="H-InvDB:HIT000035290" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:20135 IMAGE:4644318" /tissue_type="Colon, adenocarcinoma" /clone_lib="NIH_MGC_15" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..4279 /gene="HEPH" /gene_synonym="CPL" /gene_synonym="KIAA0698" /db_xref="GeneID:9843" /db_xref="HGNC:HGNC:4866" /db_xref="MIM:300167" CDS 70..3546 /gene="HEPH" /gene_synonym="CPL" /gene_synonym="KIAA0698" /codon_start=1 /product="hephaestin" /protein_id="AAH11561.1" /db_xref="GeneID:9843" /db_xref="HGNC:HGNC:4866" /db_xref="MIM:300167" /translation="MESGHLLWALLFMQSLWPQLTDGATRVYYLGIRDVQWNYAPKGR NVITNQPLDSDIVASSFLKSDKNRIGGTYKKTIYKEYKDDSYTDEVAQPAWLGFLGPV LQAEVGDVILIHLKNFATRPYTIHPHGVFYEKDSEGSLYPDGSSGPLKADDSVPPGGS HIYNWTIPEGHAPTDADPACLTWIYHSHVDAPRDIATGLIGPLITCKRGALDGNSPPQ RQDVDHDFFLLFSVVDENLSWHLNENIATYCSDPASVDKEDETFQESNRMHAINGFVF GNLPELNMCAQKRVAWHLFGMGNEIDVHTAFFHGQMLTTRGHHTDVANIFPATFVTAE MVPWEPGTWLISCQVNSHFRDGMQALYKVKSCSMAPPVDLLTGKVRQYFIEAHEIQWD YGPMGHDGSTGKNLREPGSISDKFFQKSSSRIGGTYWKVRYEAFQDETFQEKMHLEED RHLGILGPVIRAEVGDTIQVVFYNRASQPFSMQPHGVFYEKDYEGTVYNDGSSYPGLV AKPFEKVTYRWTVPPHAGPTAQDPACLTWMYFSAADPIRDTNSGLVGPLLVCRAGALG ADGKQKGVDKEFFLLFTVLDENKSWYSNANQAAAMLDFRLLSEDIEGFQDSNRMHAIN GFLFSNLPRLDMCKGDTVAWHLLGLGTETDVHGVMFQGNTVQLQGMRKGAAMLFPHTF VMAIMQPDNLGTFEIYCQAGSHREAGMRAIYNVSQCPGHQATPRQRYQAARIYYIMAE EVEWDYCPDRSWEREWHNQSEKDSYGYIFLSNKDGLLGSRYKKAVFREYTDGTFRIPR PRTGPEEHLGILGPLIKGEVGDILTVVFKNNASRPYSVHAHGVLESTTVWPLAAEPGE VVTYQWNIPERSGPGPNDSACVSWIYYSAVDPIKDMYSGLVGPLAICQKGILEPHGGR SDMDREFALLFLIFDENKSWYLEENVATHGSQDPGSINLQDETFLESNKMHAINGKLY ANLRGLTMYQGERVAWYMLAMGQDVDLHTIHFHAESFLYRNGENYRADVVDLFPGTFE VVEMVASNPGTWLMHCHVTDHVHAGMETLFTVFSRTEHLSPLTVITKETEKAVPPRDI EEGNVKMLGMQIPIKNVEMLASVLVAISVTLLLVVLALGGVVWYQHRQRKLRRNRRSI LDDSFKLLSFKQ" BASE COUNT 1099 a 1015 c 1087 g 1078 t ORIGIN 1 gcccagcctg cctggagaaa agcgtctgct cctagccaag atctcctcat cacaaaagta 61 atgtgggcca tggagtcagg ccacctcctc tgggctctgc tgttcatgca gtccttgtgg 121 cctcaactga ctgatggagc cactcgagtc tactacctgg gcatccggga tgtgcagtgg 181 aactatgctc ccaagggaag aaatgtcatc acgaaccagc ctctggacag tgacatagtg 241 gcttccagct tcttaaagtc tgacaagaac cggatagggg gaacctacaa gaagaccatc 301 tataaagaat acaaggatga ctcatacaca gatgaagtgg cccagcctgc ctggttgggc 361 ttcctggggc cagtgttgca ggctgaagtg ggggatgtca ttcttattca cctgaagaat 421 tttgccactc gtccctatac catccaccct catggtgtct tctacgagaa ggactctgaa 481 ggttccctat acccagatgg ctcctctggg ccactgaaag ctgatgactc tgttcccccg 541 gggggcagcc atatctacaa ctggaccatt ccagaaggcc atgcacccac cgatgctgac 601 ccagcgtgcc tcacctggat ctaccattct catgtagatg ctccacgaga cattgcaact 661 ggcctaattg ggcctctcat cacctgtaaa agaggagccc tggatgggaa ctcccctcct 721 caacgccagg atgtagacca tgatttcttc ctcctcttca gtgtggtaga tgagaacctc 781 agctggcatc tcaatgagaa cattgccact tactgctcag atcctgcttc agtggacaaa 841 gaagatgaga catttcagga gagcaatagg atgcatgcaa tcaatggctt tgtttttggg 901 aatttacctg agctgaacat gtgtgcacag aaacgtgtgg cctggcactt gtttggcatg 961 ggcaatgaaa ttgatgtcca cacagcattt ttccatggac agatgctgac tacccgtgga 1021 caccacactg atgtggctaa catctttcca gccacctttg tgactgctga gatggtgccc 1081 tgggaacctg gtacctggtt aattagctgc caagtgaaca gtcactttcg agatggcatg 1141 caggcactct acaaggtcaa gtcttgctcc atggcccctc ctgtggacct gctcacaggc 1201 aaagttcgac agtacttcat tgaggcccat gagattcaat gggactatgg cccgatgggg 1261 catgatggga gtactgggaa gaatttgaga gagccaggca gtatctcaga taagtttttc 1321 cagaagagct ccagccgaat tgggggcact tactggaaag tgcgatatga agcctttcaa 1381 gatgagacat tccaagagaa gatgcatttg gaggaagata ggcatcttgg aatcctgggg 1441 ccagtgatcc gggctgaggt gggtgacacc attcaggtgg tcttctacaa ccgtgcctcc 1501 cagccattca gcatgcagcc ccatggggtc ttttatgaga aagactatga aggcactgtg 1561 tacaatgatg gctcatctta ccctggcttg gttgccaagc cctttgagaa agtaacatac 1621 cgctggacag tcccccctca tgccggtccc actgctcagg atcctgcttg tctcacttgg 1681 atgtacttct ctgctgcaga tcccataaga gacacaaatt ctggcctggt gggcccgctg 1741 ctggtgtgca gggctggtgc cttgggtgca gatggcaagc agaaaggggt ggataaagaa 1801 ttctttcttc tcttcactgt gttggatgag aacaagagct ggtacagcaa tgccaatcaa 1861 gcagctgcta tgttggattt ccgactgctt tcagaggata ttgagggctt ccaagactcc 1921 aatcggatgc atgccattaa tgggtttctg ttctctaacc tgcccaggct ggacatgtgc 1981 aagggtgaca cagtggcctg gcacctgctc ggcctgggca cagagactga tgtgcatgga 2041 gtcatgttcc agggcaacac tgtgcagctt cagggcatga ggaagggtgc agctatgctc 2101 tttcctcata cctttgtcat ggccatcatg cagcctgaca accttgggac atttgagatt 2161 tattgccagg caggcagcca tcgagaagca gggatgaggg caatctataa tgtctcccag 2221 tgtcctggcc accaagccac ccctcgccaa cgctaccaag ctgcaagaat ctactatatc 2281 atggcagaag aagtagagtg ggactattgc cctgaccgga gctgggaacg ggaatggcac 2341 aaccagtctg agaaggacag ttatggttac attttcctga gcaacaagga tgggctcctg 2401 ggttccagat acaagaaagc tgtattcagg gaatacactg atggtacatt caggatccct 2461 cggccaagga ctggaccaga agaacacttg ggaatcttgg gtccacttat caaaggtgaa 2521 gttggtgata tcctgactgt ggtattcaag aataatgcca gccgccccta ctctgtgcat 2581 gctcatggag tgctagaatc tactactgtc tggccactgg ctgctgagcc tggtgaggtg 2641 gtcacttatc agtggaacat cccagagagg tctggccctg ggcccaatga ctctgcttgt 2701 gtttcctgga tctattattc tgcagtggat cccatcaagg acatgtatag tggcctggtg 2761 gggcccttgg ctatctgcca aaagggcatc ctggagcccc atggaggacg gagtgacatg 2821 gatcgggaat ttgcattgtt gttcttgatt tttgatgaaa ataagtcttg gtatttggag 2881 gaaaatgtgg caacccatgg gtcccaggat ccaggcagta ttaacctaca ggatgaaact 2941 ttcttggaga gcaataaaat gcatgcaatc aatgggaaac tctatgccaa ccttaggggt 3001 cttaccatgt accaaggaga acgagtggcc tggtacatgc tggccatggg ccaagatgtg 3061 gatctacaca ccatccactt tcatgcagag agcttcctct atcggaatgg cgagaactac 3121 cgggcagatg tggtggatct gttcccaggg acttttgagg ttgtggagat ggtggccagc 3181 aaccctggga catggctgat gcactgccat gtgactgacc atgtccatgc tggcatggag 3241 accctcttca ctgttttttc tcgaacagaa cacttaagcc ctctcaccgt catcaccaaa 3301 gagactgaaa aagcagtgcc ccccagagac attgaagaag gcaatgtgaa gatgctgggc 3361 atgcagatcc ccataaagaa tgttgagatg ctggcctctg ttttggttgc cattagtgtc 3421 acccttctgc tcgttgttct ggctcttggt ggagtggttt ggtaccaaca tcgacagaga 3481 aagctacgac gcaataggag gtccatcctg gatgacagct tcaagcttct gtctttcaaa 3541 cagtaacatc tggagcctgg agatatcctc aggaagcaca tctgtagtgc actcccagca 3601 ggccatggac tagtcactaa ccccacactc aaaggggcat gggtggtgga gaagcagaag 3661 gagcaatcaa gcttatctgg atatttcttt ctttatttat tttacatgga aataatatga 3721 tttcactttt tctttagttt ctttgctcta cgtgggcacc tggcactaag ggagtacctt 3781 attatcctac atcgcaaatt tcaacagcta cattatattt ccttctgaca cttggaaggt 3841 attgaaattt ctagaaatgt atccttctca caaagtagag accaagagaa aaactcattg 3901 attgggtttc tacttctttc aaggactcag gaaatttcac tttgaactga ggccaagtga 3961 gctgttaaga taacccacac ttaaactaaa ggctaagaat ataggcttga tgggaaattg 4021 aaggtaggct gagtattggg aatccaaatt gaattttgat tctccttggc agtgaactac 4081 tttgaagaag tggtcaatgg gttgttgctg ccatgagcat gtacaacctc tggagctaga 4141 agctcctcag gaaagccagt tctccaagtt cttaacctgt ggcactgaaa ggaatgttga 4201 gttacctctt catgttttag acagcaaacc ctatccatta aagtacttgt tagaacactg 4261 aaaaaaaaaa aaaaaaaaa //