LOCUS AL035291 5556 bp mRNA linear HUM 15-OCT-2008 DEFINITION H.sapiens gene from PACs 125H23 and 105D12. ACCESSION AL035291 VERSION AL035291.1 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5556) AUTHORS Rhodes S. JOURNAL Submitted (06-JAN-1999) to the INSDC. E-mail contact: humquery@sanger.ac.uk COMMENT This sequence was generated from cDNA clones isolated using sequence from the bacterial clones 125H23 (Z94054) and 105D12 (Z96050) and EST data. The EST sequences listed match this sequence with an identity of at least 95% between the coordinates shown. Further information can be found at http://www.sanger.ac.uk/HGP/Chr1/ Sanger Centre name: dJ125H23.C1.1. This experimentally determined gene has a CpG island. FEATURES Location/Qualifiers source 1..5556 /db_xref="H-InvDB:HIT000250042" /organism="Homo sapiens" /chromosome="1" /map="1q24" /mol_type="mRNA" /db_xref="taxon:9606" exon 1..281 /number=1 CDS 220..3984 /product="hypothetical protein" /db_xref="GOA:Q9UBS9" /db_xref="H-InvDB:HIT000250042.15" /db_xref="HGNC:HGNC:1240" /db_xref="InterPro:IPR008979" /db_xref="InterPro:IPR012919" /db_xref="UniProtKB/Swiss-Prot:Q9UBS9" /protein_id="CAA22894.1" /translation="MKKHRRALALVSCLFLCSLVWLPSWRVCCKESSSASASSYYSQD DNCALENEDVQFQKKDEREGPINAESLGKSGSNLPISPKEHKLKDDSIVDVQNTESKK LSPPVVETLPTVDLHEESSNAVVDSETVENISSSSTSEITPISKLDEIEKSGTIPIAK PSETEQSETDCDVGEALDASAPIEQPSFVSPPDSLVGQHIENVSSSHGKGKITKSEFE SKVSASEQGGGDPKSALNASDNLKNESSDYTKPGDIDPTSVASPKDPEDIPTFDEWKK KVMEVEKEKSQSMHASSNGGSHATKKVQKNRNNYASVECGAKILAANPEAKSTSAILI ENMDLYMLNPCSTKIWFVIELCEPIQVKQLDIANYELFSSTPKDFLVSISDRYPTNKW IKLGTFHGRDERNVQSFPLDEQMYAKYVKMFIKYIKVELLSHFGSEHFCPLSLIRVFG TSMVEEYEEIADSQYHSERQELFDEDYDYPLDYNTGEDKSSKNLLGSATNAILNMVNI AANILGAKTEDLTEGNKSISENATATAAPKMPESTPVSTPVPSPEYVTTEVHTHDMEP STPDTPKESPIVQLVQEEEEEASPSTVTLLGSGEQEDESSPWFESETQIFCSELTTIC CISSFSEYIYKWCSVRVALYRQRSRTALSKGKDYLVLAQPPLLLPAESVDVSVLQPLS GELENTNIEREAETVVLGDLSSSMHQDDLVNHTVDAVELEPSHSQTLSQSLLLDITPE INPLPKIEVSESVEYEAGHIPSPVIPQESSVEIDNETEQKSESFSSIEKPSITYETNK VNELMDNIIKEDVNSMQIFTKLSETIVPPINTATVPDNEDGEAKMNIADTAKQTLISV VDSSSLPEVKEEEQSPEDALLRGLQRTATDFYAELQNSTDLGYANGNLVHGSNQKESV FMRLNNRIKALEVNMSLSGRYLEELSQRYRKQMEEMQKAFNKTIVKLQNTSRIAEEQD QRQTEAIQLLQAQLTNMTQLVSNLSATVAELKREVSDRQSYLVISLVLCVVLGLMLCM QRCRNTSQFDGDYISKLPKSNQYPSPKRCFSSYDDMNLKRRTSFPLMRSKSLQLTGKE VDPNDLYIVEPLKFSPEKKKKRCKYKIEKIETIKPEEPLHPIANGDIKGRKPFTNQRD FSNMGEVYHSSYKGPPSEGSSETSSQSEESYFCGISACTSLCNGQSQKTKTEKRALKR RRSKVQDQGKLIKTLIQTKSGSLPSLHDIIKGNKEITVGTFGVTAVSGHI" exon 282..396 /number=2 exon 397..507 /number=3 exon 508..662 /number=4 exon 663..800 /number=5 exon 801..951 /number=6 exon 952..1075 /number=7 exon 1076..1200 /number=8 exon 1201..1268 /number=9 exon 1269..1376 /number=10 exon 1377..1482 /number=11 exon 1483..1503 /number=12 exon 1504..1559 /number=13 exon 1560..1651 /number=14 exon 1652..1717 /number=15 exon 1718..1786 /number=16 exon 1787..1877 /number=17 exon 1878..3044 /number=18 exon 3045..3132 /number=19 misc_feature 3113..3500 /note="match: EST N28026" exon 3133..3237 /number=20 exon 3238..3392 /number=21 misc_feature 3323..3484 /note="match: EST AA481703" misc_feature 3372..3718 /note="match: EST AA336692" exon 3393..3484 /number=22 exon 3485..3540 /number=23 exon 3541..5556 /number=24 misc_feature 3755..4174 /note="match: EST AA374971" misc_feature 3773..4226 /note="match: EST AA235936" misc_feature 3965..4169 /note="match: EST AA343970" misc_feature 4104..4368 /note="match: EST AA330203" misc_feature 4104..4531 /note="match: EST N32686" misc_feature join(4114..4356,4376..4402) /note="match: EST H26460" misc_feature 4160..4338 /note="match: EST N44164" misc_feature complement(4316..4401) /note="match: EST AA103178" misc_feature 4400..4670 /note="match: EST AA373313" misc_feature join(4515..4695,4719..4949) /note="match: EST AA278830" misc_feature join(4692..4749,4784..4890,4890..4922) /note="match: EST T27279" misc_feature 4785..4868 /note="match: EST T27289" misc_feature 4898..5157 /note="match: EST H88063" misc_feature 5042..5343 /note="match: EST D61948" misc_feature complement(5087..5554) /note="match: EST AA974378" misc_feature complement(join(5092..5117,5146..5556)) /note="match: EST N45113" misc_feature complement(5093..5554) /note="match: EST AI203426" misc_feature complement(join(5104..5366,5363..5556)) /note="match: EST N36176" misc_feature complement(5117..5556) /note="match: EST AI142894" misc_feature complement(5134..5556) /note="match: EST N22982" misc_feature complement(5136..5554) /note="match: EST AI203406" misc_feature complement(5138..5554) /note="match: EST AI206281" misc_feature complement(5146..5556) /note="match: EST AA864476" misc_feature complement(5159..5556) /note="match: EST AI217893" misc_feature complement(5168..5556) /note="match: EST AA830836" misc_feature 5188..5452 /note="match: EST H88718" misc_feature complement(5191..5556) /note="match: EST AA236951" misc_feature complement(5231..5556) /note="match: EST AA757158" misc_feature complement(5239..5556) /note="match: EST AA868446" misc_feature 5454..5556 /note="match: EST C00478" regulatory 5531..5536 /regulatory_class="polyA_signal_sequence" BASE COUNT 1784 a 1008 c 1139 g 1625 t ORIGIN 1 ccggagtcct gtgaagcgcc cctgtccgcg cctctgtggg gccctcagag agggctgcca 61 ggacgcgagc cactgaggag ccgctcagcc agcgccatag cccttaggac tatcggtcac 121 attctcgcgc tcctgctccg gctcctccat cttggcctcg gcagtggcgg ctgccgggag 181 gatgtgccgc cttctggcag ggggaagaag gaggagaaga tgaagaagca ccggcgggcc 241 ttggccctgg tctcctgcct ctttctgtgc tctctggtct ggcttcccag ctggcgtgta 301 tgttgtaaag agagttcttc agcttcagcg tcatcatatt actctcaaga tgacaactgc 361 gcactagaaa atgaagatgt acaattccag aaaaaggatg aaagagaggg acctatcaat 421 gccgaatcat tgggaaaatc aggttcaaat ttacctattt ctccaaaaga acataaatta 481 aaagatgatt ctattgtgga tgtacaaaat acagagtcaa aaaagttaag tccaccggtg 541 gtggagacac tccctacagt tgatttgcat gaagagtctt ccaatgcagt tgtggacagt 601 gaaactgttg aaaatatttc cagctcatct acctcagaaa tcactccaat ctcaaagctt 661 gatgaaatag aaaaatctgg tactattccg atagccaaac caagtgaaac tgagcagtct 721 gaaactgatt gtgatgttgg tgaggccctt gatgctagtg ctccaattga acaaccttcc 781 tttgtcagtc cacctgacag ccttgttggc cagcatatag aaaatgtatc atcttcacat 841 ggtaaaggaa agataacaaa atcagaattt gaatcaaaag tttcagcaag tgaacagggc 901 ggtggtgatc caaaatctgc attgaatgct tcagataatt taaaaaatga gagctctgat 961 tatacaaaac caggagacat tgaccctaca tcagtagcaa gtcccaaaga tccagaagat 1021 ataccaacat ttgatgaatg gaagaagaaa gttatggaag tagaaaaaga aaaaagtcag 1081 tcgatgcatg catcttctaa tggaggttca catgccacca aaaaggtcca gaaaaatcga 1141 aataattatg cctcagtaga atgtggtgcc aaaattctag cagctaatcc agaagccaag 1201 agcacatctg ctattcttat agaaaatatg gatctttaca tgttgaatcc ttgcagcact 1261 aaaatttggt ttgttattga actttgtgaa ccaattcaag taaaacagct tgatattgca 1321 aattatgaat tattttcttc tactcctaaa gattttctgg tttctatcag tgacagatat 1381 ccaacaaata agtggattaa gctgggtact tttcatggta gagatgagcg gaatgtacag 1441 agtttccctt tagatgaaca gatgtatgca aaatatgtca agatgttcat caagtacata 1501 aaggttgagt tgctatcaca ttttggatca gagcactttt gtccattaag ccttataagg 1561 gtatttggca ctagcatggt ggaagaatat gaagaaattg ctgattccca gtatcactca 1621 gaacgccagg aactatttga tgaggactat gattatccac tggattataa tactggagag 1681 gataaatcct caaaaaatct tcttggttct gctacaaatg ccattctaaa tatggtgaat 1741 attgctgcta atattctggg agcaaaaact gaagacctga cagaaggaaa taaaagtata 1801 tctgagaatg ccactgccac agctgcacct aaaatgcctg aatcaactcc tgtttcaact 1861 cctgttccat ctcctgagta tgtaaccact gaagtacaca cacatgacat ggagccgtca 1921 acaccagata ctccaaaaga gagtcccatt gtacagttag ttcaagagga ggaagaggag 1981 gcaagtccat ctacagtgac ccttctgggc agcggtgaac aggaagatga atcatcaccc 2041 tggtttgagt cagagacaca aatattttgc agtgaactga ccacaatttg ttgtatttct 2101 agtttttcag aatacatata taaatggtgt tcagttagag ttgctcttta tcggcagcgc 2161 agccgaactg ctttgagtaa aggaaaagat tatcttgtgt tagctcaacc acccttacta 2221 cttcctgcgg aatcagtaga tgtttcagta ttgcaacctc tgagtggaga attggaaaat 2281 acgaatatag aaagggaagc tgaaactgtt gttctgggtg atttaagtag tagtatgcac 2341 caggatgact tggtgaatca cactgtagat gcagttgaac ttgaaccaag ccattctcaa 2401 actctttctc agtctcttct tttagatatt accccagaaa tcaatccctt gcctaaaata 2461 gaagtatctg agtctgttga atatgaggca ggacatatac catcaccagt gattccccaa 2521 gagagttctg ttgagatcga taatgaaaca gaacaaaagt ctgagagctt tagttctata 2581 gagaaaccat ctattaccta tgaaacaaat aaagttaatg agttaatgga taatattata 2641 aaagaagatg tgaactccat gcaaattttc acaaagctgt ctgaaacaat agtgccacca 2701 ataaatacag ccactgtacc cgacaatgaa gatggggaag ccaaaatgaa tatagctgac 2761 acagcaaagc aaactttgat ttctgttgtg gattcttctt cattacctga agtaaaagaa 2821 gaagaacagt ctccagaaga tgcccttttg agagggttac agaggacagc tacagatttt 2881 tatgctgaat tgcaaaattc tacagatcta ggatatgcta atggaaatct tgtacatgga 2941 tcaaaccaaa aggagtcagt atttatgaga cttaataatc gtattaaagc cttagaagtt 3001 aacatgtctc tcagtggtcg ctatctggag gagcttagcc aaaggtaccg aaaacaaatg 3061 gaagaaatgc aaaaggcttt caataaaaca atcgtgaaac ttcagaatac ttcaagaata 3121 gcagaggagc aggatcagcg gcaaactgaa gccatccagt tgctacaggc acagctgacc 3181 aacatgacac agcttgtttc aaatttatca gcaacagtag cagaattgaa acgggaggtt 3241 tcagatcgac aaagctatct tgtcatatct ttggttcttt gtgttgtctt gggactgatg 3301 ctttgtatgc agcgttgtcg aaatacttct caatttgatg gagattatat ttcaaaactt 3361 cctaaaagta atcagtatcc aagccctaaa aggtgtttct cttcctatga tgatatgaat 3421 ttgaaaagaa gaacttcatt cccactcatg agatccaagt ctctacagtt aactggcaaa 3481 gaagtagacc caaatgattt gtacattgta gaacccctca agttttctcc agaaaagaag 3541 aagaagcgct gcaagtacaa aattgaaaaa attgagacca taaagcctga agaaccattg 3601 caccccatag ccaatggcga cataaaagga agaaagccct ttacgaacca gagagatttt 3661 tctaatatgg gagaagttta tcactcttct tataaaggtc ctccatctga aggaagctca 3721 gaaacttcat cacagtcaga agagtcctat ttttgtggca tttcagcttg cacaagtctg 3781 tgcaatggac agtctcaaaa gacaaaaact gagaagaggg ctttaaaacg aagacgatct 3841 aaagtccaag accaaggaaa attgataaaa actctaatac agactaagtc gggatcattg 3901 ccgagcctgc atgacataat caaaggaaac aaagagatca ccgtgggaac atttggtgtt 3961 acagcagtct cgggacatat ctaaaattaa ttgaactttt catacagaag acttttttgt 4021 tgttgttctt tgaagaacag tctgtagtat ttgaagggtt tgggggaggg agaaaatatt 4081 aatgggaaag gcattcagaa attatggttt ctaccttttt aaaaagtaga tgggattgtg 4141 tcaatcttgg ttaatgagct acagttttac aaagctgatc acttcctata aggacaatgg 4201 tagacatttt ataaagatgt tttttcacaa gattaattac tgggacaaaa gtaatttgga 4261 agcccagttc cttaggtggg ataggaatga aagcctaaac ctcttccttt agctttgttc 4321 ctatttcttg caccttccca tatttatgtg ccttttgtct atttataatg ccactggaag 4381 aggagggata actttttctg ttatttgatt tcttttataa ctttgttagg tttttgaagc 4441 tgcaaacact acaatgcttt gagggggtct gtgcctgaag ctcaggagtg tggatcagac 4501 agtctaaaga tcctaaaaac ttgccaactg gatctttgtt tagcaaactc actggaaatg 4561 aacacttaat ggaattttta agtctgttct gttaggtaga tggtgatgct cttgttattt 4621 tcacttattc agactggatt acttcttact tagttactaa ctcaatgagg aaaaatccct 4681 acaggatctt tttttgcaaa caactgatat atgcagacaa atttttgaca aattcacctt 4741 ttaaacacac ttaaccattt gtgaaggttt tctttagctt acattttaaa catacacaat 4801 aaacactaat cctccaaact ttcactgttt ttattagtat gaatataaaa tttgaaggtt 4861 tggccaatta gtacaagtct catgatataa tcactgcctg catacatatg cacagatcca 4921 gttagtgagt ttgtcaagct taatctaatt ggttaagtct aaagagatta ttattccttg 4981 atgtttgctt tgtattggct acaaatgtgc agaggtaata catatgtgat gtcgatgtct 5041 ctgtcttttt ttttgtcttt aaaaaataat tggcagcaac tgtatttgaa taaaatgatt 5101 tcttagtatg attgtacagt aatgaatgaa agtggaacat gtttcttttt gaaagggaga 5161 gaattgacca tttattgttg tgatgtttaa gttataactt attgagcact tttagtagtg 5221 ataactgttt ttaaacttgc ctaatacctt tcttgggtat tgtttgtaat gtgacttatt 5281 taacgccttt tttgtttgtt taagttgctg ctttaggtta acagcgtgtt ttagaagatt 5341 taaatttctt tcctgtctgc acaattagct attcagagca agagggcctg attttataga 5401 agccccttga aaagaggtcc agatgagagc agagatacag tgagaaatta tgtgatctgt 5461 gtgttgtggg aagagaattt tcaatatgta actacggagc tgtagtgcca ttagaaactg 5521 tgaatttcca aataaatctg aacacttgtc tttatt //