LOCUS CR456510 2514 bp mRNA linear HUM 16-OCT-2008 DEFINITION Homo sapiens LARGE full length open reading frame (ORF) cDNA clone (cDNA clone C22ORF:pGEM.LARGE). ACCESSION CR456510 VERSION CR456510.1 KEYWORDS CDNA; chromosome 22; ORF. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2514) AUTHORS Collins J.E., Wright C.L., Edwards C.A., Davis M.P., Grinham J.A., Cole C.G., Goward M.E., Aguado B., Mallya M., Mokrab Y., Huckle E.J., Beare D.M., Dunham I. TITLE A genome annotation-driven approach to cloning the human ORFeome JOURNAL Genome Biol. 5(10), R84-R84(2004). PUBMED 15461802 REFERENCE 2 (bases 1 to 2514) AUTHORS Collins J.E., Wright C.L., Edwards C.A., Davis M.P., Grinham J.A., Cole C.G., Goward M.E., Aguado B., Mallya M., Mokrab Y., Huckle E.J., Beare D.M., Dunham I. JOURNAL Submitted (23-MAR-2006) to the INSDC. Sanger Institute, Hinxton, Cambridgeshire, CB10 1SA, UK. E-mail enquiries: c22g@sanger.ac.uk COMMENT Sanger Institute name : pGEM.LARGE Homo sapiens cDNA sequence. This sequence was generated as part of The Wellcome Trust Sanger Institute program to isolate cDNA clones representing the full length open reading frame of well annotated protein coding genes on human chromosome 22. For more information see http://www.sanger.ac.uk/HGP/Chr22/ORFcloning FEATURES Location/Qualifiers source 1..2514 /db_xref="H-InvDB:HIT000267444" /organism="Homo sapiens" /chromosome="22" /lab_host="Mach1" /mol_type="mRNA" /clone="pGEM.LARGE" /db_xref="taxon:9606" CDS 29..2299 /gene="LARGE" /db_xref="GOA:O95461" /db_xref="H-InvDB:HIT000267444.13" /db_xref="HGNC:HGNC:6511" /db_xref="InterPro:IPR002495" /db_xref="InterPro:IPR029044" /db_xref="UniProtKB/Swiss-Prot:O95461" /protein_id="CAG30396.1" /translation="MLGICRGRRKFLAASLSLLCIPAITWIYLFSGSFEDGKPVSLSP LESQAHSPRYTASSQRERESLEVRMREVEEENRALRRQLSLAQGRAPSHRRGNHSKTY SMEEGTGDSENLRAGIVAGNSSECGQQPVVEKCETIHVAIVCAGYNASRDVVTLVKSV LFHRRNPLHFHLIADSIAEQILATLFQTWMVPAVRVDFYNADELKSEVSWIPNKHYSG IYGLMKLVLTKTLPANLERVIVLDTDITFATDIAELWAVFHKFKGQQVLGLVENQSDW YLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMKWEQMWRLTAERELMGMLSTSLAD QDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQCYRDVSDLKVIHWNSPKKLRVKNK HVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENLQKQLSELDEDDLCYEFRRER FTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDRLQMLEAICKHWEGPISLALYLSDA EAQQFLRYAQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDF LPMYGLYEYLRKSVIQLDLANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFT FRYHVWTKGHAPTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNKV AHIMELDVQEYEFIVLPNAYMIHMPHAPSFDITKFRSNKQYRICLKTLKEEFQQDMSR RYGFAALKYLTAENNS" BASE COUNT 563 a 736 c 673 g 542 t ORIGIN 1 gggattaggg attgccactt ctgagaggat gctgggaatc tgcaggggga gacggaaatt 61 cttggctgcc tcgttgagtc ttctctgcat cccagccatc acctggattt acctgttttc 121 tgggagcttc gaagatggaa agcccgtgtc tctgtcaccg ctggagtccc aggcacacag 181 ccccaggtac acggcctcca gccagcggga gcgcgagagc ctggaggtgc gcatgcgcga 241 ggtggaggag gagaaccgcg ccctccgcag gcagctcagc ctggcccagg gccgagcccc 301 atcccatcgc cgaggcaacc actccaagac ctactccatg gaggagggca ctggagacag 361 cgagaacctt cgggctggca tcgtggcagg caacagctcc gagtgtgggc agcagccggt 421 cgtggagaaa tgcgagacaa tccacgttgc tattgtctgc gctggataca atgccagccg 481 ggatgtcgtc accctggtca aatccgtcct gttccataga cggaaccctc tgcacttcca 541 ccttattgct gactccattg cggagcagat cctggccacg ctcttccaga cctggatggt 601 gcccgctgtg cgtgtggact tctacaatgc agacgagctc aagtctgaag tttcctggat 661 ccccaataaa cattactctg ggatttatgg tctgatgaag cttgtcctga ccaagactct 721 tcctgccaac ctggagagag tcatcgtcct tgacacggat atcacctttg ccactgacat 781 tgcagagctg tgggctgtgt tccacaagtt caaaggtcag caagtcctgg gcttggtgga 841 gaaccagagt gactggtacc ttggaaacct gtggaaaaat caccgcccat ggccagccct 901 tggaagaggc tacaacacag gggtgatcct gttacttctg gataagctgc ggaagatgaa 961 atgggagcag atgtggaggc tgaccgcaga gagggagctc atgggcatgc tctctacatc 1021 cttagctgac caggatattt tcaatgccgt catcaaacaa aaccccttcc ttgtgtacca 1081 gctcccctgc ttctggaatg tgcagctgtc agaccacacc cgctccgagc agtgctacag 1141 agacgtgtct gatctaaagg tcattcactg gaactccccc aagaagctcc gggtgaagaa 1201 caagcatgtg gagttttttc gcaacctcta cctgaccttc ctggagtatg acggcaatct 1261 tctgaggcgg gaactgtttg gctgccccag tgaggctgat gtcaacagtg aaaacctcca 1321 gaagcagctg tctgagctgg acgaggacga cctgtgctat gagttccggc gagagcgctt 1381 cactgtccac cgcacccacc tgtacttcct gcactacgag tatgagcctg cagcagacag 1441 cacggacgtc accctggtcg ctcagctgtc catggacagg ctccagatgc tggaggccat 1501 ctgcaagcac tgggaggggc ccatcagcct ggccctctac ctgtcagacg ccgaggccca 1561 gcagttcctc cgctacgcac agggctctga ggtgcttatg agccgccaca acgtgggcta 1621 ccacatcgtg tacaaggagg gccagttcta ccccgtgaac ctgctgcgca acgtggccat 1681 gaagcacatc agcactccct acatgttcct gtctgacatt gacttcctgc ccatgtatgg 1741 gctctatgag tacctcagga agtctgtcat ccagctcgat cttgccaaca ccaagaaagc 1801 aatgattgtc cccgcgttcg agacactgcg ctaccggctg tccttcccca agtcaaaagc 1861 ggagttgctg tcaatgctgg acatggggac cctcttcaca ttcaggtacc acgtctggac 1921 gaaaggccac gcacccacaa acttcgccaa gtggcggacc gccaccacgc cttaccgggt 1981 tgagtgggag gccgattttg agccgtatgt tgttgtgaga cgtgactgcc cggagtacga 2041 ccggaggttt gtaggctttg gctggaacaa agtggctcat atcatggagc tggatgtgca 2101 ggagtatgag ttcattgtgc tgcccaacgc ctacatgatc cacatgcctc atgcccccag 2161 cttcgacatt accaagttcc gttccaacaa gcaataccgc atctgtctca aaaccctcaa 2221 ggaagagttt cagcaggaca tgtcccgccg ctacggcttt gctgccctga aatatctcac 2281 agccgagaac aacagctagc accaagaagc ccaccactag ggggagacat gctgtagggg 2341 aagtgccact cgctgtttgg ggcccggcct tcaaattcaa aattgagcca tgctttttcg 2401 gtttgttttt atttatctct ttggcccagc caagctgccc tcactacaga gaccttggac 2461 aaggatccag ccagtccctc tctgccccac aaccctgcat tcccagaggt tagc //