LOCUS BC020567 4125 bp mRNA linear HUM 06-OCT-2003 DEFINITION Homo sapiens rho/rac guanine nucleotide exchange factor (GEF) 2, mRNA (cDNA clone MGC:21557 IMAGE:4157775), complete cds. ACCESSION BC020567 VERSION BC020567.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4125) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 4125) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (03-JAN-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: David N. Louis, M.D. cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 20 Row: c Column: 24 This clone was selected for full length sequencing because it passed the following selection criteria: Similarity but not identity to protein. FEATURES Location/Qualifiers source 1..4125 /db_xref="H-InvDB:HIT000038828" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:21557 IMAGE:4157775" /tissue_type="Brain, anaplastic oligodendroglioma with 1p/19q loss" /clone_lib="NCI_CGAP_Brn67" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..4125 /gene="ARHGEF2" /gene_synonym="DKFZp547L106" /gene_synonym="GEF" /gene_synonym="GEF-H1" /gene_synonym="GEFH1" /gene_synonym="KIAA0651" /gene_synonym="LFP40" /gene_synonym="P40" /db_xref="GeneID:9181" /db_xref="MIM:607560" CDS 182..3061 /gene="ARHGEF2" /gene_synonym="DKFZp547L106" /gene_synonym="GEF" /gene_synonym="GEF-H1" /gene_synonym="GEFH1" /gene_synonym="KIAA0651" /gene_synonym="LFP40" /gene_synonym="P40" /codon_start=1 /product="ARHGEF2 protein" /protein_id="AAH20567.1" /db_xref="GeneID:9181" /db_xref="MIM:607560" /translation="MKEAKDARYTNGHLFTTISVSGMTMCYACNKSITAKEALICPTC NVTIHNRCKDTLANCTKVKQKQQKAALLKNNTALQSVSLRSKTTIRERPSSAIYPSDS FRQSLLGSRRGRSSLSLAKSVSTTNIAGHFNDESPLGLRRILSQSTDSLNMRNRTLSV ESLIDEAEVIYSELMSDFEMDEKDFAADSWSLAVDSSFLQQHKKEVMKQQDVIYELIQ TELHHVRTLKIMTRLFRTGMLEELHLEPGVVQGLFPCVDELSDIHTRFLSQLLERRRQ ALCPGSTRNFVIHRLGDLLISQFSGPSAEQMCKTYSEFCSRHSKALKLYKELYARDKR FQQFIRKVTRPAVLKRHGVQECILLVTQRITKYPLLISRILQHSHGIEEERQDLTTAL GLVKELLSNVDEGIYQLEKGARLQEIYNRMDPRAQTPVPGKGPFGREELLRRKLIHDG CLLWKTATGRFKDVLVLLMTDVLVFLQEKDQKYIFPTLDKPSVVSLQNLIVRDIANQE KGMFLISAAPPEMYEVHTASRDDRSTWIRVIQQSVRTCPSREDFPLIETEDEAYLRRI KMELQQKDRALVELLREKVGLFAEMTHFQAEEDGGSGMALPTLPRGLFRSESLESPRG ERLLQDAIREVEGLKDLLVGPGVELLLTPREPALPLEPDSGGNTSPGVTANGEARTFN GSIELCRADSDSSQRDRNGNQLRSPQEEALQRLVNLYGLLHGLQAAVAQQDTLMEARF PEGPERREKLCRANSRDGEAGRAGAAPVAPEKQATELALLQRQHALLQEELRRCRRLG EERATEAGSLEARLRESEQARALLEREAEEARRQLAALGQTEPLPAEAPWARRPVDPR RRSLPAGDALYLSFNPPQPSRGTDRLDLPVTTRSVHRNFEDRERQELGSPEERLQDSS DPDTGSEEEGSSRLSPPHSPRDFTRMQDIPEETESRDGEAVASES" misc_feature 218..358 /gene="ARHGEF2" /gene_synonym="DKFZp547L106" /gene_synonym="GEF" /gene_synonym="GEF-H1" /gene_synonym="GEFH1" /gene_synonym="KIAA0651" /gene_synonym="LFP40" /gene_synonym="P40" /note="C1; Region: Protein kinase C conserved region 1 (C1) domains (Cysteine-rich domains)" /db_xref="CDD:smart00109" misc_feature 815..1393 /gene="ARHGEF2" /gene_synonym="DKFZp547L106" /gene_synonym="GEF" /gene_synonym="GEF-H1" /gene_synonym="GEFH1" /gene_synonym="KIAA0651" /gene_synonym="LFP40" /gene_synonym="P40" /note="RhoGEF; Region: Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases" /db_xref="CDD:smart00325" BASE COUNT 954 a 1183 c 1168 g 820 t ORIGIN 1 gagaccaacg cgtgcgggcc gaacccctcc ccccgccttc ccccaacaat acaggacgcc 61 ggggtccgcg ccgcgtcctc cctggtcccc ccgtccgatt atgtctcgga tcgaatccct 121 cacgcgggcg cggatcgacc ggagcagaga gctggcgagc aagacccggg aaaaggagaa 181 gatgaaggaa gccaaggatg cccgctatac caatgggcac ctcttcacca ccatttcagt 241 ttcaggcatg accatgtgct atgcctgtaa caagagcatc acagccaagg aagccctcat 301 ctgcccaacc tgcaatgtga ctatccacaa ccgctgtaaa gacaccctcg ccaactgtac 361 caaggtcaag cagaagcaac agaaagcggc cctgctgaag aacaacaccg ccttgcagtc 421 cgtttctctt cgaagtaaga caaccatccg ggagcggcca agctcggcca tctacccctc 481 cgacagcttc cggcagtccc tcctgggctc ccgccgtggc cgctcctcct tgtctttagc 541 caagagtgtt tctaccacca acattgctgg acatttcaat gatgagtctc ccctggggct 601 gcgccggatc ctctcacagt ccacagactc cctcaacatg cggaaccgaa ccctatccgt 661 ggaatccctc attgacgaag cagaggtaat ctacagtgag ctgatgagtg actttgagat 721 ggatgagaag gactttgcag ctgactcttg gagtcttgct gtggacagca gcttcctgca 781 gcagcataaa aaggaggtga tgaagcagca agatgtcatc tatgagctaa tccagacaga 841 gctgcaccat gtgaggacac tgaagatcat gacccgcctc ttccgcacgg ggatgctgga 901 agagctacac ttggagccag gagtggtcca gggcctgttc ccctgcgtgg acgagctcag 961 tgacatccat acacgcttcc tcagccagct attagaacgc cgacgccagg ccctgtgccc 1021 tggcagcacc cggaactttg tcatccatcg cttgggtgat ctgctcatca gccagttctc 1081 aggtcctagt gcggagcaga tgtgtaagac ctactcggag ttctgcagcc gccacagcaa 1141 ggccttaaag ctctataagg agctgtacgc ccgagacaaa cgcttccagc aattcatccg 1201 gaaagtgacc cgccccgccg tgctcaagcg gcacggggta caggagtgca tcctgctggt 1261 gactcagcgc atcaccaagt acccgttact catcagccgc atcctgcagc attcccacgg 1321 gatcgaggag gagcgccagg acctgaccac agcactgggg ctagtgaagg agctgctgtc 1381 caatgtggac gagggtattt atcagctgga gaaaggggcc cgtctgcagg agatctacaa 1441 ccgcatggac cctcgggccc aaaccccagt gcctggcaag ggcccctttg gccgagagga 1501 acttctgagg cgcaaactca tccacgatgg ctgcctgctc tggaagacag cgacggggcg 1561 cttcaaagat gtgctagtgc tgctgatgac agatgtactg gtgtttctcc aggaaaagga 1621 ccagaagtac atctttccta ccctggacaa gccttcagtg gtatcgctgc agaatctaat 1681 cgtacgagac attgccaacc aggagaaagg gatgtttctg atcagcgcag ccccacctga 1741 gatgtacgag gtgcacacag catcccggga tgaccggagc acctggatcc gggtcattca 1801 gcagagcgtg cgcacatgcc catccaggga ggacttcccc ctgattgaga cagaggatga 1861 ggcttacctg cggcgaatta agatggagtt gcagcagaag gaccgggcac tggtggagct 1921 gctgcgagag aaggtcgggc tgtttgctga gatgacccat ttccaggccg aagaggatgg 1981 tggcagtggg atggccctgc ccaccctgcc caggggcctt ttccgctctg agtcccttga 2041 gtcccctcgt ggcgagcggc tgctgcagga tgccatccgt gaggtggagg gtctgaaaga 2101 cctgctggtg gggccaggag tggaactgct cttgacaccc cgagagccag ccctgccctt 2161 ggaaccagac agcggtggta acacgagtcc tggggtcact gccaatggtg aggccagaac 2221 cttcaatggc tccattgaac tctgcagagc tgactcagac tctagccaga gggatcgaaa 2281 tggaaatcag ctgagatcac cgcaagagga ggcgttacag cgattggtca atctctatgg 2341 acttctacat ggcctacagg cagctgtggc ccagcaggac actctgatgg aagcccggtt 2401 ccctgagggc cctgagcggc gggagaagct gtgccgagcc aactctcggg atggggaggc 2461 tggcagggct ggggctgccc ctgtggcccc tgaaaagcag gccacggaac tggcattact 2521 gcagcggcaa catgcgctgc tgcaggagga gctacggcgc tgccggcggc taggtgaaga 2581 acgggcaacc gaagctggca gcctggaggc ccggctccgg gagagtgagc aggcccgggc 2641 actgctggag cgtgaggccg aagaggctcg aaggcagctg gccgccctgg gccagaccga 2701 gccactccca gctgaggccc cctgggcccg cagacctgtg gatcctcggc ggcgcagcct 2761 ccccgcaggc gatgccctgt acttgagttt caacccccca cagcccagcc gaggcactga 2821 ccgcctggat ctacctgtca ctactcgctc tgtccatcga aactttgagg accgagagag 2881 gcaggaactg gggagccccg aagagcggct gcaagacagc agtgaccctg acactggcag 2941 cgaggaggaa ggtagcagcc gtctgtctcc gccccacagt ccacgagact ttaccagaat 3001 gcaggacatc ccggaggaga cggagagccg cgacggggag gctgtagcct ccgagagcta 3061 agggggcccc tcccccctgc cccgtgcccc actgaagaac attactgagg gggctaacct 3121 tggggactcc aatttgccaa tgatgaggga acatttgaaa gaactgcaaa ttgtccttgc 3181 cagctcttgg gatccttgga tacctggggc catttaagaa gctaggggaa ttaggccaca 3241 acaccccctg ggacatccga aagctacacc acagatgcca gtggttcatg ccttcttccc 3301 gcaactttag gaaaatttat ttatttattg tttattagtt atggggggag aggggagatt 3361 taaaggacca gggacatggg aaccaagcca tagggatcag agggccttgt ccttgaacac 3421 tactggggta tattcaggct catccacgca gctgctgggt tcttgcccta acggccctcc 3481 cctgcaacat ccgtcttgga ggagaggctg cagccacagc accctactgc cctttaaata 3541 aaggagggct gtgggcaggg ccatgtccct ttctcctctc ccctcaacct cttactgctg 3601 ttctcccttt ctccgtcctt catggaagcc ctgggagata acctggcttc ctggagttga 3661 tggaataaag gttggggtgg ccataatggt ttgttggggg tgagggaaaa aacccacagg 3721 gaccagaatg ttttgttgtt cttttgtttt cttttttgta ccaaagtcaa ctgcacgtgt 3781 ttttatattt ttaagagatc gtaggcaatt agagatcgaa gcctcctatc tccacatctc 3841 tgaagaagtt gaggggtggg ggagagaatg acttctgcct tcatctgcag taacgggggg 3901 acctatactg acctcttccc cagccattta gaaacaagtt ctagggtggg ttggaaaatc 3961 tccaagagcc ctgacctcat cttccacctc agcaaccatg acctgaaacc tcagcgtgaa 4021 tttgggggat ttttcagtgg aacccttgcc cccaaatgtc gaccagcccc caaatgtcga 4081 agaattttct tcttgccaat tttgttgttt aaaaaaaaaa aaaaa //