LOCUS BC020567 4125 bp mRNA linear HUM 06-OCT-2003
DEFINITION Homo sapiens rho/rac guanine nucleotide exchange factor (GEF) 2,
mRNA (cDNA clone MGC:21557 IMAGE:4157775), complete cds.
ACCESSION BC020567
VERSION BC020567.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 4125)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 4125)
AUTHORS Strausberg,R.
TITLE Direct Submission
JOURNAL Submitted (03-JAN-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Cancer Genomics Office, National Cancer
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: David N. Louis, M.D.
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Sequencing Group at the Stanford Human Genome
Center, Stanford University School of Medicine, Stanford, CA 94305
Web site: http://www-shgc.stanford.edu
Contact: (Dickson, Mark) mcd@paxil.stanford.edu
Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
R. M.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 20 Row: c Column: 24
This clone was selected for full length sequencing because it
passed the following selection criteria: Similarity but not
identity to protein.
FEATURES Location/Qualifiers
source 1..4125
/db_xref="H-InvDB:HIT000038828"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:21557 IMAGE:4157775"
/tissue_type="Brain, anaplastic oligodendroglioma with
1p/19q loss"
/clone_lib="NCI_CGAP_Brn67"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..4125
/gene="ARHGEF2"
/gene_synonym="DKFZp547L106"
/gene_synonym="GEF"
/gene_synonym="GEF-H1"
/gene_synonym="GEFH1"
/gene_synonym="KIAA0651"
/gene_synonym="LFP40"
/gene_synonym="P40"
/db_xref="GeneID:9181"
/db_xref="MIM:607560"
CDS 182..3061
/gene="ARHGEF2"
/gene_synonym="DKFZp547L106"
/gene_synonym="GEF"
/gene_synonym="GEF-H1"
/gene_synonym="GEFH1"
/gene_synonym="KIAA0651"
/gene_synonym="LFP40"
/gene_synonym="P40"
/codon_start=1
/product="ARHGEF2 protein"
/protein_id="AAH20567.1"
/db_xref="GeneID:9181"
/db_xref="MIM:607560"
/translation="MKEAKDARYTNGHLFTTISVSGMTMCYACNKSITAKEALICPTC
NVTIHNRCKDTLANCTKVKQKQQKAALLKNNTALQSVSLRSKTTIRERPSSAIYPSDS
FRQSLLGSRRGRSSLSLAKSVSTTNIAGHFNDESPLGLRRILSQSTDSLNMRNRTLSV
ESLIDEAEVIYSELMSDFEMDEKDFAADSWSLAVDSSFLQQHKKEVMKQQDVIYELIQ
TELHHVRTLKIMTRLFRTGMLEELHLEPGVVQGLFPCVDELSDIHTRFLSQLLERRRQ
ALCPGSTRNFVIHRLGDLLISQFSGPSAEQMCKTYSEFCSRHSKALKLYKELYARDKR
FQQFIRKVTRPAVLKRHGVQECILLVTQRITKYPLLISRILQHSHGIEEERQDLTTAL
GLVKELLSNVDEGIYQLEKGARLQEIYNRMDPRAQTPVPGKGPFGREELLRRKLIHDG
CLLWKTATGRFKDVLVLLMTDVLVFLQEKDQKYIFPTLDKPSVVSLQNLIVRDIANQE
KGMFLISAAPPEMYEVHTASRDDRSTWIRVIQQSVRTCPSREDFPLIETEDEAYLRRI
KMELQQKDRALVELLREKVGLFAEMTHFQAEEDGGSGMALPTLPRGLFRSESLESPRG
ERLLQDAIREVEGLKDLLVGPGVELLLTPREPALPLEPDSGGNTSPGVTANGEARTFN
GSIELCRADSDSSQRDRNGNQLRSPQEEALQRLVNLYGLLHGLQAAVAQQDTLMEARF
PEGPERREKLCRANSRDGEAGRAGAAPVAPEKQATELALLQRQHALLQEELRRCRRLG
EERATEAGSLEARLRESEQARALLEREAEEARRQLAALGQTEPLPAEAPWARRPVDPR
RRSLPAGDALYLSFNPPQPSRGTDRLDLPVTTRSVHRNFEDRERQELGSPEERLQDSS
DPDTGSEEEGSSRLSPPHSPRDFTRMQDIPEETESRDGEAVASES"
misc_feature 218..358
/gene="ARHGEF2"
/gene_synonym="DKFZp547L106"
/gene_synonym="GEF"
/gene_synonym="GEF-H1"
/gene_synonym="GEFH1"
/gene_synonym="KIAA0651"
/gene_synonym="LFP40"
/gene_synonym="P40"
/note="C1; Region: Protein kinase C conserved region 1
(C1) domains (Cysteine-rich domains)"
/db_xref="CDD:smart00109"
misc_feature 815..1393
/gene="ARHGEF2"
/gene_synonym="DKFZp547L106"
/gene_synonym="GEF"
/gene_synonym="GEF-H1"
/gene_synonym="GEFH1"
/gene_synonym="KIAA0651"
/gene_synonym="LFP40"
/gene_synonym="P40"
/note="RhoGEF; Region: Guanine nucleotide exchange factor
for Rho/Rac/Cdc42-like GTPases"
/db_xref="CDD:smart00325"
BASE COUNT 954 a 1183 c 1168 g 820 t
ORIGIN
1 gagaccaacg cgtgcgggcc gaacccctcc ccccgccttc ccccaacaat acaggacgcc
61 ggggtccgcg ccgcgtcctc cctggtcccc ccgtccgatt atgtctcgga tcgaatccct
121 cacgcgggcg cggatcgacc ggagcagaga gctggcgagc aagacccggg aaaaggagaa
181 gatgaaggaa gccaaggatg cccgctatac caatgggcac ctcttcacca ccatttcagt
241 ttcaggcatg accatgtgct atgcctgtaa caagagcatc acagccaagg aagccctcat
301 ctgcccaacc tgcaatgtga ctatccacaa ccgctgtaaa gacaccctcg ccaactgtac
361 caaggtcaag cagaagcaac agaaagcggc cctgctgaag aacaacaccg ccttgcagtc
421 cgtttctctt cgaagtaaga caaccatccg ggagcggcca agctcggcca tctacccctc
481 cgacagcttc cggcagtccc tcctgggctc ccgccgtggc cgctcctcct tgtctttagc
541 caagagtgtt tctaccacca acattgctgg acatttcaat gatgagtctc ccctggggct
601 gcgccggatc ctctcacagt ccacagactc cctcaacatg cggaaccgaa ccctatccgt
661 ggaatccctc attgacgaag cagaggtaat ctacagtgag ctgatgagtg actttgagat
721 ggatgagaag gactttgcag ctgactcttg gagtcttgct gtggacagca gcttcctgca
781 gcagcataaa aaggaggtga tgaagcagca agatgtcatc tatgagctaa tccagacaga
841 gctgcaccat gtgaggacac tgaagatcat gacccgcctc ttccgcacgg ggatgctgga
901 agagctacac ttggagccag gagtggtcca gggcctgttc ccctgcgtgg acgagctcag
961 tgacatccat acacgcttcc tcagccagct attagaacgc cgacgccagg ccctgtgccc
1021 tggcagcacc cggaactttg tcatccatcg cttgggtgat ctgctcatca gccagttctc
1081 aggtcctagt gcggagcaga tgtgtaagac ctactcggag ttctgcagcc gccacagcaa
1141 ggccttaaag ctctataagg agctgtacgc ccgagacaaa cgcttccagc aattcatccg
1201 gaaagtgacc cgccccgccg tgctcaagcg gcacggggta caggagtgca tcctgctggt
1261 gactcagcgc atcaccaagt acccgttact catcagccgc atcctgcagc attcccacgg
1321 gatcgaggag gagcgccagg acctgaccac agcactgggg ctagtgaagg agctgctgtc
1381 caatgtggac gagggtattt atcagctgga gaaaggggcc cgtctgcagg agatctacaa
1441 ccgcatggac cctcgggccc aaaccccagt gcctggcaag ggcccctttg gccgagagga
1501 acttctgagg cgcaaactca tccacgatgg ctgcctgctc tggaagacag cgacggggcg
1561 cttcaaagat gtgctagtgc tgctgatgac agatgtactg gtgtttctcc aggaaaagga
1621 ccagaagtac atctttccta ccctggacaa gccttcagtg gtatcgctgc agaatctaat
1681 cgtacgagac attgccaacc aggagaaagg gatgtttctg atcagcgcag ccccacctga
1741 gatgtacgag gtgcacacag catcccggga tgaccggagc acctggatcc gggtcattca
1801 gcagagcgtg cgcacatgcc catccaggga ggacttcccc ctgattgaga cagaggatga
1861 ggcttacctg cggcgaatta agatggagtt gcagcagaag gaccgggcac tggtggagct
1921 gctgcgagag aaggtcgggc tgtttgctga gatgacccat ttccaggccg aagaggatgg
1981 tggcagtggg atggccctgc ccaccctgcc caggggcctt ttccgctctg agtcccttga
2041 gtcccctcgt ggcgagcggc tgctgcagga tgccatccgt gaggtggagg gtctgaaaga
2101 cctgctggtg gggccaggag tggaactgct cttgacaccc cgagagccag ccctgccctt
2161 ggaaccagac agcggtggta acacgagtcc tggggtcact gccaatggtg aggccagaac
2221 cttcaatggc tccattgaac tctgcagagc tgactcagac tctagccaga gggatcgaaa
2281 tggaaatcag ctgagatcac cgcaagagga ggcgttacag cgattggtca atctctatgg
2341 acttctacat ggcctacagg cagctgtggc ccagcaggac actctgatgg aagcccggtt
2401 ccctgagggc cctgagcggc gggagaagct gtgccgagcc aactctcggg atggggaggc
2461 tggcagggct ggggctgccc ctgtggcccc tgaaaagcag gccacggaac tggcattact
2521 gcagcggcaa catgcgctgc tgcaggagga gctacggcgc tgccggcggc taggtgaaga
2581 acgggcaacc gaagctggca gcctggaggc ccggctccgg gagagtgagc aggcccgggc
2641 actgctggag cgtgaggccg aagaggctcg aaggcagctg gccgccctgg gccagaccga
2701 gccactccca gctgaggccc cctgggcccg cagacctgtg gatcctcggc ggcgcagcct
2761 ccccgcaggc gatgccctgt acttgagttt caacccccca cagcccagcc gaggcactga
2821 ccgcctggat ctacctgtca ctactcgctc tgtccatcga aactttgagg accgagagag
2881 gcaggaactg gggagccccg aagagcggct gcaagacagc agtgaccctg acactggcag
2941 cgaggaggaa ggtagcagcc gtctgtctcc gccccacagt ccacgagact ttaccagaat
3001 gcaggacatc ccggaggaga cggagagccg cgacggggag gctgtagcct ccgagagcta
3061 agggggcccc tcccccctgc cccgtgcccc actgaagaac attactgagg gggctaacct
3121 tggggactcc aatttgccaa tgatgaggga acatttgaaa gaactgcaaa ttgtccttgc
3181 cagctcttgg gatccttgga tacctggggc catttaagaa gctaggggaa ttaggccaca
3241 acaccccctg ggacatccga aagctacacc acagatgcca gtggttcatg ccttcttccc
3301 gcaactttag gaaaatttat ttatttattg tttattagtt atggggggag aggggagatt
3361 taaaggacca gggacatggg aaccaagcca tagggatcag agggccttgt ccttgaacac
3421 tactggggta tattcaggct catccacgca gctgctgggt tcttgcccta acggccctcc
3481 cctgcaacat ccgtcttgga ggagaggctg cagccacagc accctactgc cctttaaata
3541 aaggagggct gtgggcaggg ccatgtccct ttctcctctc ccctcaacct cttactgctg
3601 ttctcccttt ctccgtcctt catggaagcc ctgggagata acctggcttc ctggagttga
3661 tggaataaag gttggggtgg ccataatggt ttgttggggg tgagggaaaa aacccacagg
3721 gaccagaatg ttttgttgtt cttttgtttt cttttttgta ccaaagtcaa ctgcacgtgt
3781 ttttatattt ttaagagatc gtaggcaatt agagatcgaa gcctcctatc tccacatctc
3841 tgaagaagtt gaggggtggg ggagagaatg acttctgcct tcatctgcag taacgggggg
3901 acctatactg acctcttccc cagccattta gaaacaagtt ctagggtggg ttggaaaatc
3961 tccaagagcc ctgacctcat cttccacctc agcaaccatg acctgaaacc tcagcgtgaa
4021 tttgggggat ttttcagtgg aacccttgcc cccaaatgtc gaccagcccc caaatgtcga
4081 agaattttct tcttgccaat tttgttgttt aaaaaaaaaa aaaaa
//