LOCUS BC001751 2450 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens chromosome 20 open reading frame 4, mRNA (cDNA clone
MGC:1227 IMAGE:3536406), complete cds.
ACCESSION BC001751
VERSION BC001751.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2450)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2450)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (16-JAN-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: DCTD/DTP
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Genome Sequence Centre,
BC Cancer Agency, Vancouver, BC, Canada
info@bcgsc.bc.ca
Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson
Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen
Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel
Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave
Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth
Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao,
Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR
Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang,
Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 8 Row: n Column: 5
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 18034689.
FEATURES Location/Qualifiers
source 1..2450
/db_xref="H-InvDB:HIT000030625"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:1227 IMAGE:3536406"
/tissue_type="Lung, small cell carcinoma"
/clone_lib="NIH_MGC_7"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..2450
/gene="C20orf4"
/gene_synonym="bA234K24.2"
/gene_synonym="CGI-23"
/gene_synonym="DKFZp564N1363"
/db_xref="GeneID:25980"
/db_xref="HGNC:HGNC:15886"
CDS 63..1217
/gene="C20orf4"
/gene_synonym="bA234K24.2"
/gene_synonym="CGI-23"
/gene_synonym="DKFZp564N1363"
/codon_start=1
/product="chromosome 20 open reading frame 4"
/protein_id="AAH01751.1"
/db_xref="GeneID:25980"
/db_xref="HGNC:HGNC:15886"
/translation="MAAVQMDPELAKRLFFEGATVVILNMPKGTEFGIDYNSWEVGPK
FRGVKMIPPGIHFLHYSSVDKANPKEVGPRMGFFLSLHQRGLTVLRWSTLREEVDLSP
APESEVEAMRANLQELDQFLGPYPYATLKKWISLTNFISEATVEKLQPENRQICAFSD
VLPVLSMKHTKDRVGQNLPRCGIECKSYQEGLARLPEMKPRAGTEIRFSELPTQMFPE
GATPAEITKHSMDLSYALETVLNKQFPSSPQDVLGELQFAFVCFLLGNVYEAFEHWKR
LLNLLCRSEAAMMKHHTLYINLISILYHQLGEIPADFFVDIVSQDNFLTSTLQVFFSS
ACSIAVDATLRKKAEKFQAHLTKKFRWDFAAEPEDCAPVVVELPEGIEMG"
BASE COUNT 582 a 683 c 627 g 558 t
ORIGIN
1 cgtggggcga ggcgctgttc atcaaagaaa aagggttctt ttggtcaccc accactggcc
61 ccatggctgc cgtgcagatg gatcctgagc tagccaagcg cctcttcttt gaaggggcca
121 ctgtggtcat cctgaacatg cccaagggaa cagagtttgg gattgactat aactcctggg
181 aggtcgggcc caagttccgg ggcgtgaaga tgatccctcc aggcatccac ttcctccact
241 acagctctgt ggacaaggct aatccgaagg aagtaggccc tcgtatgggt ttcttcctta
301 gcctgcacca gcgggggctg acagtgctgc gctggagcac actcagggaa gaggtagacc
361 tgtccccagc cccagagtct gaggtggagg ccatgagggc caacctccag gagctggacc
421 agttcctggg gccttaccca tatgccaccc tgaagaagtg gatctcactc accaacttca
481 tcagcgaagc cacagtggag aagctacagc ccgagaatcg acagatctgt gccttttccg
541 atgtgctacc tgtgctctcc atgaagcaca ccaaggaccg cgtggggcag aatctacccc
601 gctgtggcat tgagtgcaaa agctaccaag agggcctggc ccggctacca gagatgaagc
661 ccagagccgg gacagagatc cgcttctcag agctgcccac gcagatgttc ccagagggtg
721 ccacgccagc tgagataacc aagcacagca tggacctgag ctatgccctg gagactgtgc
781 tcaacaagca gttccccagc agcccccagg atgtgcttgg tgaactccag tttgcttttg
841 tgtgcttcct gctggggaat gtgtacgagg catttgagca ttggaagcgg ctcctgaacc
901 tcctgtgccg gtcagaagca gccatgatga agcaccacac cctctacatc aacctcatct
961 ccatcctgta ccaccagctt ggtgagatcc ccgctgactt cttcgtagac attgtctccc
1021 aagacaactt cctcaccagc accttacagg ttttcttttc ctctgcctgc agcattgccg
1081 tggatgccac cctgagaaag aaagctgaaa agttccaagc tcacctgacc aagaagttcc
1141 ggtgggactt tgctgcggaa cctgaggact gtgccccggt ggtggtggag ctccctgagg
1201 gcatcgagat gggctaactc ggggagcgct ctcagctgcg aggggcccct tcccacaggg
1261 ctgcagtcct ggcctctcca tttacttctt cccatcctgg gacctgccag ggcagcaatc
1321 tctccaggtc ctgcaaagat ggagccagaa ttcccttttt cactgataaa tatatttctt
1381 cattgccaaa gaggctgtac ccatcctgaa ggcacatttg tgggttcccc atcagccagg
1441 ccttggtgct aacctggctg aatttcacac aggctcttac acacacacgc tcctaggaga
1501 catctgccta cacggcaacc atatttcctc tgaatgagaa ggaattgaac caaaagtcca
1561 agaaagaact gattgcttgt tccataggag cttaggaaac aagaaaccct ggattgccca
1621 gggggtctga gaagttggtt ggtgactttt tttgcggtta aatgaagggt gatggggaga
1681 tcagcccgaa ttgccgcctg cctcttgcta aataggagca gaggacttgg cctgcagctc
1741 cttgggagcc cttgattggg aagagagttt caagggaggc agctggattc aatctagcag
1801 gtggtcagct tcagctttct ccatcgaaat cccattctcc tgtccagagg cccagtgggt
1861 catctcccaa ggtgggtgtg gaccctggcc tcagaggcct tgctggtgct gtcacctccc
1921 acctgttcca ttccgaggcc tcacccagaa gtgggaccct ccccttcctc accagagcca
1981 ccgtgactgt ttctgatgac ctggagagtc aacaacaacc agaaaggttt ctgcccagag
2041 caggcttctt aaggccttta cgaagttttg tgccttccaa gtgctgaaga agacctggtc
2101 agcctaaatc ttcccagtcc cgctgtggag ctgtcagtca ccggagtaat gagctcctgg
2161 ttcctcggga gtccttcgtg ctgtgtggca gggttcctct ctagacaagt acacaggccc
2221 tgccaccctg acatcaaact gttgtactat gatcacagtc cctgtgccat ccttttccaa
2281 gactggggct cacaccatgt ttttgaatga gaatccctgc tggttgagac ttttgcttcc
2341 acttgtttcc ttggagatgt ttttccaaga gcataatgta cattaaagtc ttcgagttga
2401 gacaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
//