LOCUS BC038406 3375 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens chromosome 3 open reading frame 20, mRNA (cDNA clone
MGC:35115 IMAGE:5167835), complete cds.
ACCESSION BC038406
VERSION BC038406.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 3375)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 3375)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (01-OCT-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: Life Technologies, Inc.
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Baylor College of Medicine Human Genome
Sequencing Center
Center code: BCM-HGSC
Web site: http://www.hgsc.bcm.tmc.edu/cdna/
Contact: amg@bcm.tmc.edu
Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
A.N., Gibbs, R.A.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 51 Row: a Column: 5
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 68163571.
FEATURES Location/Qualifiers
source 1..3375
/db_xref="H-InvDB:HIT000052026"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:35115 IMAGE:5167835"
/tissue_type="Brain, adult medulla"
/clone_lib="NIH_MGC_119"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..3375
/gene="C3orf20"
/gene_synonym="DKFZP434N1817"
/db_xref="GeneID:84077"
/db_xref="HGNC:HGNC:25320"
CDS 373..3087
/gene="C3orf20"
/gene_synonym="DKFZP434N1817"
/codon_start=1
/product="chromosome 3 open reading frame 20"
/protein_id="AAH38406.2"
/db_xref="GeneID:84077"
/db_xref="HGNC:HGNC:25320"
/translation="MSYIKSNLELYQQYTAMAPKLLARISKLLMICQNAGISVPKGIR
NIFEFTWEELISDPSVPTPSDILGLEVSFGAPLVVLMEPTFVQVPTLKKPLPPPPPAP
PRPVLLATTGAAKRSTLSPTMARQVRTHQETLNRFQQQSIHLLTELLRLKMKAMVESM
SVGANPLDITRRFVEASQLLHLNAKEMAFNCLISTAGRSGYSSGQLWKESLANMSAIG
VNSPYQLIYHSSTACLSFSLSAGKEAKKKIGKSRTTEDVSMPPLHRGVGTPANSLEFS
DPCPEAREKLQELCRHIEAERATWKGRNISYPMILRNYKAKMPSHLMLARKGDSQTPG
LHYPPTAGAQTLSPTSHPSSANHHFSQHCQEGKAPKKAFKFHYTFYDGSSFVYYPSGN
VAVCQIPTCCRGRTITCLFNDIPGFSLLALFNTEGQGCVHYNLKTSCPYVLILDEEGG
TTNDQQGYVVHKWSWTSRTETLLSLEYKVNEEMKLKVLGQDSITVTFTSLNETVTLTV
SANNCPHGMAYDKRLNRRISNMDDKVYKMSRALAEIKKRFQKTVTQFINSILLAAGLF
TIEYPTKKEEEEFVRFKMRSRTHPERLPKLSLYSGESLLRSQSGHLESSIAETLKDEP
ESAPVSPVRKTTKIHTKAKVTSRGKAREGRSPTRWAALPSDCPLVLRKLMLKEDTRAG
CKCLVKAPLVSDVELERFLLAPRDPSQVLVFGIISSQNYTSTGQLQWLLNTLYNHQQR
GRGSPCIQCRYDSYRLLQYDLDSPLQEDPPLMVKKNSVVQGMILMFAGGKLIFGGRVL
NGYGLSKQNLLKQIFRSQQDYKMGYFLPDDYKFSVPNSVLSLEDSESVKKAESEDIQG
SSSSLALEDYVEKELSLEAEKTREPEVELHPLSRDSKITSWKKQASKK"
BASE COUNT 944 a 937 c 801 g 693 t
ORIGIN
1 gtccttttaa gtcagtaaat tgaactaagt cggttattcg gcaagcagtt cctataaaaa
61 actacatggc taagatgtac acgttggata attccatgcc catcttggca ttgcgaccag
121 gagcttctgc atcccttgct tccaaagaga accagtgaaa catcttttta ttcttgcagg
181 ttcttaatga ttgaccacaa gcagatcttt caccctcgga tctctagcta caaaaggaac
241 cactggctca atgacctgta agggccgttt cagcacatcc attctgtcca tctccaagcc
301 ttcaccgtag ggaagaactt ttgctctcag tcacctctca gagagctctc tttatagctg
361 aaggtccctc tcatgagtta catcaagagt aacctagaat tatatcagca atacacagcc
421 atggccccca agctactggc ccgcatctcc aaactcctca tgatctgcca gaatgcaggc
481 atttctgtac caaaaggcat cagaaacatc tttgagttca cttgggaaga gctcatcagt
541 gacccttcag tgcctacccc gtccgacatc ttgggcctgg aggtcagctt tggagccccc
601 ctggtggtgc tcatggaacc cacctttgtg caggtcccca cactgaagaa gccactacct
661 ccaccaccac cagcaccacc acgtccagtg ctgctggcaa ccactggggc agccaagcgc
721 tccaccctct ctcccaccat ggcccgtcag gtgcgcaccc accaggagac cctgaacagg
781 tttcagcagc agtccatcca cctgctgacg gagctcctca gactgaagat gaaggccatg
841 gtggagtcta tgtcggtggg tgccaacccc ttggacatca ccaggcgctt tgtggaggcc
901 agccagctcc tccacctcaa tgccaaggag atggccttca actgcctgat cagcacagcc
961 gggagaagtg gctacagcag cggacagttg tggaaagagt ccctcgcaaa catgtccgcc
1021 attggggtga actcgcctta ccagctgatc taccactctt ccacagcctg tctgagcttt
1081 tctctctctg ctggaaaaga agccaagaag aaaataggca aatctagaac tacagaagat
1141 gtcagcatgc cgcccctgca tcgaggagtg ggaacccctg ccaacagcct ggagttcagc
1201 gacccctgcc ctgaggcccg ggagaagctg caggagttgt gtcgccacat agaagctgaa
1261 agggccacat ggaaagggag gaatatctcc taccccatga tcttacgaaa ctacaaggca
1321 aagatgccct ctcatctaat gttggcccgc aaaggagact ctcagacccc gggtttacat
1381 taccctccca ctgcaggtgc tcagactctc agccccacct ctcacccatc ttctgccaac
1441 catcatttca gtcagcattg tcaagagggg aaggcaccca agaaggcctt caagtttcat
1501 tacaccttct atgatggctc ctccttcgtt tactatccct ctggaaacgt cgctgtatgt
1561 cagatcccca catgctgcag agggagaacc atcacctgcc tctttaatga catacctgga
1621 ttctccttgc tggccctatt caatactgaa ggccagggct gtgttcacta caacctaaaa
1681 accagttgcc catatgtctt aatcttggat gaggaaggtg ggaccaccaa tgaccagcag
1741 ggctatgtag tccacaagtg gagctggact tccaggacag agaccctgct ttccctggaa
1801 tacaaggtga atgaggaaat gaaactaaag gtactgggac aggactccat cacagtcacc
1861 ttcacctccc tgaatgagac agtaacactc actgtgtcgg ccaacaattg tccccatgga
1921 atggcatatg acaaacggct gaaccgcaga atcagcaaca tggacgacaa ggtgtataag
1981 atgagccgag ccctggctga gatcaagaag cggtttcaga agacagtgac tcagttcatt
2041 aattctatct tgctggccgc aggtctgttt accattgaat atcccaccaa aaaggaggag
2101 gaagaatttg ttcggttcaa gatgagatcc agaactcatc ccgagcggct ccccaagcta
2161 agtttatact caggagaaag tcttttacga tctcagtcag gccacctgga atcctcaatt
2221 gcagagactt tgaaggatga gcctgagtct gctcctgtga gcccagttcg gaagaccacc
2281 aaaatccaca ccaaagccaa ggtcacatcc agagggaagg cccgcgaggg gcgcagcccc
2341 accaggtggg cggccttgcc ctcagactgc ccgctggtgc tgcggaagct catgctcaag
2401 gaagacaccc gtgctggctg caagtgcctg gtgaaggcgc ccctggtctc tgacgtggag
2461 ctggagcgct tcctgttggc gccccgagac cccagccaag tgctggtgtt tgggatcatc
2521 tcaagccaga actacaccag cactgggcag ctccagtggc tgctgaacac tctctacaac
2581 caccagcagc ggggccgtgg ctccccctgc atccagtgcc ggtatgactc ctaccgcctg
2641 ctgcagtatg acctggacag ccccctgcag gaggaccctc ccctgatggt gaagaagaac
2701 tctgtggtgc aggggatgat tctgatgttt gccgggggga agctcatttt tgggggccgt
2761 gttttgaatg gatatggcct cagcaagcag aatctgctga aacagatctt ccggtctcaa
2821 caggattaca agatgggcta cttcctgccg gatgactaca aattcagtgt tcccaactct
2881 gtcctgagcc tggaggattc tgaatcagtc aagaaagccg agtcagaaga tatccaagga
2941 agcagctcct cattggccct ggaagactat gtggagaagg agttatctct ggaggctgag
3001 aagacaagag agcctgaagt ggagctacat cctctcagca gggacagcaa gataactagt
3061 tggaagaagc aggcctccaa gaagtagcgc catcctggca gcagccaagt gagccaggcc
3121 ccggcccggg gtgctggggc ttcttgccag cccagccctg cctccccggt ctcccaccct
3181 gtcctccaag cttctataat aaaccagcgg gcctccagca ttggggtgag gctctgggga
3241 aggacagaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
3301 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
3361 aaaaaaaaaa aaaaa
//