LOCUS BC043183 4192 bp mRNA linear HUM 01-SEP-2006
DEFINITION Homo sapiens collagen, type XX, alpha 1, mRNA (cDNA clone MGC:44186
IMAGE:5287981), complete cds.
ACCESSION BC043183
VERSION BC043183.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 4192)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 4192)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (09-JAN-2003) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: Miklos Palkovits, M.D., Ph.D.
cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki
Toshiyuki and Piero Carninci (RIKEN)
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Genome Sequence Centre,
BC Cancer Agency, Vancouver, BC, Canada
info@bcgsc.bc.ca
Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson
Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen
Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel
Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave
Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth
Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao,
Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR
Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang,
Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 76 Row: d Column: 17
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 44917609.
FEATURES Location/Qualifiers
source 1..4192
/db_xref="H-InvDB:HIT000097511"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:44186 IMAGE:5287981"
/tissue_type="Brain, hypothalamus"
/clone_lib="NIH_MGC_96"
/lab_host="DH10B"
/note="Vector: pBluescriptR"
gene 1..4192
/gene="COL20A1"
/gene_synonym="bA261N11.4"
/gene_synonym="KIAA1510"
/db_xref="GeneID:57642"
/db_xref="HGNC:HGNC:14670"
CDS 106..3960
/gene="COL20A1"
/gene_synonym="bA261N11.4"
/gene_synonym="KIAA1510"
/codon_start=1
/product="collagen, type XX, alpha 1"
/protein_id="AAH43183.1"
/db_xref="GeneID:57642"
/db_xref="HGNC:HGNC:14670"
/translation="MSSGDPAHLGLCVWLWLGATLGREQVQASGLLRLAVLPEDRLQM
KWRESEGSGLGYLVQVKPMAGDSEQEVILTTKTHKATVGGLSPSKGYTLQIFELTGSG
RFLLARREFVIEDLKSSSLDRSSQRPLGSGAPEPTPSHTGSPDPEQASEPQVAFTPSQ
DPRTPAGPQFRCLPPVPADMVFLVDGSWSIGHSHFQQVKDFLASVIAPFEIGPDKVQV
GLTQYSGDAQTEWDLNSLSTKEQVLAAVRRLRYKGGNTFTGLALTHVLGQNLQPAAGL
RPEAAKVVILVTDGKSQDDVHTAARVLKDLGVNVFAVGVKNADEAELRLLASPPRDIT
VHSVLDFLQLGALAGLLSRLICQRLQGGSPRQGPAAAPALDTLPAPTSLVLSQVTSSS
IRLSWTPAPRHPLKYLIVWRASRGGTPREVVVEGPAASTELHNLASRTEYLVSVFPIY
EGGVGEGLRGLVTTAPLPPPRALTLAAVTPRTVHLTWQPSAGATHYLVRCSPASPKGE
EEEREVQVGRPEVLLDGLEPGRDYEVSVQSLRGPEGSEARGIRARTPTLAPPRHLGFS
DVSHDAARVFWEGAPRPVRLVRVTYVSSEGGHSGQTEAPGNATSATLGPLSSSTTYTV
RVTCLYPGGGSSTLTGRVTTKKAPSPSQLSMTELPGDAVQLAWVAAAPSGVLVYQITW
TPLGEGKAHEISVPGNLGTAVLPGLGRHTEYDVTILAYYRDGARSDPVSLRYTPSTVS
RSPPSNLALASETPDSLQVSWTPPLGRVLHYWLTYAPASGLGPEKSVSVPGARSHVTL
PDLQAATKYRVLVSAIYAAGRSEAVSATGQTACPALRPDGSLPGFDLMVAFSLVEKAY
ASIRGVAMEPSAFGGTPTFTLFKDAQLTRRVSDVYPAPLPPEHTIVFLVRLLPETPRE
AFALWQMTAKDFQPLLGVLLDAGKKSLTYFHRDPRAALQEATFDPQEVRKIFFGSFHK
VHVAVGRSKVRLYVDCRKVAERPLGEMGSPPAAGFVTLGRLAKARGPRSSSAAFQLQM
LQIVCSDTWADEDRCCELPASRDGETCPAFVSACSCSSETPGPPGPQGPPGLPGRNGT
PGEQGFPGPRGPPGVKGEKGDHGLPGLQGHPGHQGIPGRVGLQGPKGMRGLEGTAGLP
GPPGPRGFQGMAGARGTSGERGPPGTVGPTGLPGPKGERGEKGEQQSLATLYQLVSQA
SHVSKFDSFHENTRPPMPILEQKLEPGTEPLGSPGTRSKALVPGEWGRGGRHLEGRGE
PGAVGQMGSPGQQGASTQGLWE"
BASE COUNT 761 a 1399 c 1354 g 678 t
ORIGIN
1 agcggataag ctccagcctt cctgtggcca cagcaggacc agagtggacc agcacacccc
61 aggagagagg actggggtcc caggagtagg aggagcccga gcaccatgag ctccggagac
121 cctgcacacc tcggcctctg cgtctggctg tggctgggcg ccaccctggg aagagagcaa
181 gttcaagcaa gcggtctcct gaggctggct gtgctgcctg aggaccggct gcagatgaag
241 tggagagagt cggaggggag cggcctcggc tacctggtgc aggtgaagcc catggcaggg
301 gactcggaac aggaggtgat actgaccacc aagacccata aggccacagt ggggggcctg
361 agcccctcca agggctacac cttgcagatc ttcgagctca ctggctctgg gcgcttcctg
421 ctagctcgga gggagtttgt gattgaggat ctgaagagta gctccctgga caggagcagc
481 cagaggcccc tcggctctgg agccccggag cccaccccct cccacacggg gagcccagac
541 cctgagcagg cttctgagcc ccaagttgcc ttcacaccaa gccaggatcc gcgcactcct
601 gccggccccc agttccgctg cctgcccccc gtgcctgctg acatggtctt cctggtggac
661 gggtcctgga gcattggcca cagtcacttc cagcaggtca aggacttcct ggccagtgtc
721 atcgcaccct ttgaaatcgg gccggataag gtccaagtag gcctgactca gtacagcggg
781 gatgctcaga ctgagtggga cctgaactcc ctcagcacca aggaacaggt gctggcagct
841 gtgcgccgcc tccgctacaa gggggggaac acgttcacag gccttgccct gacccacgtg
901 ctggggcaga acctgcagcc ggcggctggc ctccgtccag aggcagccaa ggtggtgatt
961 ctggtgacgg acggcaagtc ccaggacgat gtgcacactg ctgcccgtgt cctcaaggac
1021 ctgggcgtga acgtcttcgc tgtgggtgtg aagaacgccg atgaggctga gctgaggctc
1081 ctggcgtccc cgccgaggga catcaccgtc cacagcgtgc tggacttcct gcagctcggc
1141 gcgctggctg gcctgctcag ccgtctcatc tgccagaggc tccagggtgg gagcccgcgg
1201 cagggcccag cagcggctcc agccctggac accctccctg cccccaccag cctggtcctg
1261 agccaggtga cctcctccag catccgcctg tcctggactc cagccccccg gcaccccctc
1321 aagtatctga tcgtttggcg agcctctaga ggtggcaccc ccagggaggt ggtggtggag
1381 gggcccgccg cctccacgga gctgcacaac ctggcctccc gcacagagta cctggtctcc
1441 gtgttcccca tctatgaggg cggggttggc gaaggcctgc ggggcctggt gaccacagca
1501 cctctgcctc cgccccgggc gctgaccctg gccgcagtga cgcccagaac cgtccacctc
1561 acctggcagc catcggccgg ggccacccac tacctggtgc gatgttctcc tgcttccccc
1621 aagggtgaag aggaggagcg agaggtgcag gtcgggcggc ccgaggtgct gctggatggc
1681 ctggaacctg gcagggacta tgaggtctcg gtgcagagcc tgcgaggccc tgagggcagc
1741 gaggcccggg gcatccgtgc caggaccccc accctggccc ccccgagaca cctgggcttc
1801 tcagacgtga gccacgacgc ggcacgagtg ttctgggagg gtgccccgag gcctgtgcgc
1861 ctggtcaggg tcacctatgt gtccagcgag ggtggacact cggggcagac agaggctcct
1921 gggaacgcca cctcggccac gctggggcct ctctcttcct ccaccaccta cactgtccgt
1981 gtcacctgcc tctaccctgg gggtggctcc tctacgctga ctggccgggt gaccaccaag
2041 aaagctccca gcccaagcca gctgtccatg acggagctgc caggggatgc agtccagctg
2101 gcgtgggtgg ccgcagcccc gtctggcgtg cttgtctacc agatcacgtg gacgcccctg
2161 ggagagggga aggctcacga gatctctgtc ccagggaacc tcggcacggc cgtcctgcct
2221 ggcctaggga ggcacacaga gtacgacgtc accatcttgg cctactacag ggacggggcc
2281 cgcagtgacc ctgtgtccct ccgctatacc ccctccacgg tgagcaggag cccaccctcc
2341 aacctggccc tggcctcgga gacccccgac agcctgcagg tcagctggac gcccccgctt
2401 ggccgcgtgc tccattactg gctcacctac gcccccgcct ctggcttggg acccgagaaa
2461 tccgtctctg tgccaggagc caggagccac gtgacactgc ccgacctgca ggcagccacg
2521 aagtacaggg tcctggtctc agctatctat gcagcaggca ggagtgaggc tgtgtctgcc
2581 acgggccaga cagcctgccc agccctccgc cctgacggct ccctcccagg gtttgacctg
2641 atggtggcct tcagcctggt ggaaaaggct tatgcgtcca tccggggcgt ggccatggag
2701 ccctctgcct tcggtgggac cccgaccttc acgctcttca aggacgccca gctgacaaga
2761 cgggtcagtg acgtctaccc agccccccta cctccagagc acaccatcgt cttccttgtg
2821 cgcctacttc ccgagacacc ccgtgaggcc ttcgcgctgt ggcagatgac agccaaggac
2881 ttccagcccc tccttggggt tctgctggat gccgggaaga agtccctgac ctacttccac
2941 cgtgacccca gggctgcctt gcaggaggcc accttcgacc cgcaggaagt gaggaagatt
3001 ttcttcggga gcttccacaa ggtgcacgtg gctgtgggcc gctccaaggt caggctctat
3061 gtggactgcc ggaaggtggc tgagcggccc cttggggaga tgggcagccc acccgctgcc
3121 ggcttcgtca cgctggggag gctggccaag gccaggggcc ccaggagcag ttcggccgcg
3181 tttcagctcc agatgctgca gatcgtgtgc agtgacacct gggccgatga ggaccggtgc
3241 tgtgagctcc ctgcctcgag ggatggagag acctgccccg ccttcgtgtc tgcctgttcc
3301 tgttcctcag agacccctgg gcccccagga cctcaaggac ccccaggcct ccctgggagg
3361 aatggcaccc caggagagca gggcttccca gggcccaggg gtccaccagg ggtcaaagga
3421 gagaagggag accatgggct tccaggcttg cagggccacc ccggccacca gggcatcccc
3481 gggagagttg gcctccaggg accaaaggga atgagaggcc tggagggaac tgctggcctg
3541 cctggacccc ctggccccag ggggttccag ggcatggcag gggccagggg cactagtgga
3601 gagcgaggac ctccagggac cgtggggccc acaggactgc cagggcccaa aggggaacga
3661 ggagagaagg gcgagcagca gtcccttgcc accctctacc agcttgtgag ccaggcctca
3721 cacgtgtcaa agttcgactc cttccacgag aacaccaggc cccccatgcc catcttggag
3781 cagaagctgg agccgggcac tgagcccctg gggtcacctg gcacccgcag caaggccctg
3841 gttcctggag aatgggggcg tggtggccgc caccttgagg gcagagggga gcctggagct
3901 gttggtcaga tgggcagccc tgggcagcag ggggctagca cccagggcct ctgggagtga
3961 caggacattt tctgcactgc cccgaggaac gctgagcctt cctccctggg tttgtctgga
4021 caccgagagc gaccacatcc tggagaagcc aggagaaaag ctcaggaaga gcctgcaggt
4081 ggaaggagag ggaagcagcg gcctcggcca aggcccaccc catactcttg gctctgtagc
4141 atttccaagt tcagataaac ccctgagtgc tcacccaaaa aaaaaaaaaa aa
//