LOCUS BC043183 4192 bp mRNA linear HUM 01-SEP-2006 DEFINITION Homo sapiens collagen, type XX, alpha 1, mRNA (cDNA clone MGC:44186 IMAGE:5287981), complete cds. ACCESSION BC043183 VERSION BC043183.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4192) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 4192) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (09-JAN-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Miklos Palkovits, M.D., Ph.D. cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki Toshiyuki and Piero Carninci (RIKEN) cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 76 Row: d Column: 17 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 44917609. FEATURES Location/Qualifiers source 1..4192 /db_xref="H-InvDB:HIT000097511" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:44186 IMAGE:5287981" /tissue_type="Brain, hypothalamus" /clone_lib="NIH_MGC_96" /lab_host="DH10B" /note="Vector: pBluescriptR" gene 1..4192 /gene="COL20A1" /gene_synonym="bA261N11.4" /gene_synonym="KIAA1510" /db_xref="GeneID:57642" /db_xref="HGNC:HGNC:14670" CDS 106..3960 /gene="COL20A1" /gene_synonym="bA261N11.4" /gene_synonym="KIAA1510" /codon_start=1 /product="collagen, type XX, alpha 1" /protein_id="AAH43183.1" /db_xref="GeneID:57642" /db_xref="HGNC:HGNC:14670" /translation="MSSGDPAHLGLCVWLWLGATLGREQVQASGLLRLAVLPEDRLQM KWRESEGSGLGYLVQVKPMAGDSEQEVILTTKTHKATVGGLSPSKGYTLQIFELTGSG RFLLARREFVIEDLKSSSLDRSSQRPLGSGAPEPTPSHTGSPDPEQASEPQVAFTPSQ DPRTPAGPQFRCLPPVPADMVFLVDGSWSIGHSHFQQVKDFLASVIAPFEIGPDKVQV GLTQYSGDAQTEWDLNSLSTKEQVLAAVRRLRYKGGNTFTGLALTHVLGQNLQPAAGL RPEAAKVVILVTDGKSQDDVHTAARVLKDLGVNVFAVGVKNADEAELRLLASPPRDIT VHSVLDFLQLGALAGLLSRLICQRLQGGSPRQGPAAAPALDTLPAPTSLVLSQVTSSS IRLSWTPAPRHPLKYLIVWRASRGGTPREVVVEGPAASTELHNLASRTEYLVSVFPIY EGGVGEGLRGLVTTAPLPPPRALTLAAVTPRTVHLTWQPSAGATHYLVRCSPASPKGE EEEREVQVGRPEVLLDGLEPGRDYEVSVQSLRGPEGSEARGIRARTPTLAPPRHLGFS DVSHDAARVFWEGAPRPVRLVRVTYVSSEGGHSGQTEAPGNATSATLGPLSSSTTYTV RVTCLYPGGGSSTLTGRVTTKKAPSPSQLSMTELPGDAVQLAWVAAAPSGVLVYQITW TPLGEGKAHEISVPGNLGTAVLPGLGRHTEYDVTILAYYRDGARSDPVSLRYTPSTVS RSPPSNLALASETPDSLQVSWTPPLGRVLHYWLTYAPASGLGPEKSVSVPGARSHVTL PDLQAATKYRVLVSAIYAAGRSEAVSATGQTACPALRPDGSLPGFDLMVAFSLVEKAY ASIRGVAMEPSAFGGTPTFTLFKDAQLTRRVSDVYPAPLPPEHTIVFLVRLLPETPRE AFALWQMTAKDFQPLLGVLLDAGKKSLTYFHRDPRAALQEATFDPQEVRKIFFGSFHK VHVAVGRSKVRLYVDCRKVAERPLGEMGSPPAAGFVTLGRLAKARGPRSSSAAFQLQM LQIVCSDTWADEDRCCELPASRDGETCPAFVSACSCSSETPGPPGPQGPPGLPGRNGT PGEQGFPGPRGPPGVKGEKGDHGLPGLQGHPGHQGIPGRVGLQGPKGMRGLEGTAGLP GPPGPRGFQGMAGARGTSGERGPPGTVGPTGLPGPKGERGEKGEQQSLATLYQLVSQA SHVSKFDSFHENTRPPMPILEQKLEPGTEPLGSPGTRSKALVPGEWGRGGRHLEGRGE PGAVGQMGSPGQQGASTQGLWE" BASE COUNT 761 a 1399 c 1354 g 678 t ORIGIN 1 agcggataag ctccagcctt cctgtggcca cagcaggacc agagtggacc agcacacccc 61 aggagagagg actggggtcc caggagtagg aggagcccga gcaccatgag ctccggagac 121 cctgcacacc tcggcctctg cgtctggctg tggctgggcg ccaccctggg aagagagcaa 181 gttcaagcaa gcggtctcct gaggctggct gtgctgcctg aggaccggct gcagatgaag 241 tggagagagt cggaggggag cggcctcggc tacctggtgc aggtgaagcc catggcaggg 301 gactcggaac aggaggtgat actgaccacc aagacccata aggccacagt ggggggcctg 361 agcccctcca agggctacac cttgcagatc ttcgagctca ctggctctgg gcgcttcctg 421 ctagctcgga gggagtttgt gattgaggat ctgaagagta gctccctgga caggagcagc 481 cagaggcccc tcggctctgg agccccggag cccaccccct cccacacggg gagcccagac 541 cctgagcagg cttctgagcc ccaagttgcc ttcacaccaa gccaggatcc gcgcactcct 601 gccggccccc agttccgctg cctgcccccc gtgcctgctg acatggtctt cctggtggac 661 gggtcctgga gcattggcca cagtcacttc cagcaggtca aggacttcct ggccagtgtc 721 atcgcaccct ttgaaatcgg gccggataag gtccaagtag gcctgactca gtacagcggg 781 gatgctcaga ctgagtggga cctgaactcc ctcagcacca aggaacaggt gctggcagct 841 gtgcgccgcc tccgctacaa gggggggaac acgttcacag gccttgccct gacccacgtg 901 ctggggcaga acctgcagcc ggcggctggc ctccgtccag aggcagccaa ggtggtgatt 961 ctggtgacgg acggcaagtc ccaggacgat gtgcacactg ctgcccgtgt cctcaaggac 1021 ctgggcgtga acgtcttcgc tgtgggtgtg aagaacgccg atgaggctga gctgaggctc 1081 ctggcgtccc cgccgaggga catcaccgtc cacagcgtgc tggacttcct gcagctcggc 1141 gcgctggctg gcctgctcag ccgtctcatc tgccagaggc tccagggtgg gagcccgcgg 1201 cagggcccag cagcggctcc agccctggac accctccctg cccccaccag cctggtcctg 1261 agccaggtga cctcctccag catccgcctg tcctggactc cagccccccg gcaccccctc 1321 aagtatctga tcgtttggcg agcctctaga ggtggcaccc ccagggaggt ggtggtggag 1381 gggcccgccg cctccacgga gctgcacaac ctggcctccc gcacagagta cctggtctcc 1441 gtgttcccca tctatgaggg cggggttggc gaaggcctgc ggggcctggt gaccacagca 1501 cctctgcctc cgccccgggc gctgaccctg gccgcagtga cgcccagaac cgtccacctc 1561 acctggcagc catcggccgg ggccacccac tacctggtgc gatgttctcc tgcttccccc 1621 aagggtgaag aggaggagcg agaggtgcag gtcgggcggc ccgaggtgct gctggatggc 1681 ctggaacctg gcagggacta tgaggtctcg gtgcagagcc tgcgaggccc tgagggcagc 1741 gaggcccggg gcatccgtgc caggaccccc accctggccc ccccgagaca cctgggcttc 1801 tcagacgtga gccacgacgc ggcacgagtg ttctgggagg gtgccccgag gcctgtgcgc 1861 ctggtcaggg tcacctatgt gtccagcgag ggtggacact cggggcagac agaggctcct 1921 gggaacgcca cctcggccac gctggggcct ctctcttcct ccaccaccta cactgtccgt 1981 gtcacctgcc tctaccctgg gggtggctcc tctacgctga ctggccgggt gaccaccaag 2041 aaagctccca gcccaagcca gctgtccatg acggagctgc caggggatgc agtccagctg 2101 gcgtgggtgg ccgcagcccc gtctggcgtg cttgtctacc agatcacgtg gacgcccctg 2161 ggagagggga aggctcacga gatctctgtc ccagggaacc tcggcacggc cgtcctgcct 2221 ggcctaggga ggcacacaga gtacgacgtc accatcttgg cctactacag ggacggggcc 2281 cgcagtgacc ctgtgtccct ccgctatacc ccctccacgg tgagcaggag cccaccctcc 2341 aacctggccc tggcctcgga gacccccgac agcctgcagg tcagctggac gcccccgctt 2401 ggccgcgtgc tccattactg gctcacctac gcccccgcct ctggcttggg acccgagaaa 2461 tccgtctctg tgccaggagc caggagccac gtgacactgc ccgacctgca ggcagccacg 2521 aagtacaggg tcctggtctc agctatctat gcagcaggca ggagtgaggc tgtgtctgcc 2581 acgggccaga cagcctgccc agccctccgc cctgacggct ccctcccagg gtttgacctg 2641 atggtggcct tcagcctggt ggaaaaggct tatgcgtcca tccggggcgt ggccatggag 2701 ccctctgcct tcggtgggac cccgaccttc acgctcttca aggacgccca gctgacaaga 2761 cgggtcagtg acgtctaccc agccccccta cctccagagc acaccatcgt cttccttgtg 2821 cgcctacttc ccgagacacc ccgtgaggcc ttcgcgctgt ggcagatgac agccaaggac 2881 ttccagcccc tccttggggt tctgctggat gccgggaaga agtccctgac ctacttccac 2941 cgtgacccca gggctgcctt gcaggaggcc accttcgacc cgcaggaagt gaggaagatt 3001 ttcttcggga gcttccacaa ggtgcacgtg gctgtgggcc gctccaaggt caggctctat 3061 gtggactgcc ggaaggtggc tgagcggccc cttggggaga tgggcagccc acccgctgcc 3121 ggcttcgtca cgctggggag gctggccaag gccaggggcc ccaggagcag ttcggccgcg 3181 tttcagctcc agatgctgca gatcgtgtgc agtgacacct gggccgatga ggaccggtgc 3241 tgtgagctcc ctgcctcgag ggatggagag acctgccccg ccttcgtgtc tgcctgttcc 3301 tgttcctcag agacccctgg gcccccagga cctcaaggac ccccaggcct ccctgggagg 3361 aatggcaccc caggagagca gggcttccca gggcccaggg gtccaccagg ggtcaaagga 3421 gagaagggag accatgggct tccaggcttg cagggccacc ccggccacca gggcatcccc 3481 gggagagttg gcctccaggg accaaaggga atgagaggcc tggagggaac tgctggcctg 3541 cctggacccc ctggccccag ggggttccag ggcatggcag gggccagggg cactagtgga 3601 gagcgaggac ctccagggac cgtggggccc acaggactgc cagggcccaa aggggaacga 3661 ggagagaagg gcgagcagca gtcccttgcc accctctacc agcttgtgag ccaggcctca 3721 cacgtgtcaa agttcgactc cttccacgag aacaccaggc cccccatgcc catcttggag 3781 cagaagctgg agccgggcac tgagcccctg gggtcacctg gcacccgcag caaggccctg 3841 gttcctggag aatgggggcg tggtggccgc caccttgagg gcagagggga gcctggagct 3901 gttggtcaga tgggcagccc tgggcagcag ggggctagca cccagggcct ctgggagtga 3961 caggacattt tctgcactgc cccgaggaac gctgagcctt cctccctggg tttgtctgga 4021 caccgagagc gaccacatcc tggagaagcc aggagaaaag ctcaggaaga gcctgcaggt 4081 ggaaggagag ggaagcagcg gcctcggcca aggcccaccc catactcttg gctctgtagc 4141 atttccaagt tcagataaac ccctgagtgc tcacccaaaa aaaaaaaaaa aa //