LOCUS BC144535 4355 bp mRNA linear HUM 18-MAR-2009 DEFINITION Homo sapiens collagen, type XXII, alpha 1, mRNA (cDNA clone MGC:178079 IMAGE:9053062), complete cds. ACCESSION BC144535 VERSION BC144535.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4355) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 4355) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (12-JUN-2007) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Mike Brownstein, NIMH cDNA Library Preparation: British Columbia Cancer Research Center cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: LLDM Plate: 677 Row: h Column: 20 isoform, single-stranded. FEATURES Location/Qualifiers source 1..4355 /db_xref="H-InvDB:HIT000502943" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:178079 IMAGE:9053062" /tissue_type="Brain, cerebellum, PCR rescued clones" /clone_lib="NIH_MGC_313" /note="Vector: pCR-XL-TOPO with reversed insert; Clone identification sequence tag: CGTTTTGC" gene 1..4355 /gene="COL22A1" /db_xref="GeneID:169044" /db_xref="HGNC:HGNC:22989" /db_xref="MIM:610026" CDS 40..3999 /gene="COL22A1" /codon_start=1 /product="COL22A1 protein" /protein_id="AAI44536.1" /db_xref="GeneID:169044" /db_xref="HGNC:HGNC:22989" /db_xref="MIM:610026" /translation="MAGLRGNAVAGLLWMLLLWSGGGGCQAQRAGCKSVHYDLVFLLD TSSSVGKEDFEKVRQWVANLVDTFEVGPDRTRVGVVRYSDRPTTAFELGLFGSQEEVK AAARRLAYHGGNTNTGDALRYITARSFSPHAGGRPRDRAYKQVAILLTDGRSQDLVLD AAAAAHRAGIRIFAVGVGEALKEELEEIASEPKSAHVFHVSDFNAIDKIRGKLRRRLC ENVLCPSVRVEGDRFKHTNGGTKEITGFDLMDLFSVKEILGKRENGAQSSYVRMGSFP VVQSTEDVFPQGLPDEYAFVTTFRFRKTSRKEDWYIWQVIDQYGIPQVSIRLDGENKA VEYNAVAGPQGEKGDVGPAGPPGVPGSVVQQEGLKGEQGAPGPRGHQGAPGPPGARGP IGPEGRDGPPGLQGLRGKKGDMGPPGIPGLLGLQGPPGPPGVPGPPGPGGSPGLPGEI GFPGKPGPPGPTGPPGKDGPNGPPGPPGTKGEPGERGEDGLPGKPGLRGEIGEQGLAG RPGEKGEAGLPGAPGFPGVRGEKGDQGEKGELGLPGLKGDRGEKGEAGPAGPPGLPGT TSLFTPHPRMPGEQGPKGEKGDPGLPGEPGLQGRPGELGPQGPTGPPGAKGQEGAHGA PGAAGNPGAPGHVGAPGPSGPPGSVGDPGLRGTPGKDGERGEKGAAGEEGSPGPVGPR GDPGAPGLPGPPGKGKDGEPGLRGSPGLPGPLGTKGDRGAPGIPGSPGSRGDPGIGVA GPPGPSGPPGDKGSPGSRGLPGFPGPQGPAGRDGAPGNPGERGPPGKPGLSSLLSPGD INLLAKDVCNDCPPGPPGLPGLPGFKGDKGVPGKPGREGTEGKKGEAGPPGLPGPPGI AGPQGSQGERGADGEVGQKGDQGHPGVPGFMGPPGNPGPPGADGIAGAAGPPGIQGSP GKEGPPGPQGPSGLPGIPGEEGKEGRDGKPGPPGEPGKAGEPGLPGPEGARGPPGFKG HTGDSGAPGPRGESGAMGLPGQEGLPGKDGDTGPTGPQGPQGPRGPPGKNGSPGSPGE PGPSGTPGQKGSKGENGSPGLPGFLGPRGPPGEPGEKGVPGKEGVPGKPGEPGFKGER GDPGIKGDKGPPGGKGQPGDPGIPGHKGHTGLMGPQGLPGENGPVGPPGPPGQPGFPG LRGESPSMETLRRLIQEELGKQLETRLAYLLAQMPPAYMKSSQGRPGPPGPPGKDGLP GRAGPMGEPGRPGQGGLEGPSGPIGPKGERGAKGDPGAPGVGLRGEMGPPGIPGQPGE PGYAKDGLPGIPGPQGETGPAGHPGLPGPPGPPGQCDPSQCAYFASLAARPGNVKGP" BASE COUNT 964 a 1222 c 1433 g 736 t ORIGIN 1 cctggccaag agaagcctgt ccccaagaac aggagagcca tggccggcct ccgagggaac 61 gctgtggctg gcctcctctg gatgctgctg ctgtggagtg ggggcggcgg ctgccaggct 121 cagcgggcag gttgcaaaag tgtccactac gatctggtct tcctcctgga cacctcctcc 181 agcgtgggca aggaggactt tgagaaggtc cggcagtggg tggccaacct ggtggacacc 241 ttcgaggtgg gccccgaccg cacccgtgtg ggggtcgtgc gctacagcga ccggcccacc 301 acggccttcg agttgggact ctttggctcg caggaggagg tcaaggcggc tgcccggcgt 361 ctcgcctacc acgggggcaa caccaacacg ggagacgcgc tccgctacat cacggcccgc 421 agcttctccc cacacgccgg cggccgcccc agggaccgcg cctacaagca ggtggccatc 481 ctgctcaccg acggccgcag ccaggacctg gtgctggacg ccgcggcggc agcccaccgc 541 gctggcatcc gcatctttgc cgtgggcgtg ggcgaggcac tcaaggagga gctggaggag 601 atcgcctcag agcccaagtc cgcccacgtc ttccacgtgt ccgacttcaa tgccatcgac 661 aagatccggg gcaagctgcg gcgccgtctt tgtgaaaatg tgctctgtcc tagcgttcgt 721 gtagaaggag atcgctttaa gcacaccaat ggaggaacca aggaaatcac aggttttgac 781 ctgatggatt tgttcagtgt gaaggaaatc ttggggaaga gagagaatgg agctcagagt 841 tcctatgtac ggatgggatc cttccctgtg gtgcaaagta ctgaggatgt gttcccccaa 901 ggtttacctg atgagtacgc ctttgtcaca accttccggt tcaggaaaac ctctcggaag 961 gaagactggt atatctggca ggtcatcgac cagtacggca tcccacaggt ctccatccgg 1021 ctggatggtg aaaacaaggc agtcgagtac aacgctgtgg caggacccca gggagaaaag 1081 ggtgacgtgg gacctgcggg gccacctggt gtaccaggct cagtggtgca gcaagagggc 1141 ttgaaagggg aacagggagc tccaggaccc agaggtcacc aaggcgcccc cggtcctcca 1201 ggagctcggg gtccaatagg cccagaaggc agggatggac ctcctggttt gcaaggtctc 1261 cgagggaaga aaggtgacat gggaccacct ggaatccctg gattgctggg gctgcagggc 1321 cctccaggac cccctggtgt cccaggcccc cctggaccgg gaggttctcc gggtttgcct 1381 ggagagatcg gcttcccggg aaagcctgga cctcctgggc ccacgggacc ccctggaaag 1441 gacgggccaa atggaccacc aggtccgcca ggaaccaagg gagaaccagg agaaagaggg 1501 gaagatggtc tgcctggaaa accaggcctt cggggagaaa ttggggagca gggcctggca 1561 ggccgacctg gagagaaggg agaagcaggc ctcccagggg ctccaggctt cccaggtgtg 1621 agaggagaga aaggagacca gggagaaaaa ggtgaactgg gacttccagg actgaaaggt 1681 gaccgaggtg aaaagggtga agctggtcct gcaggccctc ccgggttacc tggaactaca 1741 tccctgttca caccacatcc acggatgccc ggagaacaag ggcccaaagg agagaagggc 1801 gatccaggcc tgcctgggga accgggactg cagggccgtc ctggagaatt ggggcctcag 1861 ggacccactg gaccaccggg tgccaaggga caggaaggtg cacatggggc tcctggagca 1921 gctggaaacc ccggtgctcc cggacatgtc ggtgcccccg gtcccagtgg ccctccagga 1981 agtgtgggtg atcccggcct cagaggcacc ccagggaaag atggggagcg tggtgagaag 2041 ggtgcagcgg gggaagaagg cagcccaggg ccagttggtc ccaggggaga tcctggtgct 2101 cctgggctcc ctgggccacc cggaaaaggg aaggatggag agccgggact ccgtggatca 2161 cctggactcc ctggacccct aggaaccaag ggggatcgag gagctcctgg gatccctggt 2221 tctcctggca gccgtggtga cccaggcatt ggggttgctg gccctcctgg cccttccgga 2281 ccaccaggag acaaaggatc cccgggatca cgaggcttac ctggattccc tggcccccag 2341 ggcccagccg gccgggacgg tgcaccagga aatccaggag aaagagggcc tcctggcaag 2401 ccgggcctct cttcactact gtctccaggg gacataaatc tcttggctaa ggatgtgtgc 2461 aatgactgcc ctcctggccc cccaggcctc cctggtctac caggttttaa aggggacaaa 2521 ggtgtcccag gaaagccagg gagagaaggc acagaaggga aaaagggaga ggctgggcct 2581 ccaggcctac cagggccccc aggaatagct ggaccacagg gaagtcaagg agaacgtggt 2641 gcagatggtg aggttgggca gaaaggtgat cagggtcatc ctggagttcc aggtttcatg 2701 gggcccccag ggaaccctgg gccaccaggg gcagatggaa ttgcgggagc tgctggacca 2761 ccaggaatcc aagggtcacc tgggaaagaa ggccctcctg gcccccaagg cccatctgga 2821 ttacccggaa tcccaggaga agaaggcaaa gagggcagag atggaaagcc gggtccccct 2881 ggagagccgg gcaaagcagg agagccaggt ctaccaggac cagagggtgc ccgaggccca 2941 cctggcttca agggacacac aggcgattct ggtgcacccg gtccccgggg agagtctggt 3001 gccatggggc ttcctggtca ggaagggtta ccaggaaaag atggtgacac tggacccact 3061 gggccacagg gtccccaagg accaaggggc ccaccgggca agaatggatc accgggatct 3121 ccaggagagc ctggcccttc aggaacccct ggccagaaag gaagcaaagg ggaaaatggc 3181 agcccaggac ttcctggctt cctgggtccc cgtgggcctc cgggagaacc aggagagaaa 3241 ggagtcccag gcaaggaggg ggtccctggg aagcctggag agcctggatt caaaggagaa 3301 aggggagatc ctgggatcaa aggtgacaaa ggacctcctg gtggaaaagg ccagcctggg 3361 gaccctggaa tcccaggcca caaaggccac acaggcctga tgggtcccca aggactacct 3421 ggggagaatg gaccagttgg acccccaggg cctccaggcc agccgggatt tccaggactg 3481 aggggggagt ctccatccat ggaaaccctg cgtcggctta ttcaagaaga gctggggaag 3541 cagcttgaaa ccagactcgc ctacctcctg gcccagatgc ccccggcgta catgaagtca 3601 tctcaaggca gacctgggcc cccagggccc cctggaaaag atgggcttcc aggccgggcc 3661 ggccccatgg gggagccagg tcgtcctggg caggggggtc tggaaggacc ctctggaccc 3721 ataggtccca aaggtgagcg aggagccaaa ggtgacccag gtgcacctgg agttggcctc 3781 cgaggcgaga tgggaccccc tggaatccca ggtcaacccg gggaacctgg ctatgctaaa 3841 gatggacttc ctgggatccc tggccctcaa ggggagacag gaccagctgg acatcctggc 3901 ctcccaggac ctcccggtcc cccaggccaa tgtgaccctt cccagtgtgc ctacttcgcc 3961 agccttgctg cccggccggg taatgtgaag ggtccctaaa ggactctgga aagccagaag 4021 actgcagtgg atttctgaaa cttgaactca gagcccagtg ggaagccaga ggtcttgaaa 4081 gacttcagcc atgtgttcct tttttttttc tttcttttat cgtttgcttt ttgttttatt 4141 ttcttgagag acctcaaaat tattaaatcc aacagacgct gccggtcggt cagattatta 4201 ttaatattat tgttgttgtt aattattatt attatttcat atgctgatgc tttgtgagtt 4261 cttttccact cctttaaagt tgggaaaact tgattcgtgg ggcaggagat tgtttcttca 4321 ttcttctgac agcccccatc tgacgcgtaa ctgcc //