LOCUS       AAC41772.1              1487 aa    PRT              HUM 03-AUG-1995
DEFINITION  Homo sapiens alpha-1 type II collagen protein.
ACCESSION   L10347-1
PROTEIN_ID  AAC41772.1
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (sites)
  AUTHORS   Cheah,K.S., Stoker,N.G., Griffin,J.R., Grosveld,F.G. and Solomon,E.
  TITLE     Identification and characterization of the human type II collagen
            gene (COL2A1)
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 82 (9), 2555-2559 (1985)
   PUBMED   3857598
REFERENCE   2  (sites)
  AUTHORS   Baldwin,C.T., Reginato,A.M., Smith,C., Jimenez,S.A. and
            Prockop,D.J.
  TITLE     Structure of cDNA clones coding for human type II procollagen. The
            alpha 1(II) chain is more similar to the alpha 1(I) chain than two
            other alpha chains of fibrillar collagens
  JOURNAL   Biochem. J. 262 (2), 521-528 (1989)
   PUBMED   2803268
REFERENCE   3  (sites)
  AUTHORS   Vikkula,M. and Peltonen,L.
  TITLE     Structural analyses of the polymorphic area in type II collagen
            gene
  JOURNAL   FEBS Lett. 250 (2), 171-174 (1989)
   PUBMED   2753125
REFERENCE   4  (sites)
  AUTHORS   Ryan,M.C., Sieraski,M. and Sandell,L.J.
  TITLE     The human type II procollagen gene: identification of an additional
            protein-coding domain and location of potential regulatory
            sequences in the promoter and first intron
  JOURNAL   Genomics 8 (1), 41-48 (1990)
   PUBMED   2081599
REFERENCE   5  (sites)
  AUTHORS   Huang,M.C., Seyer,J.M., Thompson,J.P., Spinella,D.G., Cheah,K.S.
            and Kang,A.H.
  TITLE     Genomic organization of the human procollagen alpha 1(II) collagen
            gene
  JOURNAL   Eur. J. Biochem. 195 (3), 593-600 (1991)
   PUBMED   1999183
REFERENCE   6  (sites)
  AUTHORS   Vikkula,M., Metsaranta,M., Syvanen,A.C., Ala-Kokko,L., Vuorio,E.
            and Peltonen,L.
  TITLE     Structural analysis of the regulatory elements of the type-II
            procollagen gene. Conservation of promoter and first intron
            sequences between human and mouse
  JOURNAL   Biochem. J. 285 (Pt 1), 287-294 (1992)
   PUBMED   1637314
REFERENCE   7  (sites)
  AUTHORS   Ala-Kokko,L., Kvist,A.P., Metsaranta,M., Kivirikko,K.I., de
            Crombrugghe,B., Prockop,D.J. and Vuorio,E.
  TITLE     Conservation of the sizes of 53 introns and over 100 intronic
            sequences for the binding of common transcription factors in the
            human and mouse genes for type II procollagen (COL2A1)
  JOURNAL   Biochem. J. 308 (Pt 3), 923-929 (1995)
   PUBMED   8948452
COMMENT     Original source text: Homo sapiens male adult blood DNA.
            Bases Reported in References
            REFERENCE   1 (bases 26401-26754,26809-26980,27089-27253,
            27308-27488,
            27597-27840,27895-28337,28446-31001)
            AUTHORS   Cheah, Kathryn S E, Stoker, Neil G, Griffin, Jane R,
            Grosveld, Frank G, and Solomon Ellen
            TITLE     Identification and Characterization of the Human Type II
            Collagen Gene (COL2A1)
            JOURNAL   Proc. Natl. Acad. Sci, USA 82, 2555-2559 (1985) REFERENCE
            2 (bases 1-85,5892-5908,6122-6154,6259-6291,6397-6450,
            6614-6715,7694-7771,8399-8443,8555-8608,9009-9062, 9838-9891,
            10266-10319,10451-10504,10811-10855, 11391-11444,14507-14551,
            15031-15084,16598-16696, 16993-17037,17130-17228,17418-17471,
            17996-18103, 18469-18522,18607-18705,18846-18899,19340-19438,
            19832-19885,20292-20345,20696-20749,20993-21046, 21289-21333,
            21480-21578,21813-21920,22263-22316, 22595-22648,23025-23078,
            23451-23504,23759-23866, 24358-24411,24856-24909,25656-25817,
            26015-26122, 26293-26400,26755-26808,26981-27088,27254-27307,
            27489-27596,27841-27894,28338-28445)
            AUTHORS   Baldwin, Clinton T, Reginato, Anthony M, Smith, Carol,
            Jiminez, Sergio A, and Prockop, Darwin J
            TITLE     Structure of cDNA clones coding for human type II
            procollagen. The alpha1(II) chain is more similar to the alpha1(I)
            chain two other alpha chains of fibrillar collagen JOURNAL
            Biochemical Journal 262, 521-528 (1989)
            REFERENCE   3 (bases 86-4190)
            AUTHORS   Vikkula, Miikka, Metsaranta, Marjo, Syvanen,
            Ann-Cristine, Ala-Kokko, Leena, Vuorio, Eero, and Peltonen, Leena
            TITLE     Structural analysis of the regulatory elements of the
            type-II procollagen gene
            JOURNAL   Biochemical Journal 285, 287-294 (1992)
            REFERENCE   4 (bases 4191-5891)
            AUTHORS   Ryan, Maureen C, Sieraski, Madelyn, and Sandell, Linda J
            TITLE     The Human Type II Procollagen Gene: Identification of an
            Additional Protein-Coding Domain and Location of Potential
            Regulatory Sequences in the Promoter and First Intron JOURNAL
            Genomics 8, 41-48 (1990)
            REFERENCE   5 (bases 20346-20695,20750-20992,21047-21288,
            21334-21479, 21579-21812,21921-22262)
            AUTHORS   Vikkula, Miikka and Peltonen, Leena
            TITLE     Structural Analyses of the Polymorphic Area in Type II
            Collagen Gene
            JOURNAL   FEBS LETTERS 250, 2:171-174 (1989)
            REFERENCE   6 (bases 5909-6121, 6155-6258, 6292-6396,6451-6613,
            6716-7693,7772-8398,8444-85541-30997)
            AUTHORS   Huang, Min-Chi, Seyer, Jerome M, Thompson, James P,
            Spinella, Dominic G, Ceah, Kathy S E, Kang, Andrew H TITLE
            Genomic Organization of the Human Procollagen a1(II) Collagen Gene
            JOURNAL   FEBS LETTERS 195, 593-600 (1991)
            REFERENCE   7 (bases 8609-9008,9063-9837,9892-10265,10320-10450,
            10505-10810,10856-11390,11445-14506,14552-15030,
            15085-16597,16697-16992,17038-17129,17229-17417,
            17472-17995,18104-18468,18523-18606,18706-18845, 18900-19339,
            19439-19831,19886-20291,22317-22594, 22649-23024,23079-23450,
            23505-23758,23867-24357, 24412-24855,24910-25655,25818-26014,
            26123-26292)
            AUTHORS   Leena Ala-Kokko, Ari-Pekka Kvist, Marjo Metsaranta, Kari
            Kivirikko,Benoit de Crombrugghe, Darwin J. Prockop, and Eero
            Vuorio.
            TITLE     Comparison of the Human and Mouse Genes for Type II
            Procollagen (COL2A1). conservation of the relative Sizes of 54
            Introns, about 70% of 25,000 Base Sequences of the Introns and Over
            One Hundred Sites Throughout the Gene for Binding of Common
            Transcription Factors
            JOURNAL   Manuscript, in preparation.
FEATURES             Qualifiers
     source          /organism="Homo sapiens"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:9606"
                     /map="12q13"
                     /sex="male"
                     /cell_type="leukocyte"
                     /tissue_type="blood"
                     /dev_stage="adult"
     protein         /gene="COL2A1"
                     /db_xref="GDB:G00-119-063"
     intron_pos      29:1 (1/53)
     intron_pos      98:1 (2/53)
     intron_pos      104:0 (3/53)
     intron_pos      115:0 (4/53)
     intron_pos      126:0 (5/53)
     intron_pos      144:0 (6/53)
     intron_pos      178:0 (7/53)
     intron_pos      204:0 (8/53)
     intron_pos      219:0 (9/53)
     intron_pos      237:0 (10/53)
     intron_pos      255:0 (11/53)
     intron_pos      273:0 (12/53)
     intron_pos      291:0 (13/53)
     intron_pos      309:0 (14/53)
     intron_pos      324:0 (15/53)
     intron_pos      342:0 (16/53)
     intron_pos      357:0 (17/53)
     intron_pos      375:0 (18/53)
     intron_pos      408:0 (19/53)
     intron_pos      423:0 (20/53)
     intron_pos      456:0 (21/53)
     intron_pos      474:0 (22/53)
     intron_pos      510:0 (23/53)
     intron_pos      528:0 (24/53)
     intron_pos      561:0 (25/53)
     intron_pos      579:0 (26/53)
     intron_pos      612:0 (27/53)
     intron_pos      630:0 (28/53)
     intron_pos      648:0 (29/53)
     intron_pos      666:0 (30/53)
     intron_pos      684:0 (31/53)
     intron_pos      699:0 (32/53)
     intron_pos      732:0 (33/53)
     intron_pos      768:0 (34/53)
     intron_pos      786:0 (35/53)
     intron_pos      804:0 (36/53)
     intron_pos      822:0 (37/53)
     intron_pos      840:0 (38/53)
     intron_pos      876:0 (39/53)
     intron_pos      894:0 (40/53)
     intron_pos      912:0 (41/53)
     intron_pos      966:0 (42/53)
     intron_pos      1002:0 (43/53)
     intron_pos      1038:0 (44/53)
     intron_pos      1056:0 (45/53)
     intron_pos      1092:0 (46/53)
     intron_pos      1110:0 (47/53)
     intron_pos      1146:0 (48/53)
     intron_pos      1164:0 (49/53)
     intron_pos      1200:0 (50/53)
     intron_pos      1296:1 (51/53)
     intron_pos      1359:0 (52/53)
     intron_pos      1440:0 (53/53)
BEGIN
        1 MIRLGAPQSL VLLTLLVAAV LRCQGQDVQE AGSCVQDGQR YNDKDVWKPE PCRICVCDTG
       61 TVLCDDIICE DVKDCLSPEI PFGECCPICP TDLATASGQP GPKGQKGEPG DIKDIVGPKG
      121 PPGPQGPAGE QGPRGDRGDK GEKGAPGPRG RDGEPGTPGN PGPPGPPGPP GPPGLGGNFA
      181 AQMAGGFDEK AGGAQLGVMQ GPMGPMGPRG PPGPAGAPGP QGFQGNPGEP GEPGVSGPMG
      241 PRGPPGPPGK PGDDGEAGKP GKAGERGPPG PQGARGFPGT PGLPGVKGHR GYPGLDGAKG
      301 EAGAPGVKGE SGSPGENGSP GPMGPRGLPG ERGRTGPAGA AGARGNDGQP GPAGPPGPVG
      361 PAGGPGFPGA PGAKGEAGPT GARGPEGAQG PRGEPGTPGS PGPAGASGNP GTDGIPGAKG
      421 SAGAPGIAGA PGFPGPRGPP GPQGATGPLG PKGQTGEPGI AGFKGEQGPK GEPGPAGPQG
      481 APGPAGEEGK RGARGEPGGV GPIGPPGERG APGNRGFPGQ DGLAGPKGAP GERGPSGLAG
      541 PKGANGDPGR PGEPGLPGAR GLTGRPGDAG PQGKVGPSGA PGEDGRPGPP GPQGARGQPG
      601 VMGFPGPKGA NGEPGKAGEK GLPGAPGLRG LPGKDGETGA AGPPGPAGPA GERGEQGAPG
      661 PSGFQGLPGP PGPPGEGGKP GDQGVPGEAG APGLVGPRGE RGFPGERGSP GAQGLQGPRG
      721 LPGTPGTDGP KGASGPAGPP GAQGPPGLQG MPGERGAAGI AGPKGDRGDV GEKGPEGAPG
      781 KDGGRGLTGP IGPPGPAGAN GEKGEVGPPG PAGSAGARGA PGERGETGPP GPAGFAGPPG
      841 ADGQPGAKGE QGEAGQKGDA GAPGPQGPSG APGPQGPTGV TGPKGARGAQ GPPGATGFPG
      901 AAGRVGPPGS NGNPGPPGPP GPSGKDGPKG ARGDSGPPGR AGEPGLQGPA GPPGEKGEPG
      961 DDGPSGAEGP PGPQGLAGQR GIVGLPGQRG ERGFPGLPGP SGEPGKQGAP GASGDRGPPG
     1021 PVGPPGLTGP AGEPGREGSP GADGPPGRDG AAGVKGDRGE TGAVGAPGAP GPPGSPGPAG
     1081 PTGKQGDRGE AGAQGPMGPS GPAGARGIQG PQGPRGDKGE AGEPGERGLK GHRGFTGLQG
     1141 LPGPPGPSGD QGASGPAGPS GPRGPPGPVG PSGKDGANGI PGPIGPPGPR GRSGETGPAG
     1201 PPGNPGPPGP PGPPGPGIDM SAFAGLGPRE KGPDPLQYMR ADQAAGGLRQ HDAEVDATLK
     1261 SLNNQIESIR SPEGSRKNPA RTCRDLKLCH PEWKSGDYWI DPNQGCTLDA MKVFCNMETG
     1321 ETCVYPNPAN VPKKNWWSSK SKEKKHIWFG ETINGGFHFS YGDDNLAPNT ANVQMTFLRL
     1381 LSTEGSQNIT YHCKNSIAYL DEAAGNLKKA LLIQGSNDVE IRAEGNSRFT YTALKDGCTK
     1441 HTGKWGKTVI EYRSQKTSRL PIIDIAPMDI GGPEQEFGVD IGPVCFL
//