LOCUS       BC052575                3331 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens collagen, type VI, alpha 1, mRNA (cDNA clone MGC:59702
            IMAGE:6598940), complete cds.
ACCESSION   BC052575
VERSION     BC052575.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3331)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 3331)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (16-MAY-2003) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 47 Row: o Column: 14
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 15011912.
FEATURES             Location/Qualifiers
     source          1..3331
                     /db_xref="H-InvDB:HIT000053797"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:59702 IMAGE:6598940"
                     /tissue_type="Ovary, teratocarcinoma"
                     /clone_lib="NIH_MGC_109"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            1..3331
                     /gene="COL6A1"
                     /gene_synonym="OPLL"
                     /db_xref="GeneID:1291"
                     /db_xref="HGNC:HGNC:2211"
                     /db_xref="MIM:120220"
     CDS             83..3169
                     /gene="COL6A1"
                     /gene_synonym="OPLL"
                     /codon_start=1
                     /product="collagen, type VI, alpha 1"
                     /protein_id="AAH52575.1"
                     /db_xref="GeneID:1291"
                     /db_xref="HGNC:HGNC:2211"
                     /db_xref="MIM:120220"
                     /translation="MRAARALLPLLLQACWTAAQDEPETPRAVAFQDCPVDLFFVLDT
                     SESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIQG
                     LTRMPGGRDALKSSVDAVKYFGKGTYTDCAIKKGLEQLLVGGSHLKENKYLIVVTDGH
                     PLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRLSIIATDHTYRRNFTAADW
                     GQSRDAEEAISQTIDTIVDMIKNNVEQVCCSFECQPARGPPGLRGDPGFEGERGKPGL
                     PGEKGEAGDPGRPGDLGPVGYQGMKGEKGSRGEKGSRGPKGYKGEKGKRGIDGVDGVK
                     GEMGYPGLPGCKGSPGFDGIQGPPGPKGDPGAFGLKGEKGEPGADGEAGRPGSSGPSG
                     DEGQPGEPGPPGEKGEAGDEGNPGPDGAPGERGGPGERGPRGTPGTRGPRGDPGEAGP
                     QGDQGREGPVGVPGDPGEAGPIGPKGYRGDEGPPGSEGARGAPGPAGPPGDPGLMGER
                     GEDGPAGNGTEGFPGFPGYPGNRGAPGINGTKGYPGLKGDEGEAGDPGDDNNDIAPRG
                     VKGAKGYRGPEGPQGPPGHQGPPGPDECEILDIIMKMCSCCECKCGPIDLLFVLDSSE
                     SIGLQNFEIAKDFVVKVIDRLSRDELVKFEPGQSYAGVVQYSHSQMQEHVSLRSPSIR
                     NVQELKEAIKSLQWMAGGTFTGEALQYTRDQLLPPSPNNRIALVITDGRSDTQRDTTP
                     LNVLCSPGIQVVSVGIKDVFDFIPGSDQLNVISCQGLAPSQGRPGLSLVKENYAELLE
                     DAFLKNVTAQICIDKKCPDYTCPITFSSPADITILLDGSASVGSHNFDTTKRFAKRLA
                     ERFLTAGRTDPAHDVRVAVVQYSGTGQQRPERASLQFLQNYTALASAVDAMDFINDAT
                     DVNDALGYVTRFYREASSGAAKKRLLLFSDGNSQGATPAAIEKAVQEAQRAGIEIFVV
                     VVGRQVNEPHIRVLVTGKTAEYDVAYGESHLFRVPSYQALLRGVFHQTVSRKVALG"
BASE COUNT          754 a         1019 c         1100 g          458 t
ORIGIN      
        1 cagaaggcag cctcggtctc tgggcggcgg cggcggccca ctctgccctg gccgcgctgt
       61 gtggtgaccg caggcccgag acatgagggc ggcccgtgct ctgctgcccc tgctgctgca
      121 ggcctgctgg acagccgcgc aggatgagcc ggagaccccg agggccgtgg ccttccagga
      181 ctgccccgtg gacctgttct ttgtgctgga cacctctgag agcgtggccc tgaggctgaa
      241 gccctacggg gccctcgtgg acaaagtcaa gtccttcacc aagcgcttca tcgacaacct
      301 gagggacagg tactaccgct gtgaccgaaa cctggtgtgg aacgcaggcg cgctgcacta
      361 cagtgacgag gtggagatca tccaaggcct cacgcgcatg cctggcggcc gcgacgcact
      421 caaaagcagc gtggacgcgg tcaagtactt tgggaagggc acctacaccg actgcgctat
      481 caagaagggg ctggagcagc tcctcgtggg gggctcccac ctgaaggaga ataagtacct
      541 gattgtggtg accgacgggc accccctgga gggctacaag gaaccctgtg gggggctgga
      601 ggatgctgtg aacgaggcca agcacctggg cgtcaaagtc ttctcggtgg ccatcacacc
      661 cgaccacctg gagccgcgtc tgagcatcat cgccacggac cacacgtacc ggcgcaactt
      721 cacggcggct gactggggcc agagccgcga cgcagaggag gccatcagcc agaccatcga
      781 caccatcgtg gacatgatca aaaataacgt ggagcaagtg tgctgctcct tcgaatgcca
      841 gcctgcaaga ggacctccgg ggctccgggg cgaccccggc tttgagggag aacgaggcaa
      901 gccggggctc ccaggagaga agggagaagc cggagatcct ggaagacccg gggacctcgg
      961 acctgttggg taccagggaa tgaagggaga aaaagggagc cgtggggaga agggctccag
     1021 gggacccaag ggctacaagg gagagaaggg caagcgtggc atcgacgggg tggacggcgt
     1081 gaagggggag atggggtacc caggcctgcc aggctgcaag ggctcgcccg ggtttgacgg
     1141 cattcaagga ccccctggcc ccaagggaga ccccggcgcc tttggactga aaggagaaaa
     1201 gggcgagcct ggagctgacg gggaggcggg gagaccaggg agctcgggac catctggaga
     1261 cgagggccag ccgggagagc ctgggccccc cggagagaaa ggagaggcgg gcgacgaggg
     1321 gaacccagga cctgacggtg cccccgggga gcggggtggc cctggagaga gaggaccacg
     1381 ggggacccca ggcacacggg gaccaagagg agaccctggt gaagctggcc cgcagggtga
     1441 tcagggaaga gaaggccccg ttggtgtccc tggagacccg ggcgaggctg gccctatcgg
     1501 acctaaaggc taccgaggcg atgagggtcc cccagggtcc gagggtgcca gaggagcccc
     1561 aggacctgcc ggaccccctg gagacccggg gctgatgggt gaaaggggag aagacggccc
     1621 cgctggaaat ggcaccgagg gcttccccgg cttccccggg tatccgggca acaggggcgc
     1681 tcccgggata aacggcacga agggctaccc cggcctcaag ggggacgagg gagaagccgg
     1741 ggaccccgga gacgataaca acgacattgc accccgagga gtcaaaggag caaaggggta
     1801 ccggggtccc gagggccccc agggaccccc aggacaccaa ggaccgcctg ggccggacga
     1861 atgcgagatt ttggacatca tcatgaaaat gtgctcttgc tgtgaatgca agtgcggccc
     1921 catcgacctc ctgttcgtgc tggacagctc agagagcatt ggcctgcaga acttcgagat
     1981 tgccaaggac ttcgtcgtca aggtcatcga ccggctgagc cgggacgagc tggtcaagtt
     2041 cgagccaggg cagtcgtacg cgggtgtggt gcagtacagc cacagccaga tgcaggagca
     2101 cgtgagcctg cgcagcccca gcatccggaa cgtgcaggag ctcaaggaag ccatcaagag
     2161 cctgcagtgg atggcgggcg gcaccttcac gggggaggcc ctgcagtaca cgcgggacca
     2221 gctgctgccg cccagcccga acaaccgcat cgccctggtc atcactgacg ggcgctcaga
     2281 cactcagagg gacaccacac cgctcaacgt gctctgcagc cccggcatcc aggtggtctc
     2341 cgtgggcatc aaagacgtgt ttgacttcat cccaggctca gaccagctca atgtcatttc
     2401 ttgccaaggc ctggcaccat cccagggccg gcccggcctc tcgctggtca aggagaacta
     2461 tgcagagctg ctggaggatg ccttcctgaa gaatgtcacc gcccagatct gcatagacaa
     2521 gaagtgtcca gattacacct gccccatcac gttctcctcc ccggctgaca tcaccatcct
     2581 gctggacggc tccgccagcg tgggcagcca caactttgac accaccaagc gcttcgccaa
     2641 gcgcctggcc gagcgcttcc tcacagcggg caggacggac cccgcccacg acgtgcgggt
     2701 ggcggtggtg cagtacagcg gcacgggcca gcagcgccca gagcgggcgt cgctgcagtt
     2761 cctgcagaac tacacggccc tggccagtgc cgtcgatgcc atggacttta tcaacgacgc
     2821 caccgacgtc aacgatgccc tgggctatgt gacccgcttc taccgcgagg cctcgtccgg
     2881 cgctgccaag aagaggctgc tgctcttctc agatggcaac tcgcagggcg ccacgcccgc
     2941 tgccatcgag aaggccgtgc aggaagccca gcgggcaggc atcgagatct tcgtggtggt
     3001 cgtgggccgc caggtgaatg agccccacat ccgcgtcctg gtcaccggca agacggccga
     3061 gtacgacgtg gcctacggcg agagccacct gttccgtgtc cccagctacc aggccctgct
     3121 ccgcggtgtc ttccaccaga cagtctccag gaaggtggcg ctgggctagc ccaccctgca
     3181 cgccggcacc aaaccctgtc ctcccacccc tccccactca tcactaaaca gagtaaaatg
     3241 tgatgcgaat tttcccgaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
     3301 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa a
//