LOCUS       BC013581                2506 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens collagen, type VIII, alpha 1, mRNA (cDNA clone
            MGC:9568 IMAGE:3875911), complete cds.
ACCESSION   BC013581
VERSION     BC013581.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2506)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2506)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (04-SEP-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: DCTD/DTP/Gazdar
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 14 Row: a Column: 8
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 32895369.
FEATURES             Location/Qualifiers
     source          1..2506
                     /db_xref="H-InvDB:HIT000036288"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:9568 IMAGE:3875911"
                     /tissue_type="Lung, large cell carcinoma"
                     /clone_lib="NIH_MGC_68"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..2506
                     /gene="COL8A1"
                     /gene_synonym="MGC9568"
                     /db_xref="GeneID:1295"
                     /db_xref="HGNC:HGNC:2215"
                     /db_xref="MIM:120251"
     CDS             238..2472
                     /gene="COL8A1"
                     /gene_synonym="MGC9568"
                     /codon_start=1
                     /product="collagen, type VIII, alpha 1"
                     /protein_id="AAH13581.1"
                     /db_xref="GeneID:1295"
                     /db_xref="HGNC:HGNC:2215"
                     /db_xref="MIM:120251"
                     /translation="MAVLPGPLQLLGVLLTISLSSIRLIQAGAYYGIKPLPPQIPPQM
                     PPQIPQYQPLGQQVPHMPLAKDGLAMGKEMPHLQYGKEYPHLPQYMKEIQPAPRMGKE
                     AVPKKGKEIPLASLRGEQGPRGEPGPRGPPGPPGLPGHGIPGIKGKPGPQGYPGVGKP
                     GMPGMPGKPGAMGMPGAKGEIGQKGEIGPMGIPGPQGPPGPHGLPGIGKPGGPGLPGQ
                     PGPKGDRGPKGLPGPQGLRGPKGDKGFGMPGAPGVKGPPGMHGPPGPVGLPGVGKPGV
                     TGFPGPQGPLGKPGAPGEPGPQGPIGVPGVQGPPGIPGIGKPGQDGIPGQPGFPGGKG
                     EQGLPGLPGPPGLPGIGKPGFPGPKGDRGMGGVPGALGPRGEKGPIGAPGIGGPPGEP
                     GLPGIPGPMGPPGAIGFPGPKGEGGIVGPQGPPGPKGEPGLQGFPGKPGFLGEVGPPG
                     MRGLPGPIGPKGEAGQKGVPGLPGVPGLLGPKGEPGIPGDQGLQGPPGIPGIGGPSGP
                     IGPPGIPGPKGEPGLPGPPGFPGIGKPGVAGLHGPPGKPGALGPQGQPGLPGPPGPPG
                     PPGPPAVMPPTPPPQGEYLPDMGLGIDGVKPPHAYGAKKGKNGGPAYEMPAFTAELTA
                     PFPPVGAPVKFNKLLYNGRQNYNPQTGIFTCEVPGVYYFAYHVHCKGGNVWVALFKNN
                     EPVMYTYDEYKKGFLDQASGSAVLLLRPGDRVFLQMPSEQAAGLYAGQYVHSSFSGYL
                     LYPM"
BASE COUNT          624 a          698 c          753 g          431 t
ORIGIN      
        1 ctccgtggga gccagcgagc ctctctccct gatcttacgt gctcaaggga gctcacacgt
       61 tcaccaactc acccttgaag tcatctcaag aacaaaagac aactgaaaga agctgttgtg
      121 aaggcagagc agcatctgct gaagagacag aaaccagccc cagaggtgtc acaggaaggc
      181 accagcaagg acattggtct ttgatttgat tcagcagtcc tgtcaagtat aaatgtgatg
      241 gctgtgctgc ctggccctct gcagctgctg ggagtgctgc ttaccatttc cctgagttcc
      301 atcaggctca ttcaggctgg tgcctactat gggatcaagc cgctgccacc tcaaattcct
      361 cctcagatgc caccacaaat tccacaatac cagcccctgg gtcagcaagt acctcacatg
      421 cctttggcca aagatggcct tgccatgggc aaggagatgc cccacttgca gtatggcaaa
      481 gagtatccac acctacccca atatatgaag gaaattcaac cggcgccaag aatgggcaag
      541 gaagccgtac ccaagaaagg caaagaaata ccattagcca gtttacgagg ggaacaaggt
      601 ccccgtggag agcctggccc aagaggacca cctgggcccc ctggtttgcc aggtcatggg
      661 atacctggaa ttaaaggaaa accagggcca cagggatatc caggagttgg aaagccaggt
      721 atgcctggaa tgccagggaa gccaggagcc atgggcatgc ctggggcaaa aggagaaatt
      781 ggacagaaag gggaaattgg gcctatgggg atcccaggac cacaaggacc tccagggcct
      841 catggacttc ctggcattgg gaagccaggt gggccagggt taccagggca accaggacca
      901 aagggtgatc gaggacccaa aggactacca ggacctcaag gccttcgggg tcctaaagga
      961 gacaagggct tcgggatgcc aggtgcgcca ggtgtaaagg ggcctccagg gatgcacggc
     1021 cctcccggcc ctgttggact gccaggagtg ggcaaaccag gagtgacagg cttccctggg
     1081 ccccagggcc ccctgggaaa gccaggggct ccaggagaac ctgggccaca aggccctatt
     1141 ggggtaccgg gggttcaagg acctcctggg atacccggaa ttggaaagcc aggccaggat
     1201 gggatcccag gccagccagg atttccaggt ggcaaagggg agcaaggact gccagggcta
     1261 ccaggacccc caggccttcc agggattggg aaaccaggct tcccaggacc caaaggtgac
     1321 cggggcatgg gaggtgttcc tggggctctt ggaccaagag gggagaaagg accaataggt
     1381 gccccaggaa tagggggtcc tccaggagag ccaggcctgc ctggaatccc aggtcctatg
     1441 ggccctccag gtgctattgg ttttcctgga cccaaaggag aaggtgggat tgtagggcca
     1501 caggggccac caggtcccaa gggtgagcca gggcttcaag gcttcccagg aaagccaggt
     1561 ttccttggtg aagtagggcc tcctggcatg aggggtttgc caggtcccat agggcccaag
     1621 ggggaagctg ggcaaaaagg tgtaccagga ctccctggtg ttccagggct tctcggacct
     1681 aagggagagc caggaatccc aggggatcag ggtttacagg gccccccagg tatcccaggg
     1741 attgggggcc ctagtggccc cattggacca cctgggattc caggccccaa aggggagccg
     1801 ggcctcccag ggccccctgg gttccctggt atagggaaac ccggagtggc aggacttcat
     1861 ggccccccag ggaagcctgg tgcccttggt cctcaaggcc agcctggcct tccaggaccc
     1921 ccaggccctc caggacctcc aggaccccca gctgtgatgc cccctacacc accaccccag
     1981 ggagagtatc tgccagatat ggggctggga attgatggcg tgaaaccccc ccatgcctac
     2041 ggggctaaga aaggcaagaa tggagggcca gcctatgaga tgcctgcatt taccgccgag
     2101 ctaaccgcac ctttcccacc ggtgggggcc ccagtgaagt ttaacaaact gctgtataac
     2161 ggcagacaga actacaaccc gcagacaggc atcttcacct gtgaggtccc tggtgtctac
     2221 tactttgcat accacgttca ctgcaagggg ggcaacgtgt gggttgctct attcaagaac
     2281 aacgagcccg tgatgtacac gtacgacgag tacaaaaagg gcttcctgga ccaggcatct
     2341 gggagtgcag tgctgctgct caggcccgga gaccgggtgt tcctccagat gccctcagaa
     2401 caggctgcag gactgtatgc cgggcagtat gtccactcct ccttttcagg atatttattg
     2461 tatcccatgt aaaaacaaaa aaacaaaaaa caaaaaaaaa aaaaaa
//