LOCUS       BC151220                6517 bp    mRNA    linear   HUM 24-JUL-2007
DEFINITION  Homo sapiens collagen, type IV, alpha 1, mRNA (cDNA clone
            MGC:165004 IMAGE:40148649), complete cds.
ACCESSION   BC151220
VERSION     BC151220.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 6517)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 6517)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (23-JUL-2007) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Novartis Institute for Biomedical Research
            cDNA Library Preparation: Novartis Institute for Biomedical
            Research
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 306 Row: l Column: 7.
FEATURES             Location/Qualifiers
     source          1..6517
                     /db_xref="H-InvDB:HIT000435929"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:165004 IMAGE:40148649"
                     /tissue_type="Donated clones,Novartis FGA collection"
                     /clone_lib="NIH_MGC_417"
                     /lab_host="DH5a"
                     /note="Vector: pCMV-SPORT6"
     gene            1..6517
                     /gene="COL4A1"
                     /gene_synonym="arresten"
                     /db_xref="GeneID:1282"
                     /db_xref="HGNC:HGNC:2202"
                     /db_xref="MIM:120130"
     CDS             115..5124
                     /gene="COL4A1"
                     /gene_synonym="arresten"
                     /codon_start=1
                     /product="COL4A1 protein"
                     /protein_id="AAI51221.1"
                     /db_xref="GeneID:1282"
                     /db_xref="HGNC:HGNC:2202"
                     /db_xref="MIM:120130"
                     /translation="MGPRLSVWLLLLPAALLLHEEHSRAAAKGGCAGSGCGKCDCHGV
                     KGQKGERGLPGLQGVIGFPGMQGPEGPQGPPGQKGDTGEPGLPGTKGTRGPPGASGYP
                     GNPGLPGIPGQDGPPGPPGIPGCNGTKGERGPLGPPGLPGFAGNPGPPGLPGMKGDPG
                     EILGHVPGMLLKGERGFPGIPGTPGPPGLPGLQGPVGPPGFTGPPGPPGPPGPPGEKG
                     QMGLSFQGPKGDKGDQGVSGPPGVPGQAQVQEKGDFATKGEKGQKGEPGFQGMPGVGE
                     KGEPGKPGPRGKPGKDGDKGEKGSPGFPGEPGYPGLIGRQGPQGEKGEAGPPGPPGIV
                     IGTGPLGEKGERGYPGTPGPRGEPGPKGFPGLPGQPGPPGLPVPGQAGAPGFPGERGE
                     KGDRGFPGTSLPGPSGRDGLPGPPGSPGPPGQPGYTNGIVECQPGPPGDQGPPGIPGQ
                     PGFIGEIGEKGQKGESCLICDIDGYRGPPGPQGPPGEIGFPGQPGAKGDRGLPGRDGV
                     AGVPGPQGTPGLIGQPGAKGEPGEFYFDLRLKGDKGDPGFPGQPGMPGRAGSPGRDGH
                     PGLPGPKGSPGSVGLKGERGPPGGVGFPGSRGDTGPPGPPGYGPAGPIGDKGQAGFPG
                     GPGSPGLPGPKGEPGKIVPLPGPPGAEGLPGSPGFPGPQGDRGFPGTPGRPGLPGEKG
                     AVGQPGIGFPGPPGPKGVDGLPGDMGPPGTPGRPGFNGLPGNPGVQGQKGEPGVGLPG
                     LKGLPGLPGIPGTPGEKGSIGVPGVPGEHGAIGPPGLQGIRGEPGPPGLPGSVGSPGV
                     PGIGPPGARGPPGGQGPPGLSGPPGIKGEKGFPGFPGLDMPGPKGDKGAQGLPGITGQ
                     SGLPGLPGQQGAPGIPGFPGSKGEMGVMGTPGQPGSPGPVGAPGLPGEKGDHGFPGSS
                     GPRGDPGLKGDKGDVGLPGKPGSMDKVDMGSMKGQKGDQGEKGQIGPIGEKGSRGDPG
                     TPGVPGKDGQAGQPGQPGPKGDPGISGTPGAPGLPGPKGSVGGMGLPGTPGEKGVPGI
                     PGPQGSPGLPGDKGAKGEKGQAGPPGIGIPGLRGEKGDQGIAGFPGSPGEKGEKGSIG
                     IPGMPGSPGLKGSPGSVGYPGSPGLPGEKGDKGLPGLDGIPGVKGEAGLPGTPGPTGP
                     AGQKGEPGSDGIPGSAGEKGEPGLPGRGFPGFPGAKGDKGSKGEVGFPGLAGSPGIPG
                     SKGEQGFMGPPGPQGQPGLPGSPGHATEGPKGDRGPQGQPGLPGLPGPMGPPGLPGID
                     GVKGDKGNPGWPGAPGVPGPKGDPGFQGMPGIGGSPGITGSKGDMGPPGVPGFQGPKG
                     LPGLQGIKGDQGDHGVPGAKGLPGPPGPPGPYDIIKGEPGLPGPEGPPGLKGLQGLPG
                     PKGQQGVTGLVGIPGPPGIPGFDGAPGQKGEMGPAGPTGPRGFPGPPGPDGLPGSMGP
                     PGTPSVDHGFLVTRHSQTIDDPQCPSGTKILYHGYSLLYVQGNERAHGQDLGTAGSCL
                     RKFSTMPFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPITGENIRPFISRCAVCE
                     APAMVMAVHSQTIQIPPCPSGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRS
                     APFIECHGRGTCNYYANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCMRR
                     T"
BASE COUNT         1665 a         1656 c         1911 g         1285 t
ORIGIN      
        1 ccgccgcacc cgggacggtg cgtagcgctg gaagtccggc cttccgagag ctagctgtcc
       61 gccgcggccc ccgcacgccg ggcagccgtc cctcgccgcc tcgggcgcgc caccatgggg
      121 ccccggctca gcgtctggct gctgctgctg cccgccgccc ttctgctcca cgaggagcac
      181 agccgggccg ctgcgaaggg tggctgtgct ggctctggct gtggcaaatg tgactgccat
      241 ggagtgaagg gacaaaaggg tgaaagaggc ctcccggggt tacaaggtgt cattgggttt
      301 cctggaatgc aaggacctga ggggccacag ggaccaccag gacaaaaggg tgatactgga
      361 gaaccaggac tacctggaac aaaagggaca agaggacctc cgggagcatc tggctaccct
      421 ggaaacccag gacttcccgg aattcctggc caagacggcc cgccaggccc cccaggtatt
      481 ccaggatgca atggcacaaa gggggagaga gggccgctcg ggcctcctgg cttgcctggt
      541 ttcgctggaa atcccggacc accaggctta ccagggatga agggtgatcc aggtgagata
      601 cttggccatg tgcccgggat gctgttgaaa ggtgaaagag gatttcccgg aatcccaggg
      661 actccaggcc caccaggact gccagggctt caaggtcctg ttgggcctcc aggatttacc
      721 ggaccaccag gtcccccagg ccctcccggc cctccaggtg aaaagggaca aatgggctta
      781 agttttcaag gaccaaaagg tgacaagggt gaccaagggg tcagtgggcc tccaggagta
      841 ccaggacaag ctcaagttca agaaaaagga gacttcgcca ccaagggaga aaagggccaa
      901 aaaggtgaac ctggatttca ggggatgcca ggggtcggag agaaaggtga acccggaaaa
      961 ccaggaccca gaggcaaacc cggaaaagat ggtgacaaag gggaaaaagg gagtcccggt
     1021 tttcctggtg aacccgggta cccaggactc ataggccgcc agggcccgca gggagaaaag
     1081 ggtgaagcag gtcctcctgg cccacctgga attgttatag gcacaggacc tttgggagaa
     1141 aaaggagaga ggggctaccc tggaactccg gggccaagag gagagccagg cccaaaaggt
     1201 ttcccaggac taccaggcca acccggacct ccaggcctcc ctgtacctgg gcaggctggt
     1261 gcccctggct tccctggtga aagaggagaa aaaggtgacc gaggatttcc tggtacatct
     1321 ctgccaggac caagtggaag agatgggctc ccgggtcctc ctggttcccc tgggccccct
     1381 gggcagcctg gctacacaaa tggaattgtg gaatgtcagc ccggacctcc aggtgaccag
     1441 ggtcctcctg gaattccagg gcagccagga tttataggcg aaattggaga gaaaggtcaa
     1501 aaaggagaga gttgcctcat ctgtgatata gacggatatc gggggcctcc cgggccacag
     1561 ggacccccgg gagaaatagg tttcccaggg cagccagggg ccaagggcga cagaggtttg
     1621 cctggcagag atggtgttgc aggagtgcca ggccctcaag gtacaccagg gctgataggc
     1681 cagccaggag ccaaggggga gcctggtgag ttttatttcg acttgcggct caaaggtgac
     1741 aaaggagacc caggctttcc aggacagccc ggcatgccag ggagagcggg ttctcctgga
     1801 agagatggcc atccgggtct tcctggcccc aagggctcgc cgggttctgt aggattgaaa
     1861 ggagagcgtg gcccccctgg aggagttgga ttcccaggca gtcgtggtga caccggcccc
     1921 cctgggcccc caggatatgg tcctgctggt cccattggtg acaaaggaca agcaggcttt
     1981 cctggaggcc ctggatcccc aggcctgcca ggtccaaagg gtgaaccagg aaaaattgtt
     2041 cctttaccag gcccccctgg agcagaagga ctgccggggt ccccaggctt cccaggtccc
     2101 caaggagacc gaggctttcc cggaacccca ggaaggccag gcctgccagg agagaagggc
     2161 gctgtgggcc agccaggcat tggatttcca gggccccccg gccccaaagg tgttgacggc
     2221 ttacctggag acatggggcc accagggact ccaggtcgcc cgggatttaa tggcttacct
     2281 gggaacccag gtgtgcaggg ccagaaggga gagcctggag ttggtctacc gggactcaaa
     2341 ggtttgccag gtcttcccgg cattcctggc acacccgggg agaaggggag cattggggta
     2401 ccaggcgttc ctggagaaca tggagcgatc ggaccccctg ggcttcaggg gatcagaggt
     2461 gaaccgggac ctcctggatt gccaggctcc gtggggtctc caggagttcc aggaataggc
     2521 ccccctggag ctaggggtcc ccctggagga cagggaccac cggggttgtc aggccctcct
     2581 ggaataaaag gagagaaggg tttccccgga ttccctggac tggacatgcc gggccctaaa
     2641 ggagataaag gggctcaagg actccctggc ataacgggac agtcggggct ccctggcctt
     2701 cctggacagc agggggctcc tgggattcct gggtttccag gttccaaggg agaaatgggc
     2761 gtcatgggga cccccgggca gccgggctca ccaggaccag tgggtgctcc tggattaccg
     2821 ggtgaaaaag gggaccatgg ctttccgggc tcctcaggac ccaggggaga ccctggcttg
     2881 aaaggtgata agggggatgt cggtctccct ggcaagcctg gctccatgga taaggtggac
     2941 atgggcagca tgaagggcca gaaaggagac caaggagaga aaggacaaat tggaccaatt
     3001 ggtgagaagg gatcccgagg agaccctggg accccaggag tgcctggaaa ggacgggcag
     3061 gcaggacagc ctgggcagcc aggacctaaa ggtgatccag gtataagtgg aaccccaggt
     3121 gctccaggac ttccgggacc aaaaggatct gttggtggaa tgggcttgcc aggaacacct
     3181 ggagagaaag gtgtgcctgg catccctggc ccacaaggtt cacctggctt acctggagac
     3241 aaaggtgcaa aaggagagaa agggcaggca ggcccacctg gcataggcat cccaggactg
     3301 cgtggtgaaa agggagatca agggatagcg ggtttcccag gaagccctgg agagaaggga
     3361 gaaaaaggaa gcattgggat cccaggaatg ccagggtccc caggccttaa agggtctccc
     3421 gggagtgttg gctatccagg aagtcctggg ctacctggag aaaaaggtga caaaggcctc
     3481 ccaggattgg atggcatccc tggtgtcaaa ggagaagcag gtcttcctgg gactcctggc
     3541 cccacaggcc cagctggcca gaaaggggag ccaggcagtg atggaatccc ggggtcagca
     3601 ggagagaagg gtgaaccagg tctaccagga agaggattcc cagggtttcc aggggccaaa
     3661 ggagacaaag gttcaaaggg tgaggtgggt ttcccaggat tagccgggag cccaggaatt
     3721 cctggatcca aaggagagca aggattcatg ggtcctccgg ggccccaggg acagccgggg
     3781 ttaccgggat ccccaggcca tgccacggag gggcccaaag gagaccgcgg acctcagggc
     3841 cagcctggcc tgccaggact tccgggaccc atggggcctc cagggcttcc tgggattgat
     3901 ggagttaaag gtgacaaagg aaatccaggc tggccaggag cacccggtgt cccagggccc
     3961 aagggagacc ctggattcca gggcatgcct ggtattggtg gctctccagg aatcacaggc
     4021 tctaagggtg atatggggcc tccaggagtt ccaggatttc aaggtccaaa aggtcttcct
     4081 ggcctccagg gaattaaagg tgatcaaggc gatcacggcg tcccgggagc taaaggtctc
     4141 ccgggtcctc ctggcccccc aggtccttac gacatcatca aaggggagcc cgggctccct
     4201 ggtcctgagg gccccccagg gctgaaaggg cttcagggac tgccaggccc gaaaggccag
     4261 caaggtgtta caggattggt gggtatacct ggacctccag gtattcctgg gtttgacggt
     4321 gcccctggcc agaaaggaga gatgggacct gccgggccta ctggtccaag aggatttcca
     4381 ggtccaccag gccccgatgg gttgccagga tccatggggc ccccaggcac cccatctgtt
     4441 gatcacggct tccttgtgac caggcatagt caaacaatag atgacccaca gtgtccttct
     4501 gggaccaaaa ttctttacca cgggtactct ttgctctacg tgcaaggcaa tgaacgggcc
     4561 catggccagg acttgggcac ggctggcagc tgcctgcgca agttcagcac aatgcccttc
     4621 ctgttctgca atattaacaa cgtgtgcaac tttgcatcac gaaatgacta ctcgtactgg
     4681 ctgtccaccc ctgagcccat gcccatgtca atggcaccca tcacggggga aaacataaga
     4741 ccatttatta gtaggtgtgc tgtgtgtgag gcgcctgcca tggtgatggc cgtgcacagc
     4801 cagaccattc agatcccacc gtgccccagc gggtggtcct cgctgtggat cggctactct
     4861 tttgtgatgc acaccagcgc tggtgcagaa ggctctggcc aagccctggc gtcccccggc
     4921 tcctgcctgg aggagtttag aagtgcgcca ttcatcgagt gtcacggccg tgggacctgc
     4981 aattactacg caaacgctta cagcttttgg ctcgccacca tagagaggag cgagatgttc
     5041 aagaagccta cgccgtccac cttgaaggca ggggagctgc gcacgcacgt cagccgctgc
     5101 caagtctgta tgagaagaac ataatgaagc ctgactcagc taatgtcaca acatggtgct
     5161 acttcttctt ctttttgtta acagcaacga accctagaaa tatatcctgt gtacctcact
     5221 gtccaatatg aaaaccgtaa agtgccttat aggaatttgc gtaactaaca caccctgctt
     5281 cattgacctc tacttgctga aggagaaaaa gacagcgata agctttcaat agtggcatac
     5341 caaatggcac ttttgatgaa ataaaatatc aatattttct gcaatccaat gcactgatgt
     5401 gtgaagtgag aactccatca gaaaaccaaa gggtgctagg aggtgtgggt gccttccata
     5461 ctgtttgccc attttcattc ttgtattata attaattttc tacccccaga gataaatgtt
     5521 tgtttatatc actgtctagc tgtttcaaaa tttaggtccc ttggtctgta caaataatag
     5581 caatgtaaaa atggtttttt gaacctccaa atggaattac agactcagta gccatatctt
     5641 ccaacccccc agtataaatt tctgtctttc tgctatgtgt ggtactttgc agctgctttt
     5701 gcagaaatca caattttcct gtggaataaa gatggtccaa aaatagtcaa aaattaaata
     5761 tatatatata ttagtaattt atatagatgt cagcaattag gcagatcaag gtttagttta
     5821 acttccactg ttaaaataaa gcttacatag ttttcttcct ttgaaagact gtgctgtcct
     5881 ttaacatagg tttttaaaga ctaggatatt gaatgtgaaa catccgtttt cattgttcac
     5941 ttctaaacca aaaattatgt gttgccaaaa ccaaacccag gttcatgaat atggtgtcta
     6001 ttatagtgaa acatgtactt tgagcttatt gtttttattc tgtattaaat attttcaggg
     6061 ttttaaacac taatcacaaa ctgaatgact tgacttcaaa agcaacaacc ttaaaggccg
     6121 tcatttcatt agtattcctc attctgcatc ctggcttgaa aaacagctct gttgaatcac
     6181 agtatcagta ttttcacacg taagcacatt cgggccattt ccgtggtttc tcatgagctg
     6241 tgttcacaga cctcagcagg gcatcgcatg gaccgcagga gggcagattc ggaccactag
     6301 gcctgaaatg acatttcact aaaagtctcc aaaacatttc taagactact aaggcctttt
     6361 atgtaatttc tttaaatgtg tatttcttaa gaattcaaat ttgtaataaa actatttgta
     6421 taaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
     6481 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaa
//