LOCUS       BC047305                3504 bp    mRNA    linear   HUM 04-NOV-2003
DEFINITION  Homo sapiens collagen, type IV, alpha 1, mRNA (cDNA clone
            IMAGE:4941939), partial cds.
ACCESSION   BC047305
VERSION     BC047305.1
KEYWORDS    .
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3504)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 3504)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (28-FEB-2003) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: David N. Louis, M.D.
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 93 Row: e Column: 14.
FEATURES             Location/Qualifiers
     source          1..3504
                     /db_xref="H-InvDB:HIT000098508"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="IMAGE:4941939"
                     /tissue_type="Brain, anaplastic oligodendroglioma with
                     1p/19q loss"
                     /clone_lib="NCI_CGAP_Brn67"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            <1..3504
                     /gene="COL4A1"
                     /db_xref="GeneID:1282"
                     /db_xref="MIM:120130"
     CDS             <1..3229
                     /gene="COL4A1"
                     /codon_start=2
                     /product="COL4A1 protein"
                     /protein_id="AAH47305.1"
                     /db_xref="GeneID:1282"
                     /db_xref="MIM:120130"
                     /translation="GSRGDTGPPGPPGYGPAGPIGDKGQAGFPGGPGSPGLPGPKGEP
                     GKIVPLPGPPGAEGLPGSPGFPGPQGDRGFPGTPGRPGLPGEKGAVGQPGIGFPGPPG
                     PKGVDGLPGDMGPPGTPGRPGFNGLPGNPGVQGQKGEPGVGLPGLKGLPGLPGIPGTP
                     GEKGSIGVPGVPGEHGAIGPPGLQGIRGEPGPPGLPGSVGSPGVPGIGPPGARGPPGG
                     QGPPGLSGPPGIKGEKGFPGFPGLDMPGPKGDKGAQGLPGITGQSGLPGLPGQQGAPG
                     IPGFPGSKGEMGVMGTPGQPGSPGPVGAPGLPGEKGDHGFPGSSGPRGDPGLKGDKGD
                     VGLPGKPGSMDKVDMGSMKGQKGDQGEKGQIGPIGEKGSRGDPGTPGVPGKDGQAGQP
                     GQPGPKGDPGISGTPGAPGLPGPKGSVGGMGLPGTPGEKGVPGIPGPQGSPGLPGDKG
                     AKGEKGQAGPPGIGIPGLRGEKGDQGIAGFPGSPGEKGEKGSIGIPGMPGSPGLKGSP
                     GSVGYPGSPGLPGEKGDKGLPGLDGIPGVKGEAGLPGTPGPTGPAGQKGEPGSDGIPG
                     SAGEKGEPGLPGRGFPGFPGAKGDKGSKGEVGFPGLAGSPGIPGSKGEQGFMGPPGPQ
                     GQPGLPGSPGHATEGPKGDRGPQGQPGLPGLPGPMGPPGLPGIDGVKGDKGNPGWPGA
                     PGVPGPKGDPGFQGMPGIGGSPGITGSKGDMGPPGVPGFQGPKGLPGLQGIKGDQGDH
                     GVPGAKGLPGPPGPPGPYDIIKGEPGLPGPEGPPGLKGLQGLPGPKGQQGVTGLVGIP
                     GPPGIPGFDGAPGQKGEMGPAGPTGPRGFPGPPGPDGLPGSMGPPGTPSVDHGFLVTR
                     HSQTIDDPQCPSGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTMPFLFCNIN
                     NVCNFASRNDYSYWLSTPEPMPMSMAPITGENIRPFISRCAVCEAPAMVMAVHSQTIQ
                     IPPCPSGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRGTCNY
                     YANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCMRRT"
     misc_feature    2552..2881
                     /gene="COL4A1"
                     /note="C4; Region: C-terminal tandem repeated domain in
                     type 4 procollagen. Duplicated domain in C-terminus of
                     type 4 collagens. Mutations in alpha-5 collagen IV are
                     associated with X-linked Alport syndrome"
                     /db_xref="CDD:pfam01413"
     misc_feature    2882..3223
                     /gene="COL4A1"
                     /note="C4; Region: C-terminal tandem repeated domain in
                     type 4 procollagens"
                     /db_xref="CDD:smart00111"
BASE COUNT          825 a          942 c         1105 g          632 t
ORIGIN      
        1 aggcagtcgt ggtgacaccg gcccccctgg gcctccagga tatggtcctg ctggtcccat
       61 tggtgacaaa ggacaagcag gctttcctgg aggccctgga tccccaggcc tgccaggtcc
      121 aaagggtgaa ccaggaaaaa ttgttccttt accaggcccc cctggagcag aaggactgcc
      181 ggggtcccca ggcttcccag gtccccaagg agaccgaggc tttcccggaa ccccaggaag
      241 gccaggcctg ccaggagaga agggcgctgt gggccagcca ggcattggat ttccagggcc
      301 ccccggcccc aaaggtgttg acggcttacc tggagacatg gggccaccgg ggactccagg
      361 tcgcccggga tttaatggct tacctgggaa cccaggtgtg cagggccaga agggagagcc
      421 tggagttggt ctaccgggac tcaaaggttt gccaggtctt cccggcattc ctggcacacc
      481 cggggagaag gggagcattg gggtaccagg cgttcctgga gaacatggag cgatcggacc
      541 ccctgggctt caggggatca gaggtgaacc gggacctcct ggattgccag gctccgtggg
      601 gtctccagga gttccaggaa taggcccccc tggagctagg ggtccccctg gaggacaggg
      661 accaccgggg ttgtcaggcc ctcctggaat aaaaggagag aagggtttcc ccggattccc
      721 tggactggac atgccgggcc ctaaaggaga taaaggggct caaggactcc ctggcataac
      781 gggacagtcg gggctccctg gccttcctgg acagcagggg gctcctggga ttcctgggtt
      841 tccaggttcc aagggagaaa tgggcgtcat ggggaccccc gggcagccgg gctcaccagg
      901 accagtgggt gctcctggat taccgggtga aaaaggggac catggctttc cgggctcctc
      961 aggacccagg ggagaccctg gcttgaaagg tgataagggg gatgtcggtc tccctggcaa
     1021 gcctggctcc atggataagg tggacatggg cagcatgaag ggccagaaag gagaccaagg
     1081 agagaaagga caaattggac caattggtga gaagggatcc cgaggagacc ctgggacccc
     1141 aggagtgcct ggaaaggacg ggcaggcagg acagcctggg cagccaggac ctaaaggtga
     1201 tccaggtata agtggaaccc caggtgctcc aggacttccg ggaccaaaag gatctgttgg
     1261 tggaatgggc ttgccaggaa cacctggaga gaaaggtgtg cctggcatcc ctggcccaca
     1321 aggttcacct ggcttacctg gagacaaagg tgcaaaagga gagaaagggc aggcaggccc
     1381 acctggcata ggcatcccag ggctgcgagg tgaaaaggga gatcaaggga tagcgggttt
     1441 cccaggaagc cctggagaga agggagaaaa aggaagcatt gggatcccag gaatgccagg
     1501 gtccccaggc cttaaagggt ctcccgggag tgttggctat ccaggaagtc ctgggctacc
     1561 tggagaaaaa ggtgacaaag gcctcccagg attggatggc atccctggtg tcaaaggaga
     1621 agcaggtctt cctgggactc ctggccccac aggcccagct ggccagaaag gggagccagg
     1681 cagtgatgga atcccggggt cagcaggaga gaagggtgaa ccaggtctac caggaagagg
     1741 attcccaggg tttccagggg ccaaaggaga caaaggttca aagggtgagg tgggtttccc
     1801 aggattagcc gggagcccag gaattcctgg atccaaagga gagcaaggat tcatgggtcc
     1861 tccggggccc cagggacagc cggggttacc gggatcccca ggccatgcca cggaggggcc
     1921 caaaggagac cgcggacctc agggccagcc tggcctgcca ggacttccgg gacccatggg
     1981 gcctccaggg cttcctggga ttgatggagt taaaggtgac aaaggaaatc caggctggcc
     2041 aggagcaccc ggtgtcccag ggcccaaggg agaccctgga ttccagggca tgcctggtat
     2101 tggtggctct ccaggaatca caggctctaa gggtgatatg gggcctccag gagttccagg
     2161 atttcaaggt ccaaaaggtc ttcctggcct ccagggaatt aaaggtgatc aaggcgatca
     2221 cggcgtcccg ggagctaaag gtctcccggg tcctcctggc cccccaggtc cttacgacat
     2281 catcaaaggg gagcccgggc tccctggtcc tgagggcccc ccagggctga aagggcttca
     2341 gggactgcca ggcccgaaag gccagcaagg tgttacagga ttggtgggta tacctggacc
     2401 tccaggtatt cctgggtttg acggtgcccc tggccagaaa ggagagatgg gacctgccgg
     2461 gcctactggt ccaagaggat ttccaggtcc accaggcccc gatgggttgc caggatccat
     2521 ggggccccca ggcaccccat ctgttgatca cggcttcctt gtgaccaggc atagtcaaac
     2581 aatagatgac ccacagtgtc cttctgggac caaaattctt taccacgggt actctttgct
     2641 ctacgtgcaa ggcaatgaac gggcccatgg ccaggacttg ggcacggccg gcagctgcct
     2701 gcgcaagttc agcacaatgc ccttcctgtt ctgcaatatt aacaacgtgt gcaactttgc
     2761 atcacgaaat gactactcgt actggctgtc cacccctgag cccatgccca tgtcaatggc
     2821 acccatcacg ggggaaaaca taagaccatt tattagtagg tgtgctgtgt gtgaggcgcc
     2881 tgccatggtg atggccgtgc acagccagac cattcagatc ccaccgtgcc ccagcgggtg
     2941 gtcctcgctg tggatcggct actcttttgt gatgcacacc agcgctggtg cagaaggctc
     3001 tggccaagcc ctggcgtccc ccggctcctg cctggaggag tttagaagtg cgccattcat
     3061 cgagtgtcac ggccgtggga cctgcaatta ctacgcaaac gcttacagct tttggctcgc
     3121 caccatagag aggagcgaga tgttcaagaa gcctacgccg tccaccttga aggcagggga
     3181 gctgcgcacg cacgtcagcc gctgccaagt ctgtatgaga agaacataat gaagcctgac
     3241 tcagctaatg tcacaacatg gtgctacttc ttcttctttt tgttaacagc aacgaaccct
     3301 agaaatatat cctgtgtacc tcactgtcca atatgaaaac cgtaaagtgc cttataggaa
     3361 tttgcgtaac taacacaccc tgcttcattg acctctactt gctgaaggag aaaaagacag
     3421 cgataagctt tcaatagtgg cataccaaat ggcacttttg atgaaataaa atatcaatat
     3481 tttctgcaaa aaaaaaaaaa aaaa
//