LOCUS HUMCOL4A5X 2933 bp mRNA linear HUM 01-NOV-1994 DEFINITION Human collagen type IV alpha 5 chain (COL4A5) gene, 5' end. ACCESSION M90464 VERSION M90464.1 KEYWORDS collagen. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2933) AUTHORS Zhou,J., Hertz,J.M., Leinonen,A. and Tryggvason,K. TITLE Complete amino acid sequence of the human alpha 5 (IV) collagen chain and identification of a single-base mutation in exon 23 converting glycine 521 in the collagenous domain to cysteine in an Alport syndrome patient JOURNAL J. Biol. Chem. 267 (18), 12475-12481 (1992) PUBMED 1352287 COMMENT Original source text: Homo sapiens kidney cDNA to mRNA. FEATURES Location/Qualifiers source 1..2933 /db_xref="H-InvDB:HIT000196773" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /map="Xq22" /tissue_type="kidney" gene 1..2933 /gene="COL4A5" CDS 203..2933 /gene="COL4A5" /codon_start=1 /product="alpha-5 type IV collagen" /protein_id="AAA52046.1" /db_xref="GDB:G00-120-596" /translation="MKLRGVSLAAGLFLLALSLWGQPAEAAACYGCSPGSKCDCSGIK GEKGERGFPGLEGHPGLPGFPGPEGPPGPRGQKGDDGIPGPPGPKGIRGPPGLPGFPG TPGLPGMPGHDGAPGPQGIPGCNGTKGERGFPGSPGFPGLQGPPGPPGIPGMKGEPGS IIMSSLPGPKGNPGYPGPPGIQGLPGPTGIPGPIGPPGPPGLMGPPGPPGLPGPKGNM GLNFQGPKGEKGEQGLQGPPGPPGQISEQKRPIDVEFQKGDQGLPGDRGPPGPPGIRG PPGPPGGEKGEKGEQGEPGKRGKPGKDGENGQPGIPGLPGDPGYPGEPGRDGEKGQKG DTGPPGPPGLVIPRPGTGITIGEKGNIGLPGLPGEKGERGFPGIQGPPGLPGPPGAAV MGPPGPPGFPGERGQKGDEGPPGISIPGPPGLDGQPGAPGLPGPPGPAGPHIPPSDEI CEPGPPGPPGSPGDKGLQGEQGVKGDKGDTCFNCIGTGISGPPGQPGLPGLPGPPGSL GFPGQKGEKGQAGATGPKGLPGIPGAPGAPGFPGSKGEPGDILTFPGMKGDKGELGSP GAPGLPGLPGTPGQDGLPGLPGPKGEPGGITFKGERGPPGNPGLPGLPGNIGPMGPPG FGPPGPVGEKGIQGVAGNPGQPGIPGPKGDPGQTITQPGKPGLPGNPGRDGDVGLPGD PGLPGQPGLPGIPGSKGEPGIPGIGLPGPPGPKGFPGIPGPPGAPGTPGRIGLEGPPG PPGFPGPKGEPGFALPGPPGPPGLPGFKGALGPKGDRGFPGPPGPPGRTGLDGLPGPK GDVGPNGQPGPMGPPGLPGIGVQGPPGPPGIPGPIGQPGLHGIPGEKGDPGPPGLDVP GPPGERGSPGIPGAPGPIGPPGSPGLPGKAGASGFPGTKGEMGMMGPPGPPGPL" sig_peptide 203..322 /gene="COL4A5" /note="G00-120-596; putative" BASE COUNT 703 a 754 c 875 g 601 t ORIGIN 1 aggggggaag gaagagtagc tccttcttct tcttcttttt tttttcttcc actcttaaaa 61 agcttctttc tcttcaccca agcctcactg tccctctccg gctctagctc tctccatata 121 aaccctcaag attatgtcaa ttggttagag ccagccggga atttcgtgcg ggtgctgaag 181 gagctgcggg agccggagaa gaatgaaact gcgtggagtc agcctggctg ccggcttgtt 241 cttactggcc ctgagtcttt gggggcagcc tgcagaggct gcggcttgct atgggtgttc 301 tccaggatca aagtgtgact gcagtggcat aaaaggggaa aagggagaga gagggtttcc 361 aggtttggaa ggacacccag gattgcctgg atttccaggt ccagaagggc ctccggggcc 421 tcggggacaa aagggtgatg atggaattcc agggccacca ggaccaaaag gaatcagagg 481 tcctcctgga cttcctggat ttccagggac accaggtctt cctggaatgc caggccacga 541 tggggcccca ggacctcaag gtattcccgg atgcaatgga accaagggag aacgtggatt 601 tccaggcagt cccggttttc ctggtttaca gggtcctcca ggaccccctg ggatcccagg 661 tatgaagggt gaaccaggta gtataattat gtcatcactg ccaggaccaa agggtaatcc 721 aggatatcca ggtcctcctg gaatacaagg cctacctggt cccactggta taccagggcc 781 aattggtccc ccaggaccac caggtttgat gggccctcct ggtccaccag gacttccagg 841 acctaagggg aatatgggct taaatttcca gggacccaaa ggtgaaaaag gtgagcaagg 901 tcttcagggc ccacctgggc cacctgggca gatcagtgaa cagaaaagac caattgatgt 961 agagtttcag aaaggagatc agggacttcc tggtgaccga gggcctcctg gacctccagg 1021 gatacgtggt cctccaggtc ccccaggtgg tgagaaaggt gagaagggtg agcaaggaga 1081 gccaggcaaa agaggtaaac caggcaaaga tggagaaaat ggccaaccag gaattcctgg 1141 tttgcctggt gatcctggtt accctggtga acccggaagg gatggtgaaa agggccaaaa 1201 aggtgacact ggcccacctg gacctcctgg acttgtaatt cctagacctg ggactggtat 1261 aactatagga gaaaaaggaa acattgggtt gcctgggttg cctggagaaa aaggagagcg 1321 aggatttcct ggaatacagg gtccacctgg ccttcctgga cctccagggg ctgcagttat 1381 gggtcctcct ggccctcctg gatttcctgg agaaaggggt cagaaaggtg atgaaggacc 1441 acctggaatt tccattcctg gacctcctgg acttgacgga cagcctgggg ctcctgggct 1501 tccagggcct cctggccctg ctggccctca cattcctcct agtgatgaga tatgtgaacc 1561 aggccctcca ggccccccag gatctccagg tgataaagga ctccaaggag aacaaggagt 1621 gaaaggtgac aaaggtgaca cttgcttcaa ctgcattgga actggtattt cagggcctcc 1681 aggtcaacct ggtttgccag gtctcccagg tcctccagga tctcttggtt tccctggaca 1741 gaaaggggaa aaaggacaag ctggtgcaac tggtcccaaa ggattaccag gcattccagg 1801 agctccaggt gctccaggct ttcctggatc taaaggtgaa cctggtgata tcctcacttt 1861 tccaggaatg aagggtgaca aaggagagtt gggttcccct ggagctccag ggcttcctgg 1921 tttacctggc actcctggac aggatggatt gccagggctt cctggcccga aaggagagcc 1981 tggtggaatt acttttaagg gtgaaagagg tccccctggg aacccaggtt taccaggcct 2041 cccagggaat atagggccta tgggtccccc tggtttcggc cctccaggcc cagtaggtga 2101 aaaaggcata caaggtgtgg caggaaatcc aggccagcca ggaataccag gtcctaaagg 2161 ggatccaggt cagactataa cccagccggg gaagcctggc ttgcctggta acccaggcag 2221 agatggtgat gtaggtcttc caggtgaccc tggacttcca gggcaaccag gcttgccagg 2281 gatacctggt agcaaaggag aaccaggtat ccctggaatt gggcttcctg gaccacctgg 2341 tcccaaaggc tttcctggaa ttccaggacc tccaggagca cctgggacac ctggaagaat 2401 tggtctagaa ggccctcctg ggccacccgg ctttccagga ccaaagggtg aaccaggatt 2461 tgcattacct gggccacctg ggccaccagg acttccaggt ttcaaaggag cacttggtcc 2521 aaaaggtgat cgtggtttcc caggacctcc gggtcctcca ggacgcactg gcttagatgg 2581 gctccctgga ccaaaaggtg atgttggacc aaatggacaa cctggaccaa tgggacctcc 2641 tgggctgcca ggaataggtg ttcagggacc accaggacca ccagggattc ctgggccaat 2701 aggtcaacct ggtttacatg gaataccagg agagaagggg gatccaggac ctcctggact 2761 tgatgttcca ggacccccag gtgaaagagg cagtccaggg atccccggag cacctggtcc 2821 tataggacct ccaggatcac cagggcttcc aggaaaagca ggtgcctctg gatttccagg 2881 taccaaaggt gaaatgggta tgatgggacc tccaggccca ccaggacctt tgg //