LOCUS BC151220 6517 bp mRNA linear HUM 24-JUL-2007 DEFINITION Homo sapiens collagen, type IV, alpha 1, mRNA (cDNA clone MGC:165004 IMAGE:40148649), complete cds. ACCESSION BC151220 VERSION BC151220.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6517) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 6517) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (23-JUL-2007) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Novartis Institute for Biomedical Research cDNA Library Preparation: Novartis Institute for Biomedical Research cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 306 Row: l Column: 7. FEATURES Location/Qualifiers source 1..6517 /db_xref="H-InvDB:HIT000435929" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:165004 IMAGE:40148649" /tissue_type="Donated clones,Novartis FGA collection" /clone_lib="NIH_MGC_417" /lab_host="DH5a" /note="Vector: pCMV-SPORT6" gene 1..6517 /gene="COL4A1" /gene_synonym="arresten" /db_xref="GeneID:1282" /db_xref="HGNC:HGNC:2202" /db_xref="MIM:120130" CDS 115..5124 /gene="COL4A1" /gene_synonym="arresten" /codon_start=1 /product="COL4A1 protein" /protein_id="AAI51221.1" /db_xref="GeneID:1282" /db_xref="HGNC:HGNC:2202" /db_xref="MIM:120130" /translation="MGPRLSVWLLLLPAALLLHEEHSRAAAKGGCAGSGCGKCDCHGV KGQKGERGLPGLQGVIGFPGMQGPEGPQGPPGQKGDTGEPGLPGTKGTRGPPGASGYP GNPGLPGIPGQDGPPGPPGIPGCNGTKGERGPLGPPGLPGFAGNPGPPGLPGMKGDPG EILGHVPGMLLKGERGFPGIPGTPGPPGLPGLQGPVGPPGFTGPPGPPGPPGPPGEKG QMGLSFQGPKGDKGDQGVSGPPGVPGQAQVQEKGDFATKGEKGQKGEPGFQGMPGVGE KGEPGKPGPRGKPGKDGDKGEKGSPGFPGEPGYPGLIGRQGPQGEKGEAGPPGPPGIV IGTGPLGEKGERGYPGTPGPRGEPGPKGFPGLPGQPGPPGLPVPGQAGAPGFPGERGE KGDRGFPGTSLPGPSGRDGLPGPPGSPGPPGQPGYTNGIVECQPGPPGDQGPPGIPGQ PGFIGEIGEKGQKGESCLICDIDGYRGPPGPQGPPGEIGFPGQPGAKGDRGLPGRDGV AGVPGPQGTPGLIGQPGAKGEPGEFYFDLRLKGDKGDPGFPGQPGMPGRAGSPGRDGH PGLPGPKGSPGSVGLKGERGPPGGVGFPGSRGDTGPPGPPGYGPAGPIGDKGQAGFPG GPGSPGLPGPKGEPGKIVPLPGPPGAEGLPGSPGFPGPQGDRGFPGTPGRPGLPGEKG AVGQPGIGFPGPPGPKGVDGLPGDMGPPGTPGRPGFNGLPGNPGVQGQKGEPGVGLPG LKGLPGLPGIPGTPGEKGSIGVPGVPGEHGAIGPPGLQGIRGEPGPPGLPGSVGSPGV PGIGPPGARGPPGGQGPPGLSGPPGIKGEKGFPGFPGLDMPGPKGDKGAQGLPGITGQ SGLPGLPGQQGAPGIPGFPGSKGEMGVMGTPGQPGSPGPVGAPGLPGEKGDHGFPGSS GPRGDPGLKGDKGDVGLPGKPGSMDKVDMGSMKGQKGDQGEKGQIGPIGEKGSRGDPG TPGVPGKDGQAGQPGQPGPKGDPGISGTPGAPGLPGPKGSVGGMGLPGTPGEKGVPGI PGPQGSPGLPGDKGAKGEKGQAGPPGIGIPGLRGEKGDQGIAGFPGSPGEKGEKGSIG IPGMPGSPGLKGSPGSVGYPGSPGLPGEKGDKGLPGLDGIPGVKGEAGLPGTPGPTGP AGQKGEPGSDGIPGSAGEKGEPGLPGRGFPGFPGAKGDKGSKGEVGFPGLAGSPGIPG SKGEQGFMGPPGPQGQPGLPGSPGHATEGPKGDRGPQGQPGLPGLPGPMGPPGLPGID GVKGDKGNPGWPGAPGVPGPKGDPGFQGMPGIGGSPGITGSKGDMGPPGVPGFQGPKG LPGLQGIKGDQGDHGVPGAKGLPGPPGPPGPYDIIKGEPGLPGPEGPPGLKGLQGLPG PKGQQGVTGLVGIPGPPGIPGFDGAPGQKGEMGPAGPTGPRGFPGPPGPDGLPGSMGP PGTPSVDHGFLVTRHSQTIDDPQCPSGTKILYHGYSLLYVQGNERAHGQDLGTAGSCL RKFSTMPFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPITGENIRPFISRCAVCE APAMVMAVHSQTIQIPPCPSGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRS APFIECHGRGTCNYYANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCMRR T" BASE COUNT 1665 a 1656 c 1911 g 1285 t ORIGIN 1 ccgccgcacc cgggacggtg cgtagcgctg gaagtccggc cttccgagag ctagctgtcc 61 gccgcggccc ccgcacgccg ggcagccgtc cctcgccgcc tcgggcgcgc caccatgggg 121 ccccggctca gcgtctggct gctgctgctg cccgccgccc ttctgctcca cgaggagcac 181 agccgggccg ctgcgaaggg tggctgtgct ggctctggct gtggcaaatg tgactgccat 241 ggagtgaagg gacaaaaggg tgaaagaggc ctcccggggt tacaaggtgt cattgggttt 301 cctggaatgc aaggacctga ggggccacag ggaccaccag gacaaaaggg tgatactgga 361 gaaccaggac tacctggaac aaaagggaca agaggacctc cgggagcatc tggctaccct 421 ggaaacccag gacttcccgg aattcctggc caagacggcc cgccaggccc cccaggtatt 481 ccaggatgca atggcacaaa gggggagaga gggccgctcg ggcctcctgg cttgcctggt 541 ttcgctggaa atcccggacc accaggctta ccagggatga agggtgatcc aggtgagata 601 cttggccatg tgcccgggat gctgttgaaa ggtgaaagag gatttcccgg aatcccaggg 661 actccaggcc caccaggact gccagggctt caaggtcctg ttgggcctcc aggatttacc 721 ggaccaccag gtcccccagg ccctcccggc cctccaggtg aaaagggaca aatgggctta 781 agttttcaag gaccaaaagg tgacaagggt gaccaagggg tcagtgggcc tccaggagta 841 ccaggacaag ctcaagttca agaaaaagga gacttcgcca ccaagggaga aaagggccaa 901 aaaggtgaac ctggatttca ggggatgcca ggggtcggag agaaaggtga acccggaaaa 961 ccaggaccca gaggcaaacc cggaaaagat ggtgacaaag gggaaaaagg gagtcccggt 1021 tttcctggtg aacccgggta cccaggactc ataggccgcc agggcccgca gggagaaaag 1081 ggtgaagcag gtcctcctgg cccacctgga attgttatag gcacaggacc tttgggagaa 1141 aaaggagaga ggggctaccc tggaactccg gggccaagag gagagccagg cccaaaaggt 1201 ttcccaggac taccaggcca acccggacct ccaggcctcc ctgtacctgg gcaggctggt 1261 gcccctggct tccctggtga aagaggagaa aaaggtgacc gaggatttcc tggtacatct 1321 ctgccaggac caagtggaag agatgggctc ccgggtcctc ctggttcccc tgggccccct 1381 gggcagcctg gctacacaaa tggaattgtg gaatgtcagc ccggacctcc aggtgaccag 1441 ggtcctcctg gaattccagg gcagccagga tttataggcg aaattggaga gaaaggtcaa 1501 aaaggagaga gttgcctcat ctgtgatata gacggatatc gggggcctcc cgggccacag 1561 ggacccccgg gagaaatagg tttcccaggg cagccagggg ccaagggcga cagaggtttg 1621 cctggcagag atggtgttgc aggagtgcca ggccctcaag gtacaccagg gctgataggc 1681 cagccaggag ccaaggggga gcctggtgag ttttatttcg acttgcggct caaaggtgac 1741 aaaggagacc caggctttcc aggacagccc ggcatgccag ggagagcggg ttctcctgga 1801 agagatggcc atccgggtct tcctggcccc aagggctcgc cgggttctgt aggattgaaa 1861 ggagagcgtg gcccccctgg aggagttgga ttcccaggca gtcgtggtga caccggcccc 1921 cctgggcccc caggatatgg tcctgctggt cccattggtg acaaaggaca agcaggcttt 1981 cctggaggcc ctggatcccc aggcctgcca ggtccaaagg gtgaaccagg aaaaattgtt 2041 cctttaccag gcccccctgg agcagaagga ctgccggggt ccccaggctt cccaggtccc 2101 caaggagacc gaggctttcc cggaacccca ggaaggccag gcctgccagg agagaagggc 2161 gctgtgggcc agccaggcat tggatttcca gggccccccg gccccaaagg tgttgacggc 2221 ttacctggag acatggggcc accagggact ccaggtcgcc cgggatttaa tggcttacct 2281 gggaacccag gtgtgcaggg ccagaaggga gagcctggag ttggtctacc gggactcaaa 2341 ggtttgccag gtcttcccgg cattcctggc acacccgggg agaaggggag cattggggta 2401 ccaggcgttc ctggagaaca tggagcgatc ggaccccctg ggcttcaggg gatcagaggt 2461 gaaccgggac ctcctggatt gccaggctcc gtggggtctc caggagttcc aggaataggc 2521 ccccctggag ctaggggtcc ccctggagga cagggaccac cggggttgtc aggccctcct 2581 ggaataaaag gagagaaggg tttccccgga ttccctggac tggacatgcc gggccctaaa 2641 ggagataaag gggctcaagg actccctggc ataacgggac agtcggggct ccctggcctt 2701 cctggacagc agggggctcc tgggattcct gggtttccag gttccaaggg agaaatgggc 2761 gtcatgggga cccccgggca gccgggctca ccaggaccag tgggtgctcc tggattaccg 2821 ggtgaaaaag gggaccatgg ctttccgggc tcctcaggac ccaggggaga ccctggcttg 2881 aaaggtgata agggggatgt cggtctccct ggcaagcctg gctccatgga taaggtggac 2941 atgggcagca tgaagggcca gaaaggagac caaggagaga aaggacaaat tggaccaatt 3001 ggtgagaagg gatcccgagg agaccctggg accccaggag tgcctggaaa ggacgggcag 3061 gcaggacagc ctgggcagcc aggacctaaa ggtgatccag gtataagtgg aaccccaggt 3121 gctccaggac ttccgggacc aaaaggatct gttggtggaa tgggcttgcc aggaacacct 3181 ggagagaaag gtgtgcctgg catccctggc ccacaaggtt cacctggctt acctggagac 3241 aaaggtgcaa aaggagagaa agggcaggca ggcccacctg gcataggcat cccaggactg 3301 cgtggtgaaa agggagatca agggatagcg ggtttcccag gaagccctgg agagaaggga 3361 gaaaaaggaa gcattgggat cccaggaatg ccagggtccc caggccttaa agggtctccc 3421 gggagtgttg gctatccagg aagtcctggg ctacctggag aaaaaggtga caaaggcctc 3481 ccaggattgg atggcatccc tggtgtcaaa ggagaagcag gtcttcctgg gactcctggc 3541 cccacaggcc cagctggcca gaaaggggag ccaggcagtg atggaatccc ggggtcagca 3601 ggagagaagg gtgaaccagg tctaccagga agaggattcc cagggtttcc aggggccaaa 3661 ggagacaaag gttcaaaggg tgaggtgggt ttcccaggat tagccgggag cccaggaatt 3721 cctggatcca aaggagagca aggattcatg ggtcctccgg ggccccaggg acagccgggg 3781 ttaccgggat ccccaggcca tgccacggag gggcccaaag gagaccgcgg acctcagggc 3841 cagcctggcc tgccaggact tccgggaccc atggggcctc cagggcttcc tgggattgat 3901 ggagttaaag gtgacaaagg aaatccaggc tggccaggag cacccggtgt cccagggccc 3961 aagggagacc ctggattcca gggcatgcct ggtattggtg gctctccagg aatcacaggc 4021 tctaagggtg atatggggcc tccaggagtt ccaggatttc aaggtccaaa aggtcttcct 4081 ggcctccagg gaattaaagg tgatcaaggc gatcacggcg tcccgggagc taaaggtctc 4141 ccgggtcctc ctggcccccc aggtccttac gacatcatca aaggggagcc cgggctccct 4201 ggtcctgagg gccccccagg gctgaaaggg cttcagggac tgccaggccc gaaaggccag 4261 caaggtgtta caggattggt gggtatacct ggacctccag gtattcctgg gtttgacggt 4321 gcccctggcc agaaaggaga gatgggacct gccgggccta ctggtccaag aggatttcca 4381 ggtccaccag gccccgatgg gttgccagga tccatggggc ccccaggcac cccatctgtt 4441 gatcacggct tccttgtgac caggcatagt caaacaatag atgacccaca gtgtccttct 4501 gggaccaaaa ttctttacca cgggtactct ttgctctacg tgcaaggcaa tgaacgggcc 4561 catggccagg acttgggcac ggctggcagc tgcctgcgca agttcagcac aatgcccttc 4621 ctgttctgca atattaacaa cgtgtgcaac tttgcatcac gaaatgacta ctcgtactgg 4681 ctgtccaccc ctgagcccat gcccatgtca atggcaccca tcacggggga aaacataaga 4741 ccatttatta gtaggtgtgc tgtgtgtgag gcgcctgcca tggtgatggc cgtgcacagc 4801 cagaccattc agatcccacc gtgccccagc gggtggtcct cgctgtggat cggctactct 4861 tttgtgatgc acaccagcgc tggtgcagaa ggctctggcc aagccctggc gtcccccggc 4921 tcctgcctgg aggagtttag aagtgcgcca ttcatcgagt gtcacggccg tgggacctgc 4981 aattactacg caaacgctta cagcttttgg ctcgccacca tagagaggag cgagatgttc 5041 aagaagccta cgccgtccac cttgaaggca ggggagctgc gcacgcacgt cagccgctgc 5101 caagtctgta tgagaagaac ataatgaagc ctgactcagc taatgtcaca acatggtgct 5161 acttcttctt ctttttgtta acagcaacga accctagaaa tatatcctgt gtacctcact 5221 gtccaatatg aaaaccgtaa agtgccttat aggaatttgc gtaactaaca caccctgctt 5281 cattgacctc tacttgctga aggagaaaaa gacagcgata agctttcaat agtggcatac 5341 caaatggcac ttttgatgaa ataaaatatc aatattttct gcaatccaat gcactgatgt 5401 gtgaagtgag aactccatca gaaaaccaaa gggtgctagg aggtgtgggt gccttccata 5461 ctgtttgccc attttcattc ttgtattata attaattttc tacccccaga gataaatgtt 5521 tgtttatatc actgtctagc tgtttcaaaa tttaggtccc ttggtctgta caaataatag 5581 caatgtaaaa atggtttttt gaacctccaa atggaattac agactcagta gccatatctt 5641 ccaacccccc agtataaatt tctgtctttc tgctatgtgt ggtactttgc agctgctttt 5701 gcagaaatca caattttcct gtggaataaa gatggtccaa aaatagtcaa aaattaaata 5761 tatatatata ttagtaattt atatagatgt cagcaattag gcagatcaag gtttagttta 5821 acttccactg ttaaaataaa gcttacatag ttttcttcct ttgaaagact gtgctgtcct 5881 ttaacatagg tttttaaaga ctaggatatt gaatgtgaaa catccgtttt cattgttcac 5941 ttctaaacca aaaattatgt gttgccaaaa ccaaacccag gttcatgaat atggtgtcta 6001 ttatagtgaa acatgtactt tgagcttatt gtttttattc tgtattaaat attttcaggg 6061 ttttaaacac taatcacaaa ctgaatgact tgacttcaaa agcaacaacc ttaaaggccg 6121 tcatttcatt agtattcctc attctgcatc ctggcttgaa aaacagctct gttgaatcac 6181 agtatcagta ttttcacacg taagcacatt cgggccattt ccgtggtttc tcatgagctg 6241 tgttcacaga cctcagcagg gcatcgcatg gaccgcagga gggcagattc ggaccactag 6301 gcctgaaatg acatttcact aaaagtctcc aaaacatttc taagactact aaggcctttt 6361 atgtaatttc tttaaatgtg tatttcttaa gaattcaaat ttgtaataaa actatttgta 6421 taaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 6481 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaa //