LOCUS BC047305 3504 bp mRNA linear HUM 04-NOV-2003 DEFINITION Homo sapiens collagen, type IV, alpha 1, mRNA (cDNA clone IMAGE:4941939), partial cds. ACCESSION BC047305 VERSION BC047305.1 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3504) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3504) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (28-FEB-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: David N. Louis, M.D. cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 93 Row: e Column: 14. FEATURES Location/Qualifiers source 1..3504 /db_xref="H-InvDB:HIT000098508" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="IMAGE:4941939" /tissue_type="Brain, anaplastic oligodendroglioma with 1p/19q loss" /clone_lib="NCI_CGAP_Brn67" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene <1..3504 /gene="COL4A1" /db_xref="GeneID:1282" /db_xref="MIM:120130" CDS <1..3229 /gene="COL4A1" /codon_start=2 /product="COL4A1 protein" /protein_id="AAH47305.1" /db_xref="GeneID:1282" /db_xref="MIM:120130" /translation="GSRGDTGPPGPPGYGPAGPIGDKGQAGFPGGPGSPGLPGPKGEP GKIVPLPGPPGAEGLPGSPGFPGPQGDRGFPGTPGRPGLPGEKGAVGQPGIGFPGPPG PKGVDGLPGDMGPPGTPGRPGFNGLPGNPGVQGQKGEPGVGLPGLKGLPGLPGIPGTP GEKGSIGVPGVPGEHGAIGPPGLQGIRGEPGPPGLPGSVGSPGVPGIGPPGARGPPGG QGPPGLSGPPGIKGEKGFPGFPGLDMPGPKGDKGAQGLPGITGQSGLPGLPGQQGAPG IPGFPGSKGEMGVMGTPGQPGSPGPVGAPGLPGEKGDHGFPGSSGPRGDPGLKGDKGD VGLPGKPGSMDKVDMGSMKGQKGDQGEKGQIGPIGEKGSRGDPGTPGVPGKDGQAGQP GQPGPKGDPGISGTPGAPGLPGPKGSVGGMGLPGTPGEKGVPGIPGPQGSPGLPGDKG AKGEKGQAGPPGIGIPGLRGEKGDQGIAGFPGSPGEKGEKGSIGIPGMPGSPGLKGSP GSVGYPGSPGLPGEKGDKGLPGLDGIPGVKGEAGLPGTPGPTGPAGQKGEPGSDGIPG SAGEKGEPGLPGRGFPGFPGAKGDKGSKGEVGFPGLAGSPGIPGSKGEQGFMGPPGPQ GQPGLPGSPGHATEGPKGDRGPQGQPGLPGLPGPMGPPGLPGIDGVKGDKGNPGWPGA PGVPGPKGDPGFQGMPGIGGSPGITGSKGDMGPPGVPGFQGPKGLPGLQGIKGDQGDH GVPGAKGLPGPPGPPGPYDIIKGEPGLPGPEGPPGLKGLQGLPGPKGQQGVTGLVGIP GPPGIPGFDGAPGQKGEMGPAGPTGPRGFPGPPGPDGLPGSMGPPGTPSVDHGFLVTR HSQTIDDPQCPSGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTMPFLFCNIN NVCNFASRNDYSYWLSTPEPMPMSMAPITGENIRPFISRCAVCEAPAMVMAVHSQTIQ IPPCPSGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRGTCNY YANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCMRRT" misc_feature 2552..2881 /gene="COL4A1" /note="C4; Region: C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome" /db_xref="CDD:pfam01413" misc_feature 2882..3223 /gene="COL4A1" /note="C4; Region: C-terminal tandem repeated domain in type 4 procollagens" /db_xref="CDD:smart00111" BASE COUNT 825 a 942 c 1105 g 632 t ORIGIN 1 aggcagtcgt ggtgacaccg gcccccctgg gcctccagga tatggtcctg ctggtcccat 61 tggtgacaaa ggacaagcag gctttcctgg aggccctgga tccccaggcc tgccaggtcc 121 aaagggtgaa ccaggaaaaa ttgttccttt accaggcccc cctggagcag aaggactgcc 181 ggggtcccca ggcttcccag gtccccaagg agaccgaggc tttcccggaa ccccaggaag 241 gccaggcctg ccaggagaga agggcgctgt gggccagcca ggcattggat ttccagggcc 301 ccccggcccc aaaggtgttg acggcttacc tggagacatg gggccaccgg ggactccagg 361 tcgcccggga tttaatggct tacctgggaa cccaggtgtg cagggccaga agggagagcc 421 tggagttggt ctaccgggac tcaaaggttt gccaggtctt cccggcattc ctggcacacc 481 cggggagaag gggagcattg gggtaccagg cgttcctgga gaacatggag cgatcggacc 541 ccctgggctt caggggatca gaggtgaacc gggacctcct ggattgccag gctccgtggg 601 gtctccagga gttccaggaa taggcccccc tggagctagg ggtccccctg gaggacaggg 661 accaccgggg ttgtcaggcc ctcctggaat aaaaggagag aagggtttcc ccggattccc 721 tggactggac atgccgggcc ctaaaggaga taaaggggct caaggactcc ctggcataac 781 gggacagtcg gggctccctg gccttcctgg acagcagggg gctcctggga ttcctgggtt 841 tccaggttcc aagggagaaa tgggcgtcat ggggaccccc gggcagccgg gctcaccagg 901 accagtgggt gctcctggat taccgggtga aaaaggggac catggctttc cgggctcctc 961 aggacccagg ggagaccctg gcttgaaagg tgataagggg gatgtcggtc tccctggcaa 1021 gcctggctcc atggataagg tggacatggg cagcatgaag ggccagaaag gagaccaagg 1081 agagaaagga caaattggac caattggtga gaagggatcc cgaggagacc ctgggacccc 1141 aggagtgcct ggaaaggacg ggcaggcagg acagcctggg cagccaggac ctaaaggtga 1201 tccaggtata agtggaaccc caggtgctcc aggacttccg ggaccaaaag gatctgttgg 1261 tggaatgggc ttgccaggaa cacctggaga gaaaggtgtg cctggcatcc ctggcccaca 1321 aggttcacct ggcttacctg gagacaaagg tgcaaaagga gagaaagggc aggcaggccc 1381 acctggcata ggcatcccag ggctgcgagg tgaaaaggga gatcaaggga tagcgggttt 1441 cccaggaagc cctggagaga agggagaaaa aggaagcatt gggatcccag gaatgccagg 1501 gtccccaggc cttaaagggt ctcccgggag tgttggctat ccaggaagtc ctgggctacc 1561 tggagaaaaa ggtgacaaag gcctcccagg attggatggc atccctggtg tcaaaggaga 1621 agcaggtctt cctgggactc ctggccccac aggcccagct ggccagaaag gggagccagg 1681 cagtgatgga atcccggggt cagcaggaga gaagggtgaa ccaggtctac caggaagagg 1741 attcccaggg tttccagggg ccaaaggaga caaaggttca aagggtgagg tgggtttccc 1801 aggattagcc gggagcccag gaattcctgg atccaaagga gagcaaggat tcatgggtcc 1861 tccggggccc cagggacagc cggggttacc gggatcccca ggccatgcca cggaggggcc 1921 caaaggagac cgcggacctc agggccagcc tggcctgcca ggacttccgg gacccatggg 1981 gcctccaggg cttcctggga ttgatggagt taaaggtgac aaaggaaatc caggctggcc 2041 aggagcaccc ggtgtcccag ggcccaaggg agaccctgga ttccagggca tgcctggtat 2101 tggtggctct ccaggaatca caggctctaa gggtgatatg gggcctccag gagttccagg 2161 atttcaaggt ccaaaaggtc ttcctggcct ccagggaatt aaaggtgatc aaggcgatca 2221 cggcgtcccg ggagctaaag gtctcccggg tcctcctggc cccccaggtc cttacgacat 2281 catcaaaggg gagcccgggc tccctggtcc tgagggcccc ccagggctga aagggcttca 2341 gggactgcca ggcccgaaag gccagcaagg tgttacagga ttggtgggta tacctggacc 2401 tccaggtatt cctgggtttg acggtgcccc tggccagaaa ggagagatgg gacctgccgg 2461 gcctactggt ccaagaggat ttccaggtcc accaggcccc gatgggttgc caggatccat 2521 ggggccccca ggcaccccat ctgttgatca cggcttcctt gtgaccaggc atagtcaaac 2581 aatagatgac ccacagtgtc cttctgggac caaaattctt taccacgggt actctttgct 2641 ctacgtgcaa ggcaatgaac gggcccatgg ccaggacttg ggcacggccg gcagctgcct 2701 gcgcaagttc agcacaatgc ccttcctgtt ctgcaatatt aacaacgtgt gcaactttgc 2761 atcacgaaat gactactcgt actggctgtc cacccctgag cccatgccca tgtcaatggc 2821 acccatcacg ggggaaaaca taagaccatt tattagtagg tgtgctgtgt gtgaggcgcc 2881 tgccatggtg atggccgtgc acagccagac cattcagatc ccaccgtgcc ccagcgggtg 2941 gtcctcgctg tggatcggct actcttttgt gatgcacacc agcgctggtg cagaaggctc 3001 tggccaagcc ctggcgtccc ccggctcctg cctggaggag tttagaagtg cgccattcat 3061 cgagtgtcac ggccgtggga cctgcaatta ctacgcaaac gcttacagct tttggctcgc 3121 caccatagag aggagcgaga tgttcaagaa gcctacgccg tccaccttga aggcagggga 3181 gctgcgcacg cacgtcagcc gctgccaagt ctgtatgaga agaacataat gaagcctgac 3241 tcagctaatg tcacaacatg gtgctacttc ttcttctttt tgttaacagc aacgaaccct 3301 agaaatatat cctgtgtacc tcactgtcca atatgaaaac cgtaaagtgc cttataggaa 3361 tttgcgtaac taacacaccc tgcttcattg acctctactt gctgaaggag aaaaagacag 3421 cgataagctt tcaatagtgg cataccaaat ggcacttttg atgaaataaa atatcaatat 3481 tttctgcaaa aaaaaaaaaa aaaa //