LOCUS BC047305 3504 bp mRNA linear HUM 04-NOV-2003
DEFINITION Homo sapiens collagen, type IV, alpha 1, mRNA (cDNA clone
IMAGE:4941939), partial cds.
ACCESSION BC047305
VERSION BC047305.1
KEYWORDS .
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 3504)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 3504)
AUTHORS Strausberg,R.
TITLE Direct Submission
JOURNAL Submitted (28-FEB-2003) National Institutes of Health, Mammalian
Gene Collection (MGC), Cancer Genomics Office, National Cancer
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: David N. Louis, M.D.
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 93 Row: e Column: 14.
FEATURES Location/Qualifiers
source 1..3504
/db_xref="H-InvDB:HIT000098508"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="IMAGE:4941939"
/tissue_type="Brain, anaplastic oligodendroglioma with
1p/19q loss"
/clone_lib="NCI_CGAP_Brn67"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene <1..3504
/gene="COL4A1"
/db_xref="GeneID:1282"
/db_xref="MIM:120130"
CDS <1..3229
/gene="COL4A1"
/codon_start=2
/product="COL4A1 protein"
/protein_id="AAH47305.1"
/db_xref="GeneID:1282"
/db_xref="MIM:120130"
/translation="GSRGDTGPPGPPGYGPAGPIGDKGQAGFPGGPGSPGLPGPKGEP
GKIVPLPGPPGAEGLPGSPGFPGPQGDRGFPGTPGRPGLPGEKGAVGQPGIGFPGPPG
PKGVDGLPGDMGPPGTPGRPGFNGLPGNPGVQGQKGEPGVGLPGLKGLPGLPGIPGTP
GEKGSIGVPGVPGEHGAIGPPGLQGIRGEPGPPGLPGSVGSPGVPGIGPPGARGPPGG
QGPPGLSGPPGIKGEKGFPGFPGLDMPGPKGDKGAQGLPGITGQSGLPGLPGQQGAPG
IPGFPGSKGEMGVMGTPGQPGSPGPVGAPGLPGEKGDHGFPGSSGPRGDPGLKGDKGD
VGLPGKPGSMDKVDMGSMKGQKGDQGEKGQIGPIGEKGSRGDPGTPGVPGKDGQAGQP
GQPGPKGDPGISGTPGAPGLPGPKGSVGGMGLPGTPGEKGVPGIPGPQGSPGLPGDKG
AKGEKGQAGPPGIGIPGLRGEKGDQGIAGFPGSPGEKGEKGSIGIPGMPGSPGLKGSP
GSVGYPGSPGLPGEKGDKGLPGLDGIPGVKGEAGLPGTPGPTGPAGQKGEPGSDGIPG
SAGEKGEPGLPGRGFPGFPGAKGDKGSKGEVGFPGLAGSPGIPGSKGEQGFMGPPGPQ
GQPGLPGSPGHATEGPKGDRGPQGQPGLPGLPGPMGPPGLPGIDGVKGDKGNPGWPGA
PGVPGPKGDPGFQGMPGIGGSPGITGSKGDMGPPGVPGFQGPKGLPGLQGIKGDQGDH
GVPGAKGLPGPPGPPGPYDIIKGEPGLPGPEGPPGLKGLQGLPGPKGQQGVTGLVGIP
GPPGIPGFDGAPGQKGEMGPAGPTGPRGFPGPPGPDGLPGSMGPPGTPSVDHGFLVTR
HSQTIDDPQCPSGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTMPFLFCNIN
NVCNFASRNDYSYWLSTPEPMPMSMAPITGENIRPFISRCAVCEAPAMVMAVHSQTIQ
IPPCPSGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRGTCNY
YANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCMRRT"
misc_feature 2552..2881
/gene="COL4A1"
/note="C4; Region: C-terminal tandem repeated domain in
type 4 procollagen. Duplicated domain in C-terminus of
type 4 collagens. Mutations in alpha-5 collagen IV are
associated with X-linked Alport syndrome"
/db_xref="CDD:pfam01413"
misc_feature 2882..3223
/gene="COL4A1"
/note="C4; Region: C-terminal tandem repeated domain in
type 4 procollagens"
/db_xref="CDD:smart00111"
BASE COUNT 825 a 942 c 1105 g 632 t
ORIGIN
1 aggcagtcgt ggtgacaccg gcccccctgg gcctccagga tatggtcctg ctggtcccat
61 tggtgacaaa ggacaagcag gctttcctgg aggccctgga tccccaggcc tgccaggtcc
121 aaagggtgaa ccaggaaaaa ttgttccttt accaggcccc cctggagcag aaggactgcc
181 ggggtcccca ggcttcccag gtccccaagg agaccgaggc tttcccggaa ccccaggaag
241 gccaggcctg ccaggagaga agggcgctgt gggccagcca ggcattggat ttccagggcc
301 ccccggcccc aaaggtgttg acggcttacc tggagacatg gggccaccgg ggactccagg
361 tcgcccggga tttaatggct tacctgggaa cccaggtgtg cagggccaga agggagagcc
421 tggagttggt ctaccgggac tcaaaggttt gccaggtctt cccggcattc ctggcacacc
481 cggggagaag gggagcattg gggtaccagg cgttcctgga gaacatggag cgatcggacc
541 ccctgggctt caggggatca gaggtgaacc gggacctcct ggattgccag gctccgtggg
601 gtctccagga gttccaggaa taggcccccc tggagctagg ggtccccctg gaggacaggg
661 accaccgggg ttgtcaggcc ctcctggaat aaaaggagag aagggtttcc ccggattccc
721 tggactggac atgccgggcc ctaaaggaga taaaggggct caaggactcc ctggcataac
781 gggacagtcg gggctccctg gccttcctgg acagcagggg gctcctggga ttcctgggtt
841 tccaggttcc aagggagaaa tgggcgtcat ggggaccccc gggcagccgg gctcaccagg
901 accagtgggt gctcctggat taccgggtga aaaaggggac catggctttc cgggctcctc
961 aggacccagg ggagaccctg gcttgaaagg tgataagggg gatgtcggtc tccctggcaa
1021 gcctggctcc atggataagg tggacatggg cagcatgaag ggccagaaag gagaccaagg
1081 agagaaagga caaattggac caattggtga gaagggatcc cgaggagacc ctgggacccc
1141 aggagtgcct ggaaaggacg ggcaggcagg acagcctggg cagccaggac ctaaaggtga
1201 tccaggtata agtggaaccc caggtgctcc aggacttccg ggaccaaaag gatctgttgg
1261 tggaatgggc ttgccaggaa cacctggaga gaaaggtgtg cctggcatcc ctggcccaca
1321 aggttcacct ggcttacctg gagacaaagg tgcaaaagga gagaaagggc aggcaggccc
1381 acctggcata ggcatcccag ggctgcgagg tgaaaaggga gatcaaggga tagcgggttt
1441 cccaggaagc cctggagaga agggagaaaa aggaagcatt gggatcccag gaatgccagg
1501 gtccccaggc cttaaagggt ctcccgggag tgttggctat ccaggaagtc ctgggctacc
1561 tggagaaaaa ggtgacaaag gcctcccagg attggatggc atccctggtg tcaaaggaga
1621 agcaggtctt cctgggactc ctggccccac aggcccagct ggccagaaag gggagccagg
1681 cagtgatgga atcccggggt cagcaggaga gaagggtgaa ccaggtctac caggaagagg
1741 attcccaggg tttccagggg ccaaaggaga caaaggttca aagggtgagg tgggtttccc
1801 aggattagcc gggagcccag gaattcctgg atccaaagga gagcaaggat tcatgggtcc
1861 tccggggccc cagggacagc cggggttacc gggatcccca ggccatgcca cggaggggcc
1921 caaaggagac cgcggacctc agggccagcc tggcctgcca ggacttccgg gacccatggg
1981 gcctccaggg cttcctggga ttgatggagt taaaggtgac aaaggaaatc caggctggcc
2041 aggagcaccc ggtgtcccag ggcccaaggg agaccctgga ttccagggca tgcctggtat
2101 tggtggctct ccaggaatca caggctctaa gggtgatatg gggcctccag gagttccagg
2161 atttcaaggt ccaaaaggtc ttcctggcct ccagggaatt aaaggtgatc aaggcgatca
2221 cggcgtcccg ggagctaaag gtctcccggg tcctcctggc cccccaggtc cttacgacat
2281 catcaaaggg gagcccgggc tccctggtcc tgagggcccc ccagggctga aagggcttca
2341 gggactgcca ggcccgaaag gccagcaagg tgttacagga ttggtgggta tacctggacc
2401 tccaggtatt cctgggtttg acggtgcccc tggccagaaa ggagagatgg gacctgccgg
2461 gcctactggt ccaagaggat ttccaggtcc accaggcccc gatgggttgc caggatccat
2521 ggggccccca ggcaccccat ctgttgatca cggcttcctt gtgaccaggc atagtcaaac
2581 aatagatgac ccacagtgtc cttctgggac caaaattctt taccacgggt actctttgct
2641 ctacgtgcaa ggcaatgaac gggcccatgg ccaggacttg ggcacggccg gcagctgcct
2701 gcgcaagttc agcacaatgc ccttcctgtt ctgcaatatt aacaacgtgt gcaactttgc
2761 atcacgaaat gactactcgt actggctgtc cacccctgag cccatgccca tgtcaatggc
2821 acccatcacg ggggaaaaca taagaccatt tattagtagg tgtgctgtgt gtgaggcgcc
2881 tgccatggtg atggccgtgc acagccagac cattcagatc ccaccgtgcc ccagcgggtg
2941 gtcctcgctg tggatcggct actcttttgt gatgcacacc agcgctggtg cagaaggctc
3001 tggccaagcc ctggcgtccc ccggctcctg cctggaggag tttagaagtg cgccattcat
3061 cgagtgtcac ggccgtggga cctgcaatta ctacgcaaac gcttacagct tttggctcgc
3121 caccatagag aggagcgaga tgttcaagaa gcctacgccg tccaccttga aggcagggga
3181 gctgcgcacg cacgtcagcc gctgccaagt ctgtatgaga agaacataat gaagcctgac
3241 tcagctaatg tcacaacatg gtgctacttc ttcttctttt tgttaacagc aacgaaccct
3301 agaaatatat cctgtgtacc tcactgtcca atatgaaaac cgtaaagtgc cttataggaa
3361 tttgcgtaac taacacaccc tgcttcattg acctctactt gctgaaggag aaaaagacag
3421 cgataagctt tcaatagtgg cataccaaat ggcacttttg atgaaataaa atatcaatat
3481 tttctgcaaa aaaaaaaaaa aaaa
//