LOCUS HUMCOLVIIA 1986 bp mRNA linear HUM 27-FEB-1996 DEFINITION Human type VII collagen (COL7A1) mRNA, 3' end. ACCESSION L06862 VERSION L06862.1 KEYWORDS structural protein; type VII collagen. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1986) AUTHORS Greenspan,D.S. TITLE The carboxyl-terminal half of type VII collagen, including the non-collagenous NC-2 domain and intron/exon organization of the corresponding region of the COL7A1 gene JOURNAL Hum. Mol. Genet. 2 (3), 273-278 (1993) PUBMED 8499916 COMMENT Original source text: Homo sapiens cDNA to mRNA. FEATURES Location/Qualifiers source 1..1986 /db_xref="H-InvDB:HIT000191727" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /tissue_type="placenta" /tissue_lib="lambda-gt10" gene 1..1986 /gene="COL7A1" CDS <1..1653 /gene="COL7A1" /note="base pairs 1-1167: triple-helix encoding region; base pairs 1168-1653: non-collagenous NC-2 domain encoding region" /codon_start=1 /product="type VII collagen" /protein_id="AAA89196.1" /translation="GLPGAPGVVGFPGQTGPRGEMGQPGPSGERGLAGPPGREGIPGP LGPPGPPGSVGPPGASGLKGDKGDPGVGLPGPRGERGEPGIRGEDGRPGQEGPRGLTG PPGSRGERGEKGDVGSAGLKGDKGDSAVILGPPGPRGAKGDMGERGPRGLDGDKGPRG DNGDPGDKGSKGEPGDKGSAGLPGLRGLLGPQGQPGAAGIPGDPGSPGKDGVPGIRGE KGDVGFMGPRGLKGERGVKGACGLDGEKGDKGEAGPPGRPGLAGHKGEMGEPGVPGQS GAPGKEGLIGPKGDRGFDGQPGPKGDQGEKGERGTPGIGGFPGPSGNDGSAGPPGPPG SVGPRGPEGLQGQKGERGPPGERVVGAPGVPGAPGERGEQGRPGPAGPRGEKGEAALT EDDIRGFVRQEMSQHCACQGQFIASGSRPLPSYAADTAGSQLHAVPVLRVSHAEEEER VPPEDDEYSEYSEYSVEEYQDPEAPWDSDDPCSLPLDEGSCTAYTLRWYHRAVTGSTE ACHPFVYGGCGGNANRFGTREACERRCPPRVVQSQGTGTAQD" 3'UTR 1654..1986 /gene="COL7A1" regulatory 1963..1968 /regulatory_class="polyA_signal_sequence" /gene="COL7A1" polyA_site 1986 /gene="COL7A1" BASE COUNT 377 a 539 c 742 g 328 t ORIGIN 1 ggcctgcccg gtgctcctgg tgttgttggg ttcccgggtc agacaggccc tcgaggagag 61 atgggtcagc caggccctag tggagagcgg ggtctggcag gccccccagg gagagaagga 121 atcccaggac ccctggggcc acctggacca ccggggtcag tgggaccacc tggggcctct 181 ggactcaaag gagacaaggg agaccctgga gtagggctgc ctgggccccg aggcgagcgt 241 ggggagccag gcatccgggg tgaagatggc cgccccggcc aggagggacc ccgaggactc 301 acggggcccc ctggcagcag gggagagcgt ggggagaagg gtgatgttgg gagtgcagga 361 ctaaagggtg acaagggaga ctcagctgtg atcctggggc ctccaggccc acggggtgcc 421 aagggggaca tgggtgaacg agggcctcgg ggcttggatg gtgacaaagg acctcgggga 481 gacaatgggg accctggtga caagggcagc aagggagagc ctggtgacaa gggctcagcc 541 gggttgccag gactgcgtgg actcctggga ccccagggtc aacctggtgc agcagggatc 601 cctggtgacc cgggatcccc aggaaaggat ggagtgcctg gtatccgagg agaaaaagga 661 gatgttggct tcatgggtcc ccggggcctc aagggtgaac ggggagtgaa gggagcctgt 721 ggccttgatg gagagaaggg agacaaggga gaagctggtc ccccaggccg ccccgggctg 781 gcaggacaca aaggagagat gggggagcct ggtgtgccgg gccagtcggg ggcccctggc 841 aaggagggcc tgatcggtcc caagggtgac cgaggctttg acgggcagcc aggccccaag 901 ggtgaccagg gcgagaaagg ggagcgggga accccaggaa ttgggggctt cccaggcccc 961 agtggaaatg atggctctgc tggtccccca gggccacctg gcagtgttgg tcccagaggc 1021 cccgaaggac ttcagggcca gaagggtgag cgaggtcccc ccggagagag agtggtgggg 1081 gctcctgggg tccctggagc tcctggcgag agaggggagc aggggcggcc agggcctgcc 1141 ggtcctcgag gcgagaaggg agaagctgca ctgacggagg atgacatccg gggctttgtg 1201 cgccaagaga tgagtcagca ctgtgcctgc cagggccagt tcatcgcatc tggatcacga 1261 cccctcccta gttatgctgc agacactgcc ggctcccagc tccatgctgt gcctgtgctc 1321 cgcgtctctc atgcagagga ggaagagcgg gtaccccctg aggatgatga gtactctgaa 1381 tactccgagt attctgtgga ggagtaccag gaccctgaag ctccttggga tagtgatgac 1441 ccctgttccc tgccactgga tgagggctcc tgcactgcct acaccctgcg ctggtaccat 1501 cgggctgtga caggcagcac agaggcctgt cacccttttg tctatggtgg ctgtggaggg 1561 aatgccaacc gttttgggac ccgtgaggcc tgcgagcgcc gctgcccacc ccgggtggtc 1621 cagagccagg ggacaggtac tgcccaggac tgaggcccag ataatgagct gagattcagc 1681 atcccctgga ggagtcgggg tctcagcaga accccactgt ccctcccctt ggtgctagag 1741 gcttgtgtgc acgtgagcgt gcgagtgcac gtccgttatt tcagtgactt ggtcccgtgg 1801 gtctagcctt cccccctgtg gacaaacccc cattgtggct cctgccaccc tggcagatga 1861 ctcactgtgg gggggtggct gtgggcagtg agcggatgtg actggcgtct gacccgcccc 1921 ttgacccaag cctgtgatga catggtgctg attctggggg gcattaaagc tgctgtttta 1981 aaaggc //