LOCUS       HUMNC1A                 2661 bp    mRNA    linear   HUM 26-JUL-2016
DEFINITION  Homo sapiens type VII collagen mRNA, partial cds.
ACCESSION   M96984
VERSION     M96984.1
KEYWORDS    extracellular matrix adhesion protein; type VII collagen.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2661)
  AUTHORS   Gammon,W.R., Abernethy,M.L., Padilla,K.M., Prisayanh,P.S.,
            Cook,M.E., Wright,J., Briggaman,R.A. and Hunt,S.W. III.
  TITLE     Structural similarity to multidomain extracellular matrix adhesion
            proteins supports a role for Type VII collagen in the organization
            and adherence of epithelial basement membrane
  JOURNAL   Unpublished
FEATURES             Location/Qualifiers
     source          1..2661
                     /db_xref="H-InvDB:HIT000196946"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /germline
                     /tissue_lib="WISH"
     CDS             <1..>2661
                     /note="NC1 region"
                     /codon_start=1
                     /product="type VII collagen"
                     /protein_id="AAA36357.2"
                     /translation="EFRELGPGQGSVLLRDLEPGTDYEVTVSTLFGRSVGPATSLMAR
                     TDASVEQTLRPVILGPTSILLSWNLVPEARGYRLEWRRETGLEPPQKVVLPSDVTRYQ
                     LDGLQPGTEYRLTLYTLLEGHEVATPATVVPTGPELPVSPVTDLQATDVPGQRVRVSW
                     SPVPGATQYRIIWRSTQGVERTLVLPGSQTAFDLDDVQAGLSYTVRVSARVGPREGSA
                     SVLTVRREPETPLAVPGLRVVVSDATRVRVAWGPVPGASGFRISWSTGSGPESSQTLP
                     PDSTATDITGLQPGTTYQVAVSVLRGREEGPAAVIVARTDPLGPVRTVHVTQASSSSV
                     TITWTRVPGATGYRVSWHSAHGPEKSQLVSGEATVAELDGLEPDTEYTVHVRAHVAGV
                     DGPPASVVVRTAPEPVGRVSRLQILNASSDVLRITWVGVTGATAYRLAWGRSEGGPMR
                     HQILPGNTDSAEIRGLEGGVSYSVRVTALVGDREGTPVSIVVTTPPEAPPALGTLHVV
                     QRGEHSLRLRWEPVPRAQGFLLHWQPEGGQEQSRVLGPELSSYHLDGLEPATQYRVRL
                     SVLGPAGEGPSAEVTARTESPRVPSIELRVVDTSIDSVTLAWTPVSRASSYILSWRPL
                     RGPGQEVPGSPQTLPGISSSQRVTGLEPGVSYIFSLTPVLDGVRGPEASVTQTPVCPR
                     GLADVVFLPHATQDNAHRAEATRRVLERLVLALGPLGPQAVQVGLLSYSHRPSPLFPL
                     NGSHDLGIILQRIRDMPYMDPSGNNLGTAVVTAHRYMLAPDAPGRRQHVPGVMVLLVD
                     EPLRGDIFSPIREAQASGLNVVMLGMAGADPEQLRRLAPGMDSVQTFFAVDDGPSLDQ
                     AVSGLATALCQASFTTQPRPEPCPVYCPKGQ"
     repeat_region   8..135
                     /rpt_family="Type III homology region"
     repeat_region   136..405
                     /rpt_family="Type III homology region"
     repeat_region   406..681
                     /rpt_family="Type III homology region"
     repeat_region   682..951
                     /rpt_family="Type III homology region"
     repeat_region   952..1209
                     /rpt_family="Type III homology region"
     repeat_region   1210..1491
                     /rpt_family="Type III homology region"
     misc_feature    1252..1260
                     /note="putative N-linked glycosylation site"
     repeat_region   1492..1758
                     /rpt_family="Type III homology region"
     repeat_region   1759..2034
                     /rpt_family="Type III homology region"
     misc_feature    2035..2661
                     /note="similar to von Willebrand factor A domain"
     misc_feature    2221..2229
                     /note="putative N-linked glycosylation site"
     misc_feature    2404..2412
                     /standard_name="RGD site"
BASE COUNT          464 a          801 c          865 g          531 t
ORIGIN      
        1 gaattccggg agctgggccc tgggcagggt tcagtgttgc tgcgtgactt ggagcctggc
       61 acggactatg aggtgaccgt gagcacccta tttggccgca gtgtggggcc cgccacttcc
      121 ctgatggctc gcactgacgc ttctgttgag cagaccctgc gcccggtcat cctgggcccc
      181 acatccatcc tcctttcctg gaacttggtg cctgaggccc gtggctaccg gttggaatgg
      241 cggcgtgaga ctggcttgga gccaccgcag aaggtggtac tgccctctga tgtgacccgc
      301 taccagttgg atgggctgca gccgggcact gagtaccgcc tcacactcta cactctgctg
      361 gagggccacg aggtggccac ccctgcaacc gtggttccca ctggaccaga gctgcctgtg
      421 agccctgtaa cagacctgca agccaccgac gtgcccgggc agcgggtgcg agtgtcctgg
      481 agcccagtcc ctggtgccac ccagtaccgc atcatttggc gcagcaccca gggggttgag
      541 cggaccctgg tgcttcctgg gagtcagaca gcattcgact tggatgacgt tcaggctggg
      601 cttagctaca ctgtgcgggt gtctgctcga gtgggtcccc gtgagggcag tgccagtgtc
      661 ctcactgtcc gccgggagcc ggaaactcca cttgctgttc cagggctgcg ggttgtggtg
      721 tcagatgcaa cgcgagtgag ggtggcctgg ggacccgtcc ctggagccag tggatttcgg
      781 attagctgga gcacaggcag tggtccggag tccagccaga cactgccccc agactctact
      841 gccacagaca tcacagggct gcagcctgga accacctacc aggtggctgt gtcggtactg
      901 cgaggcagag aggagggccc tgctgcagtc atcgtggctc gaacggaccc actgggccca
      961 gtgaggacgg tccatgtgac tcaggccagc agctcatctg tcaccattac ctggaccagg
     1021 gttcctggcg ccacaggata cagggtttcc tggcactcag cccacggccc agagaaatcc
     1081 cagttggttt ctggggaggc cacggtggct gagctggatg gactggagcc agatactgag
     1141 tatacggtgc atgtgagggc ccatgtggct ggcgtggatg ggccccctgc ctctgtggtt
     1201 gtgaggactg cccctgagcc tgtgggtcgt gtgtcgaggc tgcagatcct caatgcttcc
     1261 agcgacgttc tacggatcac ctgggtaggg gtcactggag ccacagctta cagactggcc
     1321 tggggccgga gtgaaggcgg ccccatgagg caccagatac tcccaggaaa cacagactct
     1381 gcagagatcc ggggtctcga aggtggagtc agctactcag tgcgagtgac tgcacttgtc
     1441 ggggaccgcg agggcacacc tgtctccatt gttgtcacta cgccgcctga ggctccgcca
     1501 gccctgggga cgcttcacgt ggtgcagcgc ggggagcact cgctgaggct gcgctgggag
     1561 ccggtgccca gagcgcaggg cttccttctg cactggcaac ctgagggtgg ccaggaacag
     1621 tcccgggtcc tggggcccga gctcagcagc tatcacctgg acgggctgga gccagcgaca
     1681 cagtaccgcg tgaggctgag tgtcctaggg ccggctggag aagggccctc tgcagaggtg
     1741 actgcgcgca ctgagtcacc tcgtgttcca agcattgaac tacgtgtggt ggacacctcg
     1801 atcgactcgg tgactttggc ctggactcca gtgtccaggg catccagtta catcctatcc
     1861 tggcggccac tcagaggccc tggccaggaa gtgcctgggt ccccgcagac acttccaggg
     1921 atctcaagct cccagcgggt gacagggcta gagcctggcg tctcttacat cttctccctg
     1981 acgcctgtcc tggatggtgt gcggggtcct gaggcatctg tcacacagac gccagtgtgc
     2041 ccccgtggcc tggcggatgt ggtgttccta ccacatgcca ctcaagacaa tgctcaccgt
     2101 gcggaggcta cgaggagggt cctggagcgt ctggtgttgg cacttgggcc tcttgggcca
     2161 caggcagttc aggttggcct gctgtcttac agtcatcggc cctccccact gttcccactg
     2221 aatggctccc atgaccttgg cattatcttg caaaggatcc gtgacatgcc ctacatggac
     2281 ccaagtggga acaacctggg cacagccgtg gtcacagctc acagatacat gttggcacca
     2341 gatgctcctg ggcgccgcca gcacgtacca ggggtgatgg ttctgctagt ggatgaaccc
     2401 ttgagaggtg acatattcag ccccatccgt gaggcccagg cttctgggct taatgtggtg
     2461 atgttgggaa tggctggagc ggacccagag cagctgcgtc gcttggcgcc gggtatggac
     2521 tctgtccaga ccttcttcgc cgtggatgat gggccaagcc tggaccaggc agtcagtggt
     2581 ctggccacag ccctgtgtca ggcatccttc actactcagc cccggccaga gccctgccca
     2641 gtgtattgtc caaagggcca g
//