LOCUS BC013581 2506 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens collagen, type VIII, alpha 1, mRNA (cDNA clone
MGC:9568 IMAGE:3875911), complete cds.
ACCESSION BC013581
VERSION BC013581.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2506)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2506)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (04-SEP-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: DCTD/DTP/Gazdar
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Sequencing Group at the Stanford Human Genome
Center, Stanford University School of Medicine, Stanford, CA 94305
Web site: http://www-shgc.stanford.edu
Contact: (Dickson, Mark) mcd@paxil.stanford.edu
Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
R. M.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 14 Row: a Column: 8
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 32895369.
FEATURES Location/Qualifiers
source 1..2506
/db_xref="H-InvDB:HIT000036288"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:9568 IMAGE:3875911"
/tissue_type="Lung, large cell carcinoma"
/clone_lib="NIH_MGC_68"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..2506
/gene="COL8A1"
/gene_synonym="MGC9568"
/db_xref="GeneID:1295"
/db_xref="HGNC:HGNC:2215"
/db_xref="MIM:120251"
CDS 238..2472
/gene="COL8A1"
/gene_synonym="MGC9568"
/codon_start=1
/product="collagen, type VIII, alpha 1"
/protein_id="AAH13581.1"
/db_xref="GeneID:1295"
/db_xref="HGNC:HGNC:2215"
/db_xref="MIM:120251"
/translation="MAVLPGPLQLLGVLLTISLSSIRLIQAGAYYGIKPLPPQIPPQM
PPQIPQYQPLGQQVPHMPLAKDGLAMGKEMPHLQYGKEYPHLPQYMKEIQPAPRMGKE
AVPKKGKEIPLASLRGEQGPRGEPGPRGPPGPPGLPGHGIPGIKGKPGPQGYPGVGKP
GMPGMPGKPGAMGMPGAKGEIGQKGEIGPMGIPGPQGPPGPHGLPGIGKPGGPGLPGQ
PGPKGDRGPKGLPGPQGLRGPKGDKGFGMPGAPGVKGPPGMHGPPGPVGLPGVGKPGV
TGFPGPQGPLGKPGAPGEPGPQGPIGVPGVQGPPGIPGIGKPGQDGIPGQPGFPGGKG
EQGLPGLPGPPGLPGIGKPGFPGPKGDRGMGGVPGALGPRGEKGPIGAPGIGGPPGEP
GLPGIPGPMGPPGAIGFPGPKGEGGIVGPQGPPGPKGEPGLQGFPGKPGFLGEVGPPG
MRGLPGPIGPKGEAGQKGVPGLPGVPGLLGPKGEPGIPGDQGLQGPPGIPGIGGPSGP
IGPPGIPGPKGEPGLPGPPGFPGIGKPGVAGLHGPPGKPGALGPQGQPGLPGPPGPPG
PPGPPAVMPPTPPPQGEYLPDMGLGIDGVKPPHAYGAKKGKNGGPAYEMPAFTAELTA
PFPPVGAPVKFNKLLYNGRQNYNPQTGIFTCEVPGVYYFAYHVHCKGGNVWVALFKNN
EPVMYTYDEYKKGFLDQASGSAVLLLRPGDRVFLQMPSEQAAGLYAGQYVHSSFSGYL
LYPM"
BASE COUNT 624 a 698 c 753 g 431 t
ORIGIN
1 ctccgtggga gccagcgagc ctctctccct gatcttacgt gctcaaggga gctcacacgt
61 tcaccaactc acccttgaag tcatctcaag aacaaaagac aactgaaaga agctgttgtg
121 aaggcagagc agcatctgct gaagagacag aaaccagccc cagaggtgtc acaggaaggc
181 accagcaagg acattggtct ttgatttgat tcagcagtcc tgtcaagtat aaatgtgatg
241 gctgtgctgc ctggccctct gcagctgctg ggagtgctgc ttaccatttc cctgagttcc
301 atcaggctca ttcaggctgg tgcctactat gggatcaagc cgctgccacc tcaaattcct
361 cctcagatgc caccacaaat tccacaatac cagcccctgg gtcagcaagt acctcacatg
421 cctttggcca aagatggcct tgccatgggc aaggagatgc cccacttgca gtatggcaaa
481 gagtatccac acctacccca atatatgaag gaaattcaac cggcgccaag aatgggcaag
541 gaagccgtac ccaagaaagg caaagaaata ccattagcca gtttacgagg ggaacaaggt
601 ccccgtggag agcctggccc aagaggacca cctgggcccc ctggtttgcc aggtcatggg
661 atacctggaa ttaaaggaaa accagggcca cagggatatc caggagttgg aaagccaggt
721 atgcctggaa tgccagggaa gccaggagcc atgggcatgc ctggggcaaa aggagaaatt
781 ggacagaaag gggaaattgg gcctatgggg atcccaggac cacaaggacc tccagggcct
841 catggacttc ctggcattgg gaagccaggt gggccagggt taccagggca accaggacca
901 aagggtgatc gaggacccaa aggactacca ggacctcaag gccttcgggg tcctaaagga
961 gacaagggct tcgggatgcc aggtgcgcca ggtgtaaagg ggcctccagg gatgcacggc
1021 cctcccggcc ctgttggact gccaggagtg ggcaaaccag gagtgacagg cttccctggg
1081 ccccagggcc ccctgggaaa gccaggggct ccaggagaac ctgggccaca aggccctatt
1141 ggggtaccgg gggttcaagg acctcctggg atacccggaa ttggaaagcc aggccaggat
1201 gggatcccag gccagccagg atttccaggt ggcaaagggg agcaaggact gccagggcta
1261 ccaggacccc caggccttcc agggattggg aaaccaggct tcccaggacc caaaggtgac
1321 cggggcatgg gaggtgttcc tggggctctt ggaccaagag gggagaaagg accaataggt
1381 gccccaggaa tagggggtcc tccaggagag ccaggcctgc ctggaatccc aggtcctatg
1441 ggccctccag gtgctattgg ttttcctgga cccaaaggag aaggtgggat tgtagggcca
1501 caggggccac caggtcccaa gggtgagcca gggcttcaag gcttcccagg aaagccaggt
1561 ttccttggtg aagtagggcc tcctggcatg aggggtttgc caggtcccat agggcccaag
1621 ggggaagctg ggcaaaaagg tgtaccagga ctccctggtg ttccagggct tctcggacct
1681 aagggagagc caggaatccc aggggatcag ggtttacagg gccccccagg tatcccaggg
1741 attgggggcc ctagtggccc cattggacca cctgggattc caggccccaa aggggagccg
1801 ggcctcccag ggccccctgg gttccctggt atagggaaac ccggagtggc aggacttcat
1861 ggccccccag ggaagcctgg tgcccttggt cctcaaggcc agcctggcct tccaggaccc
1921 ccaggccctc caggacctcc aggaccccca gctgtgatgc cccctacacc accaccccag
1981 ggagagtatc tgccagatat ggggctggga attgatggcg tgaaaccccc ccatgcctac
2041 ggggctaaga aaggcaagaa tggagggcca gcctatgaga tgcctgcatt taccgccgag
2101 ctaaccgcac ctttcccacc ggtgggggcc ccagtgaagt ttaacaaact gctgtataac
2161 ggcagacaga actacaaccc gcagacaggc atcttcacct gtgaggtccc tggtgtctac
2221 tactttgcat accacgttca ctgcaagggg ggcaacgtgt gggttgctct attcaagaac
2281 aacgagcccg tgatgtacac gtacgacgag tacaaaaagg gcttcctgga ccaggcatct
2341 gggagtgcag tgctgctgct caggcccgga gaccgggtgt tcctccagat gccctcagaa
2401 caggctgcag gactgtatgc cgggcagtat gtccactcct ccttttcagg atatttattg
2461 tatcccatgt aaaaacaaaa aaacaaaaaa caaaaaaaaa aaaaaa
//