LOCUS AH005286 4923 bp DNA linear HUM 01-AUG-2016 DEFINITION Homo sapiens surfactant protein D (SP-D) gene, complete cds. ACCESSION AH005286 L05483 L05484 L05485 VERSION AH005286.2 KEYWORDS collagen; lectin; surfactant protein D. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4923) AUTHORS Rust,K., Grosso,L., Zhang,V., Chang,D., Persson,A., Longmore,W., Cai,G.Z. and Crouch,E. TITLE Human surfactant protein D: SP-D contains a C-type lectin carbohydrate recognition domain JOURNAL Arch. Biochem. Biophys. 290 (1), 116-126 (1991) PUBMED 1898081 REFERENCE 2 (bases 1 to 694) AUTHORS Lu,J., Willis,A.C. and Reid,K.B. TITLE Purification, characterization and cDNA cloning of human lung surfactant protein D JOURNAL Biochem. J. 284 (PT 3), 795-802 (1992) PUBMED 1339284 REFERENCE 3 (bases 1 to 4923) AUTHORS Crouch,E., Rust,K., Veile,R., Donis-Keller,H. and Grosso,L. TITLE Genomic organization of human surfactant protein D (SP-D). SP-D is encoded on chromosome 10q22.2-23.1 JOURNAL J. Biol. Chem. 268 (4), 2976-2983 (1993) PUBMED 8428971 COMMENT On or before Aug 1, 2016 this sequence version replaced L05483.1, L05484.1, L05485.1, AH005286.1. FEATURES Location/Qualifiers source 1..4923 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" /map="10q22.2-23.1" /sex="male" /tissue_type="placenta" /tissue_lib="Stratagene; lambda FIX II" gene 1..4910 /gene="SP-D" intron <1..246 /gene="SP-D" /experiment="experimental evidence, no additional details recorded" /citation=[3] exon 247..448 /gene="SP-D" /experiment="experimental evidence, no additional details recorded" /citation=[3] /number=1 CDS join(250..448,1630..1746,2007..2123,2439..2555,2998..3114, 3725..3808,4446..4822) /gene="SP-D" /codon_start=1 /product="surfactant protein D" /protein_id="AAB59450.1" /translation="MLLFLLSALVLLTQPLGYLEAEMKTYSHRTMPSACTLVMCSSVE SGLPGRDGRDGREGPRGEKGDPGLPGAAGQAGMPGQAGPVGPKGDNGSVGEPGPKGDT GPSGPPGPPGVPGPAGREGALGKQGNIGPQGKPGPKGEAGPKGEVGAPGMQGSAGARG LAGPKGERGVPGERGVPGNTGAAGSAGAMGPQGSPGARGPPGLKGDKGIPGDKGAKGE SGLPDVASLRQQVEALQGQVQHLQAAFSQYKKVELFPNGQSVGEKIFKTAGFVKPFTE AQLLCTQAGGQLASPRSAAENAALQQLVVAKNEAAFLSMTDSKTEGKFTYPTGESLVY SNWAPGEPNDDGGSEDCVEIFTNGKWNDRACGEKRLVVCEF" misc_difference 282 /gene="SP-D" /note="base 282 is an 'a' in cDNA sequence reported by Lu et al.,1991" /replace="a" misc_difference 341 /gene="SP-D" /note="base 341 is a 'c' in cDNA sequence reported by Lu et al.,1992" /replace="c" intron order(449..694,795..1629) /gene="SP-D" /number=1 gap 695..794 /estimated_length=unknown exon 1630..1746 /gene="SP-D" /experiment="experimental evidence, no additional details recorded" /number=2 /function="collagen encoding exon" intron 1747..2006 /gene="SP-D" /experiment="experimental evidence, no additional details recorded" /number=2 exon 2007..2123 /gene="SP-D" /experiment="experimental evidence, no additional details recorded" /number=3 /function="collagen encoding exon" intron 2124..2438 /gene="SP-D" /experiment="experimental evidence, no additional details recorded" /number=3 exon 2439..2555 /gene="SP-D" /experiment="experimental evidence, no additional details recorded" /number=4 /function="collagen encoding exon" intron 2556..2997 /gene="SP-D" /experiment="experimental evidence, no additional details recorded" /number=4 exon 2998..3114 /gene="SP-D" /experiment="experimental evidence, no additional details recorded" /number=5 /function="collagen encoding exon" intron 3115..3724 /gene="SP-D" /experiment="experimental evidence, no additional details recorded" /number=5 exon 3725..3808 /gene="SP-D" /experiment="experimental evidence, no additional details recorded" /number=6 /function="encodes linking segment between collagen and lectin domains" intron order(3809..4056,4157..4445) /gene="SP-D" /number=6 gap 4057..4156 /estimated_length=unknown exon 4446..4904 /gene="SP-D" /experiment="experimental evidence, no additional details recorded" /citation=[1] /citation=[3] /number=7 /function="lectin or carbohydrate recognition domain (CRD)" regulatory 4905..4910 /regulatory_class="polyA_signal_sequence" /gene="SP-D" BASE COUNT 1090 a 1174 c 1376 g 1083 t ORIGIN 1 cacctgcaat gggatgctgc tgtaacctat gtctagagtt ccagctggta actcatgctc 61 tccatccaag acctttggcc ctccttgcct aacgctttga ttcctggcgg ctcttaaggc 121 ccattagcag aggacagaag ggagggaaag gggtcttctt gtcacctcta gaagcttgag 181 ccaagtccct aaaccatgtc catgaagtca taagcatgtc ttttcctttt tctgttcacc 241 tctcaggcca tgctgctctt cctcctctct gcactggtcc tgctcacaca gcccctgggc 301 tacctggaag cagaaatgaa gacctactcc cacagaacaa tgcccagtgc ttgcaccctg 361 gtcatgtgta gctcagtgga gagtggcctg cctggtcgcg atggacggga tgggagagag 421 ggccctcggg gcgagaaggg ggacccaggt agggtggggc cctgggctta tcctgctggg 481 gaggaatggt cattggaact gtaactagcc cagcaactct gggtactttg ttatggaagt 541 ctggagacgt gtttcttcct cccaagaagg cgagttccac actgctccac cagctggcaa 601 acgtggctgt gtgctgagcc ccatgtgcct cctccagcac caactgccta gctcctccac 661 taggatccga tcttctatca gagcagcaac gtgtnnnnnn nnnnnnnnnn nnnnnnnnnn 721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 781 nnnnnnnnnn nnnngaattc agatgtttcc aggtggtggt gctgctggga tacgccagca 841 cagattcttc cacccatcca gcaaatttac ccacaaccac atgctcaccc cccatggagg 901 cctggctctc tgccacggga gcatggcttc caagctgaaa tgtcacctcc tgtcgtctgc 961 aaatctccct cgattctcct tgactcttaa aatgcccact ttctctttga agaagaaagc 1021 tcgtgagtgt gtgagtgctg ggtggggacg atcagaatga gatgagggac acttatgcat 1081 gtcttccgtt gttgcccttg aatcacagtg tattatgacc cctgctgagc aggacctgag 1141 gctaagctca gatttgcatc ttgcctagtg ctaggtataa agtagatgct gcatagatgg 1201 ctcttagtga acaaataaat gagcaaatcg accaataact gaatgaacaa gttagcacca 1261 cctgtctggt gccatttctg gttggcttcc ctaatatata tatatggaat gcaagaagtc 1321 catgatcaca ggtaagagct gacaaattgc ctgccatcaa tacagtggcc ttggggtagg 1381 gcaaggagca ccgcatagaa ggtcaggcag gtcttggctt taatcgctct gaaggcttct 1441 tcctcatcta taaaaatggg gatgtttgat acccatttcc tggatcacca ctgggcctgg 1501 gtgtgtgacg ggtggaagtc tcatgctctg cagaatccta aaggggaatg ctaggggcct 1561 gatccagcag gaggctggcc ctttctgggt ctagggccag gtctttgtcc atttcttctg 1621 ctgctacagg tttgccagga gctgcagggc aagcagggat gcctggacaa gctggcccag 1681 ttgggccaaa aggggacaat ggctctgttg gagaacctgg accaaaggga gacactgggc 1741 caagtggtgg aggccaactg ggtgtggact tcaggtgttt ccaggtggtg cagctgcaag 1801 gatatgccag tgggcacgca cagagcccca tcccaggaag aaggttgtct gaaggtggca 1861 ctgggtcatt tcaggacaga tgggacctgc tgttgggagg acaacaatgc tgcctccctt 1921 gcctctgtta ttaccctcca cctttactac acactcctcc tgccaagagg attcattgtt 1981 actgaccggc tggtgctttt gctcaggacc tccaggacct cccggtgtgc ctggtccagc 2041 tggaagagaa ggtgccctgg ggaagcaggg gaacatagga cctcagggca agccaggccc 2101 aaaaggagaa gctgggccta aaggtaggtg gctggtctgt ggccaggtct gagaagggaa 2161 aggaggaagg acaagggtcc atgatatgac tgaggcgtgc tctagagtgc caggagagcc 2221 agcccttgat gctgcaaagg cttgccatgc accctcccaa tccccacaac tgcaatattt 2281 tgggtgtggg tggaggagag gagcggcaga aagcaaggag agtctcccca cttctctctc 2341 tgaccctaaa gttgcgggca aaaagctcca ttgctaacaa aaccacacct gctgaacgcg 2401 ctcttagctg tgtttgttcc tgtgtgttcc ttcttcagga gaagtaggtg ccccaggcat 2461 gcagggctcg gcaggggcaa gaggcctcgc aggccctaag ggagagcgag gtgtccctgg 2521 tgagcgtgga gtccctggaa acacaggggc agcaggtgag cagtggaaag agctgggctg 2581 ggctgggtcc agttctcctg tgcctgcttg gagctgtcaa tgcctacacc ccctccaccc 2641 accacaaagg cagaggtggt atcgctggga acatggagag aatctgtatc aagaagggtg 2701 catgaggggc cttccttggg gactgttcct atctaggaga gtgaagcgcc aagcaatgga 2761 gaaggtctgt atacaggaag ttagttgttg ggtggcgggc agttttcaca gcgggttgat 2821 tgctgggtga cagggacagt cagtgtacag ggagctgcat ggctggggta ctggggaccc 2881 atctgtctat ttctgggcaa ataaaggata gttagtccat gtatgggaag gtaaaaatgc 2941 taaaaacaaa aaaaataaaa atgagtacat agctcaccca gtcttctttt accccagggt 3001 ctgctggagc catgggtccc cagggaagtc caggtgccag gggacccccg ggattgaagg 3061 gggacaaagg cattcctgga gacaaaggag caaagggaga aagtgggctt ccaggtaagg 3121 tagcagctcc tggcccaggg aagccccgct gggatgggtg ggcagagagg aagccctcct 3181 catcccagag gaggggctct gggatgtgtg ccccattaat ggcagctggt gggtggaact 3241 caggaaaggc ggacaggctg gaaggcagga gcatggggct ctggtcttag ccacttagta 3301 ctcatgggaa agcaacctgg gcagcttctc catctgacta tcgggaaaac tacaactcgc 3361 cttcccaatt cataggacag ttttgggcat taaaaaccca gtggatagga aacacatgag 3421 tgtgtattcc tgcccagcaa cccccccacc cacccaatgg agatgccatt atattccaag 3481 actttaaagt cacaatgtca tctggttcag tgtcagtgaa gtacaggcag caatttgttc 3541 atagcagaac cataagacat ccttacacag cagaggccag ctgtgtcagt agatgtgggc 3601 ttgctgtggg tcagccccac aattgtgccc gaaatcagag acacccttcc tttggggccc 3661 gagactctga gctcttacct tcgctctctg ggaagtcttt acttggcctt tttctcccct 3721 ccagatgttg cttctctgag gcagcaggtt gaggccttac agggacaagt acagcacctc 3781 caggctgctt tctctcagta taagaaaggt gagttcctgg atctgaacct ggacccagca 3841 ctggcattgg gccccacact agatcttgac cttggtttgg cctttggctg gggctagacc 3901 tttggctgga cctgaggcag agctgccctg ggtctaggat gggaatgagc tgggacctgt 3961 gtagggctgg acctgggact gtgcaggggt agggctcagg ctgggatggg ctaagaggcg 4021 acacgtctcg gtgaactcaa gtgaatcaag ggaagannnn nnnnnnnnnn nnnnnnnnnn 4081 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4141 nnnnnnnnnn nnnnnngaat tcctaactgc acgtttgctg cttgtgtgac tttgagcctc 4201 tagcttctct aaactccagt actctccatc tgtaagtggg gaatcatggt gtcacctcac 4261 agggtaggtc atggtggacg ggggctgttc tggaacaatc tctcttgcat tctctctggc 4321 atcctgatgc ggtgcctgtt tgggaggtct gaaaggctca ggttggagca cagaggtctc 4381 acagcccctg agctttggcc aggtgatgcc acaaccccaa cctgactttc ttcttctcct 4441 actagttgag ctcttcccaa atggccaaag tgtgggggag aagattttca agacagcagg 4501 ctttgtaaaa ccatttacgg aggcacagct gctgtgcaca caggctggtg gacagttggc 4561 ctctccacgc tctgccgctg agaatgccgc cttgcaacag ctggtcgtag ctaagaacga 4621 ggctgctttc ctgagcatga ctgattccaa gacagagggc aagttcacct accccacagg 4681 agagtccctg gtctattcca actgggcccc aggggagccc aacgatgatg gcgggtcaga 4741 ggactgtgtg gagatcttca ccaatggcaa gtggaatgac agggcttgtg gagaaaagcg 4801 tcttgtggtc tgcgagttct gagccaactg gggtgggtgg ggcagtgctt ggcccaggag 4861 tttggcaaga agtcaaggct tagaccctca tgctgccaat atctaataaa aatggagacc 4921 atc //