LOCUS HUMAPOB 4634 bp mRNA linear HUM 31-OCT-1994 DEFINITION Human apolipoprotein B100 mRNA, partial cds. ACCESSION M10374 VERSION M10374.1 KEYWORDS apolipoprotein; apolipoprotein B; glycoprotein; lipoprotein; low density lipoprotein; very low density lipoprotein. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4634) AUTHORS Knott,T.J., Rall,S.C. Jr., Innerarity,T.L., Jacobson,S.F., Urdea,M.S., Levy-Wilson,B., Powell,L.M., Pease,R.J., Eddy,R., Nakai,H., Byers,M., Priestley,L.M., Robertson,E., Rall,L.B., Betsholtz,C., Shows,T.B., Mahley,R.W. and Scott,J. TITLE Human apolipoprotein B: structure of carboxyl-terminal domains, sites of gene expression, and chromosomal localization JOURNAL Science 230 (4721), 37-43 (1985) PUBMED 2994225 COMMENT Original source text: Human (adult human liver cDNA library), cDNA to mRNA. Apolipoprotein B100 is the largest of the lipoproteins. It is a constituent of the low density and very low density lipoproteins (LDL, VLDL) and of the chylomicrons. In [1], protein modification sites and binding sites are tentatively identified, and the apo B gene is assigned to the p24 region of chromosome 2. The end of this sequence is upstream from the mRNA terminus. FEATURES Location/Qualifiers source 1..4634 /db_xref="H-InvDB:HIT000194080" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /map="2p24-p23" gene 1..4634 /gene="APOB" CDS <1..4368 /gene="APOB" /note="apolipoprotein B100" /codon_start=1 /protein_id="AAA51750.1" /db_xref="GDB:G00-119-686" /translation="NIMEAHVGINGEANLDFLNIPLTIPEMRLPYTIITTPPLKDFSL WEKTGLKEFLKTTKQSFDLSVKAQYKKNKHRHSITNPLAVLCEFISQSIKSFDRHFEK NRNNALDFVTKSYNETKIKFDKYKAEKSHDELPRTFQIPGYTVPVVNVEVSPFTIEMS AFGYVFPKAVSMPSFSILGSDVRVPSYTLILPSLELPVLHVPRNLKLSLPHFKELCTI SHIFIPAMGNITYDFSFKSSVITLNTNAELFNQSDIVAHLLSSSSSVIDALQYKLEGT TRLTRKRGLKLATALSLSNKFVEGSHNSTVSLTTKNMEVSVAKTTKAEIPILRMNFKQ ELNGNTKSKPTVSSSMEFKYDFNSSMLYSTAKGAVDHKLSLESLTSYFSIESSTKGDV KGSVLSREYSGTIASEANTYLNSKSTRSSVKLQGTSKIDDIWNLEVKENFAGEATLQR IYSLWEHSTKNHLQLEGLFFTNGEHTSKATLELSPWQMSALVQVHASQPSSFHDFPDL GQEVALNANTKNQKIRWKNEVRIHSGSFQSQVELSNDQEKAHLDIAGSLEGHLRFLKN IILPVYDKSLWDFLKLDVTTSIGRRQHLRVSTAFVYTKNPNGYSFSIPVKVLADKFIT PGLKLNDLNSVLVMPTFHVPFTDLQVPSCKLDFREIQIYKKLRTSSFALNLPTLPEVK FPEVDVLTKYSQPEDSLIPFFEITVPESQLTVSRFTLPKSVSDGIAALDLNAVANKIA DFELPTIIVPEQTIEIPSIKFSVPAGIVIPSFQALTARFEVDSPVYNATWSASLKNKA DYVETVLDSTCSSTVQFLEYELNVLGTHKIEDGTLASKTKGTLAHRDFSAEYEEDGKF EGLQEWEGKAHLNIKSPAFTDLHLRYQKDKKGISTSAASPAVGTVGMDMDEDDDFSKW NFYYSPQSSPDKKLTIFKTELRVRESDEETQIKVNWEEEAASGLLTSLKDNVPKATGV LYDYVNKYHWEHTGLTLREVSSKLRRNLQNNAEWVYQGAIRQIDDIDVRFQKAASGTT GTYQEWKDKAQNLYQELLTQEGQASFQGLKDNVFDGLVRVTQKFHMKVKHLIDSLIDF LNFPRFQFPGKPGIYTREELCTMFIREVGTVLSQVYSKVHNGSEILFSYFQDLVITLP FELRKHKLIDVISMYRELLKDLSKEAQEVFKAIQSLKTTEVLRNLQDLLQFIFQLIED NIKQLKEMKFTYLINYIQDEINTIFNDYIPYVFKLLKENLCLNLHKFNEFIQNELQEA SQELQQIHQYIMALREEYFDPSIVGWTVKYYELEEKIVSLIKNLLVALKDFHSEYIVS ASNFTSQLSSQVEQFLHRNIQEYLSILTDPDGKGKEKIAELSATAQEIIKSQAIATKK IISDYHQQFRYKLQDFSDQLSDYYEKFIAESKRLIDLSIQNYHTFLIYITELLKKLQS TTVMNPYMKLAPGELTIIL" BASE COUNT 1479 a 1029 c 885 g 1241 t ORIGIN 154 bp upstream of EcoRI site; chromosome 2pter-p23. 1 aacattatgg aggcccatgt aggaataaat ggagaagcaa atctggattt cttaaacatt 61 cctttaacaa ttcctgaaat gcgtctacct tacacaataa tcacaactcc tccactgaaa 121 gatttctctc tatgggaaaa aacaggcttg aaggaattct tgaaaacgac aaagcaatca 181 tttgatttaa gtgtaaaagc tcagtataag aaaaacaaac acaggcattc catcacaaat 241 cctttggctg tgctttgtga gtttatcagt cagagcatca aatcctttga caggcatttt 301 gaaaaaaaca gaaacaatgc attagatttt gtcaccaaat cctataatga aacaaaaatt 361 aagtttgata agtacaaagc tgaaaaatct cacgacgagc tccccaggac ctttcaaatt 421 cctggataca ctgttccagt tgtcaatgtt gaagtgtctc cattcaccat agagatgtcg 481 gcattcggct atgtgttccc aaaagcagtc agcatgccta gtttctccat cctaggttct 541 gacgtccgtg tgccttcata cacattaatc ctgccatcat tagagctgcc agtccttcat 601 gtccctagaa atctcaagct ttctcttcca catttcaagg aattgtgtac cataagccat 661 atttttattc ctgccatggg caatattacc tatgatttct cctttaaatc aagtgtcatc 721 acactgaata ccaatgctga actttttaac cagtcagata ttgttgctca tctcctttct 781 tcatcttcat ctgtcattga tgcactgcag tacaaattag agggcaccac aagattgaca 841 agaaaaaggg gattgaagtt agccacagct ctgtctctga gcaacaaatt tgtggagggt 901 agtcataaca gtactgtgag cttaaccacg aaaaatatgg aagtgtcagt ggcaaaaacc 961 acaaaagccg aaattccaat tttgagaatg aatttcaagc aagaacttaa tggaaatacc 1021 aagtcaaaac ctactgtctc ttcctccatg gaatttaagt atgatttcaa ttcttcaatg 1081 ctgtactcta ccgctaaagg agcagttgac cacaagctta gcttggaaag cctcacctct 1141 tacttttcca ttgagtcatc taccaaagga gatgtcaagg gttcggttct ttctcgggaa 1201 tattcaggaa ctattgctag tgaggccaac acttacttga attccaagag cacacggtct 1261 tcagtgaagc tgcagggcac ttccaaaatt gatgatatct ggaaccttga agtaaaagaa 1321 aattttgctg gagaagccac actccaacgc atatattccc tctgggagca cagtacgaaa 1381 aaccacttac agctagaggg cctctttttc accaacggag aacatacaag caaagccacc 1441 ctggaactct ctccatggca aatgtcagct cttgttcagg tccatgcaag tcagcccagt 1501 tccttccatg atttccctga ccttggccag gaagtggccc tgaatgctaa cactaagaac 1561 cagaagatca gatggaaaaa tgaagtccgg attcattctg ggtctttcca gagccaggtc 1621 gagctttcca atgaccaaga aaaggcacac cttgacattg caggatcctt agaaggacac 1681 ctaaggttcc tcaaaaatat catcctacca gtctatgaca agagcttatg ggatttccta 1741 aagctggatg taaccaccag cattggtagg agacagcatc ttcgtgtttc aactgccttt 1801 gtgtacacca aaaaccccaa tggctattca ttctccatcc ctgtaaaagt tttggctgat 1861 aaattcatta ctcctgggct gaaactaaat gatctaaatt cagttcttgt catgcctacg 1921 ttccatgtcc catttacaga tcttcaggtt ccatcgtgca aacttgactt cagagaaata 1981 caaatctata agaagctgag aacttcatca tttgccctca acctaccaac actccccgag 2041 gtaaaattcc ctgaagttga tgtgttaaca aaatattctc aaccagaaga ctccttgatt 2101 cccttttttg agataaccgt gcctgaatct cagttaactg tgtcccgatt cacgcttcca 2161 aaaagtgttt cagatggcat tgctgctttg gatctaaatg cagtagccaa caagatcgca 2221 gactttgagt tgcccaccat catcgtgcct gagcagacca ttgagattcc ctccattaag 2281 ttctctgtac ctgctggaat tgtcattcct tcctttcaag cactgactgc acgctttgag 2341 gtagactctc ccgtgtataa tgccacttgg agtgccagtt tgaaaaacaa agcagattat 2401 gttgaaacag tcctggattc cacatgcagc tcaaccgtac agttcctaga atatgaacta 2461 aatgttttgg gaacacacaa aatcgaagat ggtacgttag cctctaagac taaaggaaca 2521 cttgcacacc gtgacttcag tgcagaatat gaagaagatg gcaaatttga aggacttcag 2581 gaatgggaag gaaaagcgca cctcaatatc aaaagcccag cgttcaccga tctccatctg 2641 cgctaccaga aagacaagaa aggcatctcc acctcagcag cctccccagc cgtaggcacc 2701 gtgggcatgg atatggatga agatgacgac ttttctaaat ggaacttcta ctacagccct 2761 cagtcctctc cagataaaaa actcaccata ttcaaaactg agttgagggt ccgggaatct 2821 gatgaggaaa ctcagatcaa agttaattgg gaagaagagg cagcttctgg cttgctaacc 2881 tctctgaaag acaacgtgcc caaggccaca ggggtccttt atgattatgt caacaagtac 2941 cactgggaac acacagggct caccctgaga gaagtgtctt caaagctgag aagaaatctg 3001 cagaacaatg ctgagtgggt ttatcaaggg gccattaggc aaattgatga tatcgacgtg 3061 aggttccaga aagcagccag tggcaccact gggacctacc aagagtggaa ggacaaggcc 3121 cagaatctgt accaggaact gttgactcag gaaggccaag ccagtttcca gggactcaag 3181 gataacgtgt ttgatggctt ggtacgagtt actcaaaaat tccatatgaa agtcaagcat 3241 ctgattgact cactcattga ttttctgaac ttccccagat tccagtttcc ggggaaacct 3301 gggatataca ctagggagga actttgcact atgttcataa gggaggtagg gacggtactg 3361 tcccaggtat attcgaaagt ccataatggt tcagaaatac tgttttccta tttccaagac 3421 ctagtgatta cacttccttt cgagttaagg aaacataaac taatagatgt aatctcgatg 3481 tatagggaac tgttgaaaga tttatcaaaa gaagcccaag aggtatttaa agccattcag 3541 tctctcaaga ccacagaggt gctacgtaat cttcaggacc ttttacaatt cattttccaa 3601 ctaatagaag ataacattaa acagctgaaa gagatgaaat ttacttatct tattaattat 3661 atccaagatg agatcaacac aatcttcaat gattatatcc catatgtttt taaattgttg 3721 aaagaaaacc tatgccttaa tcttcataag ttcaatgaat ttattcaaaa cgagcttcag 3781 gaagcttctc aagagttaca gcagatccat caatacatta tggcccttcg tgaagaatat 3841 tttgatccaa gtatagttgg ctggacagtg aaatattatg aacttgaaga aaagatagtc 3901 agtctgatca agaacctgtt agttgctctt aaggacttcc attctgaata tattgtcagt 3961 gcctctaact ttacttccca actctcaagt caagttgagc aatttctgca cagaaatatt 4021 caggaatatc ttagcatcct taccgatcca gatggaaaag ggaaagagaa gattgcagag 4081 ctttctgcca ctgctcagga aataattaaa agccaggcca ttgcgacgaa gaaaataatt 4141 tctgattacc accagcagtt tagatataaa ctgcaagatt tttcagacca actctctgat 4201 tactatgaaa aatttattgc tgaatccaaa agattgattg acctgtccat tcaaaactac 4261 cacacatttc tgatatacat cacggagtta ctgaaaaagc tgcaatcaac cacagtcatg 4321 aacccctaca tgaagcttgc tccaggagaa cttactatca tcctctaatt ttttaaaaga 4381 aatctcatta tctctttcca atgaacttca catagcacag aaaaaatcaa actgcctata 4441 ttgataaaac catacagtga gccagccttg cagtaggcag tagactataa gcagaagcac 4501 atatgaactg gacctgcacc aaagctggca ccagggctcg gaaggtctct gaactcagaa 4561 ggatggcatt ttttgcaagt taaagaaaat caggatctga gttattttgc taaacttggg 4621 ggaggaggaa caaa //