LOCUS AC133439 4006 bp DNA linear HUM 30-SEP-2002 DEFINITION Homo sapiens 3 BAC RP11-577M5 (Roswell Park Cancer Institute Human BAC Library) complete sequence. ACCESSION AC133439 VERSION AC133439.2 KEYWORDS HTG. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4006) AUTHORS Muzny,D.M., Adams,C., Adio-Oduola,B., Ali-osman,F.R., Allen,C., Alsbrooks,S.L., Amaratunge,H.C., Are,J.R., Ayele,M., Banks,T., Barbaria,J., Benton,J., Bimage,K., Blankenburg,K., Bonnin,D., Bouck,J., Bowie,S., Brieva,M., Brown,E., Brown,M., Bryant,N.P., Buhay,C., Burch,P., Burkett,C., Burrell,K.L., Byrd,N.C., Carron,T.F., Carter,M., Cavazos,S.R., Chacko,J., Chavez,D., Chen,G., Chen,R., Chen,Z., Chowdhry,I., Christopoulos,C., Cleveland,C.D., Cox,C., Coyle,M.D., Dathorne,S.R., David,R., Davila,M.L., Davis,C., Davy-Carroll,L., Dederich,D.A., Delaney,K.R., Delgado,O., Denn,A.L., Ding,Y., Dinh,H.H., Douthwaite,K.J., Draper,H., Dugan-Rocha,S., Durbin,K.J., Earnhart,C., Edgar,D., Edwards,C.C., Elhaj,C., Escotto,M., Falls,T., Ferraguto,D., Flagg,N., Ford,J., Foster,P., Frantz,P., Gabisi,A., Gao,J., Garcia,A., Garner,T., Garza,N., Gill,R., Gorrell,J.H., Guevara,W., Gunaratne,P., Hale,S., Hamilton,K., Harris,C., Harris,K., Hart,M., Havlak,P., Hawes,A., He,X., Hernandez,J., Hernandez,O., Hodgson,A., Hogues,M., Holloway,C., Hollins,B., Homsi,F., Howard,S., Huber,J., Hulyk,S., Hume,J., Jackson,L.E., Jacobson,B., Jia,Y., Johnson,R., Jolivet,S., Joudah,S., Karlsson,E., Kelly,S., Khan,U., King,L., Korvah,J., Kovar,C., Kratovic,J., Kureshi,A., Landry,N., Leal,B., Lewis,L.C., Lewis,L., Li,J., Li,Z., Lichtarge,O., Lieu,C., Liu,J., Liu,W., Loulseged,H., Lozado,R.J., Lu,X., Lucier,A., Lucier,R., Luna,R., Ma,J., Maheshwari,M., Mapua,P., Martin,R., Martindale,A., Martinez,E., Massey,E., Mawhiney,E., McLeod,M.P., Meador,M., Mei,G., Metzker,M., Miner,G., Miner,Z., Mitchell,T., Mohabbat,K., Moore,S., Morgan,M., Moorish,T., Morris,S., Moser,M., Neal,D., Nelson,D., Newtson,J., Newtson,N., Nguyen,A., Nguyen,N., Nguyen,N., Nickerson,E., Nwokenkwo,S., Oguh,M., Okwuonu,G., Oragunye,N., Oviedo,R., Pace,A., Payton,B., Peery,J., Perez,L., Peters,L., Pickens,R., Primus,E., Pu,L.L., Quiles,M., Ren,Y., Rives,M., Rojas,A., Rojubokan,I., Rolfe,M., Ruiz,S., Savery,G., Scherer,S., Scott,G., Shen,H., Shooshtari,N., Sisson,I., Sodergren,E., Sonaike,T., Sparks,A., Stanley,H., Stone,H., Sutton,A., Svatek,A., Tabor,P., Tamerisa,A., Tamerisa,K., Tang,H., Tansey,J., Taylor,C., Taylor,T., Telfrod,B., Thomas,N., Thomas,S., Usmani,K., Vasquez,L., Vera,V., Villalon,D., Vinson,R., Wang,Q., Wang,S., Ward-Moore,S., Warren,R., Washington,C., Watlington,S., Williams,G., Williamson,A., Wleczyk,R., Wooden,S., Worley,K., Wu,C., Wu,Y., Wu,Y.F., Zhou,J., Zorrilla,S., Naylor,S.L., Weinstock,G. and Gibbs,R. TITLE Direct Submission JOURNAL Unpublished REFERENCE 2 (bases 1 to 4006) AUTHORS Worley,K.C. TITLE Direct Submission JOURNAL Submitted (12-SEP-2002) Human Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA REFERENCE 3 (bases 1 to 4006) AUTHORS Worley,K.C. TITLE Direct Submission JOURNAL Submitted (30-SEP-2002) Human Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA COMMENT On Sep 30, 2002 this sequence version replaced AC133439.1. INFORMATION: http://www.hgsc.bcm.tmc.edu/ or email gc-help@bcm.tmc.edu CLONE LENGTH: This sequence does not necessarily represent the entire insert of this clone. Overlapping regions of clones are only sequenced and submitted once, so the sequence for the remainder of the insert may be found in the record for the adjacent clones. Overlapping clones are noted at the beginning and end of the Features listing. ANNOTATION OF FEATURES: STSs are identified using ePCR (Genome Res. 7:541-550) searches of a local database that includes entries from dbSTS, GDB, and local mapping efforts. Repeats are identified using RepeatMasker (A. Smit and P. Green, unpublished.) for Human and Mouse sequences. Genes and Region of sequence similarity are identified by BLAST (Nuc. Acids Res. 25:3389-3402) similarity (expect < 1e-34) to the EST and cDNA sequences. Genes demonstrate at least two exons flanked by consensus splice sites that maintained sequence continuity across the splice junctions. Sequences that are not identical matches are annotated as similar. SEQUENCING READ COVERAGE:Sequencing is completed to a minimum standard of double strand coverage with a minimum of 2 clones and 2 reads with no ambiguities or 2 chemistries with a minimum of 2 clones and 3 reads with no ambiguities. If the sequence quality for a region does not meet this standard, it will be indicated in the annotation as Low Coverage. QUALITY OF INDIVIDUAL BASES:This sequence meets stringent quality standards - estimated error rate less than 1 per 10,000 bases. Reports of lowest quality individual bases and measures of base quality are listed below. Description of the metrics can be found at URL: http://www.hgsc.bcm.tmc.edu:8088/quality.info/genbank.annotation.ht ml. FEATURES Location/Qualifiers source 1..4006 /db_xref="H-InvDB:HIT000383791" /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" /chromosome="3" /clone="RP11-577M5" misc_feature complement(1..2021) /note="overlaps bases 1..2022 of clone AC024892" /function="clone overlap" repeat_region complement(631..934) /rpt_family="L1MB5" repeat_region complement(1110..1257) /rpt_family="L1M4" repeat_region 1273..1567 /rpt_family="AluY" repeat_region 1807..2292 /rpt_family="MER34" misc_feature complement(2002..4006) /note="overlaps bases 181591..183595 of clone AC117460" /function="clone overlap" BASE COUNT 1234 a 707 c 658 g 1407 t ORIGIN 1 aaggaacaaa ggagattata tggtcatttc aaaatatttg ggaaattcta ctctgggtat 61 tttttcagag atagtttata accaagtcat agaagaaaat caatcttatt acttcatgaa 121 aatatcagat actatattca taatagcaat agaacctatg aagcccttat gagtaaaatt 181 taataagaaa agtgttgaaa gtgtatgaaa aaactatgaa aacaactata cgtggatcta 241 tcttacataa tttccagata tttttgacta tttcaaaaaa tcattcctgc tctcaatagt 301 tgtctattac caaatattta aatgcaaatt atcatgaaac ataaaagatc attaatagat 361 attttctaag aaaaatttac aggttattta ttactaattt tacattagaa ctatcaatgc 421 caaatattcc ttactaaagg taaaaatagc attttattat tcttatttct attaacatgc 481 aggtaatcat cattttgtac acatttaccc caaaacattt gggaagctct acgctggaaa 541 ttctttcaaa gatagtttat aagttataga aaaaaattaa tctttatcta tgttcatatc 601 atatttatat ttatacgcaa aaaattaaaa atataaatgg aactatgtaa tttacatttc 661 tttctatttt gattttttta cttaataatg cattgagaaa atgctccgat ttttaacagg 721 tacagctcaa atccattttc tttatatctg aagactattc cacagtcaga ctgtaccata 781 atttattcag tcattctata attgatgagt attctttttg tttcctggct ttgtcttcat 841 gaacaatgct ttattaggta tccttgtgtt tcacgtttgt atctgggtga caatatttct 901 agaatatata tccagaagtg agattgttag atcaaagata tataaattac taattttgat 961 agatgtcagc tttgaaaggc agttaaaatt tatgtcagtt attttagatt gcaactcttt 1021 ttcacctttc tatcatcagt agatattatt tctcatttta cttttttgtt tatttgacaa 1081 atgaaaacga tgcgacaatt taaaaaatat ccttaattat taaggaggtt gaacattgtt 1141 tcatatattt atcggttttt ttttacatat ttattaatca tggtaaacat atctgttcat 1201 atttttcact catttttttc attaggttgt ctacttgttc ctgccaagtt gtaagagagt 1261 atatgtgttg tgggccgggc gcggtggctc acgcctgtaa tcctagcact ttgggaggcc 1321 gaggcggacg gatcatgagg tcaggagatc gagaccacgg tgaaaccccg tctctactaa 1381 aaatacaaaa agttagccgg gcgtagtggc gggcgcctgt agtcccagct actcgggagg 1441 ctgaggcagg agaatggcgt gaacccggga ggcggagctt gcagtgagcc gagatggcgc 1501 ccctgcactc cagcctgggt gacagagcga gactccgtct caaaaaaaaa aaaaaaaaaa 1561 aagagaatat atgtgttgtg tatatatttt tataagtatt aatgcaaatt atgttacaga 1621 tatttttcca acatgttttg tcctttccca ttttttatga attcttttgt catgccataa 1681 aaattaggta gctatgccta tattcaatga ttagcctaat tctctttctc atgcaatggt 1741 tttggatttg catctttttc cattgcaatg gtcttccaca tggttttatt gcctctggag 1801 ggtggtagaa tatgccacac caaaatatgc cactttggca gatggattat tttgagctaa 1861 aggtacataa agaaacagca gatggaagaa gggcattcta atttcacctc ttcttcctca 1921 aaacaagtgg ttaaaaactc ccatgtaaaa gatgccctcc ctgtaccaga agaaatgaaa 1981 cattattttc attttttaat gaattctatc caagagaatt ctatacaaac aacgcttgtt 2041 aaaataattc taatcttcct tttacctccc tgcataattt agctgctttt ccacaattgc 2101 ttttctttgt tcaacctagt atgaaagcct ttaggttttg ccacatgttc aggtcttcat 2161 ttacttatga ggacttttat gttatgtaaa acttttatac atttgtatgc ttttctcttc 2221 tgaatctggc ttatgtccat ctgattctca ggccaactaa gagagtagag gcaaaatttt 2281 gcctccccta cacctcaaat atatgcattc atttcagtat tttccaattt tagtgtcaaa 2341 tacctaaaca cacactttgt aaacagattt tactaatgga tgtaagttag actttaaaaa 2401 ctgttaatta aatgggaaac tcaaaggtat aaatataata aaatgctacc gctttgtgtt 2461 atctttagtg attgtctttg taacatgttt tcttattaca gtgacgagta tggatatatc 2521 agagggaaat aagactcttg tgacagagtt tgttctcaca ggacttacag atcgaccatg 2581 gctgcacgtc ctcttctttg ttgtgttttt ggtggtctat ctcatcacca tggtgggcaa 2641 ccttggactg atagttctaa tttggaacga cccccatctt catatgccca tgtacttatt 2701 ccttggtggt ttagcctttt cagatgcttg tacttcaacc tctataaccc ctaggatgct 2761 ggtcaatttc ttagacaaga ctgcaatgat atccctagct gagtgcatca cccagtttta 2821 cttttttgct tccagtgcaa ctacagaatg cttcctcctg gtgatgatgg cctatgaccg 2881 ctatgtagcc atatgtaatc ccttgcttta tccagtgatg atgtccaaca aactcagcgc 2941 tcagttgcta agtatttcat atgtaattgg tttcctgcat cctctggttc atgtgagttt 3001 actattgcga ctaactttct gcaggtttaa cataatacat tatttctact gtgaaatttt 3061 acaactgttc aaaatttcat gcaatggtcc atctattaac gcactaatga tatttatttt 3121 tggtgctttt atacaaatac ccactttaat gactatcata atctcttata ctcgtgtgct 3181 ctttgatatt ctgaaaaaaa agtctgaaaa gggcagaagc aaagccttct ccacatgcgg 3241 cgcccatctg ctttctgtct cattgtacta cggaactctg atcttcatgt atgtgcgtcc 3301 tgcatctggc ttagctgaag accaagacaa agtgtattct ctgttttaca cgattataat 3361 tcccctgcta aacccattta tttacagctt gagaaataaa aaagtcatgc atgcattgag 3421 aagagttata aggaagtaaa cagttccaaa gggaaatgtc aaatcattta ttttttcacc 3481 ctttgcataa ataagtcaac aagtcttgtg gttagccatg gctctgccat ttcatcagag 3541 ggctagtggt cagggtggtc actgagcact acgtgaagaa acccagcttt gtacaatgtt 3601 tacttggatt tgggtccagt tgaagtgtct ttacatcaac catggttgat ttctttaaaa 3661 acattttact tccttgtttt ctgaacattt aatcttgaaa atttattatt ttccagggac 3721 cctacatcta aattccattg aataaccgac aataattgaa actgttatat aagagccatt 3781 tagttcacaa acctagttta ctgcactagg ctctgtgata ccctaccata caagaataaa 3841 atgataaaaa ctcagagtct tgcagaaaaa gtgttctaga aataaaagtg agaaatagta 3901 ataaaaatta agagtttaaa aaatggtgta ttttttaatt tttaatttct ggtatatctt 3961 aatgacaaaa atttttaaaa gcttgtcttg aacaatggac catcaa //