eScholarship Repository eScholarship Repository California Digital Library
eScholarship > CBMB > Paper yeh_endseq

CBMB Papers

CBMB Website

Policies

Search CBMB

Submit a Paper

Notify me of new papers

institute_logo

Center for Bioinformatics & Molecular Biostatistics
University of California, San Francisco

CBMB Papers  •  CBMB Website  •  Policies  •  Search CBMB  •  Submit a Paper

Predicting Progress in Shotgun Sequencing with Paired Ends
Ru-Fang Yeh, University of California, San Francisco
Terence P. Speed, University of California, Berkeley
Michael S. Waterman, University of Southern California
Xiaoman Li, University of Southern California

Screen display is blurry, but will print out clearly.

Download the Paper (302 K, PDF file) - October 11, 2002 Tell a colleague about it.
Printing Tips: Select 'print as image' in the Acrobat print dialog if you have trouble printing.

ABSTRACT:
Paired-end shotgun sequencing has become widely used for large-scale sequencing projects in recent years, including whole genome shot-gun sequencing and map-based BAC clone sequencing. Under this scheme, sequences from both ends of random clones are determined and assembled into sequence contigs. The sequence data and their linking information are used to construct clone maps in the form of scaffolds. In order to plan a cost-effective sequencing project utilizing such an approach, it is crucial to have knowledge of the expected project progress in relation to parameters such as insert size, clone lengths and redundancy. There has been a lack of theoretical analysis for the paired-end sequencing strategy due to the difficulty of correlated ends. Here we present a mathematical analysis for the progress of a sequencing project employing such a scheme. Formulae for various measures of the expected progress such as expected number and size of scaffolds are derived and assessed by Monte Carlo simulations for parameter sets used in the human genome project.

SUGGESTED CITATION:
Ru-Fang Yeh, Terence P. Speed, Michael S. Waterman, and Xiaoman Li, "Predicting Progress in Shotgun Sequencing with Paired Ends" (October 11, 2002). Center for Bioinformatics & Molecular Biostatistics. Paper yeh_endseq.
http://repositories.cdlib.org/cbmb/yeh_endseq

 
bar
Open Archives Initiative eScholarship is a service of the California Digital Library bepress