|
Statistics Papers
Statistics Website
Policies
Search Statistics
Submit a Paper
Notify me of new papers
|
 |

Genomewide Motif Identification Using a Dictionary Model
Chiara Sabatti, Department of Statistics, UCLA
Kenneth Lange, Department of Statistics, UCLA
ABSTRACT: This paper surveys and extends models and algorithms for identifying binding sites in non-coding
regions of DNA. These sites control the transcription of genes into messenger RNA in preparation for
translation into proteins. We summarize the underlying biology, review three different models for binding
site identification, and present a unified model that borrows from the previous models and integrates their
main features. We then describe maximum likelihood and maximum a posteriori algorithms for fitting
the unified model to data. Finally, we conclude with a prospectus of future data analyses and theoretical
research.
SUGGESTED CITATION: Chiara Sabatti and Kenneth Lange,
"Genomewide Motif Identification Using a Dictionary Model"
(May 3, 2002).
Department of Statistics, UCLA.
Department of Statistics Papers.
Paper 2002050301.
http://repositories.cdlib.org/uclastat/papers/2002050301
|