Link To Us
Book Lists Home
Add To My Book List
Change-point detection and sequence alignment: Statistical problems of genomics
Nancy R. Zhang
ProQuest / UMI
Saturday, March 18, 2006
Number of Pages:
In Part I of this thesis, we will study the problem of estimating the number of change-points in a data series that is hypothesized to have undergone abrupt changes. We examine two different models: Gaussian data points with changing mean and Poisson process with changing rate parameter. This problem can be approached through the model selection perspective, where model complexity grows with the number of change-points. The classic Bayes Information Criterion (BIC) statistic can not be used because of irregularities in the likelihood function. By asymptotic approximation of the Bayes Factor, we derive a “modified BIC” that is theoretically justified for the change-point models that we study. An example of application as well as a source of inspiration for the Gaussian model is the analysis of array comparative genomic hybridization (array-CGH) data. Array-CGH measures the number of chromosome copies at each genome location of a cell sample, and is useful for finding the regions of genome deletion and amplification in tumor cells. The new modified BIC statistic will be tested on array-CGH data sets and compared to existing methods. Variations to the basic change-point model that are inspired by array-CGH data will also be discussed. In Part II, we will switch to a different problem: The characterization of scores of optimal local sequence alignments. This problem was inspired by the comparison of protein and DNA sequences in biology. The specific question that we will ask is: For which scoring functions does the optimal local alignment score grow logarithmically with sequence length? We will define the concept of “Local Optimality” and use it to prove a sufficient condition on the scoring function for logarithmic growth of the optimal score for gapped alignments. “Local Optimality” refers to the fact that in an optimal alignment, any local changes around gaps should not increase the overall score. We will use numerical studies to compare our local optimality based result to previous results and also draw some theoretical connections.
Post a book review for this title
No reviews for this title. Be the first to post a review.
More Genomics Books
Cats Are Not Peas: A Calico History of Genetics
Sudden Origins: Fossils, Genes, and the Emergence of Species
Chromosomes: The Complex Code
The Dependent Gene: The Fallacy of "Nature vs. Nurture"
Mycobacterium: Molecular Microbiology
Developmental Plasticity and Evolution
A Dictionary of Genetics
Evolution of Sameness and Difference: Perspectives on the Human Genome Project
Genetics, Paleontology and Macroevolution
The Century of the Gene
More Genomics Books ...
Copyright © 2010 Biohealthmatics.com. All Rights Reserved.
Terms & Conditions
Can't find what you are looking for? View our
Also try visiting
PaidEmployment.com for more career resources
TDS Platform- An Online e-Health Platform
Last Updated: 24 November 2007.