
Single-nucleotide polymorphism (SNP) markers are abundant on the human genome and can be used in fine mapping of genes. Although it is useful to choose markers spaced at regular intervals between two boundaries of a chromosomal region for mapping, it can be a difficult task since intervals vary from one residue to hundreds of thousands.
Our algorithms choose markers based on interval regularity within a genomic region containing a total of n markers, when given the desired number of select markers (m) or the desired interval (I). The solution set has a minimum variance of intervals or minimum sum of squared deviations from I. Although the number of all possible sets increases exponentially with respect to n, our algorithms cover them all with only O(n2) calculations and comparisons by inductively eliminating all redundancy.
These algorithms were implemented in a Windows program named CHOISS. It has a simple graphic interface and is equipped with several functions for convenience (automatic construction of an input file from a text form of NCBI Genbank contig data, graphic visualization of maps with total and chosen markers, etc.). It provides an option to choose between m- and I-value given cases. The program performs well on Windows 98, Me, 2000 and XP versions. It can be downloaded freely from this web page.

Paper
Published in Bioinformatics
CHOISS for
selection of single nucleotide polymorphism markers on interval regularity
Soohyun
Lee and Changwon Kang
Bioinformatics
Vol.
20, Issue 4, pp. 581-582 (March 1, 2004)
(Bioinformatics
Advance Access published online, January 22, 2004)
DOI:
10.1093/bioinformatics/btg458
Supplementary information for the
Bioinformatics paper

Contact: ckang@kaist.ac.kr