dispersion. NMF 0.17.6

Description

Computes the dispersion coefficient of a -- consensus -- matrix object, generally obtained from multiple NMF runs.

Usage

dispersion(object, ...)

Arguments

object: an object from which the dispersion is computed
...: extra arguments to allow extension

Details

The dispersion coefficient is based on the consensus matrix (i.e. the average of connectivity matrices) and was proposed by Kim et al. (2007) to measure the reproducibility of the clusters obtained from NMF.

It is defined as:

\rho = \sum_{i,j=1}^n 4 (C_{ij} -
  \frac{1}{2})^2 ,  where n is the total number of
  samples.
  
      By construction, 0 \leq \rho \leq 1 and \rho =
  1 only for a perfect consensus matrix, where all entries
  0 or 1. A perfect consensus matrix is obtained only when
  all the connectivity matrices are the same, meaning that
  the algorithm gave the same clusters at each run. See
  Kim et al. (2007).

Methods

dispersionsignature(object = "matrix"): Workhorse method that computes the dispersion on a given matrix.
dispersionsignature(object = "NMFfitX"): Computes the dispersion on the consensus matrix obtained from multiple NMF runs.

References