The new variance introduced in this article fixes two big data problems associated with the traditional variance and the way it is computed in Hadoop, using a numerically unstable formula. The following aspects are discussed:
- Synthetic Metrics: Definition, Usage
- Hadoop, numerical and statistical stability
- The abstract concept of variance
- A New Big Data Theorem
- Transformation-Invariant Metrics
- Implementation: Communications versus Computational Costs
- Bayesian Models
Read article at http://bit.ly/K2nLDM
You may leave the list at any time by sending the command
SIGNOFF allstat
to [log in to unmask], leaving the subject line blank.
|