This is our new weekly challenge, and it is about detecting spurious correlations in big data. There's a theoretical solution (I think, not sure) but the problem can be solved using simulations.
The purpose here is to show that with big data, the risk associated with spurious correlations is high. If you are anti big-data (you don't like the hype), this is your chance to make a valid point about reckless processing of big data.
Participate or read answers at http://bit.ly/1oTLm8K
You may leave the list at any time by sending the command
SIGNOFF allstat
to [log in to unmask], leaving the subject line blank.
|