Thursday, June 24, 2010

Sergey Brin's Search for a Parkinson's Cure

Article at Wired magazine

Brin’s tolerance for “noisy data” is especially telling, since medical science tends to consider it poisonous. Biomedical researchers often limit their experiments to narrow questions that can be rigorously measured. But the emphasis on purity can mean fewer patients to study, which results in small data sets. That limits the research’s “power”—a statistical term that generally means the probability that a finding is actually true. And by design it means the data almost never turn up insights beyond what the study set out to examine.

Increasingly, though, scientists—especially those with a background in computing and information theory—are starting to wonder if that model could be inverted. Why not start with tons of data, a deluge of information, and then wade in, searching for patterns and correlations?

This is what Jim Gray, the late Microsoft researcher and computer scientist, called the fourth paradigm of science, the inevitable evolution away from hypothesis and toward patterns. Gray predicted that an “exaflood” of data would overwhelm scientists in all disciplines, unless they reconceived their notion of the scientific process and applied massive computing tools to engage with the data. “The world of science has changed,” Gray said in a 2007 speech—from now on, the data would come first.

Wednesday, June 2, 2010

What is data science?

Article at O'Reilly Radar

An well-written, dense article covering the rise of data science.

I'm not even going to try to summarize the article with excerpts, but I have picked out a portion that best summarizes what I do.

A picture may or may not be worth a thousand words, but a picture is certainly worth a thousand numbers. The problem with most data analysis algorithms is that they generate a set of numbers. To understand what the numbers mean, the stories they are really telling, you need to generate a graph.

Tuesday, June 1, 2010

Cognitive Biases - A Visual Compendium

Slide deck at Scribd

Content mostly from Wikipedia's page on cognitive biases

Why bother looking at these?
Two words: Know thyself

Rest assured that competent marketers know these biases and utilize them, and I'm not just referring to people trying to sell you stuff.