Article at Wired magazine
Brin’s tolerance for “noisy data” is especially telling, since medical science tends to consider it poisonous. Biomedical researchers often limit their experiments to narrow questions that can be rigorously measured. But the emphasis on purity can mean fewer patients to study, which results in small data sets. That limits the research’s “power”—a statistical term that generally means the probability that a finding is actually true. And by design it means the data almost never turn up insights beyond what the study set out to examine.
Increasingly, though, scientists—especially those with a background in computing and information theory—are starting to wonder if that model could be inverted. Why not start with tons of data, a deluge of information, and then wade in, searching for patterns and correlations?
This is what Jim Gray, the late Microsoft researcher and computer scientist, called the fourth paradigm of science, the inevitable evolution away from hypothesis and toward patterns. Gray predicted that an “exaflood” of data would overwhelm scientists in all disciplines, unless they reconceived their notion of the scientific process and applied massive computing tools to engage with the data. “The world of science has changed,” Gray said in a 2007 speech—from now on, the data would come first.
Thursday, June 24, 2010
Wednesday, June 2, 2010
What is data science?
Article at O'Reilly Radar
An well-written, dense article covering the rise of data science.
I'm not even going to try to summarize the article with excerpts, but I have picked out a portion that best summarizes what I do.
A picture may or may not be worth a thousand words, but a picture is certainly worth a thousand numbers. The problem with most data analysis algorithms is that they generate a set of numbers. To understand what the numbers mean, the stories they are really telling, you need to generate a graph.
An well-written, dense article covering the rise of data science.
I'm not even going to try to summarize the article with excerpts, but I have picked out a portion that best summarizes what I do.
A picture may or may not be worth a thousand words, but a picture is certainly worth a thousand numbers. The problem with most data analysis algorithms is that they generate a set of numbers. To understand what the numbers mean, the stories they are really telling, you need to generate a graph.
Tuesday, June 1, 2010
Cognitive Biases - A Visual Compendium
Slide deck at Scribd
Content mostly from Wikipedia's page on cognitive biases
Why bother looking at these?
Two words: Know thyself
Rest assured that competent marketers know these biases and utilize them, and I'm not just referring to people trying to sell you stuff.
Content mostly from Wikipedia's page on cognitive biases
Why bother looking at these?
Two words: Know thyself
Rest assured that competent marketers know these biases and utilize them, and I'm not just referring to people trying to sell you stuff.
Subscribe to:
Posts (Atom)