Print print this page (PC only) Close close window

The Signal and the Noise
The Art and Science of Prediction
Nate Silver - Author
£25.00

Book: Hardback | 153 x 234mm | 544 pages | ISBN 9781846147524 | 27 Sep 2012 | Allen Lane
The Signal and the Noise

Every time we choose a route to work, decide whether to go on a second date, or set aside money for a rainy day, we are making a prediction about the future. Yet from the global financial crisis to 9/11 to the Fukushima disaster, we often fail to foresee hugely significant events. In The Signal and the Noise, the New York Times' political forecaster and statistics guru Nate Silver explores the art of prediction, revealing how we can all build a better crystal ball.

In his quest to distinguish the true signal from a universe of noisy data, Silver visits hundreds of expert forecasters, in fields ranging from the stock market to the poker table, from earthquakes to terrorism. What lies behind their success? And why do so many predictions still fail? By analysing the rare prescient forecasts, and applying a more quantitative lens to everyday life, Silver distils the essential lessons of prediction.

We live in an increasingly data-driven world, but it is harder than ever to detect the true patterns amid the noise of information. In this dazzling insider's tour of the world of forecasting, Silver reveals how we can all develop better foresight in our everyday lives.

1. THE FINANCIAL CRISIS:
A Catastrophic Failure of Prediction

It was October 23, 2008. The stock market was in free fall, having plummeted almost 30 percent over the previous five weeks. Once-esteemed companies like Lehman Brothers had gone bankrupt. Credit markets had all but ceased to function. Houses in Las Vegas had lost 40 percent of their value. Unemployment was skyrocketing. Hundreds of billions of dollars had been committed to failing financial firms. Confidence in government was the lowest that pollsters had ever measured. The presidential election was less than two weeks away.

Congress, normally dormant so close to an election, was abuzz with activity. The bailout bills it had passed were sure to be unpopular and it needed to create every impression that the wrongdoers would be punished. The House Oversight Committee had called the heads of the three major credit-rating agencies, Standard & Poor’s (S&P), Moody’s, and Fitch Ratings, to testify before them. The ratings agencies were charged with assessing the likelihood that trillions of dollars in mortgage-backed securities would go into default. To put it mildly, it appeared they had blown the call.


The Worst Prediction of a Sorry Lot

The financial crisis of the late 2000s can be understood in a number of ways: as a moral failure or as a regulatory failure or as a failure of institutions. Most obviously, it was an economic failure of massive proportions. By late 2011, four years after the Great Recession officially began, the average American was about $2,500 poorer than she would have been otherwise.

I am convinced, however, that the best way to view the financial crisis is as a catastrophic failure of prediction. The predictive failures were widespread, occurring at virtually every stage during, before, and after the crisis and involving everyone from the mortgage brokers to the White House.

The most calamitous failures of prediction usually have a lot in common. We focus on those signals that tell a story about the world as we would like it to be, not how it really is. We ignore the risks that are hardest to measure, even when they pose the greatest threats to our well-being. We make approximations and assumptions about the world that are much cruder than we realize. We abhor uncertainty, even when it is an irreducible part of the problem we are trying to solve. If we want to get at the heart of the financial crisis, we should begin by identifying the greatest predictive failure of all, a prediction that committed all these mistakes.

The ratings agencies had given their AAA rating, normally reserved for a handful of the world’s most solvent governments and best-run businesses, to thousands of mortgage-backed securities, financial instruments that allowed investors to bet on the likelihood of someone else defaulting on their home. The ratings issued by these companies are quite explicitly meant to be predictions: estimates of the likelihood that a piece of debt will go into default. Standard & Poor’s told investors, for instance, that when it rated a particularly complex type of security known as a credit-default option (CDO) at AAA, there was only a 0.12 percent probability—about 1 chance in 850—that it would fail to pay out over the next five years. This supposedly made it as safe as a AAA-rated corporate bond and safer than S&P now assumes U.S. Treasury bonds to be.8 The ratings agencies do not grade on a curve.

In fact, around 28 percent of the AAA-rated CDOs defaulted, according to S&P’s internal fi gures. (Some independent estimates are even higher.) That means that the actual default rates for CDOs were more than two hundred times higher than S&P had predicted.

This is just about as complete a failure as it is possible to make in a prediction: trillions of dollars in investments that were rated as being almost completely safe instead turned out to be almost completely unsafe. It was as if the weather forecast had been 86 degrees and sunny, and instead there was a blizzard.

When you make a prediction that goes that so badly, you have a choice of how to explain it. One path is to blame external circumstances—what we might think of as “bad luck.” Sometimes this is a reasonable choice, or even the correct one. When the National Weather Service says there is a 90 percent chance of clear skies, but it rains instead and spoils your golf outing, you can’t really blame them. Decades of historical data show that when the Weather Service says there is a 1 in 10 chance of rain, it really does rain about 10 percent of the time over the long run.

This explanation becomes less credible, however, when the forecaster does not have a history of successful predictions and when the magnitude of his error is larger. In these cases, it is much more likely that the fault lies with the forecaster’s model of the world and not with the world itself.

In the instance of CDOs, the ratings agencies had no track record at all: these were new and highly novel securities, and the default rates claimed by S&P were not derived from historical data but instead were assumptions based on a faulty statistical model. Meanwhile, the magnitude of their error was enormous: AAA-rated CDOs were two hundred times more likely to default in practice than they were in theory.

The ratings agencies’ shot at redemption would be to admit that the models had been flawed and the mistake had been theirs. But at the congressional hearing, they shirked responsibility and claimed to have been unlucky. They blamed an external contingency: the housing bubble.

“S&P is not alone in having been taken by surprise by the extreme decline in the housing and mortgage markets,” Deven Sharma, the head of Standard & Poor’s, told Congress that October. “Virtually no one, be they homeowners, financial institutions, rating agencies, regulators or investors, anticipated what is coming.”

Nobody saw it coming. When you can’t state your innocence, proclaim your ignorance: this is often the first line of defense when there is a failed forecast. But Sharma’s statement was a lie, in the grand congressional tradition of “I did not have sexual relations with that woman” and “I have never used steroids.”

What is remarkable about the housing bubble is the number of people who did see it coming—and who said so well in advance. Robert Shiller, the Yale economist, had noted its beginnings as early as 2000 in his book Irrational Exuberance. Dean Baker, a caustic economist at the Center for Economic and Policy Research, had written about the bubble in August 2002. A correspondent at The Economist magazine, normally known for its staid prose, had spoken of the “biggest bubble in history” in June 2005. Paul Krugman, the Nobel Prize–winning economist, wrote of the bubble and its inevitable end in August 2005. “This was baked into the system,” Krugman later told me. “The housing crash was not a black swan. The housing crash was the elephant in the room.”

Ordinary Americans were also concerned. Google searches on the term “housing bubble” increased roughly tenfold from January 2004 through summer 2005. Interest in the term was heaviest in those states, like California, that had seen the largest run-up in housing prices—and which were about to experience the largest decline. In fact, discussion of the bubble was remarkably widespread. Instances of the two-word phrase “housing bubble” had appeared in just eight news accounts in 2001 but jumped to 3,447 references by 2005. The housing bubble was discussed about ten times per day in reputable newspapers and periodicals.

And yet, the ratings agencies—whose job it is to measure risk in financial markets—say that they missed it. It should tell you something that they seem to think of this as their best line of defense. The problems with their predictions ran very deep.

1. Can you explain the title of your book, THE SIGNAL AND THE NOISE?

It’s a metaphor that comes from electrical engineering. The signal is the sound that you want to transmit: say, a recording of Beethoven’s Moonlight Sonata. The noise is anything that interferes with it: say, the cracking from a nearby radio tower.

I found that this metaphor was coming up again and again in my research. Intelligence officials trying to detect terrorist activity will speak of a terrorist’s signal. Or an economist might speak of the noise in financial markets.

Their goal is to isolate the signal from the noise. But I call the book “The Signal and The Noise” because I found that there is a sort of duality between them. We can trick ourselves into thinking that random patterns are meaningful ones – that noise is signal -- and sometimes vice versa.


2. Why do you think statistics books continue to capture the popular imagination, from Freakonomics to Moneyball?

We encounter so much information today that people are naturally curious about what in the heck we should do with all of it. And we’re becoming less trusting of institutions that mediate information, like the news media. We have all this data, and we want to learn for ourselves what it all means.

A little bit of math and statistics and probability and logic helps us with our information-processing goals. But what’s great about books like Moneyball and Freakonomics is that they make statistics approachable. Subjects like English and history are taught in very hands-on ways – you read great books, discuss the ideas and characters, and it’s easy to understand their relevance. Whereas math is taught in very abstract and technical ways -- even though it’s just as relevant to our everyday lives, and just as intuitive if it’s taught well.

Books like Freakonomics and Moneyball help to bridge that gap. They’re sort of making up for the calculus teacher that had you memorize one too many derivatives and turned you off to the subject as a result. Not that there’s anything wrong with calculus.


3. Even before you came on the scene as a political forecaster, you developed an innovative system for predicting baseball performance – how did you go from sports to politics?

Partly because of Moneyball, the competition in baseball was getting very fierce. Most of the inefficiencies that Michael Lewis described in the book were exploited a long time ago. There’s hardly a team left that doesn’t employ a statistical analyst of some kind, or which doesn’t know that on-base percentage is more important than batting average.

Whereas in politics – well, you don’t need watch Fox News or MSNBC for very long to know that there’s a lot of hot air when it comes to political coverage. Your average political pundit is pretty detached from the things that normal voters care about. Reporters are a lot better than pundits, but they still need to weave messy information into neat narratives. So I thought there was room to apply a little bit more rigor to the topic.


4. What is the easiest professional sport to predict?

I’d argue that baseball is both the easiest and the hardest to predict, depending on how you’re thinking about the question. The statistical methods are the most complete in baseball. Pretty much everything that has happened on a baseball diamond within the past 150 years has been dutifully and accurately recorded. And for the most part, these statistics provide a very good description of what’s really going on in the game.

But we also know that there is a lot of luck in baseball. Even a world-class team will lose one third of its ballgames. And a .300 hitter will go through plenty slumps where he can’t hit a lick. So we know a lot about how to isolate the signal from the noise in baseball – but there is an awful lot of noise!

So maybe the best answer is something like tennis, which meets both definitions of predictability. It’s quite simple structurally, so you can describe it pretty well with statistics. And the best players – Roger Federer, Rafael Nadal – are absolutely dominant.


5. Politics and Baseball, the two subjects you are best known for, are just pat of the book. Why was it important to include so many different fields - economics, earth and life sciences, games, even terrorism?

One thing that baseball fans know is to be wary of small sample sizes. If you show up at the ballpark, and the catcher gets three hits that day, that doesn’t really tell you very much about how good he really is. It takes a long time – hundreds of at-bats -- for the signal to emerge since there’s so much luck in the game.

But in the same way, I thought, perhaps baseball is an exceptional case. Are there Moneyball-like success stories in other fields in which statistics and analysis and prediction is pertinent?

In fact, I found that there are entire disciplines in which our analysis has failed to produce much progress, at least as measured by our ability to make reliable predictions. Finance and economics are obvious examples of this, for instance. Economists have literally tens of thousands of data series to mine – more statistics than baseball geeks do. But they still aren’t able to predict recessions more than a few months in advance.

The book needed to cover a diverse enough range of examples that I could to some systematic conclusions about why predictions succeed and why they fail in different fields. And that meant going beyond the cases that were most familiar to me when I started to write and research the book.

Often, the conclusions were surprising. I’d thought that weather forecasting was a hopeless case, for instance, but it turned out to be a huge success story. Meteorologists and professional gamblers basically emerge as the heroes of the book.


6. Rather controversially you say in the book “we can never make perfectly objective predictions. They will always be tainted by our subjective point of view.” How so?

Well, I think human beings are pretty darned smart. Our brains can store about three terabytes of information, which is just an enormous amount. And a three-year old is able to do a lot of things that a supercomputer can’t.

Still, three terabytes represents only about one one-millionth of the information that IBM says is now being produced in the world each day. We have to be terribly selective about the information that we choose to remember. That necessarily implies that we have a point of view – the set of facts that I have at my command won’t be the same as yours. And we have to make approximations, whether it’s in the form of our language, or the mathematical models that we design.

Even the things we take most for granted, like our sensory inputs (vision, hearing, etc.) rely heavily on making approximations about the objective world. We’re just taking in way more information out there than our brains can process.

So it’s absolutely delusional to think that any one of us has a monopoly on the truth – that our beliefs about the world aren’t flawed in any number of large and small ways. The book, in some ways, is about accepting our flaws, as well as recognizing the things that we’re good at.


7. What distinguishes those who are good at forecasting to those who seem to get it all wrong?

The whole book is an answer to that question. But here’s one big thing that weather forecasters and gamblers -- two of our success stories -- have in common. They

recognize that their knowledge of the world is imperfect. They express their predictions in terms of probabilities: there’s a 40 percent chance of rain tomorrow; there’s a 30 percent chance that I’ll catch a card to make a flush and win a huge poker hand.

This type of thinking turns out to be extremely important when it comes to sorting through the enormous amount of information that we encounter today.


8. What do you feel most accounted for the grand failure of forecasting the 2008 financial crisis?

A lot of things had to go wrong to create such a gigantic mess. The credit-rating agencies built models that assumed that the status of one’s person mortgage wasn’t much related to another’s: if a carpenter in Cleveland defaults on his mortgage, that has no effect on whether a dentist in Denver does. That assumption fails miserably when you have a massive housing bubble, and mortgages to go underwater all across the country.

What’s worse, people doubled-down rather than hedge their bets. For every dollar invested in purchasing a home, Wall Street was making about $50 in bets on the side. So a collapse in housing prices brought down the entire financial system. But it began with having a naïve trust in models that made exact-seeming predictions based on utterly flawed assumptions.


9. In 2008 you correctly predicted Barak Obama’s victory in 49 of 50 states as well as the winner of all 35 U.S. Senate races. In the four years since, has your methodology changed at all when it comes to politics? What’s toughest about political forecasting?

What’s tricky about presidential elections is simply that they don’t happen very often – just one every four years. And there are dozens of factors that go into determining the winner. It’s very much the opposite of something like baseball, where each team plays 162 games per season, but the structure of the game is pretty simple. In presidential elections, we’re really just making educated guesses about which factors determine the winners and losers.

With that said, my methodology is increasingly starting to incorporate some of those structural factors in addition to polling. The evidence is pretty clear that a bad economy, for instance, makes life challenging for the incumbent. If there’s a bad jobs report, but a good poll for Obama in Ohio, the jobs report is often the more important data point.

Let me protest, incidentally, that some of those forecasts were a bit fortunate in 2008. We had Al Franken with a 50.1 percent chance of winning the Senate race in Minnesota – basically a coin flip. The coin came up the right way for me after a prolonged recount there. But that’s basically just luck, so it’s no reason to rest on one’s laurels.


10. In the era of Big Data and the advent of faster, better technology, are we getting better at forecasting?

In certain circumstances, yes, the technology is helping -- like in baseball prediction or weather forecasting. But it shouldn’t be assumed that this is the default case.

Think of all the disasters that we’ve had in the new millennium – from the financial crisis to the September 11 attacks to the Japan earthquake (some seismologists thought we couldn’t possibly have so large an earthquake there). All of these involved some substantial failure of prediction, and all of these occurred in information-rich fields.

More information and better technology aren’t all that useful if we don’t know how to use it. And all that shiny the new technology can sometimes make us overconfident – we think we have mastery of a subject when we just don’t.


11. How is forecasting a natural phenomenon (weather, earthquakes) different from predicting human performance (financial markets, sports, politics)?

Many natural systems, including weather and earthquakes, are quite complex. But at least they aren’t changing much, at least on the scale of human lifespans. Slowly, if not always steadily, we’ve made progress in predicting natural phenomenon.

When it comes to systems that involve how humans interact with one another, however, they’re not just complex -- but also growing more complex all the time. There’s just no comparison between the globalized economy of today and the localized, agrarian economy of a couple of centuries ago. Seven billion human beings, who can catch a flight to anywhere in the world if they have the means, can spread the flu around the planet much faster they did a generation ago.

So we’re always running against a moving target. Prediction in these areas requires a somewhat more defensive posture: we have to prepare more for “unknown unknowns” as human beings keep finding new and ingenuous ways of interacting with one another.


12. You write about hedgehogs and foxes. Can you explain the difference? Which one are you? Obama? Romney?

These terms come from a quote attributed to the Greek poet Archilochus: "the fox knows many things, but the hedgehog knows one big thing". Foxes tend to be multidisciplinary and adaptable, always looking for little gains around the margin. They’re comfortable with probability and uncertainty. They’re a little more pragmatic.

Hedgehogs like to swing for the fences and seek out the big kill. They want to find some grand unifying principle that explains the world. But they can be stubborn and ideological as a result.

I’m sure that we need both these types of people in the world. But in the book, I cite research to suggest that the foxes are a lot better at making predictions. They’re more likely to know their limits, and less likely to mistake noise for a signal because it happens to fit some theory they’ve concocted.

Obama and Romney are both foxes, I think. I’m sure that some partisans will disagree, but I think they’re both quite pragmatic – perhaps unusually so for presidential candidates. Certainly as compared to George W. Bush. I’m a fox too.