[Jdm-society] an editorial on confidence/probability assessment in
national intelligence
David Budescu
dbudescu at cyrus.psych.uiuc.edu
Thu Mar 10 21:57:42 CST 2005
Editorial from the 2/20/05 Washington Post
What Percent Is 'Slam Dunk'?
By Michael Schrage
The controversial decision to reorganize America's sprawling intelligence
establishment has set in motion the most sweeping bureaucratic change
for sensors, spies and satellites since the end of World War II.
Unfortunately, the odds are excellent that this multibillion dollar
structural shuffle -- capped last week by the appointment of veteran
diplomat John Negroponte as the new national intelligence director --
will do little to improve the quality of intelligence analysis
for this country.
Why? Because America's intelligence community doesn't like odds. Yet
the simplest and most cost-effective innovation that community could
adopt would be to embrace them. It's time to require national security
analysts to assign numerical probabilities to their professional
estimates and assessments as both a matter of rigor and of record.
Policymakers can't weigh the risks associated with their decisions if
they can't see how confident analysts are in the evidence and
conclusions used to justify those decisions. The notion of imposing
intelligence accountability without intelligent counting -- without
numbers -- is a fool's errand.
World-class investment banks, insurance companies and public health
practitioners are increasingly bringing greater quantitative
sophistication to their risk analyses. For reasons having chiefly to
do with custom, culture and practice -- not competence or cost -- the
CIA, Defense Intelligence Agency, FBI and the federal government's
other analytic agencies have shied away from simple mathematical
tools that would let them better weigh conflicting evidence and data.
That bureaucratic shortsightedness undermines our ability to even see
the dots, let alone connect them.
Consider the National Intelligence Estimates, the Presidential Daily
Briefings or many of the critical classified and unclassified analyses
flowing through Washington's national security establishment. Key
estimates and analytic insights rarely come with explicit probabilities
attached. The nation's most knowledgeable experts on the Middle East,
counterterrorism, nuclear proliferation, etc., are seldom asked to
quantify, in writing, precisely how much confidence they have in their
evidence or their conclusions. Your personal financial planner does a
better job, on average, of quantitative risk assessment for your
investments than the typical intelligence analyst does for our
national security.
For example, when the State Department's Bureau of Intelligence and
Research "predicted" terrorism and insurgency in the wake of the
invasion of Iraq, its forecasts avoided explicit probabilities. But
precisely how confident were the bureau's experts in their assessments
of the breadth and intensity of the projected opposition? Did they
believe that there was a 60 percent or a 40 percent chance that Sunni
Triangle violence would spread north? Did they foresee a 20 percent or
a 75 percent chance that car bombings of Shiite mosques would provoke
widespread retaliation against Sunnis?
The cultural bias against numbers deprives congressional and White House
decision-makers of essential metrics to weigh the analytical
community's own credibility. Naming a national intelligence director
doesn't change that.
More than 40 years ago, Sherman Kent -- the godfather of the vital National
Intelligence Estimates and the man for whom the CIA's analyst school is
named -- penned a classified memo attempting to describe how vague
words like "probable" and "serious probability" could be translated
into meaningful numbers. His "Words of Estimative Probability" proved a
rhetorically awkward and ultimately futile exercise in encouraging
more disciplined discussions of probability in the analytic community.
Passive-aggressive organizational resistance to quantitative rigor
continues to this day. Former acting CIA director and longtime analyst
John McLaughlin tried to promote greater internal efforts at
assigning probabilities to intelligence assessments during the 1990s, but
they never took. Intelligence analysts "would rather use words than
numbers to describe how confident we are in our analysis," a senior CIA
officer who's served for more than 20 years told me. Moreover, "most
consumers of intelligence aren't particularly sophisticated when it
comes to probabilistic analysis. They like words and pictures, too.
My experience is that [they] prefer briefings that don't center on
numerical calculation. That's not to say we can't do it, but there's
really not that much demand for it."
That doesn't mean it shouldn't happen. Fortunately, there's no need
for a dramatic revolution; subtler measures will do. Here's a
suggestion: The simplest, easiest, cheapest and most powerful way to
transform the quality of intelligence would be to insist that
analysts attach two little numbers to every report they file.
The first number would state their confidence in the quality of the
evidence they've used for their analysis: 0.1 would be the lowest level
of personal/professional confidence; 1.0 would be -- former CIA
director George Tenet should pardon the expression -- a "slam dunk,"
an absolute certainty.
The second number would represent the analyst's own confidence in his
or her conclusions. Is the analyst 0.5 -- the "courage of a coin toss"
confident -- or a bolder 0.75 confident in his or her analysis? Or is
the evidence and environment so befogged with uncertainty that the
best analysts can offer the National Security Council is a 0.3 level of
confidence?
These two little numbers would provoke intelligence analysts and
intelligence consumers alike to think extra hard about analytical
quality, creativity and accountability. Policymakers could swiftly
determine where their analysts had both the greatest -- and the least --
confidence in their data and conclusions. Decision-makers could quickly
assess where "high confidence" interpretations were based on
"low-confidence" evidence and vice versa. That's important information
for decision-makers to have. Then their ability to push, prod and poke
the intelligence community would be firmly grounded in their own
perception of the strength and weakness of the work coming out of it.
Seeing the areas in which top analysts consistently rate their
confidence in evidence below a 0.5 might evoke new thinking from the
covert operations and "sigint" crowds in Langley and Fort Meade as to
what data they should be procuring. More significantly, these two
numbers would build a record -- an ongoing audit trail of probabilities
and odds -- to revisit and review.
Pushing analysts to weight their intelligence assessments creates a less
ambiguous standard of accountability, which might explain why the
consensus in the analytical community is to avoid disclosing their
odds. But House and Senate intelligence committees seeking greater
accountability and better quality from the newly reorganized
intelligence bureaucracies should insist that analyses brought in for
congressional review -- classified or not, publicly disclosed or not --
include confidence rankings.
Yes, analysts and their agencies will attempt to "game" the numbers. Yes,
policymakers will apply political pressure for analysts and agencies to
alter their declared odds. All risk-assessment methods are
corruptible. But these mechanisms can self-correct. Too many risk analyses
that hover uselessly below 0.5 or provide too few assessments of 0.7 and
higher that later turn out to be accurate tend to create their own
pressures for fundamental change. Better accountability promotes better
analysis. And better analysis comes from the explicit explanations and
conversations around probability and risk.
But even greater analytical accountability isn't good enough. A
growing number of fields ranging from medical diagnostics to Internet
spam filtering, for example, increasingly rely upon Bayesian analysis --
a probability theory that predicts the likelihood of future events
based on knowledge of prior events -- as a powerful tool to weigh new
evidence. Bruce Blair, director of the non-partisan Center for Defense
Information, argues convincingly on the CDI Web site that Bayesian
analysis goes a long way toward explaining the seemingly flawed risk
assessments made by the intelligence community during the run-up to
9/11 and the Iraq war. Nonetheless, although the CIA is familiar with
Bayesian analysis and its computational cousins, these techniques
haven't seeped into the national security community's analytical
mainstream.
Medical doctors and Wall Street traders today do a better job of challenging
themselves to explore the growing diversity of analytic options. Even
Major League Baseball teams, as Michael Lewis documents in his
best-selling book "Moneyball," are grasping that data-driven analyses
can lead to better talent acquisition and management decisions. Why
should professional baseball executives be doing more innovative
statistical analyses than professional intelligence analysts?
Mathematics is not a substitute for judgment, nor do equations define
analysis. But analyzing risk without probabilities is akin to
discussing art without colors. You can do it, but don't be surprised at
the sterility of the results. Unfortunately, with evidence I'd weight
at 0.8 and a conclusion to which I'd assign a confidence rating of 1.0,
I predict that until the intelligence community overcomes its
reluctance to go the probability route, it will continue to compromise
its ability to adequately assess national security risks and threats.
Author's e-mail: schrage at media.mit.edu
Michael Schrage is a senior adviser to MIT's Security Studies program. He has participated in non-classified CIA workshops on intelligence analysis.
Would you like to send this article to a friend? Go to
http://www.washingtonpost.com/ac2/wp-dyn/admin/emailfriend?contentId=A37115-2005Feb19&sent=no&referrer=emailarticle
More information about the Jdm-society
mailing list