In 2003, the United States chose to invade Iraq. Most now agree this decision was deeply flawed, costing trillions of dollars and hundreds of thousands of lives.

Exactly what went wrong here is a contested and controversial issue. At best, the decision-making process severely lacked rigour, and at worst, it was heavily biased.

The government justified the invasion thanks to the intelligence community’s claim that it was “highly probable” that Iraq possessed weapons of mass destruction (WMD) — but this statement was ambiguous. Policymakers took it to indicate near-100% certainty, and made decisions accordingly.1 But “highly probable” could easily also be interpreted as 80% certainty, or 70% — carrying very different practical implications. Those involved didn’t really think through the relevant probabilities, or consider how likely the estimates were to be wrong, or the implications if they were.

Others have suggested that the US had already decided to invade Iraq, and that this decision influenced intelligence collection — not the other way around. This is a particularly extreme example of what’s known as motivated reasoning — a tendency to reason in ways that support whatever conclusion one wants to be true.

The call to invade Iraq hinged on the subjective impressions of a few key people — subjective impressions that later turned out to be wrong — with complex motivations.

What if you could help prevent similar mistakes in the future?

When we think about doing good in the world, we usually think about solving specific problems, and doing so better than existing institutions and organisations. But you could also improve the world in a different way: by making it easier for key institutions and decision-makers to learn about the world and solve problems. This might involve helping people have better ‘epistemics’ — ways of gathering information and using it in reasoning — for example, by helping people avoid common thinking errors, better evaluate expertise, or make more accurate predictions. It might also mean finding ways to change the incentives of big organisations to make it easier to do all these things.

One advantage of this approach is that, if successful, it could enable humanity to better tackle many different problems — including those we haven’t even noticed yet.

The key here is to figure out which ways of improving epistemics and decision-making — e.g. which actors or institutions to focus on, which decisions to focus on, and which aspects of decision-making — are most important. There is, after all, a lot of work that broadly aims at ‘improving decisions’ — consulting, psychology, education, and arguably much of science generally. But there do seem to be other important and neglected ways we might be able to improve decision-making in key situations, such as improving long-range forecasting and aggregating expert opinions, as well as making these resources available to decision-makers. There may also be other especially effective ways of improving decision-making that we’re not aware of yet.

In this profile, we cover some of the different ways to help important institutions have a much greater positive impact through improved decision-making.

Summary

Governments and other important institutions frequently have to make complex, high-stakes decisions, often based on the judgement calls of just a handful of people.

There’s reason to believe that human judgements can be flawed in a number of ways, but can be substantially improved using more systematic processes and techniques. Improving the quality of decision-making in high-stakes situations — which often take place in important institutions — could improve our ability to solve many other problems. It could also help society’s ability to identify ‘unknown unknowns’ — problems we haven’t even thought of yet — and to mitigate all global catastrophic risks, which we believe are extremely important.

This seems like a very promising career option if you have a strong personal fit for the kind of research required to develop new ways of improving decision-making, or if you’re well-placed to work in influential institutions and test out what we already know. It’s also a good option if you’re currently unsure about what specific problems are most pressing, since improved decision-making can be applied to almost any area.

Our overall view

Recommended

Working on this issue seems to be among the best ways of improving the long-term future we know of, but all else equal, we think it’s less pressing than our highest priority areas.

Scale  

We think work to improve decision-making — especially within powerful institutions and at pivotal times — could have a large positive impact, by potentially leading to much more effective allocation of resources by governments, faster progress on some of the world’s most pressing problems, reduced risks from emerging technologies, and reduced risks of conflict or global catastrophe. We estimate that making important institutional decision-making near-optimal (within the bounds of human ability) would increase the expected value of the future by a huge amount — perhaps between 0.1% and 1%.

Neglectedness  

Parts of this issue seem moderately neglected. Current spending is unknown. For the sorts of interventions we’re most excited about, there are ~100–1,000 people working on them full-time, depending on how you count. A much, much larger number of researchers and consultancies work on improving decision-making broadly (e.g. by helping companies hire better), but relatively few focus on robustly testing the most promising techniques, or implementing proven strategies in the most important areas.

Solvability  

Making progress on improving epistemics and decision-making in high-stakes situations seems moderately tractable. There are already techniques that we have strong evidence can improve decision-making, and past track records suggest more research funding directed to the best researchers in this area could yield additional insights quite quickly. However, it’s currently unclear how easy it will be to get improved decision-making practices implemented in crucial institutions, and this second step could turn out to be a large challenge. We’d guess that doubling the effort directed toward optimising institutional decision-making would take us around 1% of the way there.

Profile depth

Shallow 

This is one of many profiles we've written to help people find the most pressing problems they can solve with their careers. Learn more about how we compare different problems, see how we try to score them numerically, and see how this problem compares to the others we've considered so far.

What is this problem?

Our ability to solve problems in the world relies heavily on our ability to understand it and make high-quality decisions. We need to be able to identify what problems to work on, to understand what factors contribute towards these problems, to predict which of our actions will have the desired outcomes, and to respond to feedback and change our minds.

There’s plenty of reason to think that our decision-making competence is currently less than perfect, as psychology research over the past few decades has documented a whole host of cognitive biases affecting judgements and decisions.

For example, when we try to judge our chances of success, we focus too much on all the reasons why our case will be different from average: despite the fact most startups fail, most prospective entrepreneurs are convinced they will be the unusual case that succeeds.2

We’re also often overconfident in our predictions3 — it’s been argued that unwarranted confidence contributed to the explosion of the space shuttle Challenger, for example, where NASA overruled the safety concerns expressed by an engineer.4

Many of the most important problems in the world are incredibly complicated, requiring an understanding of complex interrelated systems, an ability to predict the outcome of different actions, and an ability to balance competing considerations. That means there’s all the more room for errors in judgement to slip in. Even experts in political forecasting often do worse than simple actuarial predictions, when estimating the probabilities of events up to five years in the future.5 The organisations best placed to solve the world’s most important problems are also often highly bureaucratic, meaning that decision-makers face many constraints and competing incentives, not always aligned with better decision-making.

We think that improving the decision-making competence of key institutions may be particularly crucial, as the risks we face as a society are substantial, and institutions often have a large role in managing the risks.

With technological developments in nuclear weapons, autonomous weapons, bioengineering, and artificial intelligence, our destructive power is quickly increasing. Crises resulting from war, malicious actors, or even accidents could claim billions of lives or more.6

It’s not clear that individual decision-making, or the structure of key institutions, has evolved at anything near the pace needed to manage these potential crises — our institutions and decision-making processes look pretty similar to those that failed in the First and Second World Wars, and yet the worst-case scenarios they need to mitigate are several orders of magnitude larger.

But there’s some good news. Researchers are studying how to improve our ability to make predictions about the future, how to better think probabilistically, and how to think about complex problems in a more structured way.

It seems like developing and applying strategies that improve human judgement and decision-making could be very valuable — especially if focused on institutions working on particularly important problems, and combined with a thorough understanding of how such institutions operate.

Why work on this problem?

Improving decision-making could help us to solve many other problems

Improving humanity’s ability to gather information, as well as our judgement and decision-making, improves our ability to solve almost all other problems. It is thus an extremely robust and high-leverage way of solving problems. Some of the most important revolutions in decision-making include the inventions of the scientific method, probability theory, and Bayesian reasoning.

But these tools — as well as more modest but still impactful innovations such as better forecasting methods — aren’t always at the fingertips of key decision-makers, like those high up in governmental, corporate, or philanthropic institutions. Developing reasoning tools, making them available, and promoting their use in a variety of settings seems like it could help with a large range of issues.

This also means it’s a particularly good approach to focus on if you’re uncertain about what specific problems are most important to work on.

There are several ways to make progress on improving decision-making

Good Judgment Inc runs prediction contests where participants compete to forecast the future.

Research so far has made some progress identifying techniques that reliably improve judgements and decision-making — and at least where there is good evidence, there does seem to be growing interest in getting these techniques implemented in practice.

Philip Tetlock’s work on forecasting,7 for example, has identified a number of ways to improve the accuracy of predictions on real-world events, which have been tested in large-scale randomised control trials (RCTs). A separate research programme on prediction markets suggests getting people to ‘bet’ on their predictions can increase the accuracy of political forecasts, corporate predictions, and statistical weather forecasts.8

(Incidentally we interviewed Tetlock about his research, when to trust experts, and his career advice for people working on improving institutional decision-making on our podcast.)

With increasingly solid evidence for these effects, getting them practically implemented seems achievable — there is already interest in forecasting techniques in the intelligence community, for example.9 One promising next step here would be to take findings that we have solid evidence for, and run smaller-scale pilots in specific organisations or parts of government.

For instance, people’s judgements can be dramatically overconfident, leading to poorly informed decisions — unjustified confidence in Iraq’s possession of WMDs had dramatic consequences, as we’ve seen. Part of the problem here is that people aren’t very well calibrated — our brains don’t seem to have a very good intuitive sense of what it means to say something is, say, “70% likely” — and as a consequence, the statements we make with 70% confidence turn out to be true much less than 70% of the time.

But there’s some evidence that it’s possible to improve people’s calibration through training. Hubbard Decision Research has trained over 1,000 people in calibration and found that 80% of participants were ideally calibrated after five exercises.10

More research here — particularly larger-scale RCTs, and studies that look at the effects of calibration training on real-world judgements (rather than just trivia questions) — could help strengthen the case for implementing these techniques in practice.

Other techniques that might be promising include structured analytic techniques (SATs)11 for reducing biases in judgement, and the Delphi method12 for aggregating different viewpoints — we discuss these and other promising approaches in more detail later.

Work in this area is relatively neglected

Work on ‘improving decision-making’ very broadly isn’t all that neglected. There are a lot of people, in both industry and academia, trying out different techniques to improve decision-making. The British government spends a lot of money on management consulting13, for example, and there are researchers working on questions related to decision-making at all major universities across a range of different disciplines — including psychology, economics, business, marketing, and political science.14

However, there seems to be very little work focused on rigorously testing different techniques to get strong evidence of effectiveness, or putting the best proven techniques into practice in the most influential institutions.

For example, there has been a lot of work trying out ‘scenario analysis’ techniques, with thousands of papers discussing various methods of correcting for overconfidence by considering a wider variety of possible scenarios. But it wasn’t until 2005 that someone published an experiment actually testing whether scenario analysis improved prediction accuracy — and as it turned out, it didn’t.15 This illustrates how, without good evidence on the effectiveness of different techniques, a lot of well-intentioned effort on ‘improving decision-making’ might be wasted.

Similarly, though a lot of effort goes into gathering knowledge generally, much less goes into figuring out the best ways to gather, aggregate, and use knowledge.

Improving epistemics and decision-making also seems more neglected than other ways of trying to ‘improve the system,’ such as general science and education, suggesting this work is more effective. People often argue for investing in science, education, or for certain kinds of political reform, for similar reasons we’ve given here: because these things will help us better tackle all kinds of problems.

For instance, the US government spends around 4.6% of GDP on education ($800 billion),16 and in a survey of the top 100 US foundations by GiveWell, US education accounted for 15% of spending, beaten only by healthcare.17 By contrast, there doesn’t seem to be much government funding or charitable efforts explicitly directed at improving institutional decision-making processes in the ways we’ve discussed. And, despite the potential importance of artificial intelligence in the 21st century, we could only identify a handful of people working on systematic methods to forecast its speed of development and likely impacts (we interviewed one of those researchers on our podcast).

What’s more, there’s reason to think that focusing on institutions directly might be a more effective way to improve decision-making than a broad approach to improved education, as it targets a smaller set of people who already have a lot of influence, and focuses more on institutional processes which can often have a big impact on how high-stakes decisions actually get made.

It’s worth considering why this work hasn’t received a lot of attention so far, and whether it might be neglected for good reason. One such reason might be if it really is incredibly difficult to innovate in this area or get key actors and large institutions to adopt new practices — we discuss some of these potential barriers in the next section. In addition, the kinds of large-scale controlled trials we need to conduct to rigorously test techniques can be expensive and time-consuming.

What are the major arguments against this being pressing?

It’s difficult to get change in practice

Perhaps the main concern with this area is that it’s not clear how easy it is to actually get better decision-making strategies implemented in practice — especially in bureaucratic organisations, and where incentives are not geared towards accuracy.

It’s often hard to get groups to implement practices that are costly to them in the short term — requiring training, resources, and changes from the status quo — and only promise abstract or long-term benefits. For example, there’s been some resistance to getting prediction markets implemented in practice, as running prediction markets with ‘real money’ is probably an illegal form of gambling (and even if you could get around this, it’s legally complex).18

However, these problems may be surmountable if we can find ways to show decision-makers that the techniques will help them achieve the objectives they care about.

You might also be able to help shift decision-making practices by setting up institutional incentives that favour deliberation and truth-seeking.

But recognising the difficulties of getting change in practice also means it seems especially valuable for people thinking about this issue to develop an in-depth understanding of how important groups and institutions operate, and the kinds of incentives and barriers they face. It seems plausible that overcoming bureaucratic barriers to better decision-making may be even more important than developing better techniques.

You might want to work on a more specific problem

Suppose you think that climate change is the most important problem in the world today. You might believe that a huge part of why we’re failing to tackle climate change effectively is that people have a bias towards working on concrete, near-term problems over those that are more likely to affect future generations. And so you might consider doing research on how to overcome this bias, with the hope that you could make important institutions more likely to tackle climate change.

However, if you think the threat of climate change is especially pressing compared to other problems, this might not be the best way for you to make a difference — even if you discover something useful about reducing the bias to work on immediate problems, it might be very hard to get that implemented in a way that’s going to directly make a difference to climate change.

But it’s likely better to focus your efforts on climate change more directly — for example by working for a think tank doing research into the most effective ways to cut carbon emissions. In general, more direct interventions seem more likely to move the needle.

That said, if you can’t implement solutions to a problem without improving the decision-making processes involved, it may be a necessary step.

The advantage of broad interventions like improving decision-making is that they can be applied to a wide range of problems. The disadvantage of working in this area is that it might be harder to target your efforts towards a specific problem. So if you think one specific problem is significantly more urgent than others, and you have an opportunity to work on that problem more directly, then it is likely more effective to do the direct work.

There might be better ways to improve our ability to solve the world’s problems

One of the biggest arguments for working in this area is that if you can improve the productivity or judgement of people working on important problems, then this increases the effectiveness of everything they do to solve those problems.

But you might think there are better ways to increase the speed or effectiveness of work on the world’s most important problems.

For example, perhaps the biggest bottleneck on solving the world’s problems isn’t poor decision-making, but simply lack of information: people may not be working on the world’s biggest problems because they’re lacking crucial information about what those problems are. Being more rational won’t help them if they don’t have that information.

A lot of work on promoting effective altruism might fall in this category: giving people better information about the effectiveness of different causes, interventions, and careers.

What can you do in this area?

We can think of work in this area as falling into several broad categories:

  1. More rigorously testing existing techniques that seem promising.
  2. Doing more fundamental research to identify new techniques.
  3. Fostering adoption of the best proven techniques in high-impact areas.
  4. Directing more funding towards all of the above.

All of these areas seem pressing and seem to have room to make immediate progress (we already know enough to start trying to implement better techniques, but stronger evidence will make adoption easier, for example). This means that which area to focus on will depend quite a lot on your personal fit and opportunities — we discuss each in more detail below.

1. More rigorously testing existing techniques that seem promising

Prof Tetlock’s work has identified thinking styles that lead to significantly more accurate predictions, as detailed in his book, Superforecasting.

The idea here would be to take techniques that seem promising, but haven’t been rigorously tested yet, and try to get stronger evidence of where and whether they are effective. Some techniques or areas of research that fall into this category:

  • Calibration training — improving the accuracy of probability judgements — has a reasonable amount of evidence suggesting it is effective. However, most calibration training focuses on trivia questions — testing whether this training actually improves judgement in real-world scenarios could be promising, and could help to get these techniques applied more widely.
  • Structured Analytic Techniques (SATs) are a number of techniques developed for reducing cognitive biases in intelligence analyses. Examples of SATs include checking key assumptions and challenging consensus views. These seem to be grounded in an understanding of the psychological literature, but few have been tested rigorously (i.e. with a control group and looking at the impact on accuracy). It could be useful to select some of these techniques that look most promising, and try to test which are actually effective at improving real-world judgements.19 It might be particularly interesting and useful to try to directly pitch some of these techniques against each other and compare their levels of success.
  • Methods of aggregating expert judgements, including Roger Cooke‘s classical model for structured expert judgement (which scores different judgements according to their accuracy and informativeness and then uses these scores to combine them),20 and the Delphi method (a method for building consensus in groups by using multiple iterations of questions to collect data from different group members).
  • Better expert surveying in important areas — for example, on the model of the IGM Economic Experts Panel.

Academic researchers and groups working in this area:

There may also be some non-academic organisations with funding for, and interest in running, more rigorous tests of known decision-making techniques:

  • Intelligence Advanced Research Projects Activity (IARPA) is probably the biggest funder of research in this area right now, especially with a focus on improving high-level decisions.
  • Consultancies with a behavioural science focus, such as the Behavioural Insights Team, may also have funding and interest in doing this kind of research. These organisations generally focus on improving lots of small decisions, rather than on improving the quality of a few very important decisions, but they may do some work on the latter.22

2. Doing more fundamental research to identify new techniques

You could also try to do more fundamental research: developing new techniques and approaches to improved epistemics and decision-making, and then testing them. This is more pressing if you don’t think the existing techniques are very good.

One example of an open question in this area is: how do we judge ‘good reasoning’ when we don’t have objective answers to a question? That is, when we can’t just judge answers or contributions based on whether they lead to accurate predictions or answers we know to be true?23 Two examples of current research programmes related to this question are IARPA’s Crowdsourcing Evidence, Argumentaion, Thinking and Evaluation (CREATE) programme and Philip Tetlock’s Making Conversations Smarter, Faster (MCSF) project. You could try to get involved with one of the teams working on these projects.

The academics and institutions listed above might also be promising places to work if you’re interested in developing new decision-making techniques.

3. Fostering adoption of the best proven techniques in high-impact areas

The website FiveThirtyEight.com has popularised data-driven forecasting methods.

Alternatively, you could focus more on implementing those techniques we currently think are most likely to improve collective decision-making (such as the research on forecasting by Tetlock, prediction markets, or calibration training).24 If you think one specific problem is particularly important, you might prefer to focus on the implementation of techniques (rather than developing new ones), as this is easier to target towards specific areas.

As mentioned above, a large part of ‘fostering adoption’ might first require better understanding the practical constraints and incentives of different groups working on important problems, in order to understand what changes are likely to be feasible. For this reason, working in any of the organisations or groups listed below — with the aim of better understanding the barriers they face — might be valuable, even if you don’t expect to be in a position to change decision-making practices immediately.

These efforts might be particularly impactful if focused on organisations that control a large number of resources, or organisations working on important problems. Here are some examples of specific places that might be good to work if you want to do this:

  • Any major government, perhaps especially in the following areas: (1) intelligence/national security and foreign policy; (2) parts of government working on technology policy (the Government Office for Science in the UK, for example, or the Office of Science and Technology Policy in the US); (3) development policy (DFID in the UK, USAID in the US); or (4) defence (the Department of Defense in the US, or the Ministry of Defence in the UK). Many parts of government are also now hiring for ‘behavioural science’ specialist roles,25 which might be a good option if you’re qualified for them (which generally means a master’s or PhD in psychology or a related discipline, and ideally some policy experience).
  • Organisations that direct large amounts of money towards solving important world problems — such as foundations or grants agencies like the Gates Foundation or Open Philanthropy. See also our profile on working as a foundation grantmaker.
  • Think tanks and academic institutions focused on questions about the long-run future of humanity, which require making judgements about speculative long-term scenarios, such as the Future of Humanity Institute in Oxford and the Centre for the Study of Existential Risk in Cambridge.

You could also try to test and implement improved decision-making techniques in a range of organisations as a consultant. Some specific organisations where you might be able to do this, or at least build up relevant experience, include:

  • Good Judgment is an organisation founded on the basis of Tetlock’s successful research project on forecasting, and now runs training for individuals and organisations to apply these findings to improve predictions.
  • Working at a specialist ‘behavioural science’ consultancy, such as the Behavioural Insights Team or Ideas42. Some successful academics in this field have also set up smaller consultancies — such as Hubbard Decision Research (which has worked extensively on calibration training).26
  • HyperMind, an organisation focused on wider adoption of prediction markets.
  • Going into more general consultancy, with the aim of trying to specialise in behavioural science or decision-making — see our profile on management consulting for more details.

Finally, you could also try to advocate for the adoption of better practices across government and organisations, or for improved decision-making more generally — if you think you can get a good platform for doing so — working as a journalist, speaker, or perhaps an academic in this area.

Julia Galef is a good example of someone who has followed this kind of path. Julia worked as a freelance journalist before cofounding the Center for Applied Rationality, she co-hosts the podcast Rationally Speaking, and has a YouTube channel with hundreds of thousands of followers. She’s also written a book on improving your own judgement, while running the Update Project, which focuses on helping decision-makers improve their models of the world and resolve disagreements more productively. More broadly, she’s aiming to build an intellectual community of influential people across a range of different fields, who are genuinely trying to be truth-seeking and resolve disagreements in a productive way. You can learn more about Julia’s career path by checking out our interview with her.

4. Directing more funding towards research in this area

One challenge for all of the above areas is that it may be difficult to get funding for the kinds of work and research involved. Therefore, another approach might be to move a step backwards in the chain and try to direct more funding towards work in all of the aforementioned areas: developing, testing, and implementing better decision-making strategies.

The main place we know of that seems particularly interested in directing more funding towards improving decision-making research is IARPA in the US.27 Becoming a programme manager at IARPA — if you’re a good fit and have ideas about areas of research that could do with more funding — is therefore a very promising opportunity. There’s also some chance Open Philanthropy will invest more time or funds in exploring this area (they have previously funded some of Tetlock’s work on forecasting). Otherwise you could try to work at any other large foundation with an interest in funding scientific research where there might be room to direct funds towards this area.

If you work at any of the organisations listed above, you might also try to advocate for more funds to be directed towards testing and implementing improved decision-making practices, even if you’re not in a position to do the work yourself.

If you don’t have the relevant background to do research or implement better practices yourself, but you think this is important and you’re in (or think you could work up to) an influential position in an important organisation, then you might be able to allocate more funding towards improving decision-making practices where you work (e.g. by funding tests or hiring someone who would be able to do this work). If you’re in a position to do this, this could be even higher impact than working somewhere like IARPA, where there already seems to be a lot of motivation to direct funding towards these problems.

We don’t currently think that there are many great direct donation opportunities in this area, so this probably isn’t the best way to have an impact — at least not for relatively minor donations. If you’re a larger donor, though, you might consider funding academics to do the sort of research outlined in points 1 and 2, or even trying to set up an organisation to conduct and/or fund more of this kind of research.

Learn more

Get free, one-on-one career advice

We’ve helped hundreds of people compare their options, get introductions, and get jobs important for the long-run future. Find out if our coaching can help you:

Read more

Notes and references

  1. “The intelligence community stated it was highly probable that WMD programs existed in Iraq — statements interpreted as 100% certainty by policymakers such as President Bush.” Chang et al (2016). Developing expert political judgement: The impact of training and practice on judgmental accuracy in geopolitical forecasting tournaments. Judgement and Decision Making, 11, 5

  2. Moore, D., Oesch, J., Zietsma, C. (2007). What Competition? Myopic Self-Focus in Market-Entry Decisions. Organization Science. 18, 3, pp. 440-454

  3. Moore, D. A., Tenney, E. R., & Haran, U. (2015). Overprecision in judgment. The Wiley Blackwell handbook of judgment and decision making, 182-209. “We review some of the evidence on overprecision in beliefs. This evidence comes from the lab and the field, from professionals and novices, with consequences ranging from the trivial to the tragic. The evidence reveals individuals’ judgments to be overly precise—they are too sure they know the truth.”

  4. “What really happened was typical I think in large bureaucratic organizations, and any big organization where you’re frankly trying to be a hero in doing your job. And NASA had two strikes against it from the start, which one of those is they were too successful. They had gotten [sic] by for a quarter of a century now and had never lost a single person going into space, which was considered a very hazardous thing to do. And they had rescued the Apollo 13 halfway to the moon when part of the vehicle blew up. Seemed like it was an impossible task, but they did it. […] it gives you a little bit of arrogance you shouldn’t have. And a huge amount of money [was] involved. But they hadn’t stumbled yet and they just pressed on. So you really had to quote “prove that it would fail” and nobody could do that.” Shuttle Challenger: Does overconfidence impede decision making?

  5. “In political forecasting, Tetlock (2005) asked professionals to estimate the probabilities of events up to 5 years into the future – from the standpoint of 1988. Would there be a nonviolent end to apartheid in South Africa? Would Gorbachev be ousted in a coup? Would the United States go to war in the Persian Gulf? Experts were frequently hard-pressed to beat simple actuarial models or even chance baselines (see also Green and Armstrong, 2007).” Mellers et. al. (2015). The Psychology of Intelligence Analysis: Drivers of Prediction Accuracy in World Politics. The Journal of Experimental Psychology: Applied. 21, 1, pp. 1-14

  6. In particular, the development of nuclear weapons means that we now have the ability to kill millions or perhaps even billions with one decision.

  7. Tetlock’s book, Superforecasting outlines the lessons learned from the Good Judgment Project, where different teams of people made predictions about real-world events, and randomised control trials were used to identify some of the most effective prediction methods. The most accurate 2% of predictors were dubbed “superforecasters.” When superforecasters were grouped together into teams, it’s been claimed that their predictions were more accurate than predictions made by professional intelligence analysts with access to classified information.

  8. Arrow, K. J., et al (2008). The Promise of Prediction Markets. Science Magazine.

  9. The Good Judgment Project and much of Tetlock’s other research on forecasting was funded by the Intelligence Advanced Research Projects Activity (IARPA) as part of the Aggregative Contingent Estimation (ACE) programme.

  10. Hubbard, D., Seiersen, R., and Geer, D. (2016). How to Measure Anything in Cybersecurity Risk.

  11. A Tradecraft Primer: Structured Analytic Techniques for Improving Intelligence Analysis. Prepared by the US Government.

  12. Helmer-Hirschberg, O. Analysis of the Future: The Delphi Method. Santa Monica, CA: RAND Corporation, 1967.

  13. While management consultancies typically have a lot of experience with ‘business strategy’ — improving the strategy, structure, management, and operations of an organisation — they’re not typically experts in the errors and biases of human judgement, or in finding ways to overcome them. See What does a management consultant do, exactly? in The Guardian (2013).

  14. This research tends to focus on developing more accurate descriptive accounts of human decision-making, while prescriptive accounts focusing on how judgement could be improved are relatively rare within social science research (though this is gradually changing). “If we want to influence business and government, we must have useful advice to share. Yet most researchers in the other social sciences offer only descriptive research… As a graduate student in the late 1970s, I was trained to be descriptive, prescription was for consultants, not for serious researchers.” Bazerman, M. (2005). Conducting Influential Research: The Need for Prescriptive Implications. The Academy of Management Review, 30, 1, pp. 25-31

  15. “Scenario exercises are promoted in the political and business worlds as correctives to dogmatism and overconfidence. And by this point in the book, the need for such correctives should not be in question. But the scenario experiments show that scenario exercises are not cure-alls. Indeed, the experiments give us grounds for fearing that such exercises will often fail to open the minds of the inclined-to-be-closed-minded hedgehogs but succeed in confusing the already-inclined-to-be-open-minded foxes—confusing foxes so much that their open-mindedness starts to
    look like credulousness.” Tetlock, P. E. (2017). Expert political judgment: How good is it? How can we know?. Princeton University Press. Chapter 7.

  16. Source: OECD

  17. Source: GiveWell

  18. See Are prediction markets an illegal form of gambling?

  19. The biggest existing programme testing the effectiveness of different SATs is the CREATE program, funded by IARPA.

  20. Source

  21. Source

  22. These organisations currently focus mostly on using behavioural science to ‘nudge’ people or inform public policy interventions — as opposed to actually informing the way decisions are made in public policy — but there may well be increasing interest in the latter.

  23. Many of the most important kinds of decisions policymakers have to make aren’t questions with clear, objective answers, and we don’t currently have very good ways to judge the quality of reasoning in these cases. Effective solutions to this seem like they could be very high impact, but we don’t currently know whether this is possible or what they would look like.

  24. Note there’s some overlap here with point 1 above (more rigorously testing existing techniques) since part of what’s required to foster adoption will be providing people with evidence that they work! So going to work at a more ‘applied’ organisation with an interest in rigorous evaluation might provide an opportunity to work on both getting better evidence for existing techniques, and getting them implemented.

  25. Besides the Behavioural Insights Team who work across government departments (in both the UK and the US), there are behavioural science teams in many government departments in the UK (including the Department for Education, Department for Transport, and the intelligence agencies), and in the White House.

  26. It’s worth noting, though, that many of these consultancies work mostly with corporate clients, and so this might not be the best opportunity for immediate impact — but it might be a good way to test out improved decision-making strategies in different environments, from which we could learn about how to apply these techniques in more important areas.

  27. IARPA was the main funder behind a lot of the research on forecasting, for example, and have a number of past and open projects focused on this area, including ACE, FUSE, SHARP, and OSI.