Here is the full transcript of Robin Kramer’s talk titled “Are You Really As Good at Something As You Think?” at TED conference.
Listen to the audio version here:
TRANSCRIPT:
I don’t mean to brag, but there are lots of things that I’m pretty average at. From playing table tennis, cooking risotto, finding countries on a map, just to name a few. Now, in our everyday lives, we’re not typically assessed on our skills and abilities, so we’re forced to rely on our own judgments. I may think I’m pretty decent with Italian cuisine, but how accurate is my assessment?
Metacognition and Self-Assessment
Now, what we’re talking about here is metacognition: our insight into our own thought processes. If I have good metacognitive insight, then how good I think I am at a particular task should line up pretty well with how good I actually am. Of course, in the real world this is often not the case. And indeed, we probably all know someone who thinks they’re great at navigating maps, when in fact the reality is often the opposite.
Not to name any names, of course, but still. Perhaps you think this applies to other people and that you, yourself, wouldn’t make this sort of mistake. So let’s try a quick experiment. I want you to think about how you would rate yourself in terms of your driving ability.
The “Better Than Average” Effect
Would you rate yourself as below average, average, or perhaps even above average? So most people rate themselves as above average, which, of course, is mathematically impossible, and something that we call the “better than average” effect. This is just one of a number of cognitive biases that we see when people judge their own abilities. Today, I’m going to focus on a related bias, the Dunning-Kruger effect.
The Dunning-Kruger Effect
So back in 1999, two psychologists at Cornell University, Dunning and Kruger, described the mistakes people make when estimating their own abilities. So if we take a sample of people and we divide them into four groups based on their scores on a test, and order those groups from lowest to highest. If we plot those scores on a graph along with their self-estimates, so how well they thought they did on the test, this is the pattern that we see. So the red line is a steep slope representing their actual scores. As it must be, since we ordered the groups based on their scores in the first place.
Now what’s interesting is the blue shallower line. This represents their self-estimates. So, how good they thought they did on the test. Now the Dunning-Kruger effect describes how the weakest performers significantly overestimate their performance, shown here in the green oval.
Insight, Ability, and Statistical Effects
The explanation for this, according to Dunning and Kruger, is that insight and ability rely on the same thing. So if I’m poor at a task, I also lack the metacognitive insight to accurately assess my ability. Now this pattern has been seen again and again across a number of domains, from driving skill to exam-taking, even chess-playing. However, in recent years, a number of criticisms have been leveled at this approach, and we now have reason to believe that this pattern results is virtually unavoidable.
One reason for this is the statistical effect, regression to the mean. Now this is something that comes about when we have two measures that are related but not perfectly so. So imagine we have a sample of people and we measure their heights and their weights. Now height and weight are related, tall people are typically heavier, but the relationship is far from perfect. So unlike in the figure at the top here, the shortest people in red won’t all be the lightest people.
Some of them will be overweight or particularly muscular, for example. Similarly at the top end, the tallest people in blue won’t all be the heaviest people. Some of them will be underweight, and so on. Now as a result, on average, the shortest people will rank higher for weight than they do for height, and the tallest people will rank lower for weight than they do for height, producing this blue line here and the crossover pattern you’re now becoming familiar with.
Questioning the Dunning-Kruger Effect
Now, some people might put forward a spurious explanation for why short people are relatively overweight or tall people relatively underweight, when in fact no explanation is needed. Perhaps more compelling a reason to doubt the Dunning-Kruger effect is that we can produce the same pattern in our data when our data is entirely meaningless.
So if we collect people’s test scores along with their self-estimates of those scores, but then we shuffle those self-estimates and then analyze as before, then we still find that same pattern in the data. Of course, any effect that we can find with shuffled or randomized data is one that we should surely be suspicious of. So, given these and other issues with the Dunning-Kruger approach, I was saddened and disappointed and, frankly, a little annoyed to discover that the same approach was now being applied in my field of expertise, which is face-matching.
Face-Matching and Metacognitive Insight
Now, this is a task where we’re showing two images of faces or an image and a live person, and we’re asked to decide whether they show the same person or two different people. Now, we’ve all stood in line at passport control, anxiously awaiting the passport officer’s decision as to whether our ID photos look sufficiently like us or not. Indeed, I’ve included at the top here some examples of ID images from my own life, just to illustrate some variability.
Some proud moments in photographic history, I’m sure you’ll agree. And so what I’d like to do now is first see how well you might perform as passport officers. So here are four pairs of images, some students’ ID images and some student photos. For each pair, I’d like you to decide whether it’s a match, so two images of the same person, or a mismatch, two images of different people.
The Challenge of Face-Matching
Some of you might be surprised to hear that the top two pairs are matches, so images of the same people, and the bottom two pairs show mismatches, so two different people.