Skip to content
Home » ARC 2026: Anthropic’s Chloe Lubinski on AI (Transcript)

ARC 2026: Anthropic’s Chloe Lubinski on AI (Transcript)

The following is the full transcript of Anthropic’s Chloe Lubinski’s talk on AI at Alliance for Responsible Citizenship 2026.

Listen to the audio version here:

Understanding AI: Essentials From Anthropic’s Research Partnerships

CHLOE LUBINSKI: I work at Anthropic, where I lead the research partnerships with the world’s wisdom traditions. And my job really has two parts to it. So the first, it’s to help these experts in these various fields and disciplines actually understand AI, what it is, what’s happening now, and where it’s going. And the second part is to listen and to learn and to funnel wisdom back into the organization, back to the people that are building this technology.

Just last week, I was walking my little red-haired cocker spaniel in San Francisco, thinking of what might be most helpful to you today in having these conversations. And the thing is, I’ve probably had hundreds of conversations now across 20 or so traditions and disciplines, and I’ve found, again and again, just how important it is for folks to really understand the basics before we can even start to talk about how this can go well. So my hope today in this short time is to give you some of those essentials as quickly as I can. So I’m going to jump right in.

This Technology Is Real and Moving Fast

The first thing that I really want to tell you is my goodness, to know that this technology is real and that it’s coming faster than you think, and the force behind it is enormous. Now you may or may not have heard of the scaling laws, which is really what kicked off this whole race to begin with, and you really don’t need to understand anything about this graph other than this, which is these models get predictably better with more compute, and the more energy, the more data, the more training that goes into them, and they get smarter, and they get smarter about everything.

And so with more money, which buys compute, you can essentially purchase intelligence, and that’s kicked off a cycle that is very hard to stop. A better model does more economically valuable work, which attracts more capital, which buys more compute, which trains a better model, and around and around it goes.

Recursive Self-Improvement and What “More Capable” Really Means

And now there’s a further turn of the wheel. These systems are starting to build their own successors, what researchers call recursive self-improvement, or helping to build. But when Claude 8 can build Claude 9, which can build Claude 10, things will begin to move even more quickly.

And just to be concrete about what more capable actually means, our most capable model, in its first month of only limited release, found over 10,000 serious security vulnerabilities across partner software, flaws that human experts had missed for years, and sometimes decades. Now the same trajectory there also holds in biology, which is why we have entire teams at Anthropic dedicated to safeguarding against it. And we also think other domains will soon follow.

The Case for Slowing Down — and Why It’s So Difficult

So Anthropic stated just a few weeks ago that if it were possible to slow down so that our laws and our institutions and guardrails that we actually need have time to catch up, it would be a very good thing. But absent a coordinated global slowdown, what we’re left with is this extraordinary technology built at breakneck speed by many actors in many countries locked in a competition where commercial and geopolitical rivalry is drowning out the part of this that could actually be most consequential and even existential for our species. And any individual company stepping off the wheel doesn’t slow the wheel, it just means that you’re not on the wheel.

ALSO READ:  Michael Huemer: The Irrationality of Politics at TEDxMileHighSalon (Full Transcript)

So the question that I encourage you to sit with over these next few days is not just how to stop this, maybe you’re not asking that, but the question that I want you to think about is if it’s coming and if it’s coming this fast, how do we ensure that it goes well? Because the risks are very, very real and so are the possibilities. So if AI is coming, then what is an actual good outcome and what would it take to get there? We must imagine this together.

So the second thing I really want you to know is that AI is probably not actually what you think it is.

Neural Networks and How They Learn

Most people hear AI and think of a computer program, something coded line by line that does exactly what you tell it. But that’s not actually what this is. What we’re building are called neural networks and they’re loosely based on the architecture of the human brain, not exactly the same, but inspired by, and they’re machines that learn primarily by guessing answers and getting corrected over and over again across enormous, unfathomable amounts of data.

And the data that they’re trained on is human language. And I really want you to sit with that for a second before we move on because there is no language that exists separate from us. Language is us. Language is our thoughts and our values and our fears and our wisdom. So when you train a model on language, you’re training it on us.

What We Find Inside These Models

And because of this, when we look inside these models, and we can now through a science called interpretability, which I honestly think is the coolest new science in the world, we can find things that are quite surprising. So for example, this is where things get really weird. When you ask a model the same question in three different languages, “What’s the opposite of small?” and then you trace what activates inside the neural network, you find that the same internal thing lights up every time.