GEEK HAUS
피드로 돌아가기
2026/04/27/partnering-with-ineffable-intelligence-a

Sequoia, David Silver의 런던 AI 연구소 Ineffable Intelligence와 협력해 경험 기반 ‘superlearner’ 개발 지원

·Sequoia Capital
원문 보기

편집자 요약

Sequoia는 David Silver가 이끄는 런던 기반 AI 연구소 Ineffable Intelligence와의 협력을 발표했습니다. Ineffable은 사전학습이나 인간 데이터 모방 없이, 환경 속 행동 결과만으로 지식을 축적하는 reinforcement learning 기반 ‘superlearner’를 개발하고 있습니다.

맥락

이번 접근은 인터넷 규모 데이터와 imitation learning에 의존해 온 현세대 LLM 중심 AI와 다른 경로를 제시합니다. 성공할 경우 AI 연구의 무게중심은 데이터 수집과 모델 확장에서, 스스로 실험하고 일반화하는 경험 기반 에이전트로 이동할 가능성이 있습니다.

본문

What does it mean to build a mind that is not a copy of our own?

Today, we are excited to announce that Sequoia is partnering with David Silver and Ineffable Intelligence, a new AI research lab based in London with a singular mission: to make first contact with superintelligence.

Ineffable is building what David calls a superlearner: a system that discovers all knowledge directly through its own experience, from elementary motor skills to profound intellectual breakthroughs. No pre-training. No imitation. Just an agent learning endlessly from the consequences of its own actions in a world built to teach it.

A Reinforcement Learning-based superlearner has the potential to rediscover and then transcend the greatest inventions in human history: language, science, mathematics and technology. Imagine a machine that derives the laws of physics from first principles. That invents new branches of mathematics we never thought to ask about. That designs materials, medicines and computers we don’t yet have the vocabulary to describe. This is the prize David is reaching for.

A Different Path

The current generation of AI was built by training on the entirety of the human internet. It is an extraordinary achievement. But a system trained on human data may also have fundamental limitations.

Ineffable Intelligence is scaling reinforcement learning from a clean base: no pre-training, no human data to shortcut the system. Guided by the Era of Experience as a north star, David is proving that agents trained purely from an environment can develop non-human strategies for reasoning about problems we don’t yet know how to solve.

David led some of the defining breakthroughs in deep reinforcement learning, most famously in the devilishly difficult game of Go. Go is the ultimate machine intelligence test because it cannot be brute-forced by a computer: a combinatorially explosive O(10^170) possible legal board positions vastly exceeds the O(10^80) atoms in the observable universe. It was thought to be simply too hard for machines to solve.

At DeepMind, David drove the key breakthrough that finally solved the game of Go: self-play. Self-play drove the ~800 ELO-point leap that led to the historic AlphaGo vs. Lee Sedol showdown in March 2016. David pushed the idea further still with AlphaGo Zero: removing human pre-training entirely and learning purely through self-play increased the system’s ELO rating from ~3,700 to 5,000+. The result was a system that reached decisively superhuman performance, and with somewhat non-human mannerisms.

That’s the lineage David has spent his career building. He was the lead researcher and technical force behind the Alpha series at DeepMind, where, for a brilliant period, his approach was the dominant paradigm: AlphaGo, AlphaZero, AlphaStar, AlphaProof and more.

Even with the arrival of LLMs, David never stopped believing. He is one of the very few people on earth with the conviction, the technical depth and the team to scale reinforcement learning.

What’s Ahead

The work ahead is hard, the timeline to superintelligence is uncertain, and the bet is genuinely contrarian. That is exactly what excites us. The largest leaps in AI have always come from people willing to ignore the consensus. David has ignored more consensus, more correctly, than almost anyone in the field.

We are honored to co-lead Ineffable’s first round and to partner with David on what may be the most ambitious scientific mission of our generation.

댓글

토론

> geekhaus:~$ 다음 읽을거리?

다음 읽을거리 추천