Nov

10th

—

Nov

12th

@

5:45pm

The Hinton Lectures™

Whether you’re a developer, entrepreneur, investor, career seeker, student, corporate or member of the press register for Brooklyn Tech Week. Get connected to startups, early adopters, and unique partnership opportunities.

Ticket Sales Are Closed

THE AI SAFETY Foundation, Global risk institute, & manulife are pleased to PRESENT

HOSTED BY Professor GEOFFREY HINTON, NOBEL LAUREATE

_________

Lecturer: Owain Evans, Berkeley AI Expert & Founder of Truthful AI

A transformative lecture series charting the future of artificial intelligence, exploring the risks, revelations, and revolutionary solutions shaping tomorrow’s world.

Watch on demand

WHAT's happening

A transformative lecture series charting the future of artificial intelligence, exploring the risks,revelations, and revolutionary solutions shaping tomorrow’s world.

november 10th

Making AI Agents Safe: Risks and Opportunities

AI has made humbling progress in the last decade, matching or exceeding human abilities in many domains. This lecture will identify drivers of this progress and explore what lies ahead. There is a breakneck race to build autonomous, goal-driven AI systems, though these systems could cause large-scale harm if humans fail to control them. Owain will outline the challenge of developing safe AI systems. There has been meaningful progress on this challenge, however we currently lack a detailed solution or plan.

learn more

PRESENTED BY

november 11th

Language Models Behaving Badly

Language models can behave in deceptive and harmful ways even after applying our best safety techniques. Owain will present failure cases in public models such as Claude and Gemini, showing that safety does not generalize to all contexts. Aligned models can also be corrupted. Training on small, narrow datasets can transform them from reliably helpful to broadly malicious ("emergent misalignment"). By looking at the internal mechanisms of these models, we can start to gain a richer understanding of how this corruption occurs.

learn more

PRESENTED BY

november 12th

Can we read the thoughts of language models?

Frontier language models reason aloud before answering questions. Currently, this reasoning is in plain English, allowing us to catch models in attempts to cheat or take shortcuts. This visibility may be fragile. Future models may reason in opaque formats or encode malicious plans inside normal-looking text. Current models exhibit what Owain refers to as "subliminal learning". Preferences can be transferred between models via apparently meaningless numbers. This includes innocuous preferences (like a love for eagles) but also malicious preferences towards humans.

learn more

presented by

HOST

Professor Geoffrey Hinton

Nobel Laureate

Considered the “Godfather of AI,” Geoffrey Hinton is a distinguished British-Canadian computer scientist and cognitive psychologist, celebrated for his ground-breaking work in artificial neural networks. As a leading figure in tackling AI's most profound and disruptive dilemmas, Hinton has played pivotal roles at Google and the University of Toronto. With over 800,000 citations and collaborations with the most influential minds in AI, his career continues to be a relentless pursuit of innovation and excellence in the field.

LECTURER

Owain Evans, Ph.D.

Founder and Director of Truthful AI

Owain Evans is a leading machine learning researcher specializing in AI alignment and AGI risk. His current work focuses on emergent misalignment, deception, and situational awareness in advanced AI systems. He is the founder of Truthful AI, a research non-profit based in Berkeley, and an affiliate of the Center for Human-Compatible AI at UC Berkeley. Previously, he conducted alignment research at the University of Oxford and earned his PhD at MIT. His work has been featured in the Economist, BBC News, and the Financial Times. He has served as an advisor to nonprofits and foundations in the AI Safety space. A frequent speaker at major academic and industry events, he has presented at over 20 conferences and is recognized for his expertise and leadership in the field of AI safety.

John W. H. Bassett Theatre is located inside the Metro Toronto Convention Centre (MTCC) in downtown Toronto. The MTCC is a large complex, so it might be helpful to enter through the North Building at 255 Front Street West, where the John Bassett Theatre is located. Signs within the building will direct you to the theatre.

The MTCC and John W. H. Bassett Theatre are fully accessible, with ramps and elevators for those requiring assistance.

FIND US HERE

5:45pm

–

7:00pm

Ticket Sales Are Closed

presenting sponsor

founding and presenting sponsor

presenting sponsor

With Support From

The AI Safety Foundation is a registered Canadian charity (#78867 2822 RR0001)
Manulife, Stylized M Design, and Manulife & Stylized M Design are registered trademarks of The Manufacturers Life Insurance Company. Used under license.