@RobertMilesAI - 51 本の動画
チャンネル登録者数 16.2万人
Videos about Artificial Intelligence Safety Research, for everyone. AI is leaping forward right now, it's only a matter of time before we develop true Artifi...
AI Safety Career Advice! (And So Can You!)
Using Dangerous AI, But Safely?
Learn AI Safety at MATS #shorts
AI Ruined My Year
Apply to Study AI Safety Now! #shorts
Why Does AI Lie, and What Can We Do About It?
Apply Now for a Paid Residency on Interpretability #short
$100,000 for Tasks Where Bigger AIs Do Worse Than Smaller Ones #short
Free ML Bootcamp for Alignment #shorts
Win $50k for Solving a Single AI Problem? #Shorts
Apply to AI Safety Camp! #shorts
We Were Right! Real Inner Misalignment
Intro to AI Safety, Remastered
Deceptive Misaligned Mesa-Optimisers? It's More Likely Than You Think...
The OTHER AI Alignment Problem: Mesa-Optimizers and Inner Alignment
Quantilizers: AI That Doesn't Try Too Hard
Sharing the Benefits of AI: The Windfall Clause
10 Reasons to Ignore AI Safety
9 Examples of Specification Gaming
Training AI Without Writing A Reward Function, with Reward Modelling
AI That Doesn't Try Too Hard - Maximizers and Satisficers
Is AI Safety a Pascal's Mugging?
A Response to Steven Pinker on AI
How to Keep Improving When You're Better Than Any Teacher - Iterated Distillation and Amplification
Why Not Just: Think of AGI Like a Corporation?
Safe Exploration: Concrete Problems in AI Safety Part 6
Friend or Foe? AI Safety Gridworlds extra bit
AI Safety Gridworlds
Experts' Predictions about the Future of AI
Why Would AI Want to do Bad Things? Instrumental Convergence
Superintelligence Mod for Civilization V
Intelligence and Stupidity: The Orthogonality Thesis
Scalable Supervision: Concrete Problems in AI Safety Part 5
AI Safety at EAGlobal2017 Conference
AI learns to Create ̵K̵Z̵F̵ ̵V̵i̵d̵e̵o̵s̵ Cat Pictures: Papers in Two Minutes #1
What can AGI do? I/O and Speed
What Can We Do About Reward Hacking?: Concrete Problems in AI Safety Part 4
Reward Hacking Reloaded: Concrete Problems in AI Safety Part 3.5
The other "Killer Robot Arms Race" Elon Musk should worry about
Reward Hacking: Concrete Problems in AI Safety Part 3
Why Not Just: Raise AI Like Kids?
Empowerment: Concrete Problems in AI Safety part 2
Avoiding Positive Side Effects: Concrete Problems in AI Safety part 1.5
Avoiding Negative Side Effects: Concrete Problems in AI Safety part 1
Are AI Risks like Nuclear Risks?
Respectability
Predicting AI: RIP Prof. Hubert Dreyfus
What's the Use of Utility Functions?
Where do we go now?
Status Report
Channel Introduction