
Microsoft Research
チャンネル登録者数 34.4万人
1155 回視聴 ・ 27いいね ・ 2025/03/03
Speakers: Diganta Misra
Host: Sanchit Ahuja
In the fast-evolving world of software libraries, code generation models are struggling to keep pace. Most existing benchmarks focus on static, version-agnostic code predictions, failing to capture the true complexity of adapting to frequent updates and maintaining compatibility with multiple library versions. To address this gap, we introduce GitChameleon, a novel dataset featuring 116 Python code completion tasks, each tied to specific library versions and accompanied by executable unit tests. This dataset is designed to rigorously evaluate the ability of large language models (LLMs) to generate version-specific code that is both syntactically correct and functionally accurate. Our findings are revealing: even state-of-the-art models like GPT-4o achieve a pass@10 of just 39.9% (43.7% with error feedback), highlighting significant limitations in their ability to adapt to versioned code. In this talk, I’ll explore why today’s LLMs, while impressive, still fall short in the dynamic landscape of evolving software libraries. By examining these challenges, we hope to spark a conversation about how to build more adaptable, reliable code generation tools for the future.
コメント
関連動画

New Features in Copilot Studio, Student SOC’s and Purview & Defender Updates for Secure AI | EP11
56 回視聴 - 3 週間前

Andrew Ng Explores The Rise Of AI Agents And Agentic Reasoning | BUILD 2024 Keynote
875,816 回視聴 - 5 か月前

You Don’t Need a Rewrite, You Need React Native Brownfield | React Universe On Air Coffee Talk #27
423 回視聴 - 2 日前
![Upbeat Lofi - Deep Focus & Energy for Work [R&B, Neo Soul, Lofi Hiphop]](/wkt/back/vi/THh4fT0O7IY/mqdefault.jpg)
Upbeat Lofi - Deep Focus & Energy for Work [R&B, Neo Soul, Lofi Hiphop]
1,023,742 回視聴 - 3 か月前

LLM Course – Build a Semantic Book Recommender (Python, OpenAI, LangChain, Gradio)
177,275 回視聴 - 3 か月前

Tranquil Day with Coffee for Relax & De-stress ☕🌷Lofi coffee - Lofi morning
96,130 回視聴 - 3 か月前

Lo-fi Café ☕️ 2 Hours Lo-fi Hip Hop Beats 🎶 Music That Brings the Cafe Vibes to You | BGM | Ver.2
183,371 回視聴 - 3 か月前

How to Pass PRINCE2® Foundation & Practitioner in 2025 | Free Masterclass with Expert Tips
13 回視聴 - 3 日前
使用したサーバー: wata27
再生方法の変更
動画のデフォルトの再生方法を設定できます。埋め込みで見れるなら埋め込みで見た方が良いですよ。
現在の再生方法: 通常
コメントを取得中...
関連動画

New Features in Copilot Studio, Student SOC’s and Purview & Defender Updates for Secure AI | EP11
56 回視聴

You Don’t Need a Rewrite, You Need React Native Brownfield | React Universe On Air Coffee Talk #27
423 回視聴

Lo-fi Café ☕️ 2 Hours Lo-fi Hip Hop Beats 🎶 Music That Brings the Cafe Vibes to You | BGM | Ver.2
18万 回視聴
コメントを取得中...