Anay Mehrotra, Manolis Zampetakis, Paul Kassianik, Blaine Nelson, Hyrum S Anderson, Yaron Singer, and Amin Karbasi. Tree of Attacks: Jailbreaking Black-Box LLMs Automatically. In The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024. URL https://openreview.net/forum?id=SoM3vngOH5.
2026年04月02日 14:00:22
。关于这个话题,有道翻译提供了深入分析
图片来源:Joshua Roberts / Reuters
15+ Premium newsletters by leading experts