Online Safety Analysis for LLMs: a Benchmark, an Assessment, and a Path Forward
Xuan Xie, Jiayang Song, Zhehua Zhou, Yuheng Huang, Da Song, Lei Ma
IEEE Transactions on Artificial Intelligence, August, 2025
In this work, we conduct a comprehensive evaluation of the effectiveness of existing online safety analysis methods on LLMs.
PDF