๐Ÿ“„

Red teaming ChatGPT via Jailbreaking: Bias, Robustness, Reliability and Toxicity

Source page

AuthorsTerry Yue Zhuo, Yujin Huang, Chunyang Chen
Year2023
JournalarXiv (Cornell University)
TypeScientific Paper
Citations169
DOI10.48550/arxiv.2301.12867
Visit source โ†—