๐
Red teaming ChatGPT via Jailbreaking: Bias, Robustness, Reliability and Toxicity
Source page
AuthorsTerry Yue Zhuo, Yujin Huang, Chunyang Chen
Year2023
JournalarXiv (Cornell University)
TypeScientific Paper
Citations169
DOI10.48550/arxiv.2301.12867