Mixture of Jailbreak Experts, Naive Tabular Classifiers as Guard for Prompt Attacks
[Submitted on 26 Sep 2024 (v1), last revised 27 Sep 2024 (this version, v2)] View a PDF of the paper titled MoJE: Mixture of Jailbreak Experts, Naive Tabular Classifiers as Guard for Prompt Attacks, by Giandomenico Cornacchia and 5 other…