···186186187187* [Aegis Content Safety by NVIDIA](https://huggingface.co/datasets/nvidia/Aegis-AI-Content-Safety-Dataset-2.0)
188188 * dataset created by NVIDIA to aid in content moderation and toxicity detection
189189+* [badwords by Richard Hughes](https://github.com/hughsie/badwords)
190190+ * simple list of bad words in different locales that can be used to flag suspicious user-submitted content
189191* [Toxic Chat by LMSYS](https://huggingface.co/datasets/lmsys/toxic-chat)
190192 * dataset of toxic conversations collected from interactions with Vicuna
191193* [Toxicity by Jigsaw](https://huggingface.co/datasets/google/jigsaw_toxicity_pred)