···180180181181* [Aegis Content Safety by NVIDIA](https://huggingface.co/datasets/nvidia/Aegis-AI-Content-Safety-Dataset-2.0)
182182 * dataset created by NVIDIA to aid in content moderation and toxicity detection
183183+* [badwords by Richard Hughes](https://github.com/hughsie/badwords)
184184+ * simple list of bad words in different locales that can be used to flag suspicious user-submitted content
183185* [Toxic Chat by LMSYS](https://huggingface.co/datasets/lmsys/toxic-chat)
184186 * dataset of toxic conversations collected from interactions with Vicuna
185187* [Toxicity by Jigsaw](https://huggingface.co/datasets/google/jigsaw_toxicity_pred)