···4343* [ShieldGemma by Google DeepMind](https://www.kaggle.com/code/fernandosr85/shieldgemma-web-content-safety-analyzer?scriptVersionId=198456916)
4444 * AI safety toolkit by Google DeepMind designed to help detect and mitigate harmful or unsafe outputs in LLM applications
4545* [Roblox Voice Safety Classifier](https://github.com/Roblox/voice-safety-classifier)
4646- * machine learnign model that detects and moderates harmful content in real-time voice chat on Roblox. Focuses on spoken language detection.
4646+ * machine learning model that detects and moderates harmful content in real-time voice chat on Roblox. Focuses on spoken language detection.
4747* [Detoxify by Unitary AI](https://github.com/unitaryai/detoxify)
4848 * detects and mitigates generalized toxic language (including hate speech, harassment, bullying) in text
4949* [NSFW Filtering](https://github.com/nsfw-filter/nsfw-filter)
···107107* [Aegis Content Safety by NVIDIA](https://huggingface.co/datasets/nvidia/Aegis-AI-Content-Safety-Dataset-2.0)
108108 * a dataset created by NVIDIA to aid in content moderation and toxicity detection
109109* [Toxicity by Jigsaw](https://huggingface.co/datasets/google/jigsaw_toxicity_pred)
110110- * a large number of Wikipedia comments which have been labeled by human raters for toxic behavior110110+ * a large number of Wikipedia comments which have been labeled by human raters for toxic behavior