···123123* [NCMEC Reporting by ello](https://github.com/ello/ncmec_reporting)
124124 * a Ruby client library for reporting incidents to the National Center for Missing & Exploited Children (NCMEC) CyberTipline
125125126126+126127## Investigation
127128* [ThreatExchange by Meta](https://github.com/facebook/ThreatExchange )
128129 * a platform that enables organizations to share information about threats, such as malware, phishing attacks, and online safety harms in a structured and privacy-compliant manner
···146147 * an open source platform for investigating internet trends
147148148149149149-## Safety Datasets
150150+## Datasets
150151* [Aegis Content Safety by NVIDIA](https://huggingface.co/datasets/nvidia/Aegis-AI-Content-Safety-Dataset-2.0)
151152 * a dataset created by NVIDIA to aid in content moderation and toxicity detection
152153* [Toxicity by Jigsaw](https://huggingface.co/datasets/google/jigsaw_toxicity_pred)
···155156 * a dataset of toxic conversations collected from interactions with Vicuna
156157* [Uli Dataset by Tattle](https://github.com/tattle-made/uli_dataset)
157158 * A dataset of gendered abuse, created for Uli ML redaction.
159159+* [VTC by Unitary AI](https://github.com/unitaryai/VTC)
160160+ * an implementation of video-text retrieval with comments including a dataset, method of identifying relevant auxiliary information that adds context to videos, and quantification of the value comment-modality bring to video.
158161159162160163## Red Teaming Datasets