···122122 * an OSINT (Open-Source Intelligence) toolkit and case management platform
123123* [NCMEC Reporting by ello](https://github.com/ello/ncmec_reporting)
124124 * a Ruby client library for reporting incidents to the National Center for Missing & Exploited Children (NCMEC) CyberTipline
125125-* [VTC by Unitary AI](https://github.com/unitaryai/VTC)
126126- * an implementation of video-text retrieval with comments including a dataset, method of identifying relevant auxiliary information that adds context to videos, and quantification of the value comment-modality bring to video. Possibly helpful for review/moderation.
125125+127126128127## Investigation
129128* [ThreatExchange by Meta](https://github.com/facebook/ThreatExchange )
···148147 * an open source platform for investigating internet trends
149148150149151151-## Safety Datasets
150150+## Datasets
152151* [Aegis Content Safety by NVIDIA](https://huggingface.co/datasets/nvidia/Aegis-AI-Content-Safety-Dataset-2.0)
153152 * a dataset created by NVIDIA to aid in content moderation and toxicity detection
154153* [Toxicity by Jigsaw](https://huggingface.co/datasets/google/jigsaw_toxicity_pred)
···157156 * a dataset of toxic conversations collected from interactions with Vicuna
158157* [Uli Dataset by Tattle](https://github.com/tattle-made/uli_dataset)
159158 * A dataset of gendered abuse, created for Uli ML redaction.
159159+* [VTC by Unitary AI](https://github.com/unitaryai/VTC)
160160+ * an implementation of video-text retrieval with comments including a dataset, method of identifying relevant auxiliary information that adds context to videos, and quantification of the value comment-modality bring to video.
160161161162162163## Red Teaming Datasets