···11+<!-- Thank you for opening a pull request! Please ensure your addition is in the correct section, follows existing formatting, and is in alphabetical order. If you have more information or context about your addition, please share it below: -->
22+
+141-113
README.md
···11# awesome-safety-tools
22+23A collection of open source tools for online safety
3455+Inspired by prior work like [Awesome Redteaming](https://github.com/yeyintminthuhtut/Awesome-Red-Teaming/) and [Awesome Phishing](https://github.com/PhishyAlice/awesome-phishing). This list is not an endorsement, but rather an attempt to organize and map the available technology. ❤️
4655-Inspired by prior work like [Awesome Redteaming](https://github.com/yeyintminthuhtut/Awesome-Red-Teaming/) and [Awesome Phishing](https://github.com/PhishyAlice/awesome-phishing). This list is not an endorsement, but rather an attempt to organize and map the available technology ❤️
77+Help contribute by opening a pull request to add more resources and tools!
6877-Help and contribute by adding a pull request to add more resources and tools!
891010+## Hash Matching
9111010-## Hash Matching
1212+* [Altitude by Jigsaw](https://github.com/jigsaw-code/altitude)
1313+ * web UI and hash matching for violent extremism and terrorism content
1114* [Hasher Matcher Action (HMA) by Meta](https://github.com/facebook/ThreatExchange/tree/main/hasher-matcher-actioner)
1215 * hashing algorithm, matching function, and ability to hook into actions
1616+* [Hasher-Matcher-Actioner (CLIP demo)](https://github.com/juanmrad/HMA-CLIP-demo)
1717+ * HMA extension for CLIP as reference for adding other format extensions
1818+* [Lattice Extract by Adobe](https://github.com/adobe/lattice_extract)
1919+ * grid and lattice detection to guard against FP in hash matching
2020+* [MediaModeration (Wiki Extension)](https://github.com/wikimedia/mediawiki-extensions-MediaModeration?tab=readme-ov-file)
2121+ * CSAM hash matching for Wikimedia
1322* [PDQ by Meta](https://github.com/facebook/ThreatExchange/tree/main/pdq)
1423 * perceptual hash algorithm for images
1515-* [TMK by Meta](https://github.com/facebook/ThreatExchange/tree/main/tmk)
1616- * visual similarity match for videos
1717-* [VPDQ by Meta](https://github.com/facebook/ThreatExchange/tree/main/vpdq)
1818- * visual similarity match for videos using PDQ algorithm
1919-* [Hasher-Matcher-Actioner (CLIP demo)](https://github.com/juanmrad/HMA-CLIP-demo)
2020- * HMA extension for CLIP as reference for adding other format extensins
2124* [Perception by Thorn](https://github.com/thorn-oss/perception)
2225 * provides a common wrapper around existing, popular perceptual hashes (such as those implemented by ImageHash)
2323-* [Altitude by Jigsaw](https://github.com/jigsaw-code/altitude)
2424- * web UI and hash matching for violent extremism and terrorism content
2525-* [Lattice Extract by Adobe](https://github.com/adobe/lattice_extract)
2626- * grid and lattice detection to guard against FP in hash matching
2726* [RocketChat CSAM](https://github.com/prostasia/rocketchatcsam)
2827 * CSAM hash matching for RocketChat
2929-* [MediaModeration (Wiki Extension)](https://github.com/wikimedia/mediawiki-extensions-MediaModeration?tab=readme-ov-file)
3030- * CSAM hash matching for Wikimedia
2828+* [TMK by Meta](https://github.com/facebook/ThreatExchange/tree/main/tmk)
2929+ * visual similarity match for videos
3030+* [VPDQ by Meta](https://github.com/facebook/ThreatExchange/tree/main/vpdq)
3131+ * visual similarity match for videos using PDQ algorithm
3232+31333234## Classification
3535+3636+* [CoPE by Zentropi](https://huggingface.co/zentropi-ai/cope-a-9b)
3737+ * small language model trained for accurate, fast, steerable content classification based on developer-defined content policies
3838+* [Detoxify by Unitary AI](https://github.com/unitaryai/detoxify)
3939+ * detects and mitigates generalized toxic language (including hate speech, harassment, bullying) in text
4040+* [Content Safety API by Google](https://cloud.google.com/safesearch/docs/content-safety)
4141+ * uses machine learning to detect child sexual abuse material (CSAM), nudity, and sexually explicit content in images and videos
4242+ * free service, but not open source
4343+* [gpt-oss-safeguard by OpenAI](https://github.com/openai/gpt-oss-safeguard)
4444+ * open-weight reasoning model to classify text content based on provided safety policies
4545+* [NSFW Keras Model](https://github.com/GantMan/nsfw_model)
4646+ * convoluted neural network (CNN) based explicit image ML model
4747+* [NSFW Filtering](https://github.com/nsfw-filter/nsfw-filter)
4848+ * browser extension to block explicit images from online platforms; user facing
3349* [OSmod by Jigsaw](https://github.com/conversationai/conversationai-moderator)
3450 * toolkit of machine learning (ML) tools, models, and APIs that platforms can use to moderate content
3551* [Perspective API by Jigsaw](https://github.com/conversationai/perspectiveapi)
3652 * machine learning-powered tool that helps platforms detect and assess the toxicity of online conversations
5353+* [Private Detector by Bumble](https://github.com/bumble-tech/private-detector)
5454+ * pretrained model for detecting lewd images
3755* [Roblox Voice Safety Classifier](https://github.com/Roblox/voice-safety-classifier)
3838- * machine learning model that detects and moderates harmful content in real-time voice chat on Roblox. Focuses on spoken language detection.
3939-* [Detoxify by Unitary AI](https://github.com/unitaryai/detoxify)
4040- * detects and mitigates generalized toxic language (including hate speech, harassment, bullying) in text
5656+ * machine learning model that detects and moderates harmful content in real-time voice chat on Roblox; focuses on spoken language detection
5757+* [Sentinel by Roblox](https://github.com/Roblox/Sentinel/tree/main)
5858+ * Python library designed specifically for realtime detection of extremely rare classes of text by using contrastive learning principles
4159* [Toxic Prompt RoBERTa by Intel](https://huggingface.co/Intel/toxic-prompt-roberta)
4242- * a BERT-based model for detecting toxic content in prompts to language models
4343-* [NSFW Filtering](https://github.com/nsfw-filter/nsfw-filter)
4444- * browser extension to block explicit images from online platforms. User facing.
4545-* [NSFW Keras Model](https://github.com/GantMan/nsfw_model)
4646- * convoluted neural network (CNN) based explicit image ML model
4747-* [Private Detector by Bumble](https://github.com/bumble-tech/private-detector)
4848- * a pretrained model for detecting lewd images
4949-* [Google Content Safety API](https://cloud.google.com/safesearch/docs/content-safety)
5050- * a free service by Google that uses machine learning to detect child sexual abuse material (CSAM), nudity, and sexually explicit content in images and videos. Widely used by NGOs, platforms, and law enforcement partners to support online child safety initiatives. (industry service not open source)
5151-* [Sentinel by Roblox](https://github.com/Roblox/Sentinel/tree/main)
5252- * a Python library designed specifically for realtime detection of extremely rare classes of text by using contrastive learning principles
6060+ * BERT-based model for detecting toxic content in prompts to language models
6161+53625463## AI-powered Guardrails
6464+6565+* [Guardrails AI](https://github.com/guardrails-ai/guardrails)
6666+ * Python framework that helps build safe AI applications checking input/output for predefined risks
6767+* [Kanana Safeguard By Kakao](https://huggingface.co/kakaocorp/kanana-safeguard-8b)
6868+ * harmful content detection model based on Kanana 8B
5569* [Llama Guard by Meta](https://github.com/meta-llama/PurpleLlama/tree/main/Llama-Guard3)
5670 * AI-powered content moderation model to detect harm in text-based interactions
5771* [Llama Prompt Guard 2 by Meta](https://github.com/meta-llama/PurpleLlama/blob/main/Llama-Prompt-Guard-2/86M/MODEL_CARD.md)
5858- * Detects prompt injection and jailbreaking attacks in LLM inputs.
7272+ * Detects prompt injection and jailbreaking attacks in LLM inputs
5973* [Purple Llama by Meta](https://github.com/meta-llama/PurpleLlama/tree/main/Llama-Guard3)
6074 * set of tools to assess and improve LLM security. Includes Llama Guard, CyberSec Eval, and Code Shield
7575+* [RoGuard](https://github.com/Roblox/RoGuard-1.0)
7676+ * LLM that helps safeguard unlimited text generation on Roblox
6177* [ShieldGemma by Google DeepMind](https://www.kaggle.com/code/fernandosr85/shieldgemma-web-content-safety-analyzer?scriptVersionId=198456916)
6278 * AI safety toolkit by Google DeepMind designed to help detect and mitigate harmful or unsafe outputs in LLM applications
6363-* [Guardrails AI](https://github.com/guardrails-ai/guardrails)
6464- * a Python framework that helps build safe AI applications checking input/output for predefined risks
6565-* [RoGuard](https://github.com/Roblox/RoGuard-1.0)
6666- * a LLM that helps safeguard unlimited text generation on Roblox
7979+67806881## Privacy Protection
8282+6983* [Fawkes Facial De-Recognition Cloaking](https://github.com/Shawn-Shan/fawkes)
7084 * Code and binaries to confuse AIs when trying to match identity to photos, such as [Clearview](https://www.theverge.com/23919134/kashmir-hill-your-face-belongs-to-us-clearview-ai-facial-recognition-privacy-decoder)
7185 * Many other great tools at github.com/Shawn-Shan, MIT researcher
7286* [Presidio by Microsoft](https://github.com/microsoft/presidio)
7387 * toolset for detecting Personal Identifiable Information (PII) and other sensitive data in images and text
74888989+7590## Core Infrastructure
7676-* [Mjolnir by Matrix](https://github.com/matrix-org/mjolnir)
7777- * moderation bot for the Matrix protocol that automatically enforces content policies
9191+7892* [AbuseIO](https://github.com/AbuseIO/AbuseIO)
7993 * abuse management platform designed to help organizations handle and track abuse complaints related to online content, infrastructure, or services
8080-* [Open Truss by Github](https://github.com/open-truss/open-truss)
8181- * framework designed to help users create internal tools without needing to write code
8294* [Access by Discord](https://github.com/discord/access)
8383- * a centralized portal for managing access to internal systems within any organization
9595+ * centralized portal for managing access to internal systems within any organization
9696+* [Mjolnir by Matrix](https://github.com/matrix-org/mjolnir)
9797+ * moderation bot for the Matrix protocol that automatically enforces content policies
9898+* [Open Truss by GitHub](https://github.com/open-truss/open-truss)
9999+ * framework designed to help users create internal tools without needing to write code
84100101101+85102## Redteaming Tools
86103104104+* [Aymara](https://github.com/aymara-ai/aymara-sdk-python)
105105+ * Automated eval tools for AI safety, accuracy, and jailbreak vulnerability
106106+* [Counterfit by Microsoft](https://github.com/Azure/counterfit/)
107107+ * Automation tool for assessing AI model security and robustness
108108+* [Garak by NVIDIA](https://github.com/NVIDIA/garak)
109109+ * Framework for adversarial testing and model evaluation
110110+* [LLM Canary](https://github.com/LLM-Canary/LLM-Canary)
111111+ * AI benchmarking tool that evaluates models for security vulnerabilities and adversarial robustness
112112+* [Prompt Fuzzer](https://github.com/prompt-security/ps-fuzz)
113113+ * Tool for testing prompt injection vulnerabilities in AI systems
114114+* [Promptfoo](https://github.com/promptfoo/promptfoo)
115115+ * Automated LLM evaluations, report generations, several ready-to-use attack strategies
87116* [PyRIT Documentation](https://azure.github.io/PyRIT/)
8888- * Microsoft’s Python-based tool for AI red teaming and security testing.
8989-* [AI Benchmarking Tool](https://github.com/LLM-Canary/LLM-Canary)
9090- * Evaluates AI models for security vulnerabilities and adversarial robustness.
9191-* [Prompt Fuzzer Red Teaming Tool](https://github.com/prompt-security/ps-fuzz)
9292- * Tool for testing prompt injection vulnerabilities in AI systems.
9393-* [Open Source Red Teaming Tool – Nvidia](https://github.com/NVIDIA/garak)
9494- * Framework for adversarial testing and model evaluation.
9595-* [Tool that Enables Models to Chat with One Another](https://github.com/socketteer?tab=repositories)
9696- * Allows AI models to interact, helping test conversational weaknesses.
9797-* [Microsoft AI Tool – Counterfit](https://github.com/Azure/counterfit/)
9898- * Automation tool for assessing AI model security and robustness.
9999-* [Automated AI Alignment Evals - Aymara](https://github.com/aymara-ai/aymara-sdk-python)
100100- * Automated eval tools for AI safety, accuracy, and jailbreak vulnerability.
117117+ * Microsoft’s Python-based tool for AI red teaming and security testing
118118+* [Socketteer](https://github.com/socketteer?tab=repositories)
119119+ * Allows AI models to interact, helping test conversational weaknesses
120120+101121102122## Clustering
103103-* [SpamAssassin by Apache](https://spamassassin.apache.org)
104104- * anti-spam platform that uses a variety of techniques, including text analysis, Bayesian filtering, and DNS blocklists, to classify and block unsolicited email
123123+105124* [scikit-learn](https://github.com/scikit-learn/scikit-learn)
106125 * python library including clustering through various algorithms, such as K-Means, DBSCAN, and hierarchical clustering
126126+* [SpamAssassin by Apache](https://spamassassin.apache.org)
127127+ * anti-spam platform that uses a variety of techniques, including text analysis, Bayesian filtering, and DNS blocklists, to classify and block unsolicited email
107128108129109130## Rules Engines
131131+132132+* [Druid by Apache](https://github.com/apache/druid)
133133+ * high performance real-time analytics database
134134+* [Marble](https://github.com/checkmarble/marble)
135135+ * real-time fraud detection and compliance engine tailored for fintech companies and financial institutions
110136* [Osprey by ROOST](https://github.com/roostorg/osprey)
111111- * a high-performance rules engine for real-time event processing at scale, designed for Trust & Safety and anti-abuse work
137137+ * high-performance rules engine for real-time event processing at scale, designed for Trust & Safety and anti-abuse work
112138* [RulesEngine by Microsoft](https://microsoft.github.io/RulesEngine/)
113113- * a library for abstracting business logic, rules, and policies from a system via JSON for .NET language families
114114-* [Marble](https://github.com/checkmarble/marble)
115115- * a real-time fraud detection and compliance engine tailored for fintech companies and financial institutions
139139+ * library for abstracting business logic, rules, and policies from a system via JSON for .NET language families
116140* [Wikimedia Smite Spam](https://github.com/wikimedia/mediawiki-extensions-SmiteSpam)
117117- * an extension for MediaWiki that helps identify and manage spam content on a wiki
118118-* [Druid by Apache](https://github.com/apache/druid)
119119- * a high performance real-time analytics database
141141+ * extension for MediaWiki that helps identify and manage spam content on a wiki
120142121143122144## Review
123123-* [RabbitMQ](https://github.com/rabbitmq)
124124- * a message broker that enables applications to communicate with each other by sending messages through queues
145145+125146* [BullMQ](https://github.com/taskforcesh/bullmq)
126147 * message queue and batch processing for NodeJS and Python based on Redis
127127-* [Owlculus](https://github.com/be0vlk/owlculus)
128128- * an OSINT (Open-Source Intelligence) toolkit and case management platform
129148* [NCMEC Reporting by ello](https://github.com/ello/ncmec_reporting)
130130- * a Ruby client library for reporting incidents to the National Center for Missing & Exploited Children (NCMEC) CyberTipline
149149+ * Ruby client library for reporting incidents to the National Center for Missing & Exploited Children (NCMEC) CyberTipline
150150+* [Owlculus](https://github.com/be0vlk/owlculus)
151151+ * OSINT (Open-Source Intelligence) toolkit and case management platform
152152+* [RabbitMQ](https://github.com/rabbitmq)
153153+ * message broker that enables applications to communicate with each other by sending messages through queues
131154132155133156## Investigation
134134-* [ThreatExchange by Meta](https://github.com/facebook/ThreatExchange )
135135- * a platform that enables organizations to share information about threats, such as malware, phishing attacks, and online safety harms in a structured and privacy-compliant manner
136136-* [ThreatExchange Client via PHP](https://github.com/certly/threatexchange)
137137- * a PHP client for ThreatExchange
138138-* [ThreatExchange via Python](https://github.com/facebook/ThreatExchange/tree/main/python-threatexchange)
139139- * a Python library for ThreatExchange
140140-* [Feluda by Tattle](https://github.com/tattle-made/feluda)
141141- * A configurable engine for analysing multi-lingual and multi-modal content
157157+158158+* [CIB MangoTree](https://github.com/CIB-Mango-Tree/CIB-Mango-Tree-Website)
159159+ * collection of tools to aid researchers in coordinated inauthentic behavior (CIB) analysis
160160+* [Crossover](https://crossover.social/)
161161+ * open-source project that builds dashboards for monitoring and analyzing the recommendation algorithms of social networks, with a focus on disinformation and election monitoring
142162* [DAU Dashboard by Tattle](https://github.com/tattle-made/dau-dashboard)
143163 * Deepfake Analysis Unit(DAU) is a collaborative space for analyzing deepfakes
144144-* [CIB MangoTree](https://github.com/CIB-Mango-Tree/CIB-Mango-Tree-Website)
145145- * A collection of tools to aid researchers in coordinated inauthentic behavior (CIB) analysis
164164+* [Feluda by Tattle](https://github.com/tattle-made/feluda)
165165+ * configurable engine for analysing multi-lingual and multi-modal content
146166* [Interference by Digital Forensics Research Lab](https://github.com/DFRLab/interference2024)
147147- * an interactive, open-source database that tracks allegations of foreign interference or foreign malign influence relevant to the 2024 U.S. presidential election
148148-* [Crossover](https://crossover.social/)
149149- * An open-source project that builds dashboards for monitoring and analyzing the recommendation algorithms of social networks, with a focus on disinformation and election monitoring.
150150-* [TikTok Observatory](https://github.com/aiforensics/tkobservatory)
151151- * An open-source project maintained by [AI Forensics](https://aiforensics.org/) that allows researchers to monitor the promotion and demotion of content by the TikTok reccomendation algorithm.
167167+ * interactive, open-source database that tracks allegations of foreign interference or foreign malign influence relevant to the 2024 U.S. presidential election
152168* [OpenMeasures](https://gitlab.com/openmeasures)
153153- * an open source platform for investigating internet trends
169169+ * open source platform for investigating internet trends
170170+* [ThreatExchange by Meta](https://github.com/facebook/ThreatExchange )
171171+ * platform that enables organizations to share information about threats, such as malware, phishing attacks, and online safety harms in a structured and privacy-compliant manner
172172+* [ThreatExchange Client via PHP](https://github.com/certly/threatexchange)
173173+ * PHP client for ThreatExchange
174174+* [ThreatExchange via Python](https://github.com/facebook/ThreatExchange/tree/main/python-threatexchange)
175175+ * Python library for ThreatExchange
176176+* [TikTok Observatory](https://github.com/aiforensics/tkobservatory)
177177+ * open-source project maintained by [AI Forensics](https://aiforensics.org/) that allows researchers to monitor the promotion and demotion of content by the TikTok reccomendation algorithm
154178155179156180## Datasets
181181+157182* [Aegis Content Safety by NVIDIA](https://huggingface.co/datasets/nvidia/Aegis-AI-Content-Safety-Dataset-2.0)
158158- * a dataset created by NVIDIA to aid in content moderation and toxicity detection
183183+ * dataset created by NVIDIA to aid in content moderation and toxicity detection
184184+* [Toxic Chat by LMSYS](https://huggingface.co/datasets/lmsys/toxic-chat)
185185+ * dataset of toxic conversations collected from interactions with Vicuna
159186* [Toxicity by Jigsaw](https://huggingface.co/datasets/google/jigsaw_toxicity_pred)
160160- * a large number of Wikipedia comments which have been labeled by human raters for toxic behavior
161161-* [Toxic Chat by LMSYS](https://huggingface.co/datasets/lmsys/toxic-chat)
162162- * a dataset of toxic conversations collected from interactions with Vicuna
187187+ * large number of Wikipedia comments which have been labeled by human raters for toxic behavior
163188* [Uli Dataset by Tattle](https://github.com/tattle-made/uli_dataset)
164164- * A dataset of gendered abuse, created for Uli ML redaction.
189189+ * dataset of gendered abuse, created for Uli ML redaction.
165190* [VTC by Unitary AI](https://github.com/unitaryai/VTC)
166166- * an implementation of video-text retrieval with comments including a dataset, method of identifying relevant auxiliary information that adds context to videos, and quantification of the value comment-modality bring to video.
191191+ * implementation of video-text retrieval with comments including a dataset, method of identifying relevant auxiliary information that adds context to videos, and quantification of the value comment-modality bring to video
167192168193169194## Red Teaming Datasets
170170-* [Red Team Resistance Leaderboard](https://huggingface.co/spaces/HaizeLabs/red-teaming-resistance-benchmark)
171171- * rankings of AI models based on resistance to adversarial attacks.
172172-* [JailbreakHub by WalledAI](https://huggingface.co/datasets/walledai/JailbreakHub)
173173- * a collection of jailbreak prompts and corresponding model responses
174174-* [SidFeel Jailbreak Dataset](https://github.com/sidfeels/PromptsDB)
175175- * a collection of prompts used for jailbreaking AI models.
195195+196196+* [AI Alignment Dataset by Anthropic](https://atlas.nomic.ai/map/anthropic_rlhf)
197197+ * data used for reinforcement learning with human feedback (RLHF) to align AI models.
198198+* [DEFCOM Red Teaming Dataset](https://github.com/humane-intelligence/ai_village_defcon_grt_data)
199199+ * dataset from DEF CON’s AI red teaming event.
176200* [HackAPrompt Jailbreak Dataset](https://huggingface.co/datasets/hackaprompt/hackaprompt-dataset/viewer/default/train?p=1&row=137)
177177- * a dataset for testing AI vulnerability to prompt-based jailbreaking.
201201+ * dataset for testing AI vulnerability to prompt-based jailbreaking
178202* [HiroKachi Jailbreak Dataset](https://sizu.me/love)
179179- * adataset focused on adversarial AI prompt attacks.
203203+ * dataset focused on adversarial AI prompt attacks
204204+* [Jailbreak Prompt Generator AI Model](https://huggingface.co/tsq2000/Jailbreak-generator)
205205+ * AI model that generates jailbreak-style prompts
206206+* [JailbreakHub by WalledAI](https://huggingface.co/datasets/walledai/JailbreakHub)
207207+ * collection of jailbreak prompts and corresponding model responses
208208+* [Red Team Resistance Leaderboard](https://huggingface.co/spaces/HaizeLabs/red-teaming-resistance-benchmark)
209209+ * rankings of AI models based on resistance to adversarial attacks
180210* [Rentry Jailbreak Datasets](https://rentry.org/gpt0721)
181181- * collection of datasets related to jailbreak attempts on AI models.
182182-* [DEFCOM Red Teaming Dataset](https://github.com/humane-intelligence/ai_village_defcon_grt_data)
183183- * dataset from DEF CON’s AI red teaming event.
184184-* [Anthropic’s AI Alignment Dataset](https://atlas.nomic.ai/map/anthropic_rlhf)
185185- * data used for reinforcement learning with human feedback (RLHF) to align AI models.
186186-* [Jailbreak Prompt Generator AI Model](https://huggingface.co/tsq2000/Jailbreak-generator)
187187- * AI model that generates jailbreak-style prompts.
211211+ * collection of datasets related to jailbreak attempts on AI models
212212+* [SidFeel Jailbreak Dataset](https://github.com/sidfeels/PromptsDB)
213213+ * collection of prompts used for jailbreaking AI models
188214215215+189216## Decentralized Platforms
217217+218218+* [Automod by Bluesky](https://github.com/bluesky-social/indigo/tree/main/automod)
219219+ * tool for automating content moderation processes for the Bluesky social network and other apps on the AT Protocol
190220* [FediCheck](https://connect.iftas.org/library/iftas-documentation/fedicheck/)
191221 * domain moderation tool to assist ActivityPub service providers, such as Mastodon servers, now open-sourced.
192222* [Fediverse Spam Filtering](https://github.com/MarcT0K/Fediverse-Spam-Filtering/ )
193193- * a spam filter for Fediverse social media platforms. For now, the current version is only a proof of concept.
223223+ * spam filter for Fediverse social media platforms. For now, the current version is only a proof of concept.
194224* [FIRES](https://github.com/fedimod/fires)
195225 * reference server + protocol for the exchange of moderation adivsories and recommendations
196226* [Ozone by Bluesky](https://github.com/bluesky-social/ozone)
197227 * labeling tool designed for Bluesky. Includes moderation features to action on abuse flags, policy enforcement tools, and investigation features
198198-* [Automod by Bluesky](https://github.com/bluesky-social/indigo/tree/main/automod)
199199- * a tool for automating content moderation processes for the Bluesky social network and other apps on the AT Protocol
228228+200229201230## User Safety Tools
202202-* [Uli by Tattle](https://github.com/tattle-made/Uli)
203203- * Software and Resources for Mitigating Online Gender Based Violence in India
231231+204232* [Frankly by Applied Social Media Lab](https://github.com/berkmancenter/frankly/)
205205- * an online deliberations platform that allows anyone to host video-enabled conversations about any topic
233233+ * online deliberations platform that allows anyone to host video-enabled conversations about any topic
206234* [PolicyKit by UW Social Futures Lab](https://github.com/policykit/policykit)
207207- * a toolkit for building governance in your online community
235235+ * toolkit for building governance in your online community
208236* [SquadBox by UW Social Futures Lab](https://github.com/amyxzhang/squadbox)
209209- * a tool to help people who are being harassed online by having their friends (or “squad”) moderate their messages
210210-211211-237237+ * tool to help people who are being harassed online by having their friends (or “squad”) moderate their messages
238238+* [Uli by Tattle](https://github.com/tattle-made/Uli)
239239+ * Software and Resources for Mitigating Online Gender Based Violence in India
212240