adversarial-attacks

Here are 606 public repositories matching this topic...

Trusted-AI / adversarial-robustness-toolbox

Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams

python machine-learning privacy ai attack extraction inference artificial-intelligence evasion red-team poisoning adversarial-machine-learning blue-team adversarial-examples adversarial-attacks trusted-ai trustworthy-ai

Updated Sep 26, 2025
Python

QData / TextAttack

Star

TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/

nlp security machine-learning natural-language-processing data-augmentation adversarial-machine-learning adversarial-examples adversarial-attacks

Updated Jul 10, 2025
Python

bethgelab / foolbox

Star

A Python toolbox to create adversarial examples that fool neural networks in PyTorch, TensorFlow, and JAX

python machine-learning tensorflow keras pytorch adversarial-examples adversarial-attacks jax

Updated Apr 3, 2024
Python

microsoft / promptbench

Star

A unified evaluation framework for large language models

benchmark evaluation prompt robustness adversarial-attacks large-language-models prompt-engineering chatgpt

Updated Aug 7, 2025
Python

Harry24k / adversarial-attacks-pytorch

Star

PyTorch implementation of adversarial attacks [torchattacks]

deep-learning pytorch adversarial-attacks

Updated Jun 29, 2024
Python

thunlp / TAADpapers

Star

Must-read Papers on Textual Adversarial Attack and Defense

nlp natural-language-processing adversarial-learning adversarial-attacks paper-list adversarial-defense

Updated Jun 4, 2025
Python

DSE-MSU / DeepRobust

Star

A pytorch adversarial library for attack and defense methods on images and graphs

machine-learning deep-neural-networks deep-learning defense graph-mining graph-convolutional-networks adversarial-examples adversarial-attacks graph-neural-networks

Updated Jun 26, 2025
Python

A collection of anomaly detection methods (iid/point-based, graph and time series) including active learning for anomaly detection/discovery, bayesian rule-mining, description for diversity/explanation/interpretability. Analysis of incorporating label feedback with ensemble and tree-based detectors. Includes adversarial attacks with Graph Convol…

Updated May 22, 2024
Python

thunlp / OpenAttack

Star

An Open-Source Package for Textual Adversarial Attack.

nlp natural-language-processing pytorch adversarial-example adversarial-attacks

Updated Jul 20, 2023
Python

S3N4T0R-0X0 / APT-Attack-Simulation

Star

This repository is a compilation of all APT simulations that target many vital sectors,both private and governmental. The simulation includes written tools, C2 servers, backdoors, exploitation techniques, stagers, bootloaders, and many other tools that attackers might have used in actual attacks. These tools and TTPs are simulated here.

Updated Sep 24, 2025
Python

fra31 / auto-attack

Star

Code relative to "Reliable evaluation of adversarial robustness with an ensemble of diverse parameter-free attacks"

adversarial-attacks adversarial-robustness adversarial-defenses

Updated May 16, 2024
Python

hendrycks / natural-adv-examples

Star

A Harder ImageNet Test Set (CVPR 2021)

imagenet robustness adversarial-example adversarial-attacks domain-generalization ml-safety

Updated Mar 23, 2024
Python

jind11 / TextFooler

Star

A Model for Natural Language Attack on Text Classification and Inference

natural-language-processing text-classification natural-language-inference bert adversarial-attacks bert-model

Updated Dec 8, 2022
Python

thu-ml / ares

Star

A Python library for adversarial machine learning focusing on benchmarking adversarial robustness.

nes pca bim benchmark-framework evolutionary spsa boundary adversarial-machine-learning distillation fgsm adversarial-attacks deepfool adversarial-robustness mi-fgsm mmlda hgd

Updated Oct 15, 2023
Python

agencyenterprise / PromptInject

Star

PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to adversarial prompt attacks. 🏆 Best Paper Awards @ NeurIPS ML Safety Workshop 2022

machine-learning agi language-models ai-safety adversarial-attacks ai-alignment ml-safety gpt-3 large-language-models prompt-engineering chain-of-thought agi-alignment

Updated Feb 26, 2024
Python

deadbits / vigil-llm

Star

⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs

security-tools adversarial-machine-learning adversarial-attacks yara-scanner large-language-models llmops prompt-injection llm-security

Updated Jan 31, 2024
Python

sarathknv / adversarial-examples-pytorch

Star

Implementation of Papers on Adversarial Examples

deep-learning pytorch adversarial-networks generative-adversarial-networks adversarial-learning adversarial-examples fgsm adversarial-attacks adversarial-images perturbations adversarial-perturbations semantic-adversarial-examples

Updated Apr 24, 2023
Python

natanielruiz / disrupting-deepfakes

Star

🔥🔥Defending Against Deepfakes Using Adversarial Attacks on Conditional Image Translation Networks

machine-learning computer-vision deep-learning faceswap face-swap fake-news adversarial-attacks deepfakes deepfake-detection defending disrupting-deepfakes defending-deepfakes

Updated May 7, 2020
Python

ain-soph / trojanzoo

Star

TrojanZoo provides a universal pytorch platform to conduct security researches (especially backdoor attacks/defenses) of image classification in deep learning.

deep-learning pytorch image-classification adversarial-attacks backdoor-attacks

Updated Aug 25, 2025
Python

ChandlerBang / Pro-GNN

Star

Implementation of the KDD 2020 paper "Graph Structure Learning for Robust Graph Neural Networks"

machine-learning deep-learning pytorch semi-supervised-learning defense graph-mining attack-defense adversarial-attacks graph-neural-networks graph-structure-recovery

Updated May 12, 2023
Python

Improve this page

Add a description, image, and links to the adversarial-attacks topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the adversarial-attacks topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adversarial-attacks

Here are 606 public repositories matching this topic...

Trusted-AI / adversarial-robustness-toolbox

QData / TextAttack

bethgelab / foolbox

microsoft / promptbench

Harry24k / adversarial-attacks-pytorch

thunlp / TAADpapers

DSE-MSU / DeepRobust

shubhomoydas / ad_examples

thunlp / OpenAttack

S3N4T0R-0X0 / APT-Attack-Simulation

fra31 / auto-attack

hendrycks / natural-adv-examples

jind11 / TextFooler

thu-ml / ares

agencyenterprise / PromptInject

deadbits / vigil-llm

sarathknv / adversarial-examples-pytorch

natanielruiz / disrupting-deepfakes

ain-soph / trojanzoo

ChandlerBang / Pro-GNN

Improve this page

Add this topic to your repo