Various Guidance implementations for ComfyUI / SD WebUI (reForge)

Implementation of

Perturbed-Attention Guidance from Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidance (D. Ahn et al.)
Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of Attention (Susung Hong)
Sliding Window Guidance from The Unreasonable Effectiveness of Guidance for Diffusion Models (Kaiser et al.)
PLADIS: Pushing the Limits of Attention in Diffusion Models at Inference Time by Leveraging Sparsity (ComfyUI-only)
Normalized Attention Guidance: Universal Negative Guidance for Diffusion Models (ComfyUI-only, has a description inside ComfyUI)
Token Perturbation Guidance for Diffusion Models (ComfyUI-only)

as an extension for ComfyUI and SD WebUI (reForge).

Works with SD1.5 and SDXL.

Installation

ComfyUI

You can either:

git clone https://github.com/pamparamm/sd-perturbed-attention.git into ComfyUI/custom-nodes/ folder.
Install it via ComfyUI Manager (search for custom node named "Perturbed-Attention Guidance").
Install it via comfy-cli with comfy node registry-install sd-perturbed-attention

SD WebUI (reForge)

git clone https://github.com/pamparamm/sd-perturbed-attention.git into stable-diffusion-webui-forge/extensions/ folder.

SD WebUI (Auto1111)

As an alternative for A1111 WebUI you can use PAG implementation from sd-webui-incantations extension.

Guidance Nodes/Scripts

ComfyUI

SD WebUI (reForge)

Note

You can override CFG Scale and PAG Scale/SEG Scale for Hires. fix by opening/enabling Override for Hires. fix tab. To disable PAG during Hires. fix, you can set PAG Scale under Override to 0.

Inputs

scale: Guidance scale, higher values can both increase structural coherence of an image and oversaturate/fry it entirely.
adaptive_scale (PAG only): PAG dampening factor, it penalizes PAG during late denoising stages, resulting in overall speedup: 0.0 means no penalty and 1.0 completely removes PAG.
blur_sigma (SEG only): Normal deviation of Gaussian blur, higher values increase "clarity" of an image. Negative values set blur_sigma to infinity.
unet_block: Part of U-Net to which Guidance is applied, original paper suggests to use middle.
unet_block_id: Id of U-Net layer in a selected block to which Guidance is applied. Guidance can be applied only to layers containing Self-attention blocks.
sigma_start / sigma_end: Guidance will be active only between sigma_start and sigma_end. Set both values to negative to disable this feature.
rescale: Acts similar to RescaleCFG node - it prevents over-exposure on high scale values. Based on Algorithm 2 from Common Diffusion Noise Schedules and Sample Steps are Flawed (Lin et al.). Set to 0 to disable this feature.
rescale_mode:
- full - takes into account both CFG and Guidance.
- partial - depends only on Guidance.
- snf - Saliency-adaptive Noise Fusion from High-fidelity Person-centric Subject-to-Image Synthesis (Wang et al.). Should increase image quality on high guidance scales. Ignores rescale value.
unet_block_list: Optional input, replaces both unet_block and unet_block_id and allows you to select multiple U-Net layers separated with commas. SDXL U-Net has multiple indices for layers, you can specify them by using dot symbol (if not specified, Guidance will be applied to the whole layer). Example value: m0,u0.4 (it applies Guidance to middle block 0 and to output block 0 with index 4)
- In terms of U-Net d means input, m means middle and u means output.
- SD1.5 U-Net has layers d0-d5, m0, u0-u8.
- SDXL U-Net has layers d0-d3, m0, u0-u5. In addition, each block except d0 and d1 has 0-9 index values (like m0.7 or u0.4). d0 and d1 have 0-1 index values.
- Supports block ranges (d0-d3 corresponds to d0,d1,d2,d3) and index value ranges (d2.2-9 corresponds to all index values of d2 with the exclusion of d2.0 and d2.1).

ComfyUI TensorRT PAG (Experimental)

To use PAG together with ComfyUI_TensorRT, you'll need to:

Have 24GB of VRAM.
Build static/dynamic TRT engine of a desired model.
Build static/dynamic TRT engine of the same model with the same TRT parameters, but with fixed PAG injection in selected UNET blocks (TensorRT Attach PAG node).
Use TensorRT Perturbed-Attention Guidance node with two model inputs: one for base engine and one for PAG engine.

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
.github/workflows		.github/workflows
.vscode		.vscode
example_workflows		example_workflows
res		res
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
guidance_utils.py		guidance_utils.py
nag_nodes.py		nag_nodes.py
pag_nodes.py		pag_nodes.py
pag_trt_nodes.py		pag_trt_nodes.py
pladis_nodes.py		pladis_nodes.py
pladis_utils.py		pladis_utils.py
pyproject.toml		pyproject.toml
tpg_nodes.py		tpg_nodes.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Various Guidance implementations for ComfyUI / SD WebUI (reForge)

Installation

ComfyUI

SD WebUI (reForge)

SD WebUI (Auto1111)

Guidance Nodes/Scripts

ComfyUI

SD WebUI (reForge)

Inputs

ComfyUI TensorRT PAG (Experimental)

About

Uh oh!

Releases

Packages

Languages

License

timkhronos/sd-perturbed-attention-FixedTensorRT

Folders and files

Latest commit

History

Repository files navigation

Various Guidance implementations for ComfyUI / SD WebUI (reForge)

Installation

ComfyUI

SD WebUI (reForge)

SD WebUI (Auto1111)

Guidance Nodes/Scripts

ComfyUI

SD WebUI (reForge)

Inputs

ComfyUI TensorRT PAG (Experimental)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages