Batchtopk into jumprelu #29

TDulka · 2024-11-29T18:27:56Z

Implements a BatchTopKToJump class which trains like a BatchTopK but switches to JumpRELU in inference time.

The thresholds for jumpRELU are taken from the minimum above zero latent activations during the last part of the training (last 10% by default). This results in having slightly more active latents at inference time (in my experiments it was around 106 with k=100).

Here is a notebook that runs the training and some simple evaluation, comparing the BatchTopKToJump ran in the batchTopK mode or JumpReLU mode to a classical JumpReLU or a classical TopK approach (probably could be done better).
https://colab.research.google.com/drive/1GuFaBmbVvM-rQoWjgMTZAHDxl76xaE1G?usp=sharing

Here are the three wandb runs (I ran more runs before when iterating but wanted to have just three cleaner for comparison).
https://wandb.ai/tomasdulka/batchtopk_jumprelu

ndif-team/nnsight#276

… trainer and sae

adamkarvonen · 2024-12-26T22:42:14Z

This was implemented with a single global threshold in this PR: #31

TDulka added 10 commits November 28, 2024 14:07

fix property access to reflect nnsight api change

7a5acee

ndif-team/nnsight#276

fix to use model.inputs elsewhere too

24910f5

feat: add batch top k to jump relu sae

e95eddc

fix trainer name

283fe95

fix missing init import

c82f600

clearer train inference management and separation of concerns between…

e4525b0

… trainer and sae

set the training mode in loss rather than update

d01d4e7

add the possibility to set dead feature threshold

bc89b01

keep k as a buffer

f795707

implementation using minimum instead of the mean

1cea49e

adamkarvonen closed this Dec 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Batchtopk into jumprelu #29

Batchtopk into jumprelu #29

Uh oh!

TDulka commented Nov 29, 2024

Uh oh!

adamkarvonen commented Dec 26, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Batchtopk into jumprelu #29

Batchtopk into jumprelu #29

Uh oh!

Conversation

TDulka commented Nov 29, 2024

Uh oh!

adamkarvonen commented Dec 26, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants