[WIP] started adding support for evo2 #42210

McClain-Thiel · 2025-11-14T12:33:32Z

What does this PR do?

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

Rocketknight1 · 2025-11-14T13:45:54Z

Hi @McClain-Thiel, I'm the ex-postdoc who handles a lot of bio models at HF so feel free to ping me if you have any questions, or whenever this is ready for review!

McClain-Thiel · 2025-11-14T14:08:15Z

Awesome thanks @Rocketknight1. One quick question: not sure if youve read through the evo repo but they use a custom inference engine + library called vortex. What is the huggingface philosophy on including this as a dependency? Am i going to have to reimplement all the forward stuff in that in base torch? Or can I add it as a dep?

…n-hugging-face-transformers Align Evo2 rotary handling with HF generation

Rocketknight1 · 2025-11-17T12:23:46Z

Hi @McClain-Thiel, good question! We generally prefer not to add whole external inference engines. The right approach is probably:

Try to port the core model architecture to a modular file
If the model has weird kernels in vortex, then firstly see if we have an existing similar model you can copy from for the modular (we definitely have other SSM/Hyena-like models - there might be one with equivalent architecture!)
As a last resort, we can try to implement missing kernels via something like Kernel Hub

I haven't investigated too deeply, but if the inference lib also does other stuff like searching for multiple sequence alignments, we probably don't want to put all that directly in Transformers, but we could just add info to the model card. It's fine to show examples in the model card using vortex as long as we can run the core model without it.

github-actions · 2025-11-18T22:08:27Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: auto, evo2

McClain-Thiel · 2025-11-18T23:10:41Z

@Rocketknight1 So I think I have it mostly working. Gemini 3 did a lot of the heavy lifting but if you look at the tests I actually got the logits from the vortex implementation then use those to verify that the implementation here is close. I can remove that file ofc but thought you might want to take a look.

It also passes a simple smoke test (tokenizer is single letter (not constrained to ATGC) but when sampled it seems to give reasonable results (ATCG). Open to more rigorous testing.

Any other suggestions?

Rocketknight1 · 2025-11-19T14:29:08Z

The main thing I'd suggest now before I do a full review is to convert to modular. Basically, we want each modeling file in Transformers to be standalone, to make it easy for users to hack on them, and so that users don't need to understand ten layers of inheritance and abstraction to understand the code.

However, there's a lot of duplicated code this way, and the way we handle that is "modular" files. They're like scaffolds for the actual modeling files - unlike modeling files, imports and inheritance from other models is allowed there. Ideally, when a model is similar to existing models in the codebase, the modular file is very short and mostly just imports/inheritance, but it still can be used to auto-generate a full modeling file (which is the code that actually gets called)

Take a look at other modular files in the codebase to get an idea of how it works. In general, "# Copied from" syntax is deprecated now, and we prefer to import in the modular file instead. Once you've made the modular file, running make fix-copies will autogenerate a modeling file for you. You shouldn't edit that file directly, just tweak the modular file and rerun make fix-copies.

started adding support for evo2

d62163a

Rocketknight1 mentioned this pull request Nov 14, 2025

Add Evo2 Genomic foundation model #42209

Open

2 tasks

McClain-Thiel added 3 commits November 14, 2025 14:24

lets get a model involved

c363816

Align Evo2 rotary handling with HF generation

bafc96e

Merge pull request #1 from McClain-Thiel/codex/implement-evo2-model-i…

963e262

…n-hugging-face-transformers Align Evo2 rotary handling with HF generation

McClain-Thiel added 3 commits November 18, 2025 18:48

I think the logits match now but would like to verify

29ddb21

fixed the weird tests that allowed it to pass

857dc71

removing extra file

de25153

McClain-Thiel marked this pull request as ready for review November 18, 2025 23:07

github-actions bot requested review from ArthurZucker and Rocketknight1 November 18, 2025 23:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] started adding support for evo2 #42210

[WIP] started adding support for evo2 #42210

Uh oh!

McClain-Thiel commented Nov 14, 2025 •

edited

Loading

Uh oh!

Rocketknight1 commented Nov 14, 2025

Uh oh!

McClain-Thiel commented Nov 14, 2025

Uh oh!

Rocketknight1 commented Nov 17, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Nov 18, 2025

Uh oh!

McClain-Thiel commented Nov 18, 2025

Uh oh!

Rocketknight1 commented Nov 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[WIP] started adding support for evo2 #42210

Are you sure you want to change the base?

[WIP] started adding support for evo2 #42210

Uh oh!

Conversation

McClain-Thiel commented Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

Rocketknight1 commented Nov 14, 2025

Uh oh!

McClain-Thiel commented Nov 14, 2025

Uh oh!

Rocketknight1 commented Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Nov 18, 2025

Uh oh!

McClain-Thiel commented Nov 18, 2025

Uh oh!

Rocketknight1 commented Nov 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

McClain-Thiel commented Nov 14, 2025 •

edited

Loading

Rocketknight1 commented Nov 17, 2025 •

edited

Loading