Refactor llama-model.cpp #16252

pwilkin · 2025-09-25T13:42:08Z

@ggerganov I know you said you were planning to do it, but honestly it's been a nightmare working on all the model implementations with the huge llama-model.cpp, so I wanted to just get the "easy" albeit tedious part out of the way. Moved all llm_build_* definitions to their separate class files in src/models/

CISC · 2025-09-25T14:02:19Z

This is a nightmare to review and rebase until merged though, also you seem to have pushed the same changes to your Qwen3-Next PR?

pwilkin · 2025-09-25T14:48:32Z

@CISC Ye, need them for working there, but I'll revert once I'm done (unless this is done first and I can merge on top).

I know it's a nightmare, I already kind of went through it when I asked an LLM to automate some tasks and it proceeded merrily ripping out methods just because some classes didn't inherit from llm_graph_context :>

If you want, I can write a script that runs tree-sitter on the original definitions in llama-model.cpp vs the new classes and shows any differences to verify that nothing was accidentally lost.

jacekpoplawski · 2025-09-25T15:23:26Z

Maybe it would be easier to refactor that partially, just a subset of the models?

ngxson · 2025-09-29T03:50:49Z

This seems to be a good change, just have some other ideas:

I think for now all models can share one single .h file. The main .cpp implementation can be split into smaller files as proposed here
Our naming convention uses - instead of _. Maybe just src/models/(model-name).cpp is enough, no need the llm_build prefix. For example: src/models/gpt2.cpp

Refactor llama-model.cpp

920f0bc

pwilkin requested review from CISC and ggerganov as code owners September 25, 2025 13:42

pwilkin added 2 commits September 25, 2025 15:51

Add missing LFM2 code

380ec87

Fix whitespace / end-of-line newline issues.

b44da78

pwilkin added 2 commits September 25, 2025 18:33

Fix extra semicolons

bcd866a

Merge branch 'ggml-org:master' into llama-cpp-refactor

b73ee0a

ngxson mentioned this pull request Sep 29, 2025

Model: Qwen3 Next #16095

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor llama-model.cpp #16252

Refactor llama-model.cpp #16252

pwilkin commented Sep 25, 2025

Uh oh!

CISC commented Sep 25, 2025

Uh oh!

pwilkin commented Sep 25, 2025

Uh oh!

jacekpoplawski commented Sep 25, 2025

Uh oh!

ngxson commented Sep 29, 2025

Uh oh!

Uh oh!

Refactor llama-model.cpp #16252

Are you sure you want to change the base?

Refactor llama-model.cpp #16252

Conversation

pwilkin commented Sep 25, 2025

Uh oh!

CISC commented Sep 25, 2025

Uh oh!

pwilkin commented Sep 25, 2025

Uh oh!

jacekpoplawski commented Sep 25, 2025

Uh oh!

ngxson commented Sep 29, 2025

Uh oh!

Uh oh!