Add python example w/ cffi-generated bindings #449

ochafik · 2023-08-13T18:55:44Z

I've played around with Python bindings generation and thought it was worth adding as an example. This PR features:

Access to full C API (incl. CUDA, MPI, OpenCL, Metal, alloc... and any local API changes)
Instant regeneration with python regenerate.py (a cffi wrapper; uses llama.cpp headers by default, see README.md for options)
Lightweight utils to do copies between tensors (ggml & numpy.ndarray alike) with automatic (de/re)quantization, or view a tensor as a numpy ndarray
Full stubs with preservation of original signatures as docstrings (for neat autocomplete in IDEs)

You can play with it directly with this Colab

Some notes:

I've committed the generated bindings (ggml/cffi.py) and stubs (ggml/__init__.pyi) mostly for shows (to ease up the review) but we probably won't want to keep them in the repo (lemme know what you think, happy to add a cmake command to generate them during the build)
There's already bindings w/ high-quality docs (https://github.com/abetlen/ggml-python) but I wanted something that's easy to autogenerate / keep up-to-date with local changes, and I thought having it in the repo's examples made sense.
I've tried cffi's native extension generation (see complex branch) and decided not to go for it as it's quite a bit more fiddly than this "little" example, with unclear performance benefits.

Apologies if my Python style is questionable, I'm a bit new 😅

Add python example w/ cffi-generated bindings Features: - Seamless copies between tensors (ggml & numpy alike) with automatic (de/re)quantization - Access to full C API (incl. CUDA, MPI, OpenCL, Metal, alloc... and any local API changes) - Trivial regeneration with `python regenerate.py` (uses llama.cpp headers by default, README.md for options)

ggerganov

Thanks for the contribution!

Looks interesting. I'm not a big Python user, so cannot tell how useful this is, but it looks well done. As long as it does not interfere with the C library we can add it to the repo.

Feel free to merge it

ochafik · 2023-08-22T10:28:07Z

@ggerganov thanks a lot, will merge now (pushed some cosmetic changes + regenerated bindings, so much new gguf stuff 🤗)

re/ usefulness, I'm working on a minimalist Python hybrid of llama2.c + llama.cpp, which I hope could be a easy platform to experiment with various modifications (e.g. on-the-fly matrix decomposition / pruning at loading time w/ numpy)

Debugging GGML_ASSERT failures in Python can be fiddly but I'll cleanup a util that catches SIGABRT and enters an interpreter for easy debugging.

ochafik added 2 commits August 13, 2023 20:05

Simple python stub (*.pyi) generator for cffi

6d33164

ochafik force-pushed the python-stubs branch from daa9c6d to 6d33164 Compare August 13, 2023 19:05

Added some tests to python example + fixed numpy on scalar tensors

548aa50

ochafik mentioned this pull request Aug 21, 2023

Fix import of llama2.c models that don't share weights between embedding layers ggml-org/llama.cpp#2685

Merged

ggerganov approved these changes Aug 22, 2023

View reviewed changes

python: regenerate bindings + cosmetic cleanups

1721483

ochafik merged commit ffab9c3 into ggml-org:master Aug 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add python example w/ cffi-generated bindings #449

Add python example w/ cffi-generated bindings #449

Uh oh!

ochafik commented Aug 13, 2023 •

edited

Loading

Uh oh!

ggerganov left a comment

Uh oh!

ochafik commented Aug 22, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add python example w/ cffi-generated bindings #449

Add python example w/ cffi-generated bindings #449

Uh oh!

Conversation

ochafik commented Aug 13, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ggerganov left a comment

Choose a reason for hiding this comment

Uh oh!

ochafik commented Aug 22, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ochafik commented Aug 13, 2023 •

edited

Loading