forked from ggml-org/llama.cpp
-
Notifications
You must be signed in to change notification settings - Fork 1
Open
Labels
enhancementNew feature or requestNew feature or requestgood first issueGood for newcomersGood for newcomershelp wantedExtra attention is neededExtra attention is neededsplit
Description
Note: This issue was copied from ggml-org#6259
Original Author: @phymbert
Original Issue Number: ggml-org#6259
Created: 2024-03-23T15:29:25Z
Motivation
we support --split-max-tensors since:
As mentionned by @Artefact2 in this comment:
allowing to split by file size would be more intuitive (and usually more appropriate since file size is usually the limiting factor, eg 4G for FAT or 50G for HF)
Proposition:
Introduce --split-max-size N(M|G) split strategy to split files in file with a max size of N Megabytes or Gigabytes.
As it is not possible to have less than 1 tensor per GGUF, this size is a soft limit.
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or requestgood first issueGood for newcomersGood for newcomershelp wantedExtra attention is neededExtra attention is neededsplit