Commit 98a70f0
merge (#20)
* Master1 (#17)
* Merge PR (#10) (#11) (#13)
Merge
---------
Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dennyxbox890 <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Bump requests from 2.31.0 to 2.32.2 in the pip group across 1 directory
Bumps the pip group with 1 update in the / directory: [requests](https://github.com/psf/requests).
Updates `requests` from 2.31.0 to 2.32.2
- [Release notes](https://github.com/psf/requests/releases)
- [Changelog](https://github.com/psf/requests/blob/main/HISTORY.md)
- [Commits](psf/requests@v2.31.0...v2.32.2)
---
updated-dependencies:
- dependency-name: requests
dependency-type: direct:production
dependency-group: pip
...
Signed-off-by: dependabot[bot] <[email protected]>
* Temp (#15)
* metal : fix minor string leaks (ggml/1004)
* cmake : make it possible linking ggml as external lib (ggml/1003)
* sync : ggml
* CANN: adjust backend registry refactor. (ggml-org#10158)
remove buffer->iface.get_name that used in cann as it was removed in backend registry refactor PR.
* metal : move dequantize templates to beginning of MSL source (#0)
* metal : simplify f16 and f32 dequant kernels (#0)
* cuda : clear error after changing peer access (ggml-org#10153)
* fix build break on arm64 linux (ggml-org#10166)
This fixes the build break from the recent changes
to move the CPU backend to separate files
ggml-org#10144
* server : clarify /slots endpoint, add is_processing (ggml-org#10162)
* server : clarify /slots endpoint, add is_processing
* fix tests
* ggml : fix q4xx mat mul, increase ggml_aligned_malloc alignment (ggml-org#10167)
* ggml : fix gelu tables initialization (ggml-org#10172)
* Q6_K AVX improvements (ggml-org#10118)
* q6_k instruction reordering attempt
* better subtract method
* should be theoretically faster
small improvement with shuffle lut, likely because all loads are already done at that stage
* optimize bit fiddling
* handle -32 offset separately. bsums exists for a reason!
* use shift
* Update ggml-quants.c
* have to update ci macos version to 13 as 12 doesnt work now. 13 is still x86
* ggml : fix arch check in bf16_to_fp32 (ggml-org#10164)
* llama : add <|tool_call|> formatting to Granite template (ggml-org#10177)
Branch: GraniteToolCallTemplate
Signed-off-by: Gabe Goodhart <[email protected]>
* metal : add quantized FA support (ggml-org#10149)
* metal : add quantized FA (vec) support
ggml-ci
* metal : add quantized FA (non-vec) support
* metal : fix support check
ggml-ci
* metal : clean-up
* metal : clean-up (cont)
* metal : fix shared memory calc + reduce smem + comments
* metal : float-correctness
* metal : minor [no ci]
* ggml : adjust is_first_call init value (ggml-org#10193)
ggml-ci
* metal : fix from ptr buffer name (ggml-org#10189)
* server : remove hack for extra parallel slot (ggml-org#10187)
ggml-ci
* metal : add BF16 support (ggml-org#8439)
* ggml : add initial BF16 support
ggml-ci
* metal : add mul_mat_id BF16 support
ggml-ci
* metal : check for bfloat support on the Metal device
ggml-ci
* metal : better var names [no ci]
* metal : do not build bfloat kernels when not supported
ggml-ci
* metal : try to fix BF16 support check
ggml-ci
* metal : this should correctly check bfloat support
---------
Signed-off-by: Gabe Goodhart <[email protected]>
Co-authored-by: Plamen Minev <[email protected]>
Co-authored-by: Yuri Khrustalev <[email protected]>
Co-authored-by: Georgi Gerganov <[email protected]>
Co-authored-by: leo-pony <[email protected]>
Co-authored-by: Diego Devesa <[email protected]>
Co-authored-by: snadampal <[email protected]>
Co-authored-by: Xuan Son Nguyen <[email protected]>
Co-authored-by: Eve <[email protected]>
Co-authored-by: Gabe Goodhart <[email protected]>
---------
Signed-off-by: dependabot[bot] <[email protected]>
Signed-off-by: Gabe Goodhart <[email protected]>
Co-authored-by: dennyxbox890 <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Plamen Minev <[email protected]>
Co-authored-by: Yuri Khrustalev <[email protected]>
Co-authored-by: Georgi Gerganov <[email protected]>
Co-authored-by: leo-pony <[email protected]>
Co-authored-by: Diego Devesa <[email protected]>
Co-authored-by: snadampal <[email protected]>
Co-authored-by: Xuan Son Nguyen <[email protected]>
Co-authored-by: Eve <[email protected]>
Co-authored-by: Gabe Goodhart <[email protected]>
* Rename build.yml to build-ci.yml
* build.yml
* Update build-ci.yml
* Update CMakeLists.txt
* Update CMakeLists.txt
* Update CMakeLists.txt
* Delete ggml/src/vulkan-shaders/CMakeLists.txt
* Update build.yml
* Update build-ci.yml
* Update build-ci.yml
---------
Signed-off-by: dependabot[bot] <[email protected]>
Signed-off-by: Gabe Goodhart <[email protected]>
Co-authored-by: dennyxbox890 <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Plamen Minev <[email protected]>
Co-authored-by: Yuri Khrustalev <[email protected]>
Co-authored-by: Georgi Gerganov <[email protected]>
Co-authored-by: leo-pony <[email protected]>
Co-authored-by: Diego Devesa <[email protected]>
Co-authored-by: snadampal <[email protected]>
Co-authored-by: Xuan Son Nguyen <[email protected]>
Co-authored-by: Eve <[email protected]>
Co-authored-by: Gabe Goodhart <[email protected]>1 parent 231f936 commit 98a70f0
File tree
23 files changed
+619
-612
lines changed- .github/workflows
- common
- examples
- llava
- server/tests
- ggml
- src/ggml-cann/kernels
- pocs
- vdot
- scripts
- src
- tests
23 files changed
+619
-612
lines changedLarge diffs are not rendered by default.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
7 | 7 | | |
8 | 8 | | |
9 | 9 | | |
10 | | - | |
11 | | - | |
12 | | - | |
13 | | - | |
14 | | - | |
15 | | - | |
16 | | - | |
17 | | - | |
| 10 | + | |
18 | 11 | | |
19 | 12 | | |
20 | 13 | | |
| |||
This file was deleted.
This file was deleted.
This file was deleted.
This file was deleted.
This file was deleted.
This file was deleted.
0 commit comments