You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+10-36Lines changed: 10 additions & 36 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -712,6 +712,8 @@ Building the program with BLAS support may lead to some performance improvements
712
712
713
713
To obtain the official LLaMA 2 weights please see the <a href="#obtaining-and-using-the-facebook-llama-2-model">Obtaining and using the Facebook LLaMA 2 model</a> section. There is also a large selection of pre-quantized `gguf` models available on Hugging Face.
714
714
715
+
Note: `convert.py` does not support LLaMA 3, you can use `convert-hf-to-gguf.py` with LLaMA 3 downloaded from Hugging Face.
716
+
715
717
```bash
716
718
# obtain the official LLaMA model weights and place them in ./models
717
719
ls ./models
@@ -977,48 +979,20 @@ Here is a demo of an interactive session running on Pixel 5 phone:
Termux from F-Droid offers an alternative route to execute the project on an Android device. This method empowers you to construct the project right from within the terminal, negating the requirement for a rooted device or SD Card.
982
-
983
-
Outlined below are the directives for installing the project using OpenBLAS and CLBlast. This combination is specifically designed to deliver peak performance on recent devices that feature a GPU.
984
-
985
-
If you opt to utilize OpenBLAS, you'll need to install the corresponding package.
986
-
```
987
-
apt install libopenblas
988
-
```
989
-
990
-
Subsequently, if you decide to incorporate CLBlast, you'll first need to install the requisite OpenCL packages:
In order to compile CLBlast, you'll need to first clone the respective Git repository, which can be found at this URL: https://github.com/CNugteren/CLBlast. Alongside this, clone this repository into your home directory. Once this is done, navigate to the CLBlast folder and execute the commands detailed below:
982
+
#### Build on Android using Termux
983
+
[Termux](https://github.com/termux/termux-app#installation) is an alternative to execute `llama.cpp` on an Android device (no root required).
996
984
```
997
-
cmake .
998
-
make
999
-
cp libclblast.so*$PREFIX/lib
1000
-
cp ./include/clblast.h ../llama.cpp
985
+
apt update && apt upgrade -y
986
+
apt install git
1001
987
```
1002
988
1003
-
Following the previous steps, navigate to the LlamaCpp directory. To compile it with OpenBLAS and CLBlast, execute the command provided below:
989
+
It's recommended to move your model inside the `~/` directoryfor best performance:
make LLAMA_CLBLAST=1 //(sometimes you need to run this command twice)
1008
-
```
1009
-
1010
-
Upon completion of the aforementioned steps, you will have successfully compiled the project. To run it using CLBlast, a slight adjustment is required: a command must be issued to direct the operations towards your device's physical GPU, rather than the virtual one. The necessary command is detailed below:
(Note: some Android devices, like the Zenfone 8, need the following command instead - "export LD_LIBRARY_PATH=/system/vendor/lib64:$LD_LIBRARY_PATH". Source: https://www.reddit.com/r/termux/comments/kc3ynp/opencl_working_in_termux_more_in_comments/ )
1018
-
1019
-
For easy and swift re-execution, consider documenting this final part in a .sh script file. This will enable you to rerun the process with minimal hassle.
1020
994
1021
-
Place your desired model into the `~/llama.cpp/models/` directory and execute the `./main (...)` script.
995
+
[Follow the Linux build instructions](https://github.com/ggerganov/llama.cpp#build) to build `llama.cpp`.
0 commit comments