You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Sep 10, 2025. It is now read-only.
python3 torchchat.py export llama3 --output-aoti-package-path exportedModels/llama3_artifacts --device cpu
156
156
157
157
# Execute the exported model using Python
158
158
159
-
python3 torchchat.py generate llama3 --dso-path exportedModels/llama3.so --prompt "Hello my name is"
159
+
python3 torchchat.py generate llama3 --aoti-package-path exportedModels/llama3_artifacts --prompt "Hello my name is" --device cpu
160
160
```
161
161
162
162
NOTE: If your machine has cuda add this flag for performance
@@ -172,9 +172,14 @@ To build the runner binary on your Mac or Linux:
172
172
scripts/build_native.sh aoti
173
173
```
174
174
175
+
To compile the AOTI generated artifacts into a `.so`:
176
+
```bash
177
+
make -C exportedModels/llama3_artifacts
178
+
```
179
+
175
180
Execute
176
181
```bash
177
-
cmake-out/aoti_run exportedModels/llama3.so -z `python3 torchchat.py where llama3`/tokenizer.model -l 3 -i "Once upon a time"
182
+
cmake-out/aoti_run exportedModels/llama3_artifacts/llama3_artifacts.so -z `python3 torchchat.py where llama3`/tokenizer.model -l 3 -i "Once upon a time" -d cpu
"code generation needs to choose different implementations for DSO and PTE path. Please only use one export option, and call export twice if necessary!"
94
+
"code generation needs to choose different implementations for AOTI and PTE path. Please only use one export option, and call export twice if necessary!"
"code generation needs to choose different implementations for DSO and PTE path. Please only use one export option, and call export twice if necessary!"
111
+
"code generation needs to choose different implementations for AOTI and PTE path. Please only use one export option, and call export twice if necessary!"
0 commit comments