christoph/wamr - wamr - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
YAMAMOTO Takashi	f449b79a31	wasi_nn_openvino.c: implement multiple models per instance (#4380 ) tested with two models: ``` --load-graph=id=graph1,file=public/license-plate-recognition-barrier-0007/FP32/license-plate-recognition-barrier-0007.xml,file=public/license-plate-recognition-barrier-0007/FP32/license-plate-recognition-barrier-0007.bin \ --load-graph=id=graph2,file=classify/model.xml,file=classify/model.bin \ --init-execution-context=id=exec1,graph-id=graph1 \ --init-execution-context=id=exec2,graph-id=graph2 \ --set-input=context-id=exec1,dim=1,dim=24,dim=94,dim=3,file=out.bin \ --set-input=context-id=exec2,file=classify/banana-3x224x224-bgr.bin,dim=1,dim=3,dim=224,dim=224 \ --compute=context-id=exec1 \ --compute=context-id=exec2 \ --get-output=context-id=exec1,file=exec1-result.bin \ --get-output=context-id=exec2,file=exec2-result.bin ``` a detailed HOWTO: https://github.com/bytecodealliance/wasm-micro-runtime/pull/4380#issuecomment-2986882718	2025-06-20 15:50:29 +08:00
YAMAMOTO Takashi	ea408ab6c0	wasi-nn: add minimum serialization on WASINNContext (#4387 ) currently this is not necessary because context (WASINNContext) is local to instance. (wasm_module_instance_t) i plan to make a context shared among instances in a cluster when fixing https://github.com/bytecodealliance/wasm-micro-runtime/issues/4313. this is a preparation for that direction. an obvious alternative is to tweak the module instance context APIs to allow declaring some kind of contexts instance-local. but i feel, in this particular case, it's more natural to make "wasi-nn handles" shared among threads within a "process". note that, spec-wise, how wasi-nn behaves wrt threads is not defined at all because wasi officially doesn't have threads yet. i suppose, at this point, that how wasi-nn interacts with wasi-threads is something we need to define by ourselves, especially when we are using an outdated wasi-nn version. with this change, if a thread attempts to access a context while another thread is using it, we simply make the operation fail with the "busy" error. this is intended for the mimimum serialization to avoid problems like crashes/leaks/etc. this is not intended to allow parallelism or such. no functional changes are intended at this point yet. cf. https://github.com/bytecodealliance/wasm-micro-runtime/issues/4313 https://github.com/bytecodealliance/wasm-micro-runtime/issues/2430	2025-06-20 09:48:55 +08:00
YAMAMOTO Takashi	71c07f3e4e	deprecate legacy WAMR-specific "wasi_nn" module (#4382 ) wasi_nn.h: deprecate legacy "wasi_nn" cf. https://github.com/bytecodealliance/wasm-micro-runtime/issues/4326	2025-06-19 14:32:26 +08:00
YAMAMOTO Takashi	db7714f0f5	wasi_nn_tensorflowlite.cpp: reject non-fp32 input earlier (#4388 ) this backend assumes fp32 here and there. it's safer to reject unexpected inputs explicitly.	2025-06-18 19:08:57 +08:00
YAMAMOTO Takashi	cba9001749	wasi-nn: don't try to deinit uninitialized backend (#4375 ) cf. https://github.com/bytecodealliance/wasm-micro-runtime/issues/4339	2025-06-17 17:40:53 +08:00
YAMAMOTO Takashi	c9b8c16088	wasi_nn_openvino.c: remove pre/postprocessing and layout assumptions (#4361 ) as wasi-nn doesn't have these concepts, the best we can do without risking breaking certain applications here is to pass through tensors as they are. this matches wasmtime's behavior. tested with: * wasmtime classification-example (with this change, this example fails on tensor size mismatch instead of implicitly resizing it.) * license-plate-recognition-barrier-0007, a converted version with non-fp32 output. [1] (with this change, this model outputs integers as expected.) [1] `cd7ebe313b/models/public/license-plate-recognition-barrier-0007`	2025-06-17 13:01:46 +08:00
YAMAMOTO Takashi	2f0750a6fe	wasi_nn_openvino.c: add a missing buffer overflow check in get_output (#4353 ) cf. https://github.com/bytecodealliance/wasm-micro-runtime/issues/4351	2025-06-17 11:17:00 +08:00
YAMAMOTO Takashi	745da82cd6	wasi_nn_openvino.c: remove broken xml check (#4365 ) `xml.buf[xml.size]` check is broken because it accesses past the end of the buffer. anyway, openvino doesn't seem to care the NUL termination.	2025-06-17 11:02:36 +08:00
YAMAMOTO Takashi	0d001c4c38	wasi-nn: fix backend leak on multiple loads (#4366 ) cf. https://github.com/bytecodealliance/wasm-micro-runtime/issues/4340	2025-06-17 11:01:07 +08:00
YAMAMOTO Takashi	5478d267f4	wasi_nn_openvino.c: remove the tensor layout adjustment logic (#4308 ) the logic in question seems like an attempt to work around some application bugs. my wild guess is that it was for classification-example. cf. https://github.com/bytecodealliance/wasmtime/issues/10867	2025-06-12 09:34:14 +08:00
YAMAMOTO Takashi	ea5757f1d7	wasi-nn: do not assign wasi_nn_ctx->backend multiple times (#4329 )	2025-06-09 11:36:31 +08:00
YAMAMOTO Takashi	769d16eaab	wasi-nn: move some host-only things out of wasi_nn_types.h (#4334 ) cf. https://github.com/bytecodealliance/wasm-micro-runtime/issues/4324	2025-06-06 15:07:29 +08:00
YAMAMOTO Takashi	85efe08431	wasi-nn: protect the backend lookup table with a lock (#4319 ) this would avoid potential issues when multiple instances happen to make an attempt to load a backend at the same time. Fixes: https://github.com/bytecodealliance/wasm-micro-runtime/issues/4314	2025-06-05 09:19:46 +08:00
YAMAMOTO Takashi	6a00874f2f	wasi_nn_openvino.c: make this buildable (#4305 )	2025-06-03 13:28:13 +08:00
YAMAMOTO Takashi	61cb97221e	wasi-nn: fix shared library filenames for macOS (#4306 ) tested with openvino	2025-06-03 13:23:19 +08:00
YAMAMOTO Takashi	ae6e490ad5	fix wasi-nn abi definitions (#4307 ) sync with a more appropriate version of the definitions. as we use the "wasi_ephemeral_nn", which is p1-based, it seems more appropriate to use definitions from witx, not wit. it's a bit unfortunate p2-based wasi-nn made gratuitous changes like this from p1. note: this is an ABI change.	2025-06-03 13:22:48 +08:00
YAMAMOTO Takashi	16c46751ac	wasi-nn: remove "backends" argument from detect_and_load_backend() (#4309 ) it seems meaningless and quite confusing to access a table with two aliases ("lookup" and "backends") within a function. no functional changes are intended.	2025-06-03 13:22:27 +08:00
YAMAMOTO Takashi	1c12a32066	wasi_nn_openvino.c: fix a few printf formats (#4310 )	2025-06-03 13:21:32 +08:00
hongxia	aa1ff778b9	add load_by_name in wasi-nn (#4298 )	2025-06-03 06:26:58 +08:00
Huang Qi	412631ac13	fix: correct typos and improve comments across multiple files by codespell (#4116 ) Signed-off-by: Huang Qi <huangqi3@xiaomi.com>	2025-03-07 08:21:54 +08:00
liang.he	0599351262	wasi-nn: Add a new target for llama.cpp as a wasi-nn backend (#3709 ) Minimum support: - [x] accept (WasmEdge) customized model parameters. metadata. - [x] Target [wasmedge-ggml examples](https://github.com/second-state/WasmEdge-WASINN-examples/tree/master/wasmedge-ggml) - [x] basic - [x] chatml - [x] gemma - [x] llama - [x] qwen --- In the future, to support if required: - [ ] Target [wasmedge-ggml examples](https://github.com/second-state/WasmEdge-WASINN-examples/tree/master/wasmedge-ggml) - [ ] command-r. (>70G memory requirement) - [ ] embedding. (embedding mode) - [ ] grammar. (use the grammar option to constrain the model to generate the JSON output) - [ ] llama-stream. (new APIS `compute_single`, `get_output_single`, `fini_single`) - [ ] llava. (image representation) - [ ] llava-base64-stream. (image representation) - [ ] multimodel. (image representation) - [ ] Target [llamaedge](https://github.com/LlamaEdge/LlamaEdge)	2024-09-10 08:45:18 +08:00
liang.he	140ff25d46	wasi-nn: Apply new architecture (#3692 ) ps. https://github.com/bytecodealliance/wasm-micro-runtime/issues/3677	2024-08-13 09:14:52 +08:00
liang.he	058bc47102	[wasi-nn] Add a new wasi-nn backend openvino (#3603 )	2024-07-22 17:16:41 +08:00
liang.he	db025e457a	sync up with latest wasi-nn spec (#3530 )	2024-06-17 14:58:09 +08:00
liang.he	f844b33b2d	Make wasi-nn backends as separated shared libraries (#3509 ) - All files under core/iwasm/libraries/wasi-nn are compiled as shared libraries - wasi-nn.c is shared between backends - Every backend has a separated shared library - If wasi-nn feature is enabled, iwasm will depend on shared library libiwasm.so instead of linking static library libvmlib.a	2024-06-14 12:06:56 +08:00
liang.he	028f43bc18	Fix compilation warnings of wasi-nn (#3497 )	2024-06-07 10:49:44 +08:00
Xu Jinyang	cef88deedb	Add `wasi_ephemeral_nn` module support (#3241 ) Add `wasi_ephemeral_nn` module support with optional cmake variable, which was mentioned in #3229.	2024-03-21 21:05:34 +08:00
Wenyong Huang	0ee5ffce85	Refactor APIs and data structures as preliminary work for Memory64 (#3209 ) # Change the data type representing linear memory address from u32 to u64 ## APIs signature changes - (Export)wasm_runtime_module_malloc - wasm_module_malloc - wasm_module_malloc_internal - aot_module_malloc - aot_module_malloc_internal - wasm_runtime_module_realloc - wasm_module_realloc - wasm_module_realloc_internal - aot_module_realloc - aot_module_realloc_internal - (Export)wasm_runtime_module_free - wasm_module_free - wasm_module_free_internal - aot_module_malloc - aot_module_free_internal - (Export)wasm_runtime_module_dup_data - wasm_module_dup_data - aot_module_dup_data - (Export)wasm_runtime_validate_app_addr - (Export)wasm_runtime_validate_app_str_addr - (Export)wasm_runtime_validate_native_addr - (Export)wasm_runtime_addr_app_to_native - (Export)wasm_runtime_addr_native_to_app - (Export)wasm_runtime_get_app_addr_range - aot_set_aux_stack - aot_get_aux_stack - wasm_set_aux_stack - wasm_get_aux_stack - aot_check_app_addr_and_convert, wasm_check_app_addr_and_convert and jit_check_app_addr_and_convert - wasm_exec_env_set_aux_stack - wasm_exec_env_get_aux_stack - wasm_cluster_create_thread - wasm_cluster_allocate_aux_stack - wasm_cluster_free_aux_stack ## Data structure changes - WASMModule and AOTModule - field aux_data_end, aux_heap_base and aux_stack_bottom - WASMExecEnv - field aux_stack_boundary and aux_stack_bottom - AOTCompData - field aux_data_end, aux_heap_base and aux_stack_bottom - WASMMemoryInstance(AOTMemoryInstance) - field memory_data_size and change __padding to is_memory64 - WASMModuleInstMemConsumption - field total_size and memories_size - WASMDebugExecutionMemory - field start_offset and current_pos - WASMCluster - field stack_tops ## Components that are affected by the APIs and data structure changes - libc-builtin - libc-emcc - libc-uvwasi - libc-wasi - Python and Go Language Embedding - Interpreter Debug engine - Multi-thread: lib-pthread, wasi-threads and thread manager	2024-03-12 11:38:50 +08:00
Zhen Kong	1a88104160	Remove module instance from hashmap in wasi_nn_destroy (#2613 ) When destroying wasi-nn context, module instance should be also removed from hashmap to avoid memory leak.	2023-10-03 08:33:11 +08:00
tonibofarull	b45d014112	wasi-nn: Improve TPU support (#2447 ) 1. Allow TPU and GPU support at the same time. 2. Add Dockerfile to run example with [Coral USB](https://coral.ai/products/accelerator/).	2023-08-14 20:03:56 +08:00
tonibofarull	0b0af1b3df	wasi-nn: Support uint8 quantized networks (#2433 ) Support (non-full) uint8 quantized networks. Inputs and outputs are still required to be `float`. The (de)quantization is done internally by wasi-nn. Example generated from `quantized_model.py`: ![Screenshot from 2023-08-07 17-57-05](https://github.com/bytecodealliance/wasm-micro-runtime/assets/80318361/91f12ff6-870c-427a-b1dc-e307f7d1f5ee) Visualization with [netron](https://netron.app/).	2023-08-11 07:55:40 +08:00
tonibofarull	ab96e01f5e	wasi-nn: Add support of wasi-nn as shared lib (#2310 ) ## Context Currently, WAMR supports compiling iwasm with flag `WAMR_BUILD_WASI_NN`. However, there are scenarios where the user might prefer having it as a shared library. ## Proposed Changes Decouple wasi-nn context management by internally managing the context given a module instance reference.	2023-06-27 18:18:26 +08:00
ayakoakasaka	89be5622a5	wasi-nn: Add external delegation to support several NPU/GPU (#2162 ) Add VX delegation as an external delegation of TFLite, so that several NPU/GPU (from VeriSilicon, NXP, Amlogic) can be controlled via WASI-NN. Test Code can work with the X86 simulator.	2023-05-05 16:29:36 +08:00
tonibofarull	a15a731e12	wasi-nn: Support multiple TFLite models (#2002 ) Remove restrictions: - Only 1 WASM app at a time - Only 1 model at a time - `graph` and `graph-execution-context` are ignored Refer to previous document: `e8d718096d/core/iwasm/libraries/wasi-nn/README.md`	2023-03-08 15:54:06 +08:00
tonibofarull	1614ce12fa	wasi-nn: Enable GPU support (#1922 ) - Split logic in several dockers - runtime: wasi-nn-cpu and wasi-nn- Nvidia-gpu. - compilation: wasi-nn-compile. Prepare the testing wasm and generates the TFLites. - Implement GPU support for TFLite with Opencl.	2023-02-02 08:09:46 +08:00
tonibofarull	9eed6686df	Refactor WASI-NN to simplify the support for multiple frameworks (#1834 ) - Reorganize the library structure - Use the latest version of `wasi-nn` wit (Oct 25, 2022): `0f77c48ec1/wasi-nn.wit.md` - Split logic that converts WASM structs to native structs in a separate file - Simplify addition of new frameworks	2023-01-25 18:32:40 +08:00

36 Commits