ggml : x2 speed for WASM by optimizing SIMD by ngxson · Pull Request #11453 · ggml-org/llama.cpp

1 min read Original article ↗

@ngxson

@github-actions github-actions bot added the ggml

changes relating to the ggml tensor library for machine learning

label

Jan 27, 2025

@ngxson

slaren

@ngxson

ggerganov

@ngxson

epapilin

ngxson

funny-falcon

funny-falcon

funny-falcon

idevangsharma

giraygokirmak

camel-cdr

Co-authored-by: camel-cdr <camel-cdr@protonmail.com>

@ngxson

Co-authored-by: camel-cdr <camel-cdr@protonmail.com>

ggerganov

@ngxson ngxson added the merge ready

indicates that this may be ready to merge soon and is just holding out in case of objections

label

Feb 9, 2025

@ngxson

orca-zhang pushed a commit to orca-zhang/llama.cpp that referenced this pull request

Feb 26, 2025
* ggml : x2 speed for WASM by optimizing SIMD

* fix bad merging

* rm trailing spaces

* rm redundant clamp

* better quantize_row_q8_K

Co-authored-by: camel-cdr <camel-cdr@protonmail.com>

* remove memset that causes buffer overflow
Co-authored-by: camel-cdr <camel-cdr@protonmail.com>

---------

Co-authored-by: camel-cdr <camel-cdr@protonmail.com>

arthw pushed a commit to arthw/llama.cpp that referenced this pull request

Feb 26, 2025
* ggml : x2 speed for WASM by optimizing SIMD

* fix bad merging

* rm trailing spaces

* rm redundant clamp

* better quantize_row_q8_K

Co-authored-by: camel-cdr <camel-cdr@protonmail.com>

* remove memset that causes buffer overflow
Co-authored-by: camel-cdr <camel-cdr@protonmail.com>

---------

Co-authored-by: camel-cdr <camel-cdr@protonmail.com>