model: (qwen3next) correct vectorized key_gdiff calculation by ngxson · Pull Request #19324 · ggml-org/llama.cpp

1 min read Original article ↗

@ngxson

CISC

CISC approved these changes Feb 4, 2026

@ngxson

liparetejas pushed a commit to liparetejas/llama.cpp that referenced this pull request

Feb 23, 2026
…#19324)

* model: (qwen3next) correct vectorized key_gdiff calculation

* move transpose to outside of loop

Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request

Apr 26, 2026
…#19324)

* model: (qwen3next) correct vectorized key_gdiff calculation

* move transpose to outside of loop

ljubomirj pushed a commit to ljubomirj/llama.cpp that referenced this pull request

May 6, 2026
…#19324)

* model: (qwen3next) correct vectorized key_gdiff calculation

* move transpose to outside of loop

my-other-github-account pushed a commit to my-other-github-account/llama.cpp that referenced this pull request

May 15, 2026
…#19324)

* model: (qwen3next) correct vectorized key_gdiff calculation

* move transpose to outside of loop

my-other-github-account pushed a commit to my-other-github-account/llama.cpp that referenced this pull request

May 15, 2026
…#19324)

* model: (qwen3next) correct vectorized key_gdiff calculation

* move transpose to outside of loop

fewtarius pushed a commit to fewtarius/CachyLLama that referenced this pull request

May 30, 2026
…#19324)

* model: (qwen3next) correct vectorized key_gdiff calculation

* move transpose to outside of loop