Settings

Theme

Language models transmit behavioural traits through hidden signals in data

nature.com

4 points by armcat 25 days ago · 4 comments

Reader

zahra_lahrsson 25 days ago

Related to this: https://www.nature.com/articles/d41586-026-00906-0 (LLMs can subliminally learn malicious behavior through distilling)

pop_mccoy 25 days ago

Explains the high performance of distilled models then (e.g. Chinese ones).

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection