bearseascape
- Karma
- 50
- Created
- 2 years ago
Recent Submissions
- 1. ▲ Why one small American town won't stop stoning its residents to death (archiveofourown.org)
- 2. ▲ The most complex model we understand [video] (youtube.com)
- 3. ▲ Weird Generalization and Inductive Backdoors: New Ways to Corrupt LLMs (arxiv.org)
- 4. ▲ MooseAgent: A LLM Based Multi-Agent Framework for Automating Moose Simulation (arxiv.org)
- 5. ▲ Automated Researchers Can Subtly Sandbag (alignment.anthropic.com)
- 6. ▲ Auditing Language Models for Hidden Objectives (anthropic.com)
- 7. ▲ Policy for LLM Writing on LessWrong (lesswrong.com)
- 8. ▲ Towards Understanding Distilled Reasoning Models: A Representational Approach (arxiv.org)
- 9. ▲ Transformers Learn to Implement Multistep Gradient Descent with Chain of Thought (arxiv.org)
- 10. ▲ (Mis)Fitting: A Survey of Scaling Laws (arxiv.org)