Skills are quietly becoming the unit of agent knowledge

9 points by latand6 2 months ago · 15 comments · 2 min read

In the last few months agent skills went from a niche Claude Code feature to something every major runtime supports. Anthropic has an official skills repo. OpenAI shipped skills in Codex with a built-in skill-creator. Karpathy talks about "everything is skill issue" and describes writing skills as curricula for agents [1]. The format is converging: a folder with a SKILL.md, optional scripts, optional reference files.

What changed is that the models got good enough to follow written instructions reliably. A skill is just a tested workflow in markdown that the agent reads and follows instead of improvising. You can also bundle scripts that the agent runs during the workflow, which covers what most people use lightweight MCP servers for, except the agent can read the script source and extend it.

Karpathy talks about an "economy of agents" and says we should stop writing HTML docs for humans and start writing markdown docs for agents [1]. Anthropic just shipped a skill-creator that benchmarks whether a skill still works after model updates. There are already tens of thousands of community skills on GitHub.

Distribution still feels early. Most useful skills are tiny. A markdown file, maybe one script. Useful enough to keep reusing, but not something anyone turns into a proper GitHub repo with a README and install instructions. So they stay on one machine.

I have been writing skills for my own agents for a while and I keep running into this. The format works. Moving them between machines or handing one to someone else does not.

Curious if others are hitting the same wall or if there are approaches I am missing.

[1] https://www.youtube.com/watch?v=kwSVtQ7dziU (Karpathy on No Briars podcast, skills discussion around 1:03:40)

SirensOfTitan 2 months ago

Most of this discourse feels like some kind of religious ritual built on the foundation of authority bias. Where is the evidence that skills improves performance over any other methodology outside of the fact of its nascent popularity?

I do agree with Jacques Ellul in Technological Society that technique precedes science, and that's certainly the case with LLMs; however, this whole industry waves off rigorous validation in favor of personal anecdotes ("it feels more productive to me!" "they didn't study after Opus 4.5 was released").

latand6OP 2 months ago

The difference I'm noticing is in that with a proper skill you can skip the process of LLM wandering about and trying to guess how to interact with an API or else.
so they basically just save you time, even if they are 50% efficient of what it COULD be

MarcelinoGMX3C 2 months ago

Thinking about what manzanarama mentioned regarding checking skills into a repo, that's exactly how I look at it. To me, these skills are just specialized configuration. We've been versioning configuration for decades because it's critical to reproducibility and maintainability.

The "cognitive load" problem latand6 raised for reviewing every skill is real. That's where you integrate security and quality gates – treat these skills like any other software artifact. You wouldn't manually review every line of every dependency, so automate the validation here too.

dmppch 2 months ago

The distribution problem is harder than it looks because it's actually a composition problem in disguise. A single skill is trivially shareable — zip it, gist it, whatever.

But in practice you end up with skills that depend on other skills, or a skill that assumes specific instructions are already loaded, conflicting skills, versioning and supply chain issues - and suddenly you need dependency resolution.

I've built a package-manager approach for this (APM - github.com/microsoft/apm) and the thing that surprised me most was how quickly even small teams end up with config sprawl - and how much a manifest that travels with the project helps.

The "too small for a repo" thing is real, but one pattern that works is having a monorepo per dev team or org with all skills and building jointly over there.

latand6OP 2 months ago

Yeah, I've built my own skill-package manager as well btw! Then it clicked and I hyperfocused for a whole week and vibecoded a skill marketplace haha. Because the selling of a skill seems to sound like a new idea. going to write a show HN about that maybe

moomoo11 2 months ago

How are skills different from Having the agent create todo-the-ticket.md (with me) that contain the work scope and steps.

That’s what I do for each ticket.

latand6OP 2 months ago

Skills are repeatable workflows, that could be extracted as unit of work I guess
- moomoo11 2 months ago
  
  Yeah but each ticket is different I guess.
  Idk I just never used skills. I have an architecture md (kind of like a skill in that it’s static? It always reads this first), and todos for every single ticket the AI goes through and does it. I have like 90% plus success.

rbitr 2 months ago

Will skills replace MCP servers eventually?

brazukadev 2 months ago

How would it replace something that is not used widespread yet? I think both skills and MCP already have similar levels of adoption.
- latand6OP 2 months ago
  
  It's more - a SKILL can contain MCP :)

manzanarama 2 months ago

What do you think about checking the skills directly into the repo where they are useful?

latand6OP 2 months ago

Yeah, that's the default way to approach it, but the cognitive load is still a problem. Manually reviewing every single skill I might need just for one task is tedious. GitHub stars won't give you any useful signal. I actually started building a product that solves this.

Settings

Skills are quietly becoming the unit of agent knowledge

Keyboard Shortcuts