Claude Code Skills 2.0 adds evals plus benchmark test sets; changes target skill reliability as models update over time.
Agent skills shift AI agents toward procedural tasks with skill.md steps; progressive disclosure reduces context window bloat in real use.