Methodology · Curated marketplace
exploring-llm-evaluations
Investigate AI observability evaluations of both types — hog (deterministic code-based) and llmjudge (LLM-prompt-based).
Composite
3.7
C 3.7 · A 0.0
How we got there
1 source verified
- Best source
skillsmp.com - Authority tier Tier 2 — Curated marketplace
- Stars ★ 34,779
- Source link https://skillsmp.com/skills/posthog-posthog-products-ai-observability-skills-exploring-llm-evaluations-skill-md ↗
- First published 2026-05-22
Use this skill
/plugin install exploring-llm-evaluations More in Methodology
claude-api
Reference for the Claude API / Anthropic SDK — model ids, pricing, params, streaming, tool use, MCP, agents, caching, token counting, model migration.
prompt-engineering
Universal prompt engineering techniques for any LLM.
github-swyxio-ai-notes
notes for software engineers getting up to speed on new AI developments.
mcp-builder
Builds production MCP servers via 4-phase methodology: research, implement, test, evaluate. Triggers: build MCP, new MCP, MCP integration, MCP server scaffold.
Auto-indexed. Editorial review pending — score is based on the rubric only.