Methodology · Curated marketplace

exploring-llm-evaluations

Name: exploring-llm-evaluations
Rating: 3.7 (1 reviews)
Author: unknown

Investigate AI observability evaluations of both types — hog (deterministic code-based) and llmjudge (LLM-prompt-based).

Composite

3.7

C 3.7 · A 0.0

How we got there

Craft · D1–D5

D1 · Trigger clarity 4.5

D2 · Output specificity 3.5

D3 · Scope precision 4.0

D4 · Self-containment 3.5

D5 · Reusability 2.5

02 — Cross-validation

1 source verified

Best source skillsmp.com
Authority tier Tier 2 — Curated marketplace
Stars ★ 34,779
Source link https://skillsmp.com/skills/posthog-posthog-products-ai-observability-skills-exploring-llm-evaluations-skill-md ↗
First published 2026-05-22

Install

/plugin install exploring-llm-evaluations