Opik Archives

All Categories
Tutorials
Machine Learning
Academic Research
Integrations
Office Hours
Thought Leadership
LLMOps
Comet Community Hub
Uncategorized
Partners & Integrations
Product
Industry

Product Tutorials Machine Learning LLMOps Comet Community Hub

January 28, 2025

Abby Morgan

G-Eval for LLM Evaluation

LLM-as-a-judge evaluators have gained widespread adoption due to their flexibility, scalability, and close alignment with human judgment. They excel at…

Read

Tutorials LLMOps Comet Community Hub

December 19, 2024

Abby Morgan

BERTScore For LLM Evaluation

Introduction BERTScore represents a pivotal shift in LLM evaluation, moving beyond traditional heuristic-based metrics like BLEU and ROUGE to a…

Read

Tutorials LLMOps Comet Community Hub

November 21, 2024

Abby Morgan

Perplexity for LLM Evaluation

Perplexity is, historically speaking, one of the "standard" evaluation metrics for language models. And while recent years have seen a…

Read

Announcing Opik, our open source LLM evaluation platform!

G-Eval for LLM Evaluation

BERTScore For LLM Evaluation

Perplexity for LLM Evaluation

Products

Learn

Company

Pricing