Comet Blog

Meet Opik: Your New Tool to Evaluate, Test, and Monitor LLM Applications

September 16, 2024

Gideon MendelsJacques Verre

Today, we’re thrilled to introduce Opik – an open-source, end-to-end LLM development platform that provides the observability tools you need…

Read

Academic Research Comet Community Hub

March 27, 2025

Vincent Koc

LLM Evaluation Complexities for Non-Latin Languages

Large language models (LLMs) have revolutionized natural language processing, yet most development and evaluation efforts have historically centered around Latin-script…

Read

Tutorials LLMOps Comet Community Hub

March 26, 2025

Abby Morgan

SelfCheckGPT for LLM Evaluation

Detecting hallucinations in language models is challenging. There are three general approaches: Measuring token-level probability distributions for indications that a…

Read

LLMOps

March 26, 2025

Kelsey Kinzer

LLM Hallucination Detection in App Development

Even ChatGPT knows it’s not always right. When prompted, “Are large language models (LLMs) always accurate?” ChatGPT says no and…

Read

graphic showing example llm hallucination responses from an AI chatbot that incorrectly counts the number of times the letter A appears in the word hallucination

Product

March 10, 2025

Caroline Borders

Major Releases: TypeScript for LLM Evals, Total Fidelity ML Metrics, & More

Spring is in the air, and we’re excited to bring you four fresh releases in the Comet platform to make…

Read

LLMOps

March 3, 2025

Leonardo Gonzalez

LLM Evaluation Frameworks: Head-to-Head Comparison

As teams work on complex AI agents and expand what LLM-powered applications can achieve, a variety of LLM evaluation frameworks…

Read

Tutorials LLMOps Comet Community Hub

February 24, 2025

Abby Morgan

LLM Juries for Evaluation

Evaluating the correctness of generated responses is an inherently challenging task. LLM-as-a-Judge evaluators have gained popularity for their ability to…

Read

LLM Juries for Evaluation featured image

Tutorials Machine Learning LLMOps

February 19, 2025

Claire Longo

A Simple Recipe for LLM Observability

So, you’re building an AI application on top of an LLM, and you’re planning on setting it live in production.…

Read

Page 1
Page 2
Page 3
Page 4
…
Page 64
Next

Run open source LLM evaluations with Opik!

Comet Blog

Meet Opik: Your New Tool to Evaluate, Test, and Monitor LLM Applications

LLM Evaluation Complexities for Non-Latin Languages

SelfCheckGPT for LLM Evaluation

LLM Hallucination Detection in App Development

Major Releases: TypeScript for LLM Evals, Total Fidelity ML Metrics, & More

LLM Evaluation Frameworks: Head-to-Head Comparison

LLM Juries for Evaluation

A Simple Recipe for LLM Observability

Products

Learn

Company

Pricing