llms - Mudit Bachhawat

LLM-as-a-Judge for AI Systems

Aug 22, 2024

Reading Time: 10 minutes

Introduction Common Patterns of LLM-as-a-Judge Method Basic Evaluating Judge Model Improving Judge Performance Scaling Judgments Closing References

On Preference Optimization and DPO

May 10, 2024

Reading Time: 6 minutes

Introduction Training with preference data has allowed large language models (LLMs) to be optimized for specific qualities such as trust, safety, and harmfulness. Preference optimization is the process of using this data to enhance LLMs. This method is particularly useful for tuning the model to emphasize certain features or for training scenarios where relative feedback…

Writing Better Prompts

Dec 27, 2023

Reading Time: 12 minutes

In a world where everyone can be a programmer through natural language, the art of effective communication with Large Language Models (LLMs) becomes crucial. While machines comprehend plain English, nuances exist in crafting prompts tailored to the model’s interpretative abilities. This blog explores the emerging field of “Prompt Engineering,” delving into key methods for designing…

Tag: llms

LLM-as-a-Judge for AI Systems

On Preference Optimization and DPO

Writing Better Prompts