Maxim AI Blog

Latest

Making Language Models Unbiased, One Vector At a Time

Making Language Models Unbiased, One Vector At a Time

Introduction AI has officially broken out of the tech bubble and into everyday workflows, boosting productivity but also raising safety concerns, especially around bias in large language models. These models inherit societal biases from internet data, and debiasing efforts by frontier labs can sometimes go too far (remember the racially

Evaluating a Healthcare use case using Vertex AI and Maxim AI - Part 1

Evaluating a Healthcare use case using Vertex AI and Maxim AI - Part 1

User Simulation in AI: From Rule-Based Models to LLM-Powered Realism

User Simulation in AI: From Rule-Based Models to LLM-Powered Realism

🧮 Building a Math Trivia Game Agent with Mistral AI and Maxim

🧮 Building a Math Trivia Game Agent with Mistral AI and Maxim

🚀 Better Dashboards, Smarter Workflows – Maxim Weekly Release Notes (June 9–13, 2025)

🚀 Better Dashboards, Smarter Workflows – Maxim Weekly Release Notes (June 9–13, 2025)

Do Language Models Know That They're Being Evaluated?

Do Language Models Know That They're Being Evaluated?

🌤️ Building a Gemini-Powered Conversational Weather Agent with Maxim Logging

🌤️ Building a Gemini-Powered Conversational Weather Agent with Maxim Logging

maxim updates

✨ Agentic mode, Scheduled runs, New evals, and more

✨ Agentic mode, Scheduled runs, New evals, and more

Feature spotlight 🤖 Agentic mode in the Prompt Playground Prototype complete agent behavior, including automatic tool calling, directly within the playground. Here’s what you can do: * Test multi-step flows: Experiment with and evaluate complex agentic interactions where the model automatically calls tools and executes steps until a final response is

✨ MCP client, Live dashboard, Vertex AI evals, and more

✨ MCP client, Live dashboard, Vertex AI evals, and more

✨ Public APIs, xAI support, MCP, and more

✨ Public APIs, xAI support, MCP, and more

✨ Maxim AI February 2025 Update

✨ Maxim AI February 2025 Update

✨ Maxim AI January 2025 Updates

✨ Maxim AI January 2025 Updates

Maxim AI - Product Updates, December 2024 ✨

Maxim AI - Product Updates, December 2024 ✨

research paper

Do Language Models Know That They're Being Evaluated?

Do Language Models Know That They're Being Evaluated?

Picture this scenario: You’re very new to AI, exploring chatgpt by testing its capabilities on various topics, expecting honest answers unaware that behind the scenes, it already figured out that it’s being tested and is subtly changing its behaviour to ace your tests. This feels like a subtle

From Turn 1 to Turn 10: How LLMs Get Lost In Multi-Turn Conversations

From Turn 1 to Turn 10: How LLMs Get Lost In Multi-Turn Conversations

SuperBPE: Rethinking Tokenization for Language Models

SuperBPE: Rethinking Tokenization for Language Models

Image generated using Meta AI

Ensuring responsible AI: An overview of DeepMind’s FACTS framework

Image credit: Meta

Innovative training of LLMs in continuous latent spaces, by Meta AI