Humanloop

Manage your LLM experiments through AI agents that understand Humanloop

Your team runs dozens of prompt experiments and evaluation sessions in Humanloop. Now your AI agent can create projects, review experiment results, and surface session data on demand. LLM operations become conversational, and your product team stays focused on shipping.

Chosen by 800+ global brands across industries

LLM operations made conversational

Your AI agent interfaces directly with Humanloop's project and experiment APIs, turning complex LLM management tasks into simple conversational commands.

Create New Projects

A team member asks to spin up a new evaluation workspace. Your AI agent calls Humanloop's Create Project endpoint with the specified name and description, returning the project ID and details. New projects ready in seconds, no dashboard navigation needed.

Delete Projects

An engineer wants to clean up an obsolete project. The agent confirms the project ID, calls the Delete Project endpoint, and permanently removes the project along with all associated sessions and datapoints. Clean workspace, no leftover clutter.

List Experiments

A product manager needs the latest experiment results. The agent queries the List Experiments endpoint for a given project, retrieves experiment names, statuses, configurations, and metrics, and presents a clear summary of ongoing and completed tests.

Browse Session History

Someone wants to review recent user interactions with the AI. The agent calls List Sessions for the project, fetches paginated session data including inputs, outputs, and timestamps, and delivers a structured recap of conversation patterns.

Check Experiment Status

Before a model rollout, your team needs to verify that all experiments passed. The agent retrieves experiment details, checks statuses, and flags any that are still running or have failed, giving your team confidence before going live.

Organize by Directory

Teams managing multiple AI features want projects grouped logically. The agent creates new projects with a specified directory ID, ensuring evaluation workspaces stay organized by feature area, product line, or team without manual folder management.

Humanloop

Use Cases

LLM workflows simplified

See how AI agents streamline Humanloop project management, experiment tracking, and session analysis for product teams building with language models.

Experiment Status Checks Before Production Deploys

An engineer messages 'Are all experiments passing for the onboarding project?' Your AI Agent calls Humanloop's List Experiments endpoint, scans each experiment's status and metrics, and responds with a summary showing three passed, one still running, and none failed. The engineer knows exactly where things stand before triggering a deploy.

Session Analysis for Prompt Debugging

A prompt engineer notices a regression in output quality. They ask the agent for recent sessions. Your AI Agent retrieves the latest session data from Humanloop, including inputs, outputs, and timestamps, and surfaces the last 10 interactions. The engineer spots the problematic pattern in minutes instead of digging through dashboards.

Quick Project Scaffolding for New Features

A product manager kicks off a new AI feature and needs a Humanloop project ready. They tell the agent 'Create a project called Customer Intent Classifier in the Support directory.' The agent creates the project with the right name, description, and directory assignment. The PM shares the project link with the team and evaluation work begins immediately.

Try
Humanloop

Humanloop

FAQs

Frequently Asked Questions

How does the AI agent access my Humanloop projects and experiments?

The agent uses Humanloop's REST API with your API key. It calls endpoints like Create Project, List Experiments, and List Sessions to retrieve or modify data. All requests are authenticated and scoped to your Humanloop organization. No data is stored by Tars between conversations.

Can the agent accidentally delete important projects?

You control this through agent configuration. Set up confirmation steps that require explicit approval before any delete operation. You can also restrict the agent to read-only operations like listing experiments and sessions, removing write access entirely if preferred.

What Humanloop API permissions does Tars require?

Tars needs a Humanloop API key with access to the Projects and Experiments endpoints. For read-only setups, only list and retrieve permissions are needed. For full functionality including project creation and deletion, the key needs write access. You generate this key in your Humanloop account settings.

Does Tars store my experiment results or session data?

No. Tars queries Humanloop in real time during each interaction. Experiment statuses, session logs, and project details are fetched live. Nothing is cached or stored separately. Your evaluation data stays entirely within your Humanloop account.

Can the agent work across multiple Humanloop projects simultaneously?

Yes. The agent can query different project IDs within the same conversation. Ask about experiments in one project, then switch to session data from another. Each API call is scoped to the project ID you specify, keeping results clean and organized.

How is this different from using the Humanloop dashboard directly?

The dashboard requires you to navigate between projects, experiments, and sessions manually. A Tars AI Agent lets your team get answers through natural language, whether on Slack, WhatsApp, or your internal tools. Quick status checks take seconds instead of multiple clicks.

What happens if I request an experiment list for a project that does not exist?

The agent receives an error response from Humanloop and handles it conversationally. It informs you that the project ID was not found, suggests verifying the ID format (must start with 'pr_'), and offers to list your available projects instead.

Can I use the agent to monitor experiment results over time?

The agent can retrieve current experiment statuses and metrics on demand. For ongoing monitoring, configure it to check experiment results at regular intervals and alert your team when an experiment completes or fails. This turns Humanloop into a proactive notification system.

How to add Tools to your AI Agent

Supercharge your AI Agent with Tool Integrations

Don't limit your AI Agent to basic conversations. Watch how to configure and add powerful tools making your agent smarter and more functional.

Privacy & Security

We’ll never let you lose sleep over privacy and security concerns

At Tars, we take privacy and security very seriously. We are compliant with GDPR, ISO, SOC 2, and HIPAA.

GDPR
ISO
SOC 2
HIPAA

Still scrolling? We both know you're interested.

Let's chat about AI Agents the old-fashioned way. Get a demo tailored to your requirements.

Schedule a Demo