# Set up Monitoring

## Diagnostics

First things first, you're not alone! The WarpStream team is constantly monitoring your cluster and when we find anomalies we create [Diagnostics](/warpstream/agent-setup/monitor-the-warpstream-agents/diagnostics.md) that will alert you proactively in our UI or in our [Hosted Prometheus Endpoint](/warpstream/agent-setup/monitor-the-warpstream-agents/hosted-prometheus-endpoint.md). These are the most common problems we have found in operation and we are constantly adding to catalog of available diagnostics. For those that need more granular detail, keep reading...

## Logging

By default, the WarpStream Agent is configured to run with log level `info` . However, this can be changed with the `WARPSTREAM_LOG_LEVEL` environment variable. For example, if the `info` level logs are too noisy for you, you can set `WARPSTREAM_LOG_LEVEL=warn`.

The WarpStream Agents have an additional special log level called `analytics` that can be enabled by setting `WARPSTREAM_LOG_LEVEL=analytics`. This enables extremely detailed JSON logging that can be loaded into a logging system that supports analytics to slice and dice Agent log events and obtain a deep understanding of the workload. However, this feature emits a lot of logs, so keep that in mind before enabling it.

## Metrics

The WarpStream agents expose a traditional Prometheus metrics endpoint that can be scraped by most popular tools. Prometheus metrics will automatically be exposed on the Agent "internal port" which by default is port `8080`. If you set an explicit port override, then you'll need to update your Prometheus scrape configuration port as well.

All WarpStream Agent metrics begin with the `warpstream_` prefix.

### Recommended Metrics & Alerting

The WarpStream system is simple by design so there is less to monitor. If you are coming from open source Kafka, this should be a breath of fresh air.\
\
In [Important Metrics and Logs](/warpstream/agent-setup/monitor-the-warpstream-agents/important-metrics-and-logs.md) you will find all you need for monitoring the agent and to make sure your cluster is operational. If you would like to see the complete list of metrics, you can access those from the agent directly with `$IP:8080/metrics`

While there are not many alerts needed, we also provide a [Recommended List of Alerts](/warpstream/agent-setup/monitor-the-warpstream-agents/recommended-list-of-alerts.md) where you will find a list of key metrics for which you should configure alerts to detect issues in your agent effectively.

{% hint style="warning" %}
Some of the metrics, particularly the consumer group metrics, can become very high cardinality if the cluster contains a lot of topics or partitions. You can learn more in [Monitoring Consumer Groups](/warpstream/agent-setup/monitor-the-warpstream-agents/monitoring-consumer-groups.md) if you need to reduce the cardinality or disable them entirely
{% endhint %}

## Health Check

The Agent exposes an HTTP health check endpoint at `$IP:8080/v1/status`. A successful response is the string `OK` with a `200` status code.

## MCP Server

The WarpStream Console exposes an [MCP (Model Context Protocol)](https://modelcontextprotocol.io/) server that you can connect to from AI-powered IDEs like Cursor, Claude Code, or Windsurf. Once connected, you can troubleshoot your cluster using natural language — ask questions like "are there any errors in my cluster in the last hour?" or "what diagnostics are failing?" and your AI assistant will query the cluster for you.

See [MCP Server](/warpstream/reference/mcp-server.md) for setup instructions.


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.warpstream.com/warpstream/agent-setup/monitor-the-warpstream-agents.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.