Feature flags

Here is a list of features that are disabled by default since they are breaking changes or are considered experimental. Their behaviour can change in future releases which will be communicated via the release changelog .

You can enable them using the --enable-feature flag with a comma separated list of features. They may be enabled by default in future versions.

Exemplars storage

--enable-feature=exemplar-storage

OpenMetrics introduces the ability for scrape targets to add exemplars to certain metrics. Exemplars are references to data outside of the MetricSet. A common use case are IDs of program traces.

Exemplar storage is implemented as a fixed size circular buffer that stores exemplars in memory for all series. Enabling this feature will enable the storage of exemplars scraped by Prometheus. The config file block storage/exemplars can be used to control the size of circular buffer by # of exemplars. An exemplar with just a trace_id=<jaeger-trace-id> uses roughly 100 bytes of memory via the in-memory exemplar storage. If the exemplar storage is enabled, we will also append the exemplars to WAL for local persistence (for WAL duration).

Memory snapshot on shutdown

--enable-feature=memory-snapshot-on-shutdown

This takes a snapshot of the chunks that are in memory along with the series information when shutting down and stores it on disk. This will reduce the startup time since the memory state can now be restored with this snapshot and m-mapped chunks, while a WAL replay from disk is only needed for the parts of the WAL that are not part of the snapshot.

Extra scrape metrics

--enable-feature=extra-scrape-metrics

Note: This feature flag is deprecated. Please use the extra_scrape_metrics configuration option instead (available at both global and scrape-config level). The feature flag will be removed in a future major version. See the configuration documentation for more details.

When enabled, for each instance scrape, Prometheus stores a sample in the following additional time series:

scrape_timeout_seconds. The configured scrape_timeout for a target. This allows you to measure each target to find out how close they are to timing out with scrape_duration_seconds / scrape_timeout_seconds.
scrape_sample_limit. The configured sample_limit for a target. This allows you to measure each target to find out how close they are to reaching the limit with scrape_samples_post_metric_relabeling / scrape_sample_limit. Note that scrape_sample_limit can be zero if there is no limit configured, which means that the query above can return +Inf for targets with no limit (as we divide by zero). If you want to query only for targets that do have a sample limit use this query: scrape_samples_post_metric_relabeling / (scrape_sample_limit > 0).
scrape_body_size_bytes. The uncompressed size of the most recent scrape response, if successful. Scrapes failing because body_size_limit is exceeded report -1, other scrape failures report 0.

Per-step stats

--enable-feature=promql-per-step-stats

When enabled, passing stats=all in a query request returns per-step statistics. The following sample statistics are included:

totalQueryableSamples / totalQueryableSamplesPerStep: Total number of samples loaded during the query. For range-vector functions (e.g. rate, sum_over_time) evaluated over multiple steps, each step counts the full window (the same point may be counted in multiple steps).
samplesRead / samplesReadPerStep: Total number of samples read (I/O). For range-vector functions in range queries, only new points per step are counted (step 0: full window; later steps: points not seen in the previous step). For other query types, this equals totalQueryableSamples.
peakSamples: Peak number of samples in memory during evaluation (used for the query.max-samples limit).

The Prometheus server exposes two counters for observability: prometheus_engine_query_samples_total (samples loaded, full window per step) and prometheus_engine_query_samples_read_total (samples read, delta per step for range-vector).

When disabled in either the engine or the query, per-step statistics are not computed at all.

Experimental PromQL functions

--enable-feature=promql-experimental-functions

Enables PromQL functions that are considered experimental. These functions might change their name, syntax, or semantics. They might also get removed entirely.

Start (Created) Timestamps Zero Injection

--enable-feature=created-timestamp-zero-ingestion

NOTE: CreatedTimestamp feature was renamed to StartTimestamp for consistency. The above flag uses old name for stability.

Enables ingestion of start timestamp. Start timestamps are injected as 0 valued samples when appropriate. See PromCon talk for details.

Currently, Prometheus supports start timestamps on the

PrometheusProto
OpenMetrics1.0.0

From the above, Prometheus recommends PrometheusProto. This is because OpenMetrics 1.0 Start Timestamp information is shared as a <metric>_created metric and parsing those are prone to errors and expensive (thus, adding an overhead). You also need to be careful to not pollute your Prometheus with extra _created metrics.

Therefore, when created-timestamp-zero-ingestion is enabled Prometheus changes the global scrape_protocols default configuration option to [ PrometheusProto, OpenMetricsText1.0.0, OpenMetricsText0.0.1, PrometheusText0.0.4 ], resulting in negotiating the Prometheus Protobuf protocol first (unless the scrape_protocols option is set to a different value explicitly).

Besides enabling this feature in Prometheus, start timestamps need to be exposed by the application being scraped.

Start timestamp (ST) native storage

--enable-feature=st-storage

Enables the storage of start timestamps (ST) per sample, through WAL, TSDB/Agent and Remote-Write 2.0. This option allows preserving the exact ST value as it was presented from scrape and receive protocols. In the future this feature is meant to be a replacement of created-timestamp-zero-ingestion which injects synthetic 0 samples.

Currently, Prometheus supports start timestamps on:

PrometheusProto
OpenMetrics1.0.0

PrometheusProto is recommended, due to efficiency of ST passing.

Besides enabling this feature in Prometheus, start timestamps need to be exposed by the application being scraped.

NOTE: This is an experimental feature with known limitations until fully implemented.

It introduces new WAL record type (SamplesV2) that can only be replayed with Prometheus 3.11 or later versions.

For persistent storage support (TSDB blocks), you need to manually opt-in for XOR2 chunk format (xor2-encoding flag). The float chunk encoding must resolve to XOR2 when st-storage is active, because XOR chunks do not store start timestamps. If the resolved encoding is XOR (that is, --enable-feature=xor2-encoding is not set and chunk_encoding.floats: xor2 is not configured), Prometheus refuses to start and fails the configuration validation with an error rather than continuing to run. Likewise, explicitly setting chunk_encoding.floats: xor in the config file while st-storage is active is rejected at config reload. This might change later once we finish experimentation phase with XOR2.

ST for native histograms and NHCBs are not yet implemented (see #18315 ).

PromQL use of ST is out of scope of this feature.

Start timestamp (ST) usage in PromQL functions

--enable-feature=use-start-timestamps

Enables the use of start timestamps (ST) in PromQL functions such as rate(), irate(), and increase(). This feature doesn't currently work with extended range selectors (promql-extended-range-selectors).

Start timestamp (ST) synthesis

--enable-feature=st-synthesis

Enables the synthesis of start timestamps (ST) for cumulative metrics (Counters, Classic Histograms, Native Histograms) when they are not provided by the source. Similar to the official OpenTelemetry metricstarttimeprocessor , it tracks previous values to detect resets and subtracts the initial reference point to synthesize a zero-based timeline from the first sample.

NOTE: This is an experimental feature.

The first sample is dropped when this feature is turned on to establish the start timestamp reference point. As a result, if a series has only a single point reported, turning this feature on may result in no points being ingested for that series.

Synthesis yields accurate Start Timestamp while maintaining accurate counter rates. However, the raw counter values will be different that what's scraped. This is because the first point is dropped and its timestamp is used as the start timestamp for all subsequent points. All subsequent points are normalized against that dropped point (i.e. subtracted by it). Effectively, synthesis create new counter streams with the known start timestamp from the original data.

Synthesis works only with scraped data (RW and Otel receiver are not implemented).

Synthesis requires ordered samples. As a result, cumulative samples without ST that are out of order will be rejected despite the tsdb. out_of_order_time_window setting.

If an append fails for a series (e.g., due to out-of-order samples being rejected), the synthesis state for that series is cleared. As a result, the next sample received after the failure will be treated as the first sample again and will be dropped to establish a new reference point.

Concurrent evaluation of independent rules

--enable-feature=concurrent-rule-eval

By default, rule groups execute concurrently, but the rules within a group execute sequentially; this is because rules can use the output of a preceding rule as its input. However, if there is no detectable relationship between rules then there is no reason to run them sequentially. When the concurrent-rule-eval feature flag is enabled, rules without any dependency on other rules within a rule group will be evaluated concurrently. This has the potential to improve rule group evaluation latency and resource utilization at the expense of adding more concurrent query load.

The number of concurrent rule evaluations can be configured with --rules.max-concurrent-rule-evals, which is set to 4 by default.

Serve old Prometheus UI

Fall back to serving the old (Prometheus 2.x) web UI instead of the new UI. The new UI that was released as part of Prometheus 3.0 is a complete rewrite and aims to be cleaner, less cluttered, and more modern under the hood. However, it is not fully feature complete and battle-tested yet, so some users may still prefer using the old UI.

--enable-feature=old-ui

Metadata WAL Records

--enable-feature=metadata-wal-records

When enabled, Prometheus will store metadata in-memory and keep track of metadata changes as WAL records on a per-series basis.

This must be used if you would like to send metadata using the new remote write 2.0.

Delay compaction start time

--enable-feature=delayed-compaction

A random offset, up to 10% of the chunk range, is added to the Head compaction start time. This assists Prometheus instances in avoiding simultaneous compactions and reduces the load on shared resources.

Only auto Head compactions and the operations directly resulting from them are subject to this delay.

In the event of multiple consecutive Head compactions being possible, only the first compaction experiences this delay.

Note that during this delay, the Head continues its usual operations, which include serving and appending series.

Despite the delay in compaction, the blocks produced are time-aligned in the same manner as they would be if the delay was not in place.

Delay `name` label removal for PromQL engine

--enable-feature=promql-delayed-name-removal

When enabled, Prometheus will change the way in which the __name__ label is removed from PromQL query results (for functions and expressions for which this is necessary). Specifically, it will delay the removal to the last step of the query evaluation, instead of every time an expression or function creating derived metrics is evaluated.

This allows optionally preserving the __name__ label via the label_replace and label_join functions, and helps prevent the "vector cannot contain metrics with the same labelset" error, which can happen when applying a regex-matcher to the __name__ label.

Note that evaluating parts of the query separately will still trigger the labelset collision. This commonly happens when analyzing intermediate results of a query manually or with a tool like PromLens.

If a query refers to the already removed __name__ label, its behavior may change while this feature flag is set. (Example: sum by (__name__) (rate({foo="bar"}[5m])), see details on GitHub .) These queries are rare to occur and easy to fix. (In the above example, removing by (__name__) doesn't change anything without the feature flag and fixes the possible problem with the feature flag.)

It is possible to craft a query that aggregates by __name__ and puts samples with and without delayed name removal into the same group. In that case, the name is removed from the affected group. Note that this case hardly occurs in queries that fulfill a practical purpose.

OTLP Delta Conversion

--enable-feature=otlp-deltatocumulative

When enabled, Prometheus will convert OTLP metrics from delta temporality to their cumulative equivalent, instead of dropping them. This cannot be enabled in conjunction with otlp-native-delta-ingestion.

This uses deltatocumulative from the OTel collector, using its default settings.

Delta conversion keeps in-memory state to aggregate delta changes per-series over time. When Prometheus restarts, this state is lost, starting the aggregation from zero again. This results in a counter reset in the cumulative series.

This state is periodically (max_stale) cleared of inactive series.

Enabling this can have negative impact on performance, because the in-memory state is mutex guarded. Cumulative-only OTLP requests are not affected.

PromQL arithmetic expressions in time durations

--enable-feature=promql-duration-expr

With this flag, arithmetic expressions can be used in time durations in range queries and offset durations.

In range queries:

rate(http_requests_total[5m * 2])  # 10 minute range
rate(http_requests_total[(5+2) * 1m])  # 7 minute range

In offset durations:

http_requests_total offset (1h / 2)  # 30 minute offset
http_requests_total offset ((2 ^ 3) * 1m)  # 8 minute offset

When using offset with duration expressions, you must wrap the expression in parentheses. Without parentheses, only the first duration value will be used in the offset calculation.

step() can be used in duration expressions. For a range query, it resolves to the step width of the range query. For an instant query, it resolves to 0s.

range() can be used in duration expressions. For a range query, it resolves to the full range of the query (end time - start time). For an instant query, it resolves to 0s. This is particularly useful in combination with @end() to look back over the entire query range, e.g., max_over_time(metric[range()] @ end()).

min_of(<duration>, <duration>) and max_of(<duration>, <duration>) select between two duration expressions. min_of returns the smaller of the two, which is useful for capping a duration at a maximum value. max_of returns the larger of the two, which is useful for enforcing a minimum value. For example, max_of(step(), 5s) ensures the duration is never shorter than 5s, while min_of(range(), 1h) caps the duration at 1h.

Note: Duration expressions are not supported in the @ timestamp operator.

The following operators are supported:

+ - addition
- - subtraction
* - multiplication
/ - division
% - modulo
^ - exponentiation

Examples of equivalent durations:

5m * 2 is equivalent to 10m or 600s
10m - 1m is equivalent to 9m or 540s
(5+2) * 1m is equivalent to 7m or 420s
1h / 2 is equivalent to 30m or 1800s
4h % 3h is equivalent to 1h or 3600s
(2 ^ 3) * 1m is equivalent to 8m or 480s
step() + 1 is equivalent to the query step width increased by 1s.
max_of(step(), 5s) is equivalent to the larger of the query step width and 5s.
min_of(2 * step() + 5s, 5m) is equivalent to the smaller of twice the query step increased by 5s and 5m.

OTLP Native Delta Support

--enable-feature=otlp-native-delta-ingestion

When enabled, allows for the native ingestion of delta OTLP metrics, storing the raw sample values without conversion. This cannot be enabled in conjunction with otlp-deltatocumulative.

Currently, the StartTimeUnixNano field is ignored, and deltas are given the unknown metric metadata type.

Delta support is in a very early stage of development and the ingestion and querying process my change over time. For the open proposal see prometheus/proposals#48 .

Querying

We encourage users to experiment with deltas and existing PromQL functions; we will collect feedback and likely build features to improve the experience around querying deltas.

Note that standard PromQL counter functions like rate() and increase() are designed for cumulative metrics and will produce incorrect results when used with delta metrics. This may change in the future, but for now, to get similar results for delta metrics, you need sum_over_time():

sum_over_time(delta_metric[<range>]): Calculates the sum of delta values over the specified time range.
sum_over_time(delta_metric[<range>]) / <range>: Calculates the per-second rate of the delta metric.

These may not work well if the <range> is not a multiple of the collection interval of the metric. For example, if you do sum_over_time(delta_metric[1m]) / 1m range query (with a 1m step), but the collection interval of a metric is 10m, the graph will show a single point every 10 minutes with a high rate value, rather than 10 points with a lower, constant value.

Current gotchas

If delta metrics are exposed via federation, data can be incorrectly collected if the ingestion interval is not the same as the scrape interval for the federated endpoint.
It is difficult to figure out whether a metric has delta or cumulative temporality, since there's no indication of temporality in metric names or labels. For now, if you are ingesting a mix of delta and cumulative metrics we advise you to explicitly add your own labels to distinguish them. In the future, we plan to introduce type labels to consistently distinguish metric types and potentially make PromQL functions type-aware (e.g. providing warnings when cumulative-only functions are used with delta metrics).
If there are multiple samples being ingested at the same timestamp, only one of the points is kept - the samples are not summed together (this is how Prometheus works in general - duplicate timestamp samples are rejected). Any aggregation will have to be done before sending samples to Prometheus.

Type and Unit Labels

--enable-feature=type-and-unit-labels

When enabled, Prometheus will start injecting additional, reserved __type__ and __unit__ labels as designed in the PROM-39 proposal .

Those labels are sourced from the metadata structures of the existing scrape and ingestion formats like OpenMetrics Text, Prometheus Text, Prometheus Proto, Remote Write 2 and OTLP. All the user provided labels with __type__ and __unit__ will be overridden.

PromQL layer will handle those labels the same way __name__ is handled, e.g. dropped on certain operations like - or + and affected by promql-delayed-name-removal feature.

This feature enables important metadata information to be accessible directly with samples and PromQL layer.

It's especially useful for users who:

Want to be able to select metrics based on type or unit.
Want to handle cases of series with the same metric name and different type and units. e.g. native histogram migrations or OpenTelemetry metrics from OTLP endpoint, without translation.

In future more work is planned that will depend on this e.g. rich PromQL UX that helps when wrong types are used on wrong functions, automatic renames, delta types and more.

Behavior with metadata records

When this feature is enabled and the metadata WAL records exists, in an unlikely situation when type or unit are different across those, the Prometheus outputs intends to prefer the __type__ and __unit__ labels values. For example on Remote Write 2.0, if the metadata record somehow (e.g. due to bug) says "counter", but __type__="gauge" the remote time series will be set to a gauge.

Use Uncached IO

--enable-feature=use-uncached-io

Experimental and only available on Linux.

When enabled, it makes chunks writing bypass the page cache. Its primary goal is to reduce confusion around page‐cache behavior and to prevent over‑allocation of memory in response to misleading cache growth.

This is currently implemented using direct I/O.

For more details, see the proposal .

XOR2 chunk encoding

--enable-feature=xor2-encoding

WARNING: This is highly experimental and risky setting:

Chunks encoded with XOR2 cannot be read by older Prometheus versions that do not support the encoding. Once enabled and data is written, you need to manually delete blocks from the disk, otherwise Prometheus will return error on all queries.

We are still experimenting on the final encoding. As of now this encoding can change in any Prometheus version. All your persistent block data will be lost between versions.

This encoding is new, meaning downstream tools and LTS systems might not support it yet (e.g. Thanos sidecar uploaded blocks).

This setting enables the new XOR2 chunk encoding for float samples, which provides better disk compression than the default XOR encoding for typical Prometheus workloads. This format also allows storing Start Timestamp (ST).

The encoding can also be controlled at each configuration reload via the chunk_encoding.floats field in the storage.tsdb section of the configuration file. Setting chunk_encoding.floats: xor forces standard XOR encoding even when --enable-feature=xor2-encoding is set; setting chunk_encoding.floats: xor2 requires --enable-feature=xor2-encoding to be enabled.

Without st-storage, XOR and XOR2 are compatible encodings, so an encoding change via chunk_encoding.floats does not cut the current chunk; the new encoding takes effect when the current chunk is next cut for any reason (size, time range, or sample count). When st-storage is also enabled, XOR and XOR2 are not compatible because XOR chunks do not store start timestamps, so the in-progress chunk is cut on the next append after the encoding changes.

Note that --enable-feature=st-storage does not automatically enable XOR2 encoding. However, setting chunk_encoding.floats: xor while st-storage is active is rejected at config reload, because XOR chunks do not store start timestamps.

Extended Range Selectors

--enable-feature=promql-extended-range-selectors

Enables experimental anchored and smoothed modifiers for PromQL range and instant selectors. These modifiers provide more control over how range boundaries are handled in functions like rate and increase, especially with missing or irregular data.

Native Histograms are not yet supported by the extended range selectors.

`anchored`

Uses the most recent sample (within the lookback delta) at the beginning of the range, or alternatively the first sample within the range if there is no sample within the lookback delta. The last sample within the range is also used at the end of the range. No extrapolation or interpolation is applied, so this is useful to get the direct difference between sample values.

Anchored range selector work with: resets, changes, rate, increase, and delta.

Example query: increase(http_requests_total[5m] anchored)

Note: When using the anchored modifier with the increase function, the results returned are integers.

`smoothed`

In range selectors, linearly interpolates values at the range boundaries, using the sample values before and after the boundaries for an improved estimation that is robust against irregular scrapes and missing samples. However, it requires a sample after the evaluation interval to work properly, see note below.

For instant selectors, values are linearly interpolated at the evaluation timestamp using the samples immediately before and after that point.

Smoothed range selectors work with: rate, increase, and delta.

Example query: rate(http_requests_total[step()] smoothed)

Note for alerting and recording rules: The smoothed modifier requires samples after the evaluation interval, so using it directly in alerting or recording rules will typically under-estimate the result, as future samples are not available at evaluation time. To use smoothed safely in rules, you must apply a query_offset to the rule group (see documentation) to ensure the calculation window is fully in the past and all needed samples are available. For critical alerting, set the offset to at least one scrape interval; for less critical or more resilient use cases, consider a larger offset (multiple scrape intervals) to tolerate missed scrapes.

For more details, see the design doc .

Note: Extended Range Selectors are not supported for subqueries.

Binary operator fill modifiers

--enable-feature=promql-binop-fill-modifiers

Enables experimental fill(), fill_left(), and fill_right() modifiers for PromQL binary operators. These modifiers allow filling in missing matches on either side of a binary operation with a provided default sample value.

Example query:

  rate(successful_requests[5m])
+ fill(0)
  rate(failed_requests[5m])

See the fill modifiers documentation for more details and examples.

Search API

--enable-feature=search-api

Enables the experimental search API endpoints for discovering metric names, label names, and label values with fuzzy matching and filtering support. See the search API documentation for details.

The --web.search.max-limit flag (default 10000) bounds the limit query parameter accepted by the search endpoints. Requests with a higher limit are rejected with HTTP 400. The default response limit (100) is silently clamped to this maximum, so an operator setting a smaller cap does not break no-limit requests. Setting the flag to 0 disables the cap entirely; this is not recommended for endpoints exposed beyond a trusted network because a single client can then request the entire index in one response.

On this page