SQL query metricset

The sql query metricset collects rows returned by a query.

Field names (columns) are returned as lowercase strings. Values are returned as numeric or string.

Cursor-based incremental data fetching

The cursor feature enables incremental data fetching by tracking the last fetched row value and using it to retrieve only new data on subsequent collection cycles. This is particularly useful for:

Fetching audit logs or events that are continuously appended
Reducing database load by avoiding full table scans
Preventing duplicate data ingestion

Configuration

To enable cursor-based fetching, add a cursor configuration block to your metricset:

		- module: sql
  metricsets: [query]
  hosts: ["postgres://user:pass@localhost:5432/mydb"]
  driver: postgres
  sql_query: "SELECT id, event_data, created_at FROM events WHERE id > :cursor ORDER BY id ASC LIMIT 1000"
  sql_response_format: table
  raw_data.enabled: true
  cursor:
    enabled: true
    column: id
    type: integer
    default: "0"
		
	

Note

raw_data.enabled: true in the examples above is optional and controls the event output format, not the cursor. It is shown here because raw mode is commonly used with cursor-based fetching.

Option	Required	Description
`cursor.enabled`	No	Set to `true` to enable cursor-based fetching. Default: `false`
`cursor.column`	Yes (when enabled)	The column name to track for cursor state. Must be present in query results.
`cursor.type`	No	Optional cursor type. If omitted, it is inferred from `cursor.default` and refined from result rows. Allowed values: `integer`, `timestamp`, `date`, `float`, `decimal`
`cursor.state_id`	No	Optional stable state identity. When set, cursor state keys use this value instead of DSN, allowing continuity across DSN credential/parameter changes. Use a unique value per logical source.
`cursor.default`	Yes (when enabled)	Initial cursor value used on first run (before any state is persisted)
`cursor.direction`	No	Scan direction: `asc` (default, tracks max value) or `desc` (tracks min value)

For best performance, ensure the cursor.column has a database index. Without an index, the WHERE column > :cursor ORDER BY column clause will trigger a full table scan on every collection cycle, which can be slow on large tables.

Supported cursor types

Type	Description	Default Format Example
`integer`	Integer values (auto-incrementing IDs, sequence numbers)	`"0"`
`timestamp`	Timestamp values (TIMESTAMP, DATETIME). Accepts RFC3339, `YYYY-MM-DD HH:MM:SS[.nnnnnnnnn]`, and date-only formats. Stored internally as nanoseconds in UTC.	`"2024-01-01T00:00:00Z"`
`date`	Date values (YYYY-MM-DD format)	`"2024-01-01"`
`float`	Floating-point values (FLOAT, DOUBLE, REAL). IEEE 754 precision limits apply.	`"0.0"`
`decimal`	Exact decimal values (DECIMAL, NUMERIC). Arbitrary precision, no data loss.	`"0.00"`

Scan direction

By default, the cursor tracks the maximum value from each batch (ascending scan). For descending scans, set cursor.direction: desc:

Direction	Operator	ORDER BY	Cursor Tracks
`asc` (default)	`>`	`ASC`	Maximum value
`desc`	`<`	`DESC`	Minimum value

Query requirements

When cursor is enabled, your SQL query must:

Include the :cursor placeholder exactly once in the query WHERE clause
Include an ORDER BY clause on the cursor column matching the configured direction
Use sql_response_format: table — cursor requires table mode
Use sql_query (single query mode) — cursor is not supported with sql_queries (multiple queries)

Cursor is also not compatible with fetch_from_all_databases. Use a separate module block for each database if you need both features.

Example configurations

Integer cursor (auto-increment ID)

		- module: sql
  metricsets: [query]
  hosts: ["mysql://user:pass@localhost:3306/mydb"]
  driver: mysql
  sql_query: "SELECT id, event_type, payload FROM audit_log WHERE id > :cursor ORDER BY id ASC LIMIT 500"
  sql_response_format: table
  raw_data.enabled: true
  cursor:
    enabled: true
    column: id
    type: integer
    default: "0"
		
	

Timestamp cursor (event timestamps)

		- module: sql
  metricsets: [query]
  hosts: ["postgres://user:pass@localhost:5432/mydb"]
  driver: postgres
  sql_query: "SELECT id, message, created_at FROM logs WHERE created_at > :cursor ORDER BY created_at ASC LIMIT 500"
  sql_response_format: table
  raw_data.enabled: true
  cursor:
    enabled: true
    column: created_at
    type: timestamp
    default: "2024-01-01T00:00:00Z"
		
	

Date cursor (daily partitioned data)

		- module: sql
  metricsets: [query]
  hosts: ["oracle://user:pass@localhost:1521/MYDB"]
  driver: oracle
  sql_query: "SELECT report_date, metrics FROM daily_reports WHERE report_date > :cursor ORDER BY report_date ASC FETCH FIRST 500 ROWS ONLY"
  sql_response_format: table
  raw_data.enabled: true
  cursor:
    enabled: true
    column: report_date
    type: date
    default: "2024-01-01"
		
	

Decimal cursor (exact numeric, financial data)

		- module: sql
  metricsets: [query]
  hosts: ["postgres://user:pass@localhost:5432/mydb"]
  driver: postgres
  sql_query: "SELECT id, amount, description FROM transactions WHERE amount > :cursor ORDER BY amount ASC LIMIT 500"
  sql_response_format: table
  raw_data.enabled: true
  cursor:
    enabled: true
    column: amount
    type: decimal
    default: "0.00"
		
	

Float cursor (approximate numeric)

		- module: sql
  metricsets: [query]
  hosts: ["mysql://user:pass@localhost:3306/mydb"]
  driver: mysql
  sql_query: "SELECT id, score FROM scores WHERE score > :cursor ORDER BY score ASC LIMIT 500"
  sql_response_format: table
  raw_data.enabled: true
  cursor:
    enabled: true
    column: score
    type: float
    default: "0.0"
		
	

Note

Float cursors use IEEE 754 float64 representation. For exact precision at boundaries (for example, financial data), use the decimal type instead.

MSSQL cursor (with TOP instead of LIMIT)

MSSQL does not support LIMIT. Use TOP to restrict the number of rows per cycle:

		- module: sql
  metricsets: [query]
  hosts: ["sqlserver://sa:YourPassword@localhost:1433?database=mydb"]
  driver: mssql
  sql_query: "SELECT TOP 500 id, event_type, payload FROM audit_log WHERE id > :cursor ORDER BY id ASC"
  sql_response_format: table
  raw_data.enabled: true
  cursor:
    enabled: true
    column: id
    type: integer
    default: "0"
		
	

Descending scan (processing historical data backwards)

		- module: sql
  metricsets: [query]
  hosts: ["postgres://user:pass@localhost:5432/mydb"]
  driver: postgres
  sql_query: "SELECT id, event_data FROM events WHERE id < :cursor ORDER BY id DESC LIMIT 500"
  sql_response_format: table
  raw_data.enabled: true
  cursor:
    enabled: true
    column: id
    type: integer
    default: "999999999"
    direction: desc
		
	

With direction: desc, the cursor tracks the minimum value from each batch, suitable for scanning data in reverse chronological order.

State persistence

Cursor state is persisted to disk using Metricbeat's statestore at: {data.path}/sql-cursor/

The state persists across Metricbeat restarts, allowing incremental fetching to continue from where it left off. State is keyed by a hash of:

State identity:
- Full database URI/DSN (default behavior)
- cursor.state_id when configured
Query string
Cursor column name
Cursor direction (asc or desc)

This ensures that different query configurations maintain separate cursor states, including different databases on the same server.

Important: Changing any of these components (DSN, query, cursor column, or direction) produces a different state key, which effectively resets the cursor to its default value. This is by design — if you modify the query, the old cursor position might no longer be valid for the new query.

Cursor reset scenarios

The cursor falls back to cursor.default when:

Any state-key component changes: state identity (DSN by default or cursor.state_id), query text, cursor.column, or cursor.direction
Persisted state is invalid (for example: unsupported state version, corrupted value, unsupported stored type, or state-load failure)
cursor.type is explicitly configured and does not match the persisted state type

The cursor does not reset when only these settings change:

cursor.default (used only when state is missing/invalid)
period or timeout (runtime scheduling/timeout settings)
DSN credentials/parameters when cursor.state_id is set and unchanged

When cursor.type is omitted (auto mode), a valid persisted state type is reused on restart. If no valid state exists, type is inferred from cursor.default.

Choosing comparison operators for queries

The choice of comparison operator in your WHERE clause affects data completeness:

Use > (greater than) when:

The cursor column has unique, monotonically increasing values (auto-increment IDs, sequences)
No two rows can share the same cursor value
Example: WHERE id > :cursor ORDER BY id ASC

Use >= (greater than or equal) when:

The cursor column can have duplicate values (timestamps, dates, scores)
Late-arriving rows might be inserted with the same value as the current cursor
Example: WHERE created_at >= :cursor ORDER BY created_at ASC

The >= operator causes the last row from each batch to be re-fetched on the next cycle (a duplicate), but ensures no data is lost when multiple rows share the same cursor value. If using >=, configure Elasticsearch document IDs or use an ingest pipeline to deduplicate.

		# Safe for timestamps -- accepts duplicates, prevents data loss
sql_query: "SELECT id, data, created_at FROM events WHERE created_at >= :cursor ORDER BY created_at ASC LIMIT 500"

# Safe for unique IDs -- no duplicates possible
sql_query: "SELECT id, data FROM events WHERE id > :cursor ORDER BY id ASC LIMIT 500"

Error handling

The cursor feature follows an "at-least-once" delivery model:

Events are emitted before the cursor state is updated
If a failure occurs after emitting events but before updating state, those events will be re-fetched on the next cycle
This ensures no data loss, but can result in occasional duplicates

Driver-specific notes

MySQL: When using timestamp cursors, include parseTime=true in your DSN to ensure the driver correctly handles time.Time parameters:

hosts: ["root:pass@tcp(localhost:3306)/mydb?parseTime=true"]

Oracle: Set the session timezone to UTC for timestamp cursors. The godror driver can convert Go UTC timestamps to the Oracle session timezone, causing incorrect comparisons. Use the alterSession DSN parameter or consult the Oracle integration documentation.

MSSQL: Use TOP instead of LIMIT to restrict results per cycle. The driver uses @p1 as the parameter placeholder. Use driver: mssql in your configuration. It is automatically mapped to the modern sqlserver driver internally.

Decimal columns: The decimal cursor type passes the cursor value as a string to the database driver. Most drivers (PostgreSQL, MySQL, MSSQL) implicitly cast strings to DECIMAL for comparison. If your driver doesn't, use an explicit cast: WHERE price > CAST(:cursor AS DECIMAL(10,2)).

Limitations

Only one :cursor placeholder is allowed per query
Placeholder detection skips common quoted strings, quoted identifiers, and SQL comments. Limitation: MySQL backslash-escaped strings (for example, 'it\\'s :cursor') can still mis-detect :cursor inside the literal
The cursor column must be included in the SELECT clause. If omitted, the cursor will not advance and an error will be logged on the first fetch
NULL cursor values are skipped (only non-NULL values contribute to cursor progression)
String, UUID, and ULID columns are not supported as cursor types. Workaround: add an integer or timestamp column for cursor tracking, or use a database function to convert to a sortable value
All matching rows are loaded into memory before events are emitted. Use LIMIT to control memory usage (recommended: 500-5000 rows per cycle). For wide rows with large text columns, use a lower LIMIT
Float cursors are subject to IEEE 754 precision limits. For exact boundary comparisons (for example, financial data), use the decimal type instead
Each cursor-based fetch is protected by the module's timeout setting (which defaults to period). Hung queries are cancelled after the timeout expires, the cursor remains unchanged, and the next collection cycle can proceed normally
If a cursor collection cycle takes longer than period but completes within timeout, subsequent cycles are skipped until the current one completes

Fields

For a description of each field in the metricset, see the exported fields section.

Here is an example document generated by this metricset:

		{
    "@timestamp": "2017-10-12T08:05:34.853Z",
    "event": {
        "dataset": "sql.query",
        "duration": 115000,
        "module": "sql"
    },
    "metricset": {
        "name": "query",
        "period": 10000
    },
    "service": {
        "address": "localhost:65194",
        "type": "sql"
    },
    "sql": {
        "driver": "mysql",
        "metrics": {
            "engine": "InnoDB",
            "table_name": "db",
            "table_rows": 2,
            "table_schema": "mysql"
        },
        "query": "select table_schema, table_name, engine, table_rows from information_schema.tables where table_rows \u003e 0;"
    }
}