Skip to main content
Version: 3.0.0

Available Metrics

This topic lists all metrics available in Ignite 3.

client.handler

The metrics provided by the client handler and related to active clients.

Metric nameDescription
BytesReceivedThe total number of bytes received.
BytesSentThe total number of bytes sent.
ConnectionsInitiatedThe total number of initiated connections.
CursorsActiveThe number of active cursors.
RequestsActiveThe number of requests in progress.
RequestsProcessedThe total number of processed requests.
RequestsFailedThe total number of failed requests.
SessionsAcceptedThe total number of accepted sessions.
SessionsActiveThe number of currently active sessions.
SessionsRejectedThe total number of sessions rejected due to handshake errors.
SessionsRejectedTlsThe total number of sessions rejected due to TLS handshake errors.
SessionsRejectedTimeoutThe total number of sessions rejected due to a timeout.
TransactionsActiveThe number of active transactions.

clock.service

Metric nameDescription
ClockSkewExceedingMaxClockSkewThe observed clock skew that exceeded the maximum clock skew.

jvm

The metrics for Ignite Java Virtual Machine resource use.

Metric nameDescription
UpTimeThe uptime of the Java virtual machine in milliseconds.
gc.CollectionTimeThe approximate total time spent on garbage collection in milliseconds, summed across all collectors.
memory.heap.CommittedThe committed amount of heap memory.
memory.heap.InitThe initial amount of heap memory.
memory.heap.MaxThe maximum amount of heap memory.
memory.heap.UsedThe currently used amount of heap memory.
memory.non-heap.CommittedThe committed amount of non-heap memory.
memory.non-heap.InitThe initial amount of non-heap memory.
memory.non-heap.MaxThe maximum amount of non-heap memory.
memory.non-heap.UsedThe used amount of non-heap memory.

metastorage

Metric nameDescription
IdempotentCacheSizeThe current size of the cache of idempotent commands' results.
SafeTimeLagThe number of milliseconds the local MetaStorage SafeTime lags behind the local logical clock.

os

Metric nameDescription
CpuLoadThe CPU load. The value is between 0.0 and 1.0, where 0.0 means no CPU load and 1.0 means 100% CPU load. If the CPU load information is not available, a negative value is returned.
LoadAverageThe system load average for the last minute. The system load average is the sum of the number of runnable entities queued to the available processors and the number of runnable entities running on the available processors, averaged over a period of time. The way in which the load average is calculated depends on the operating system. If the load average is not available, a negative value is returned.

placement-driver

Metric nameDescription
ActiveLeasesCountThe number of currently active leases.
CurrentPendingAssignmentsSizeThe current size of pending assignments over all partitions.
CurrentStableAssignmentsSizeThe current size of stable assignments over all partitions.
LeasesCreatedThe total number of created leases.
LeasesProlongedThe total number of prolonged leases.
LeasesPublishedThe total number of published leases.
LeasesWithoutCandidatesThe total number of leases without candidates currently existing.

raft

Metric nameDescription
raft.fsmcaller.disruptor.StripesThe histogram of distribution data by stripes in the state machine for partitions.
raft.fsmcaller.disruptor.BatchThe histogram of the batch size to handle in the state machine for partitions.
raft.logmanager.disruptor.BatchThe histogram of the batch size to handle in the log for partitions.
raft.logmanager.disruptor.StripesThe histogram of distribution data by stripes in the log for partitions.
raft.nodeimpl.disruptor.BatchThe histogram of the batch size to handle node operations for partitions.
raft.nodeimpl.disruptor.StripesThe histogram of distribution data by stripes for node operations for partitions.
raft.readonlyservice.disruptor.StripesThe histogram of distribution data by stripes for read-only operations for partitions.
raft.readonlyservice.disruptor.BatchThe histogram of the batch size to handle read-only operations for partitions.

resource.vacuum

Metric nameDescription
MarkedForVacuumTransactionMetaCountThe count of transaction metas that have been marked for vacuum.
SkippedForFurtherProcessingUnfinishedTransactionCountThe current number of unfinished transactions that are skipped by the vacuumizer for further processing.
VacuumizedPersistentTransactionMetaCountThe count of persistent transaction metas that have been vacuumized.
VacuumizedVolatileTxnMetaCountThe count of volatile transaction metas that have been vacuumized.

storage.aipersist.{profile}

note

Each storage profile with aipersist storage engine has an individual metrics exporter.

Metric nameDescription
CpTotalPagesThe number of pages in the current checkpoint.
CpEvictedPagesThe number of evicted pages in the current checkpoint.
CpWrittenPagesThe number of written pages in the current checkpoint.
CpSyncedPagesThe number of fsynced pages in the current checkpoint.
CpWriteSpeedThe checkpoint write speed, in pages per second. The value is averaged over the last 3 checkpoints plus the current one.
CurrDirtyRatioThe current ratio of dirty pages (dirty vs total), expressed as a fraction. The fraction is computed for each segment in the current region, and the highest value becomes "current."
LastEstimatedSpeedForMarkAllThe last estimated speed of marking all clean pages dirty to the end of a checkpoint, in pages per second.
MaxSizeThe maximum in-memory region size in bytes.
MarkDirtySpeedThe speed of marking pages dirty, in pages per second. The value is averaged over the last 3 fragments, 0.25 sec each, plus the current fragment, 0–0.25 sec (0.75–1.0 sec total).
SpeedBasedThrottlingPercentageThe fraction of throttling time within average marking time (e.g., "quarter" = 0.25).
TargetDirtyRatioThe ratio of dirty pages (dirty vs total), expressed as a fraction. Throttling starts when this ratio is reached.
ThrottleParkTimeThe park (sleep) time for the write operation, in nanoseconds. The value is averaged over the last 3 fragments, 0.25 sec each, plus the current fragment, 0–0.25 sec (0.75–1.0 sec total). It defines park periods for either the checkpoint buffer protection or the clean page pool protection.
TotalAllocatedSizeThe total size of allocated pages on disk in bytes.
TotalUsedSizeThe total size of non-empty allocated pages on disk in bytes.

sql.client

SQL client metrics.

Metric nameDescription
OpenCursorsThe number of currently open cursors.

sql.memory

Metric nameDescription
LimitThe SQL memory limit (bytes).
MaxReservedThe maximum memory usage by SQL so far (bytes).
ReservedThe current memory usage by SQL (bytes).
StatementLimitThe memory limit per SQL statement (bytes).

sql.plan.cache

Metrics for SQL cache planning.

Metric nameDescription
HitsThe total number of cache plan hits.
MissesThe total number of cache plan misses.

sql.queries

Metric nameDescription
CanceledThe total number of canceled queries.
FailedThe total number of failed queries. This metric includes all unsuccessful queries, regardless of reason.
SucceededThe total number of successful queries.
TimedOutThe total number of queries that failed due to a time-out.

tables.{table_name}

Table metrics.

Metric nameDescription
RwReadsThe total number of reads performed within read-write transactions.
RoReadsThe total number of reads performed within read-only transactions.
WritesThe total number of write operations for this table.

thread.pools.{thread-pool-executor-name}

Metric nameDescription
ActiveCountThe approximate number of threads that are actively executing tasks.
CompletedTaskCountThe approximate total number of tasks that have completed execution.
CorePoolSizeThe core number of threads.
KeepAliveTimeThe thread keep-alive time, which is the amount of time threads in excess of the core pool size may remain idle before being terminated.
LargestPoolSizeThe largest number of threads that have ever simultaneously been in the pool.
MaximumPoolSizeThe maximum allowed number of threads.
PoolSizeThe current number of threads in the pool.
TaskCountThe approximate total number of tasks that have been scheduled for execution.
QueueSizeThe current size of the execution queue.

topology.cluster

Metrics for the cluster topology.

Metric nameDescription
ClusterIdThe unique identifier of the cluster.
ClusterNameThe unique name of the cluster.
TotalNodesThe total number of nodes in the logical topology.

topology.local

Metrics with node information.

Metric nameDescription
NodeNameThe unique name of the node.
NodeIdThe unique identifier of the node.
NodeVersionThe Ignite version on the node.

transactions

Transaction metrics.

Metric nameDescription
RwCommitsThe total number of read-write transaction commits.
RoCommitsThe total number of read-only transaction commits.
RwRollbacksThe total number of read-write transaction rollbacks.
RoRollbacksThe total number of read-only transaction rollbacks.
RwDurationThe histogram representation of read-write transaction latency.
RoDurationThe histogram representation of read-only transaction latency.
TotalRollbacksThe total number of transaction rollbacks.
TotalCommitsThe total number of transaction commits.

zones

Metric nameDescription
LocalUnrebalancedPartitionsCountThe number of partitions that should be moved to this node.
TotalUnrebalancedPartitionsCountThe total number of partitions that should be moved to a new owner.