Alert list – Virtuozzo Hybrid Infrastructure

License alerts

License is not loaded

License is not installed.

warning

License expired

The license of cluster “<cluster_name>” has expired. Сontact your reseller to update your license immediately!

critical

Cluster alerts

Cluster is out of space

Cluster has just <free_space> TB (<free_space_in_percent>%) of physical storage space left. You may want to free some space or add more storage capacity.

warning

Сluster “<cluster_name>” has run out of storage space allowed by license. No more data can be written. Please contact your reseller to update your license immediately!

warning

Not enough cluster nodes

Cluster “<cluster_name>” has only {1,2} node(s) instead of the recommended minimum of 3. Add {2,1} or more nodes to the cluster.

warning

High availability for the admin panel must be configured

Configure high availability for the admin panel in Settings > Management node. Otherwise the admin panel will be a single point of failure.

critical

Management node backup does not exist

Management node backup is older than <number_of_days> days.

critical

The last management node backup has failed, does not exist, or is too old.

critical

Changes to the management database are not replicated

Changes to the management database are not replicated to the node "<hostname>" because it is offline. Check the node's state and connectivity.

critical

Changes to the management database are not replicated to the node "<hostname>". Please contact the technical support.

Cluster connectivity alerts

Cluster network connectivity problem

All nodes have network connectivity problems: unstable connectivity via network "<network_name>" due to packet loss.

critical

All nodes have network connectivity problems: no connectivity via network "<network_name>".

critical

Node network connectivity problem

Node "<hostname>" has network connectivity problems: unstable connectivity via network "<network_name>" due to the loss of all MTU-sized packets.

critical

Node "<hostname>" has network connectivity problems: unstable connectivity via network "<network_name>" due to the loss of some MTU-sized packets.

critical

Node "<hostname>" has network connectivity problems: unstable connectivity via network "<network_name>" due to packet loss.

critical

Node "<hostname>" has network connectivity problems: no connectivity to node "<hostname>" with interface "<iface>" via interface "<iface>".

critical

Node "<hostname>" has network connectivity problems: unstable connectivity to node "<hostname>" with interface "<iface>" via interface "<iface>" due to the loss of all MTU-sized packets.

critical

Node "<hostname>" has network connectivity problems: unstable connectivity to node "<hostname>" with interface "<iface>" via interface "<iface>" due to packet loss.

critical

Node "<hostname>" has network connectivity problems: unstable connectivity to node "<hostname>" with interface "<iface>" via interface "<iface>" due to the loss of some MTU-sized packets.

critical

MTU mismatch

Some interfaces have MTU that differs from other interfaces in the same network: network "<network_name>" interface@host "<iface>@<hostname>".

critical

Metadata service alerts

Not enough metadata disks

Cluster “<cluster_name>” has only one MDS. There is only one disk with the metadata role at the moment. Losing this disk will completely destroy all cluster data irrespective of the redundancy schema.

critical

Cluster “<cluster_name>” requires more disks with the metadata role. Losing one more MDS will halt cluster operation.

warning

Configuration warning

Node “<hostname>” has more than one metadata service located on it. It is recommended to have only one metadata service per node. Delete the extra metadata service(s) from this node and create them on other nodes instead.

warning

Cluster “<cluster_name>” has four metadata services. This configuration slows down the cluster performance and does not improve its availability. For a cluster of four nodes, it is enough to configure three MDSes. Delete an extra MDS from one of the cluster nodes.

Cluster “<cluster_name>” has more than five metadata services. This configuration slows down the cluster performance and does not improve its availability. For a large cluster, it is enough to configure five MDSes. Delete extra MDSes from the cluster nodes.

Service failed

Metadata service #<id> is in the “<status>” state. Node: <hostname>. Disk: <disk_name>. Disk serial: <disk_serial>.

warning

Metadata disk is out of space

Metadata disk on node “<hostname>” is running out of space.

warning

Chunk service alerts

Not enough disks with storage role

Cluster “<cluster_name>” has no disks with the storage role.

warning

Cluster “<cluster_name>” has too few available CSes.

warning

Service failed

Storage service #<id> is in the “<status>” state. Node: <hostname>. Disk: <disk_name>. Disk serial: <disk_serial>.

warning

CS configuration is not optimal

CS#<cs_id> on tier <tier> has incorrect journalling settings.

warning

Encryption is disabled for CS#<cs_id> on tier <tier> but is enabled for other CSes on the same tier.

warning

Storage disk is slow

Disk <disk_name> (CS#<cs_id>) on node <hostname> is slow and needs to be replaced.

warning

Disk cache settings are not optimal

Disk <disk_name> (CS#<cs_id> on node <hostname> has cache settings different from other disks of the same tier.

warning

Node alerts

Node is offline

Node “<hostname>” is offline.

warning

Node got offline too many times

Node “<hostname>” got offline too many times last hour.

warning

Software updates exist

Software updates exist for the node “<hostname>”.

warning

Kernel is outdated

Node “<hostname>” is not running the latest kernel.

warning

OOM killer triggered

OOM killer has been triggered on node “<hostname>”.

warning

Time is not synced

Time on node “<hostname>” differs from time on backend node by more than 5 seconds.

warning

No Internet access

Cluster node <hostname> cannot reach the repository. Make sure that all cluster nodes have Internet access.

warning

Incompatible hardware detected

Incompatible hardware detected on node "<hostname>": <hardware_list>. Using Mellanox and AMD may lead to data loss. Please double check that SR-IOV is properly enabled.

critical

Disk alerts

S.M.A.R.T. warning

Disk “<disk_name>”(<serial>) on node “<hostname>” has failed a S.M.A.R.T. check.

critical

Disk error

Disk “<disk_name>” (<serial>) failed on node “<hostname>”.

critical

Disk is out of space

Root partition on node “<hostname>” is running out of space.

warning

Disk write cache is enabled

Disk write cache is enabled for disk “<disk_name>” on node “<hostname>”. Disable it to avoid potential data loss in case of a power outage.

warning

Disk write cache status unknown

Cannot determine the status of write cache for disk “<disk_name>” on node “<hostname>”.

warning

Network alerts

Network warning

Network interface “<iface_name>” has incorrect settings: <duplex> duplex and <speed> speed.

warning

Network interface “<iface_name>” on node “<hostname>” is missing important features (or has them disabled): “<feature_name>”.

warning

Network interface “<iface_name>” on node “<hostname>” is not in the full duplex mode.

warning

Network interface “<iface_name>” on node “<hostname>” has speed lower than the minimally required 1 Gbps.

warning

Network interface “<iface_name>” on node “<hostname>” has an undefined speed.

warning

Other alerts

Compute cluster has failed

Compute cluster has failed. Unable to manage virtual machines.

critical

Redundancy warning

iSCSI LUN <lun_id> of target group “<target_group>” is set to failure domain “disk” even though <number_of_nodes> nodes are available. It is recommended to set the failure domain to “host” so that the LUN can survive host failures in addition to disk failures.

warning

S3 is set to failure domain “disk” even though <number_of_nodes> nodes are available. It is recommended to set the failure domain to “host” so that S3 can survive host failures in addition to disk failures.

warning

Certificate expiration

Acronis Backup Gateway certificate has expired. All backup operations have been stopped. Update the certificate on the Backup Gateway screen.

critical

Acronis Backup Gateway certificate will expire soon. Update the certificate on the Backup Gateway screen.

warning

Acronis Backup Gateway certificate will expire on "<expiration_date>". Update the certificate on the Backup Gateway screen.

iSCSI major upgrade failed

iSCSI major upgrade failed. Will be retried…

critical

S3 cluster misconfiguration

The S3 cluster configuration is not highly available. If one S3 node fails, the entire S3 cluster may become non-operational.

warning