Monitor Machines

In Machines, you can view the health status of each individual Logpoint Machine connected to Director, along with:

Machine - the name of the Logpoint Machine.

Pool - the name of the pool to which the machine is connected.

Version - the machine’s version number.

Status - the health status of the machine. Green () means healthy, Amber () is a warning and red () means there is an error.

Action - contains two hyperlinks: Health Overview and Snoozed List for that particular machine. See Health Overview for details. Click Snoozed List to view snoozed metrics and time. You can only view the snoozed list if metrics are selected for snooze. See Snoozing the Metrics for details.

Health Monitoring
Health Monitoring

Health Status

When you click on individual machine, the Health Status on the right side displays the relevant data. This gives the health summary of the selected machine. Select an individual machine in the list to view its health metrics, including Services, Disks, Repos and Log Sources.

Health Summary

Snoozing the Metrics

You can snooze any health metrics, services, disks, repos or log sources, with Error or Warning status, for a certain period of time. Snoozing disables threshold rules, configured through Dashboard Settings, so a machine’s status is healthy for a period of time. When the metric is snoozed, a () icon appears beside the metric on the summary page and beside the Status of the machine, for the reminder that the status of machine is Healthy because one or more threshold rules are snoozed. To snooze any metric:

  1. Go to Health Monitoring >> Machines.

  2. Click on the machine whose status is Error or Warning. A health summary appears on the right.

  3. Click on the clock icon next to the metric with red indicator () or amber ().

  4. Enter numerical value for time and select Minutes/Hours/Days.

Snooze
  1. Click OK

Snoozed List

This gives the list of snoozed metrics along with their snoozed time. You can also remove the metrics from the snoozed list. You can unsnooze by clicking Remove under ACTION.

Snoozed List

You can also unsnooze any metrics by clicking the () icon.

Health Overview

Health Overview provides detailed health metrics of an individual Logpoint machine. The metrics are displayed as graphs, making the health information easy to understand and monitor.

The metrics are:

Health Metric

Description

Available Memory

The amount of RAM that is available for the system to use right away.

Free Memory

The amount of RAM that is not being used by the operating system or any applications at the moment.

Inactive Memory

The amount of RAM that has recently been accessed but is not currently being used is known as inactive memory.

I/O Wait

The time CPU waits for input/output operations to complete.

Idle Percent

The percentage of time the CPU is idle.

Average Load

A measure of system activity, indicating the average number of processes waiting for CPU time over a specific period.

Open Files

The number of files currently open or in use by the system.

Device Count

The total number of devices connected or recognized.

Top 5 CPU Usage

A list of the five processes consuming the most CPU resources.

Top 5 Memory Usage

A list of the five processes using the most memory resources.

Any health metric that exceeds their threshold, their status is marked Error with red (), Warning with amber () or Healthy with green ().

Health Overview
circle-check

Ingestion Pipeline

Ingestion Pipeline is the graphical representation of event processing data and a machine’s Queue Status. It gives a central view of log processing across various layers of Logpoint. This helps in detecting log congestion, latency or service outages. You can view various statistical data such as Standart Deviation, Average, Maximum and Minimum values.

Ingestion Pipeline Events Per Second
Ingestion Pipeline Queue Status
circle-check

Services

Services shows the status and the path for a machine’s services. You can use the column headings to sort the Services list. You can filter by entering the name of the service, path to the service and status of the service. Use a status indicator color red, amber or green to filter the list. Services with the red () Stopped mean there is an error with their machine.

Services Monitoring
circle-check

Disks

Disks displays information about a machine’s disks including file system, size, used memory, available memory, used percentage and mount points. Use the column headings to sort the list. Use a status indicator color red, amber or green to filter the list.

If the Used Percentage of the disk exceeds their threshold, their status is marked Error with red (), Warning with amber () or Healthy with green ().

Disks Monitoring
Disks Monitoring

Repos

Repos lists a selected machine’s repo details including name, usage and status. You can use the column headings to sort the list. You can enter a repo name or a status, red, orange or green, to filter the list.

Repos whose status is Down are marked with a red (), error status.

Repos Monitoring
circle-check

Last Log Received

Last Log Received lists the log sources where the last logs originated in addition to their name, type, last log received timestamp and status. Enter the name or a status indicator to filter the list. Use the column headings to sort the list.

If the Last Log Received exceeds their threshold, their status is marked Error with red (), Warning with amber () or Healthy with green ().

When Last Log Received has the status N/A it has the status Inactive and is marked in grey ().

Last Logs Monitoring
circle-check

Last updated

Was this helpful?