Back to Opsview FAQ

Collector(s) offline

You may see your Collector or Collector Cluster is offline and not be sure of the cause.

This relates to the cluster-health of your Collector Cluster and the reason why this has happened may be different. Some basic checks for reviewing this offline state are listed below to assist with troubleshooting this topic before contacting Support.

Process Basic first check for a collector(s) cluster being “OFFLINE” Copied

  1. Check if your collectors are indeed down or have a connectivity issue
/opt/opsview/watchdog/bin/opsview-monit summary -B
  1. Check the opsview-messagequeue “cluster-health-queue”
/opt/opsview/messagequeue/sbin/rabbitmqctl list_queues | awk '$2>0'
/opt/opsview/messagequeue/sbin/rabbitmqctl cluster_status

Cluster status of node rabbit@omcloud-cs-testbed.cloud.opsview.com ...
Basics
Cluster name: rabbit@testcollector1.cloud.opsview.com

Disk Nodes
rabbit@testcollector1.cloud.opsview.com
rabbit@testcollector2.cloud.opsview.com
rabbit@testcollector3.cloud.opsview.com

Running Nodes
rabbit@testcollector1.cloud.opsview.com
rabbit@testcollector2.cloud.opsview.com
rabbit@testcollector3.cloud.opsview.com
...
...
  1. Are the opsview version of your orchestrator and collector(s) in sync?

— also the Component Overview page will list all components for an Opsview server

rpm -qa | grep opsview | sort
or
dpkg -l | grep opsview

If your collector(s) version does not match between the UI and server/collector(s) level, then please review “Why does my collector show the wrong Opsview version in the UI?”

  1. Check your collector(s) disk space are not full
df -h /
  1. Use the top command to review the:
top
  1. Ensure the time of all your servers are in sync
timedatectl status
  1. Check the system logs on the orchestrator and on the collector(s) server for any communication refused error messages
/var/log/opsview/opsview.log
/var/log/messages
["Geneos"] ["FAQ"]

Was this topic helpful?