Skip to main content

The Service Watchdog

Fusion File Share Server includes a service watchdog that monitors the health of major services (processes or threads), such as authentication service, server service etc. using a keep-alive request–response mechanism.

If the watchdog detects that a service has failed to respond to a keep-alive request, it will terminate all remaining services.

Configuring the Watchdog

The watchdog is configured using the following parameters:

Global Parameter Watchdog's Request Interval

Value Type: integer

Value Format: <seconds>
Where <seconds> is the time interval (in seconds) between the watchdog's keep-alive requests. The value must be a positive integer.

Default Value: 10

Global Parameter Watchdog's Request Timeout

Value Type: integer

Value Format: <seconds>
Where <seconds> is the amount of time (in seconds) to wait for a keep-alive response. The value must be a positive integer, of a reasonably high value, otherwise, under heavy load some services might be temporarly unresponsive. For example, if a lot of users (e.g. 1000) are trying to authenticate at the same time, then authentication service can take some time to process each authentication request. If the keep-alive timeout is kept very small, then authentication service will fail to respond to a watchdog keep-alive request within the timeout, thus terminating the service. Usually, there's no need to change this value from the default.

Default Value: 200

Configuring the Using the Configuration File

In the global section of the configuration file adjust the watchdog_interval and watchdog_timeout parameters as needed.

For example:

watchdog_interval = 10
watchdog_timeout = 200