The Service Watchdog
Fusion File Share Server includes a service watchdog that monitors the health of critical services (processes or threads), such as the authentication service, SMB server service, etc., using a keep-alive request–response mechanism.
If the watchdog detects that a service has failed to respond to a keep-alive request, it terminates all remaining services.
Configuring the Watchdog
The watchdog is configured using the following parameters:
- Configuration file's
[global]
section:watchdog_interval
tsmb-cfg global update
: Not supportedtsmb-cfg global add
andtsmb-cfg global del
: Not supported
Value Type: integer
Value Format: <seconds>
<seconds>
specifies the time interval (in seconds) between the watchdog's keep-alive requests. The value must be a positive integer.
Default Value: 10
- Configuration file's
[global]
section:watchdog_timeout
tsmb-cfg global update
: Not supportedtsmb-cfg global add
andtsmb-cfg global del
: Not supported
Value Type: integer
Value Format: <seconds>
<seconds>
specifies the duration in seconds to wait for a keep-alive response. The value must be a positive integer, and should be set to a reasonably high value to avoid unintended service terminations under heavy load. For example, if a large number of users (e.g. 1000) attempt to authenticate at the same time, the authentication service may require additional time to process each request. If the keep-alive timeout is set too low, the authentication service might fail to respond to a watchdog keep-alive request within the timeout, causing the service to be terminated. The default value fits most use cases, and adjustments are usually unnecessary.
Default Value: 200
Configuring the Using the Configuration File
In the global
section of the configuration file adjust the watchdog_interval
and watchdog_timeout
parameters as needed.
For example:
watchdog_interval = 10
watchdog_timeout = 200