The Service Watchdog
Fusion File Share Server includes a service watchdog that monitors the health of major services (processes or threads), such as authentication service, server service etc. using a keep-alive request–response mechanism.
If the watchdog detects that a service has failed to respond to a keep-alive request, it will terminate all remaining services.
Configuring the Watchdog
The watchdog is configured using the following parameters:
- Configuration file's
[global]
section:watchdog_interval
tsmb-cfg global update
: Not supportedtsmb-cfg global add
andtsmb-cfg global del
: Not supported
Value Type: integer
Value Format: <seconds>
Where <seconds>
is the time interval (in seconds) between the watchdog's keep-alive requests. The value must be a positive integer.
Default Value: 10
- Configuration file's
[global]
section:watchdog_timeout
tsmb-cfg global update
: Not supportedtsmb-cfg global add
andtsmb-cfg global del
: Not supported
Value Type: integer
Value Format: <seconds>
Where <seconds>
is the amount of time (in seconds) to wait for a keep-alive response. The value must be a positive integer, of a reasonably high value, otherwise, under heavy load some services might be temporarly unresponsive. For example, if a lot of users (e.g. 1000) are trying to authenticate at the same time, then authentication service can take some time to process each authentication request. If the keep-alive timeout is kept very small, then authentication service will fail to respond to a watchdog keep-alive request within the timeout, thus terminating the service. Usually, there's no need to change this value from the default.
Default Value: 200
Configuring the Using the Configuration File
In the global
section of the configuration file adjust the watchdog_interval
and watchdog_timeout
parameters as needed.
For example:
watchdog_interval = 10
watchdog_timeout = 200