Alerts code reference
Learn about the alerts ThoughtSpot may generate.
This reference identifies the messages that can appear in the panel and in the Alerts dashboard.
Informational alerts
- APPLICATION_INVALID_STATE
- 
Raised when Application raises invalid state alert. - Msg
- 
{{.Service}}.{{.Task}} on {{.Machine}} at location {{.Location}} 
- Type
- 
INFO 
 
- DISK_ERROR
- 
Raised when a machine has disk errors. - Msg
- 
Machine {{.Machine}} has disk errors 
- Type
- 
INFO 
 
- HDFS_CORRUPTION
- 
Raised when HDFS root directory is corrupted. - Msg
- 
HDFS root directory is in a corrupted state. 
- Type
- 
INFO 
 
- MASTER_ELECTION
- 
Raised when a new Orion Master is elected. - Msg
- 
{{.Machine}} elected as Orion Master 
- Type
- 
INFO 
 
- PERIODIC_BACKUP
- 
Raised when periodic backup fails. - Msg
- 
{{.Process}} periodic backup for policy {{.Name}} failed. 
- Type
- 
INFO 
 
- PERIODIC_SNAPSHOT
- 
Raised when a periodic snapshot fails. - Msg
- 
{{.Process}} periodic snapshot {{.Name}} failed. 
- Type
- 
INFO 
 
- TASK_TERMINATED
- 
Raised when a task terminates. - Msg
- 
Task {{.Service}}.{{.Task}} terminated on machine {{.Machine}} 
- Type
- 
INFO 
 
- UPDATE_END
- 
Raised when update completes. - Msg
- 
Finished update of ThoughtSpot cluster {{.Cluster}} to release {{.Release}} 
- Type
- 
INFO 
 
- UPDATE_START
- 
Raised when update starts. - Msg
- 
Starting update of ThoughtSpot cluster {{.Cluster}} 
- Type
- 
INFO 
 
- ZK_AVG_LATENCY
- 
Raised when average Zookeeper latency is above a threshold. - Msg
- 
Average Zookeeper latency is more than {{.Num}} msec 
- Type
- 
INFO 
 
- ZK_MAX_LATENCY
- 
Raised when max Zookeeper latency is above a threshold. - Msg
- 
Max Zookeeper latency is more than {{.Num}} msec 
- Type
- 
INFO 
 
- ZK_MIN_LATENCY
- 
Raised when min Zookeeper latency is above a threshold. - Msg
- 
Min Zookeeper latency is more than {{.Num}} msec 
- Type
- 
INFO 
 
- ZK_NUM_WATCHERS
- 
Raised when there are too many Zookeeper watchers. - Msg
- 
Number of Zookeeper watchers exceeds {{.Num}} 
- Type
- 
INFO 
 
- ZK_OUTSTANDING_REQUESTS
- 
Raised when there are too many outstanding Zookeeper requests. - Msg
- 
Number of outstanding Zookeeper requests exceeds {{.Num}} 
- Type
- 
INFO 
 
Errors
- TIMELY_ERROR
- 
Raised when a job manager runs into an inconsistent state. - Msg
- 
Job manager {{.Message}} 
- Type
- 
ERROR 
 
- TIMELY_JOB_RUN_ERROR
- 
Raised when a job run fails. - Msg
- 
Job run {{.Message}} 
- Type
- 
ERROR 
 
Warnings
- BOOT_DISK_SPACE
- 
Raised when a machine is low on available disk space on boot partition. - Msg
- 
Machine {{.Machine}} has less than {{.Perc}}% disk space free on boot partition 
- Type
- 
WARNING 
 
- DISK_ERROR_EXTERNAL
- 
Raised when more than 2 disk errors happen in a day. - Msg
- 
Machine {{.Machine}} has disk errors 
- Type
- 
WARNING 
 
- DISK_SPACE
- 
Raised when a disk is low on available disk space. Valid only in the 3.2 version of ThoughtSpot. - Msg
- 
Machine {{.Machine}} has less than {{.Perc}}% disk space free 
- Type
- 
WARNING 
 
- EXPORT_DISK_SPACE
- 
Raised when a machine is low on available disk space on export partition. - Msg
- 
Machine {{.Machine}} has less than {{.Perc}}% disk space free on export partition 
- Type
- 
WARNING 
 
- HDFS_NAMENODE_DISK_SPACE
- 
Raised when a machine is low on available disk space on HDFS namenode drive. - Msg
- 
Machine {{.Machine}} has less than {{.Perc}}% disk space free on HDFS namenode drive 
- Type
- 
WARNING 
 
- HOST_DOWN
- 
Raised when a host is down. - Msg
- 
{{.Machine}} is down 
- Type
- 
WARNING 
 
- MEMORY
- 
Raised when a machine is low on free memory. - Msg
- 
Machine {{.Machine}} has less than {{.Perc}}% memory free 
- Type
- 
WARNING 
 
- OS_PROCS
- 
Raised when a machine has too many processes. - Msg
- 
Machine {{.Machine}} has more than {{.Num}} processes 
- Type
- 
WARNING 
 
- OS_USERS
- 
Raised when a machine has too many users logged in. - Msg
- 
Machine {{.Machine}} has more than {{.Num}} logged in users 
- Type
- 
WARNING 
 
- ROOT_DISK_SPACE
- 
Raised when a machine is low on available disk space on root partition. - Msg
- 
Machine {{.Machine}} has less than {{.Perc}}% disk space free on root partition 
- Type
- 
WARNING 
 
- SSH
- 
Raised when a machine has more than 600 processes. - Msg
- 
Machine {{.Machine}} doesn’t have an active SSH server 
- Type
- 
WARNING 
 
- TASK_NOT_RUNNING
- 
Raised when a service task is not running on any machine in the cluster. - Msg
- 
{{.ServiceDesc}} is not running 
- Type
- 
WARNING 
 
- TASK_UNREACHABLE
- 
Raised when a task is unreachable over HTTP. - Msg
- 
{{.ServiceDesc}} on {{.Machine}} is unreachable over HTTP 
- Type
- 
WARNING 
 
- UPDATE_DISK_SPACE
- 
Raised when a machine is low on available disk space on update partition. - Msg
- 
Machine {{.Machine}} has less than {{.Perc}}% disk space free on update partition 
- Type
- 
WARNING 
 
- ZK_EPHEMERAL_COUNT
- 
Raised when there are too many Zookeeper ephemeral files. - Msg
- 
Zookeeper has more than {{.Num}} ephemeral files 
- Type
- 
WARNING 
 
- ZK_FD_COUNT
- 
Raised when there are too many open Zookeeper files. - Msg
- 
Zookeeper has more than {{.Num}} open file descriptors 
- Type
- 
WARNING 
 
Critical alerts
- APPLICATION_INVALID_STATE_EXTERNAL
- 
Raised when Application raises invalid state alert. - Msg
- 
{{.Service}}.{{.Task}} on {{.Machine}} at location {{.Location}} 
- Type
- 
CRITICAL 
 
- HDFS_DISK_SPACE
- 
Raised when a HDFS cluster is low on total available disk space. - Msg
- 
HDFS has less than {{.Perc}}% space free 
- Type
- 
CRITICAL 
 
- OREO_TERMINATED
- 
Raised when the Oreo daemon on a machine terminates due to an error. This typically happens due to an error accessing Zookeeper, HDFS, or a hardware issue. - Msg
- 
Oreo terminated on machine {{.Machine}} 
- Type
- 
CRITICAL 
 
- PERIODIC_BACKUP_FLAPPING
- 
This alert is raised when a periodic backup failed repeatedly. - Msg
- 
Periodic backup failed {{._actual_num_occurrences}} times in last {{._earliest_duration_str}} 
- Type
- 
CRITICAL 
 
- PERIODIC_SNAPSHOT_FLAPPING
- 
This alert is raised when periodic snapshot failed repeatedly. - Msg
- 
Periodic snapshot failed {{._actual_num_occurrences}} times in last {{._earliest_duration_str}} 
- Type
- 
CRITICAL 
 
- TASK_FLAPPING
- 
Raised when a task is crashing repeatedly. The service is evaluated across the whole cluster. So, if a service crashes 5 times in a day across all nodes in the cluster, this alert is generated. - Msg
- 
Task {{.Service}}.{{.Task}} terminated {{._actual_num_occurrences}} times in last {{._earliest_duration_str}} 
- Type
- 
CRITICAL 
 
- ZK_INACCESSIBLE
- 
Raised when Zookeeper is inaccessible. - Msg
- 
Zookeeper is not accessible 
- Type
- 
CRITICAL