Commit Graph

51 Commits

Author SHA1 Message Date
Karol Wypchlo 36aa7c8311 improve health check reliability 2021-07-12 14:53:12 +02:00
Karol Wypchlo a2aa850632 improve health check reliability 2021-07-12 14:49:53 +02:00
Karol Wypchlo 7fd97b5824 improve health check reliability 2021-07-12 14:48:13 +02:00
Karol Wypchlo 49bb6dd2e2 fix portal size check reporting zero files 2021-06-15 11:41:01 +02:00
Karol Wypchło b8a6816876
fixed health check blowing up on eu-fin-3 (#838)
* request 127.0.0.1 over https - http localhost causes issues

* reformat with black
2021-06-07 15:08:18 +02:00
Karol Wypchlo cd7dac5b7e verbose => extended 2021-04-29 13:43:40 +02:00
Karol Wypchlo 1c99da3af8 fix repair string 2021-04-14 12:17:01 +02:00
Karol Wypchlo f48a8d9302 fix health-check 2021-04-13 16:19:42 +02:00
Matthew Sevey c752a17058
Update setup-scripts/health-checker.py 2021-02-03 10:30:27 -07:00
Matthew Sevey 50dff35da8
Update setup-scripts/health-checker.py
Co-authored-by: Marcin S. <scatman@bu.edu>
2021-02-03 10:22:24 -07:00
Matthew Sevey ff183beb66 Add repair size information to health checker 2021-02-03 09:42:55 -07:00
Karol Wypchło c0673b3f76
do not ping when server is in maintenance mode (#552) 2020-12-01 13:31:59 +01:00
Matthew Sevey 5f76d1ca52 remove error alert notification, subtract out siafile alerts 2020-11-24 07:49:51 -07:00
Karol Wypchlo 2dfb6d6a56 restore "or" 2020-11-24 15:26:51 +01:00
Karol Wypchlo 383144b7a6 tweak notifications on number of files in a node 2020-11-24 13:16:25 +01:00
Karol Wypchlo 7946f97d58 tweak notifications on error alerts 2020-11-24 13:08:08 +01:00
Ivaylo Novakov 41460f155f
Moved the container name var to the global space where it belongs. 2020-11-20 22:08:04 +01:00
Ivaylo Novakov 801597ccde
Fixed some typos.
Fixed formatting (force of habit...).
2020-11-20 21:45:19 +01:00
Matthew Sevey 05cd1bfb32 fix weird formatting 2020-11-20 11:46:35 -07:00
Matthew Sevey a337b754a8 run python format 2020-11-20 11:33:07 -07:00
Matthew Sevey efc6060924 scripts: update file health check to check siac output. Add total files check 2020-11-20 11:26:20 -07:00
Matthew Sevey 243d084b5d update message for siafile bad health 2020-11-18 11:04:04 -07:00
Matthew Sevey 09a4b646ec srcipts: add alert check to the python scripts 2020-11-18 10:21:06 -07:00
Karol Wypchlo 1922c4cd98 use os.popopen manually 2020-10-06 12:12:19 +02:00
Karol Wypchlo 9b6d61aa7e remove unnecessary time dependency 2020-10-06 11:27:06 +02:00
Karol Wypchlo 60f8371170 stop sia container on critical disk space threshold 2020-10-06 11:24:18 +02:00
Karol Wypchlo 2328e605b7 parse disk size as int before multiplying 2020-10-05 10:03:10 +02:00
Karol Wypchło e58752571e
add response content to health check failures (#437) 2020-09-30 16:20:55 +02:00
Karol Wypchło 10a251c081
reimplement health checks (#434) 2020-09-29 12:32:45 +02:00
Karol Wypchlo 20362fe7c5 fix health checks 2020-09-10 15:16:31 +02:00
Ivaylo Novakov 8235d75795
Only announce healthy status once a day. 2020-09-08 18:20:56 +02:00
Ivaylo Novakov ddf72ad850
Make the time comparisons in the health checker timezone-aware. 2020-09-08 18:07:33 +02:00
Ivaylo Novakov 2d032dbf17
Docstrings. 2020-09-07 17:59:39 +02:00
Ivaylo Novakov 0838e4f5e5
Add free disk space check to health-checker.py.
Move load-average check to health-checker.py.
2020-09-07 17:56:47 +02:00
Ivaylo Novakov 3f4742a436
Only notify the team if critical checks have failed. 2020-09-04 17:17:26 +02:00
Ivaylo Novakov 5eece67b03
Move parameter parsing to the top of the script. 2020-09-04 17:13:36 +02:00
Ivaylo Novakov 1cc20903c6
Move max discord message len to a constant.
Report critical checks failed.
Formatting.
2020-09-04 17:07:47 +02:00
Ivaylo Novakov a0a9137ae7
Update setup-scripts/health-checker.py
Co-authored-by: Karol Wypchło <kwypchlo@gmail.com>
2020-09-04 16:44:19 +02:00
Ivaylo Novakov 62e27120cd
Use localhost. 2020-09-04 16:39:39 +02:00
Ivaylo Novakov 59a77bfaf6
Add a health checker script to Gollum. 2020-09-04 16:12:20 +02:00
Marcin Jachymiak 285d915602 Rename health-checker to funds-checker 2020-03-09 15:59:21 -04:00
Marcin Jachymiak 6d8fe9ca02 Add log checker 2020-03-09 15:55:03 -04:00
Marcin Jachymiak 4b1ac49ef1 Refactor bot_utils out of health-checker 2020-03-09 12:15:13 -04:00
Marcin Jachymiak 5c87cb93dd Add override to load_dotenv call 2020-03-06 14:10:51 -05:00
Marcin Jachymiak 66218fde30 Check allocation not spent. Cleanup msg format 2020-03-06 10:11:42 -05:00
Marcin Jachymiak 29a73e5963 Add health-check setup script 2020-03-05 12:07:37 -05:00
Marcin Jachymiak 2af43ed499 fix api pass env 2020-03-05 10:09:48 -05:00
Marcin Jachymiak d9b27d416c Cleanup Discord msgs and add password var 2020-03-05 10:09:48 -05:00
Marcin Jachymiak 64681bf19d Fixup port env var 2020-03-05 10:09:48 -05:00
Marcin Jachymiak cbe73e14c7 Cleanup health check script 2020-03-05 10:09:48 -05:00