Commit Graph

166 Commits

Author SHA1 Message Date
Karol Wypchlo 7946f97d58 tweak notifications on error alerts 2020-11-24 13:08:08 +01:00
Ivaylo Novakov 41460f155f
Moved the container name var to the global space where it belongs. 2020-11-20 22:08:04 +01:00
Ivaylo Novakov 801597ccde
Fixed some typos.
Fixed formatting (force of habit...).
2020-11-20 21:45:19 +01:00
Matthew Sevey 05cd1bfb32 fix weird formatting 2020-11-20 11:46:35 -07:00
Matthew Sevey a337b754a8 run python format 2020-11-20 11:33:07 -07:00
Matthew Sevey efc6060924 scripts: update file health check to check siac output. Add total files check 2020-11-20 11:26:20 -07:00
Matthew Sevey 243d084b5d update message for siafile bad health 2020-11-18 11:04:04 -07:00
Matthew Sevey 09a4b646ec srcipts: add alert check to the python scripts 2020-11-18 10:21:06 -07:00
Karol Wypchlo 05e34b5aaa rename blacklist to blocklist 2020-11-13 17:29:40 +01:00
Karol Wypchlo d4d3f6977b change recommended renewal from 8 weeks to 4 weeks 2020-10-30 13:38:17 +01:00
Karol Wypchlo f732df9660 fix crontab setup script 2020-10-27 16:39:00 +01:00
Karol Wypchlo 1922c4cd98 use os.popopen manually 2020-10-06 12:12:19 +02:00
Karol Wypchlo 9b6d61aa7e remove unnecessary time dependency 2020-10-06 11:27:06 +02:00
Karol Wypchlo 60f8371170 stop sia container on critical disk space threshold 2020-10-06 11:24:18 +02:00
Karol Wypchlo 2328e605b7 parse disk size as int before multiplying 2020-10-05 10:03:10 +02:00
Karol Wypchło e58752571e
add response content to health check failures (#437) 2020-09-30 16:20:55 +02:00
Karol Wypchlo bc879b51cc update log message 2020-09-29 15:46:40 +02:00
Karol Wypchlo e7e7adc141 get api password from docker container 2020-09-29 15:42:41 +02:00
Karol Wypchło 468527c22d
add portal_subdomain to nginx regex and fill documentation (#435) 2020-09-29 12:35:34 +02:00
Karol Wypchło 10a251c081
reimplement health checks (#434) 2020-09-29 12:32:45 +02:00
Matthew Sevey 998bacd93a Add ssh pubkey for Sevey 2020-09-25 11:54:34 -04:00
Peter-Jan Brone f3b2a130fe
Wildcard Subdomains Documentation (#416)
* Add documentation on subdomains

* Update copy

* Fix typo

* Link Karol his repo

* Update copy

* Update setup-scripts/README.md

Co-authored-by: Karol Wypchło <kwypchlo@gmail.com>

Co-authored-by: Karol Wypchło <kwypchlo@gmail.com>
2020-09-22 12:23:53 +02:00
Karol Wypchło dc73367a66
Merge pull request #371 from NebulousLabs/safe-restart-scripts
add safe stop and restart sia scripts
2020-09-18 17:08:19 +02:00
Karol Wypchło 08782de656
update ssh keys 2020-09-17 15:50:47 +02:00
Karol Wypchlo a02c27664b adjust scripts 2020-09-14 17:40:18 +02:00
Karol Wypchlo abacfc1ed1 improve docs 2020-09-14 15:57:44 +02:00
Karol Wypchlo 4f849ddf53 change comments 2020-09-14 15:53:27 +02:00
Karol Wypchlo 2c12b9d009 add safe stop and restart sia scripts 2020-09-14 15:53:27 +02:00
Karol Wypchlo 6acd3309f8 fix blacklist command 2020-09-14 14:45:06 +02:00
Karol Wypchlo 20362fe7c5 fix health checks 2020-09-10 15:16:31 +02:00
Ivaylo Novakov 8235d75795
Only announce healthy status once a day. 2020-09-08 18:20:56 +02:00
Ivaylo Novakov ddf72ad850
Make the time comparisons in the health checker timezone-aware. 2020-09-08 18:07:33 +02:00
Ivaylo Novakov 2d032dbf17
Docstrings. 2020-09-07 17:59:39 +02:00
Ivaylo Novakov 0838e4f5e5
Add free disk space check to health-checker.py.
Move load-average check to health-checker.py.
2020-09-07 17:56:47 +02:00
Ivaylo Novakov 3f4742a436
Only notify the team if critical checks have failed. 2020-09-04 17:17:26 +02:00
Ivaylo Novakov 5eece67b03
Move parameter parsing to the top of the script. 2020-09-04 17:13:36 +02:00
Ivaylo Novakov 1cc20903c6
Move max discord message len to a constant.
Report critical checks failed.
Formatting.
2020-09-04 17:07:47 +02:00
Ivaylo Novakov a0a9137ae7
Update setup-scripts/health-checker.py
Co-authored-by: Karol Wypchło <kwypchlo@gmail.com>
2020-09-04 16:44:19 +02:00
Ivaylo Novakov 62e27120cd
Use localhost. 2020-09-04 16:39:39 +02:00
Ivaylo Novakov 59a77bfaf6
Add a health checker script to Gollum. 2020-09-04 16:12:20 +02:00
Matthew Sevey a911b36f56 Add severe check to log checker 2020-09-03 10:53:52 -04:00
Ivaylo Novakov 25cf5625b4
3 seconds timeout. 2020-08-31 13:39:07 +02:00
Ivaylo Novakov 2fc6d30c01
Use `os._exit` instead of `sys.exit` - we don't want to raise an exception. 2020-08-31 13:38:45 +02:00
Ivaylo Novakov 8761a77e18
Don't call `await client.close()`. 2020-08-31 13:27:49 +02:00
Ivaylo Novakov ff8c4c2d78
Merge pull request #360 from NebulousLabs/ivo/check_system_load
Check system load
2020-08-31 13:25:50 +02:00
Ivaylo Novakov 59bad59cb7
Make it clear that the uptime data comes from the `uptime` command and not something we're putting together ourselves. 2020-08-28 16:05:24 +02:00
Karol Wypchlo df7decb06c drop Marcin's key 2020-08-28 14:56:01 +02:00
Ivaylo Novakov 58409b8cee
Switch from the "1 minute" to "15 minutes" average. 2020-08-28 11:59:27 +02:00
Ivaylo Novakov efb35054f7
When chekcing logs, check also the system load and report it if it's above 10. 2020-08-28 11:54:47 +02:00
Ivaylo Novakov 2126115df4
Send the last 1MB of error log as a file. 2020-08-26 11:32:58 +03:00