-
Notifications
You must be signed in to change notification settings - Fork 3
Admin Guide: FAQ
- Identify and save logs for failed job in the Nomad UI. Service health check responses are available in the Consul UI.
- Check issues labelled
bug
on the public board. If the bug is not tracked please add it. - Ask on Slack!
Problem: Users have to click the profile icon in the top right to be authenticated, instead of it happening automatically on entering the wiki.
Fix: Remove all "Permissions" and all "Page Rules" from the "Guest" group. The "Guest" group should NOT ever have access to anything; this causes Wiki.js to auto-login the user.
Problem: Traefik doesn't update the stored certificates if the configuration changes.
Fix: Administrator will have to wipe the certificates after a change in any of these:
To wipe the certificates one should delete the Consul KV entries /traefik
and /liquid/traefik/acme
from the Consul UI.
To follow Traefik's progress in getting the HTTPS certificates from LetsEncrypt use the Nomad UI to follow its console output.
Problem: Invocation of ./liquid deploy
fails without starting a good number of Docker containers.
Fix:
- View Nomad errors in the Nomad UI. Nomad errors will be available in the Nomad UI after navigating to the job.
- Check Nomad logs with
docker exec cluster ./cluster.py tail nomad
- report any errors found there. - Try running a clean reset.
- Configure docker daemon GOMAXPROCS if you are running on many cores. For machines with >32 cores, we recommend configuring docker with
GOMAXPROCS
of 8-12. - If running on recent RHEL or similar linux distributon, try to Disable SELinux.
Problem: When processing many collections in parallel, one might run out of RabbitMQ memory. To check if this has happened:
- Proxy port
10.66.60.1:9990
from server onto local machinelocalhost:9990
using SSH LocalForward configuration. - Visit
http://localhost:9990/_snoop_rabbit/
, login with usernameguest
and passwordguest
. - Look at the "Memory" cell on the "Overview" screen. If it's red, then you're out of memory.
Fix: If you're out of RabbitMQ memory, do one of these:
- increase
rabbitmq_memory_limit
to 4+ GB, or - process less collections at the same time (set
process = off
on some of them, turn on again later)
Problem: When the elasticsearch disk exceeds some 90% limit, elasticsearch will lock itself up.
Fix: Be sure to free up some disk space first, then run:
export ES_ADDR=10.66.60.1:9990/_es
curl -XPUT "$ES_ADDR/_cluster/settings" -H 'Content-Type: application/json' -d '{"persistent":{"cluster.blocks.read_only":false}}'
curl -XPUT "$ES_ADDR/_all/_settings" -H 'Content-Type: application/json' -d'{ "index.blocks.read_only_allow_delete" : null } }'
... where 10.66.60.1 is the network address configured in cluster.ini and liquid.ini.
Problem: Docker Hub has been decreasing their free anonymous download limit. When deploying, you might reach this limit on your host IP.
Fix: See the Docker article on download-rate-limit.
Temporary work-around:
- create a free Docker Hub account for each instance
- do a
docker login
with new credentials on the machine running./liquid deploy
.
Report incomplete documentation by opening a new Issue in this repository.