As the title says. When deploying rasa, using the one liner or the full deploy way, after a while the nginx pod goes into CrashLoopBackOff. The log i was able to get out from it is this
16:25:41.03
16:25:41.03 Welcome to the Bitnami nginx container
16:25:41.03 Subscribe to project updates by watching https://github.com/bitnami/bitnami-docker-nginx
16:25:41.03 Submit issues and feature requests at https://github.com/bitnami/bitnami-docker-nginx/issues
16:25:41.03
SSL encryption is not used since no certificates were provided.
> >> exec docker CMD
/opt/bitnami/scripts/nginx/[run.sh](http://run.sh/)
16:25:41.04 INFO ==> * **Starting NGINX** *
2020/05/24 16:25:41 [emerg] 1#0: host not found in upstream "rasa-rasa-x-rasa-production:5005" in /opt/bitnami/nginx/conf/conf.d/rasax.nginx:2
nginx: [emerg] host not found in upstream "rasa-rasa-x-rasa-production:5005" in /opt/bitnami/nginx/conf/conf.d/rasax.nginx:2
This is the output form kubectl describe
Name: rasa-nginx-5b787758ff-n4xfw
Namespace: rasa
Priority: 0
Node: worker/10.50.13.41
Start Time: Sun, 24 May 2020 09:55:10 -0600
Labels: [app.kubernetes.io/component=nginx](http://app.kubernetes.io/component=nginx)
[app.kubernetes.io/instance=rasa](http://app.kubernetes.io/instance=rasa)
[app.kubernetes.io/name=rasa-x](http://app.kubernetes.io/name=rasa-x)
pod-template-hash=5b787758ff
Annotations: <none>
Status: Running
IP: 10.50.1.6
IPs:
IP: 10.50.1.6
Controlled By: ReplicaSet/rasa-nginx-5b787758ff
Containers:
rasa-x:
Container ID: docker://e41fa4d261703b375eaaead33ae4af53d74498d0e072f5ef8852426c13b8324a
Image: rasa/nginx:0.28.3
Image ID: docker-pullable://rasa/nginx@sha256:b47956b86c7c5e19b7812b667b823e2c6fe3e3bd1932ec42cb2d329507d59c5c
Port: 8080/TCP
Host Port: 0/TCP
State: Waiting
Reason: CrashLoopBackOff
Last State: Terminated
Reason: Error
Exit Code: 1
Started: Sun, 24 May 2020 10:31:12 -0600
Finished: Sun, 24 May 2020 10:31:32 -0600
Ready: False
Restart Count: 10
Liveness: exec [curl localhost:8080/nginx_status] delay=10s timeout=1s period=10s #success=1 #failure=10
Readiness: exec [curl localhost:8080/nginx_status] delay=10s timeout=1s period=10s #success=1 #failure=3
Environment:
RASA_X_HOST: rasa-rasa-x-rasa-x:5002
RASA_PRODUCTION_HOST: rasa-rasa-x-rasa-production:5005
CUSTOM_ACTION_HOST: rasa-rasa-x-app:5055
Mounts:
/opt/bitnami/nginx/conf/bitnami/terms from agreement (rw)
/var/run/secrets/[kubernetes.io/serviceaccount](http://kubernetes.io/serviceaccount) from default-token-9wc7j (ro)
Conditions:
Type Status
Initialized True
Ready False
ContainersReady False
PodScheduled True
Volumes:
agreement:
Type: ConfigMap (a volume populated by a ConfigMap)
Name: rasa-agreement
Optional: false
default-token-9wc7j:
Type: Secret (a volume populated by a Secret)
SecretName: default-token-9wc7j
Optional: false
QoS Class: BestEffort
Node-Selectors: <none>
Tolerations: [node.kubernetes.io/not-ready:NoExecute](http://node.kubernetes.io/not-ready:NoExecute) for 300s
[node.kubernetes.io/unreachable:NoExecute](http://node.kubernetes.io/unreachable:NoExecute) for 300s
Events:
Type Reason Age From Message
---- ------ ---- ---- -------
Normal Scheduled 38m default-scheduler Successfully assigned rasa/rasa-nginx-5b787758ff-n4xfw to worker
Normal Created 26m (x3 over 27m) kubelet, worker Created container rasa-x
Normal Started 26m (x3 over 27m) kubelet, worker Started container rasa-x
Warning Unhealthy 25m (x3 over 26m) kubelet, worker Readiness probe failed: % Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0curl: (7) Failed to connect to localhost port 8080: Connection refused
Normal Pulling 25m (x4 over 38m) kubelet, worker Pulling image "rasa/nginx:0.28.3"
Normal Pulled 25m (x4 over 27m) kubelet, worker Successfully pulled image "rasa/nginx:0.28.3"
Warning BackOff 7m5s (x83 over 26m) kubelet, worker Back-off restarting failed container
Warning Unhealthy 118s (x12 over 26m) kubelet, worker Liveness probe failed: % Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0curl: (7) Failed to connect to localhost port 8080: Connection refused
After doing a netstat I found that port 8080 is not listed as listening to anything. However the cluster is running fine as I can see all of the other nodes in the cluster.
Firewall is disabled btw.