Troubleshooting Guide
Quick solutions to the common and not-so-common problems you are likely to encounter as an admin while using RudderStack
Last updated
Was this helpful?
Quick solutions to the common and not-so-common problems you are likely to encounter as an admin while using RudderStack
Last updated
Was this helpful?
This section contains solutions to some of the commonly faced issues you are likely to encounter as an Admin.
Check if the server is running in normal mode in the file /data/rudderstack/recovery_data.json
or /tmp/recovery_data.json
.
If the server is in "degraded" or "maintenance" mode, RudderStack just stores the events and will not process them.
Check if you have provided the right backend token
Check if the Control Plane is up ()
Check your internal firewall rules and edit if needed. You need access to outbound HTTP.
If the control plane is accessible from your network but is down, we are already working to fix it. Do leave us a note.
Check if those destination are enabled in Control Plane
Verify that the config parameters such as API key, tracking ID, etc. are correct
There is a possibility that a destination service (Google Analytics, S3, etc.) is down.
Check the number of pending gateway tables (tables that start with gw_
), router tables (tables that start with rt_
), and batch router tables (tables that start with batch_rt_
).
If the number for any of the above possibilities is high (> 5), then we have incoming requests at a higher rate than what we can process. Consider adding another RudderStack node if possible.
If you have access to RudderStack Enterprise edition, check out the Grafana dashboards.
When RudderStack enters "degraded" mode, it will only log the event and not process the event. If the issue why the server entered the degraded mode is temporary (Transformer is down), then fix the issue and restart the server in the normal mode.
You can restart the server in the normal mode by updating the /data/rudderstack/recovery_data.json
or /tmp/recovery_data.json
. Set Mode to "normal"
When RudderStack enters "maintenance" mode, we take a back up of the old database and create a new database in the "degraded" mode. RudderStack will only log the event and not process the event in this case. If the issue is fixed, start another instance of RudderStack server in normal mode but in a different port (say 8081) pointing to the old DB. That will drain all the events in the old DB.
Then restart the actual server in the normal mode by updating the /data/rudderstack/recovery_data.json
or /tmp/recovery_data.json
. Set the mode to "normal". It will resume routing pending events and the ordering of the events is guaranteed.
Check if your system is in the degraded or maintenance mode. This could result in only logging the events and not processing them. If needed, increase the storage capacity of your machine till there are no issues in the disk usage.
If there are tables that start with ```predrop``` but if you don't see them being removed, verify the access credentials to your object storage like S3.
If you have multiple instances of Data Plane, each table dump will be inside a specific folder named after the INSTANCE_NAME
.
If you have access to RudderStack Enterprise, you already have a visualization of the RudderStack server metrics at your disposal for tracking the health of your server.
Ensure that the number ofjobsDB
tables is not increasing.
Verify that the server mode is normal.
Enable debug logging by setting the following variable in your .env
file as shown:
We recommend the following configuration for the production deployments. On a Linux machine, add the following lines to /etc/sysctl.conf
:
If your system is hitting TCP limits and returning HTTP errors, the above configuration will help.
Ideally, this should not happen. Restarting the service is recommended in such a scenario.
If you have sessions enabled, RudderStack caches the session information. Please configure sessionThreshold
and sessionEvents
in .
For any other issues that you might encounter, please feel free to .
For any other issues or bugs that you might encounter, please feel free to . You can also start a conversation in our community, and we will be happy to help!