I use a service to monitor the sites on my VPS server for up/downtime and lately have received notifications several times a day that all of the sites feeding off the server are down. A few minutes later (usually 5-20min) I get a new notification saying they are back up. This has been increasing enough that I have been trying to catch it in the act. When I have been able to it appears that users are just getting a basic timeout error when visiting any of the sites. Yet at the same time the sites are down I can log into WHM and see that it is very responsive and fast like normal. The load averages on the server are usually at 2 or less. All of the services appear to be up and running.
What should I do to troubleshoot this? If it was a once in a while thing I would just chalk it up to an automatic update. This has been happening a lot more often lately though. What logs should I be looking at and what entries should I be looking for in those logs? Are there other things in WHM I should be checking?
What should I do to troubleshoot this? If it was a once in a while thing I would just chalk it up to an automatic update. This has been happening a lot more often lately though. What logs should I be looking at and what entries should I be looking for in those logs? Are there other things in WHM I should be checking?