Kubernetes is perhaps the perfect example of something that spreads state everywhere and leaves it up to the people running it to fix it when it all goes to hell.
I just spent an hour trying to figure out why a pod was moved to a new node. I think it was probably routine, but I can’t know, because there is no way to see all the events that have happened, all I can do is trawl through logs.