The concept of a spam filter is one from a more innocent age where even if spam was a majority of documents, it could still be identified and dropped.
I'm not sure it's possible to really identify spam anymore. Even previously well-trusted news publishers are playing games with thinly veiled advertorials / scientific journals are full of generative spam etc.
That problem is just going to get worse.