let's talk about labels, I believe any nextgen ingestion should use the shared vocabulary to take in blocks by label https://about.iftas.org/library/shared-vocabulary-labels/
Maybe you want to block CSAM, but don't care about copyright infringement... (This way we could run one list instead of five and you choose the harms you want to block, as everything is labelled)