Besides fine-tunes for classification, I recently posted about embedding classifiers over here.
Embeddings are bombproof and will accept any non-empty text, garbage or not. Plus data can be added and subtracted on the fly.
Something else to look into, and maybe a better fit for the Spam/Ham detector. You could always run both and do some weighted average between them too.