The results for half of my searches are already all spam. It would be better to just accept that and have google let me engineer my queries so that I avoid spam. More token based searches, less word vector/machine learning based search. Let me query their index like its an SQL database
Are you sure they are spam? Google's engineers consider annoying things like recipe pages with 20 paragraphs of stories before the actual recipe not to be spam.