Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The ML layer is probably getting in the way of the end user getting to the smaller samples.

Used to be you'd get the best matches from the meta data on a page.

Now there's linear algebra both trying to determine what the meta data means and what the question means, so it's going to have grouping biases.

And do things like exclude seemingly random strings of numbers, because in the training data, that's usually trash, but for you, it's a part or serial number that you're looking for



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: