Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I hadn't even thought about that. But it should be pretty easy to do in post-processing. I just have to take a list of "porn" keywords. If none of them occurs in the query, but in a search-result, then that result gets downranked.


If you want you can use the now defunct web search engine boitho.com list of adult words. It is avalibal at https://github.com/searchdaimon/adult-words .

We mostly filtered out porn by using a two word phrase method. There were a lot of edge cases because many potentially dirty concepts are made up of words that are not bad when used alone. For example a text can have both "girls" and "nude" in it without being vulgar, but if it has the phrase "nude girls" the chance for it being pornografic is much higher.


Yes I guess filtering them out would at least make the website SFW, and it would make it easier to show it to people. The issue seems to happen mainly with common words (which results also appear to be polluted with heavily SEO-ed websites).

I've also searched for less generic things like "xperia z5" and the results looked good.


I have the filter implemented now. It's not perfect yet, but it already filters out a lot of the NSFW stuff. Unless you explicitly search for it.

I'm gonna further improve this over the next days. Right now it's just a quick'n dirty hack. :)


It would be a interesting data point to see how many of the 2b pages indexed are adult. lol.


I don't know. But I do know that the end-of-year statistics from search-engines about what people searched for, are complete BS. I have such a list for the German DeuSu page:

https://deusu.de/blog/2015-12-03-alle_jahre_wieder_wonach_de...

Warning! This is definitely NSFW! :)


LOL, seems that a big part of your users is searching for adult related things.

So if you filter out all the adult stuff you might make those users unhappy. Perhaps make it configurable?

Eg. add a checkbox for NSFW results or something better.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: