Skip to content
  • Stuff
  • Travel
  • Beverages
  • Support Antipaucity
  • Projects
  • About

antipaucity

fighting the lack of good ideas

i know why search is broken

Posted on 14 June 200715 June 2007 By antipaucity 4 Comments on i know why search is broken

Search is broken. Google, Yahoo, Ask, Alta-Vista, and on, and on the list goes.

Hundreds of companies, thousands of individuals. I know why search is broken, and I know what needs to be fixed. Now to figure out the how of fixing.

When you’re looking for information, you search on keywords. Google’s been nice enough to rank results by ‘popularity’ (yeah, it’s called PageRank, and it’s proprietary, but it’s a popularity/relevance ranking). The problem is that you have to know what keywords were used. Some places are nice enough to suggest spelling fixes (it’s not ‘brittany spears’, it’s ‘britney spears’).

But that’s not the issue. The issue is that you don’t know what word, term, or phrase to look for. You have the concept you need to find, like ‘module’. Except you don’t think of that word, you think of ‘chunk’. Bam! You’re out of luck: no author would use the word ‘chunk’ when they mean ‘module’, right?

To fix search, we need to search on not just the keyword, but the concept. In English, you’d use a thesaurus.

So, you’re thinking: “This is easy! I’ll just build a comparator that looks at the keyword and then goes through an index of a thesaurus and finds stuff. And we’ll all be rich!”

Hold it, buster. You missed something. This is a perfectly valid English sentence, and you can figure out what I’m saying, too: “Bring me the cooler cooler cooler from the cooler’s cooler.” Cooler is used five times, with the following meanings (at least): hip, less warm, box to keep things cool, jail cell, big refrigerator.

That’s the problem with trying to fix search. Words can mean far too many things in English. But here’s your big chance to figure out a solution: I’ve told you the problem, and I’ve given you the target.

Now go make it work.

commentary, ideas, insights, technical

Post navigation

Previous Post: it’s sad to see a 17-year-old go so violently
Next Post: the inanity of ‘special’ lanes

More Related Articles

new connexions collection available education
end6 must die commentary
splunk: match a field’s value in another field insights
automatically returning a host to the unprovisioned server pool in hpsa technical
rethinking pi-hole (again) technical
firsts – programming code
June 2007
S M T W T F S
 12
3456789
10111213141516
17181920212223
24252627282930
« May   Oct »
RSS Error: WP HTTP Error: cURL error 28: Connection timed out after 10002 milliseconds

Books

  • Debugging and Supporting Software Systems
  • Storage Series

External

  • Backblaze
  • Cirkul
  • Digital Ocean
  • Fundrise
  • Great Big Purple Sign
  • Password Generator
  • PayPal
  • Tech News Channel on Telegram
  • Vultr
  • Wish List

Other Blogs

  • Abiding in Hesed
  • Chris Agocs
  • Eric Hydrick
  • Jay Loden
  • Paragraph
  • skh:tec
  • Tech News Channel on Telegram
  • Veritas Equitas

Profiles

  • LinkedIn
  • Server Fault
  • Stack Overflow
  • Super User
  • Telegram
  • Twitter

Resume

  • LinkedIn
  • Resume (PDF)

Services

  • Datente
  • IP check
  • Password Generator
  • Tech News Channel on Telegram

Support

  • Backblaze
  • Built Bar
  • Cirkul
  • Digital Ocean
  • Donations
  • Fundrise
  • PayPal
  • Robinhood
  • Vultr
  • Wish List

35-questions 48laws adoption automation blog blogging books business career centos cloud community documentation email encryption facebook google history how-to hpsa ifttt linux money networking politics prediction proxy review scifi security social social-media splunk ssl startup storage sun-tzu tutorial twitter virtualization vmware wordpress work writing zombie

Copyright © 2025 antipaucity.

Powered by PressBook Green WordPress theme