Comment on Why do many search engines seem to ignore operators (e.g. exact phrases, term exclusions, OR, etc.)? Is there a good reason for having a dumb 1997-level search logic that I'm not seeing?

thirteene@lemmy.world ⁨4⁩ ⁨months⁩ ago

It’s because websites interpret those characters differently because of how coding requires using the physical qwerty keyboard. Essentially “>” gets used as a compator operator in programming languages, which means that it’s used as a tool to instructs the computer how to do things. When we need to display the symbol, we use “>” as an “escaped character” which basically means treat it as the symbol, not the instruction set. Often search engines will use a very powerful tool called a regular expression which looks like this for phone numbers: ^(\d{3})\s\d{3}-\d{4}

And each character represents something, ^ means start with. \d means digit { means 3 of whatever’s in front of me }. Breaking apart the search parameters is pretty complex and it needs to happen FAST, so at a certain point the developers just throw away things that can be a security concern like special characters like &^|`"'* specially for how to maliciously attack the search engine.

For other characters: www.w3schools.com/html/html_entities.asp

source
Sort:hotnewtop