Publishers of technology books, eBooks, and videos for creative people

Home > Articles > Design > Voices That Matter

This chapter is from the book 

Dealing with Stopwords

You may come across the term stopwords at search engine sites, often in the help information or search tips. Stopwords are words that search engines ignore because they are too common, or because they are reserved for some special purpose.

The list varies from one search engine to the next, but it typically includes words like a, an, any, the, to, with, from, for, of, that, who, and the Boolean operators AND, OR, NOT, and NEAR. We're not aware of any major search engine that publishes its complete list. But Google does the next best thingβ€” it tells you if it has ignored one or more of the words in your query ( Figure 3.11 ).

03fig11.gif

Figure 3.11 The Google search engine tells you exactly what words have been ignored from your query: in this case, the, a, and by.

Should you need to use a stopword as part of a search, you can sometimes signal the search engine not to ignore it by setting it off in double quotation marks: Portland NEAR "OR".

Some (though not all) search engines also pay attention to stopwords that are included as part of a phrase: "The Man Who Came to Dinner" or "to be or not to be" ( Figure 3.12 ). Most of the search engines do a pretty good job of recognizing and acting on stopwords that are included in phrases.

03fig12.jpg

Figure 3.12 A Lycos search for "to be or not to be" finds Hamlet's soliloquy, even though the phrase is composed entirely of common stopwords. The secret is to enclose the words in quotation marks.

Peachpit Promotional Mailings & Special Offers

I would like to receive exclusive offers and hear about products from Peachpit and its family of brands. I can unsubscribe at any time.