« Why I'm a Mac user | Main | Mac OS X Security Advantages »

August 09, 2005

Amazon's "Statistically Improbable Phrases"

I think this new feature of Amazon's is a nice idea. Actually I'm not sure how new it is, but I just noticed it this morning due to a list of "Statistically Improbable Phrases" appearing in the information about a book I was considering buying. From their site:

Amazon.com's Statistically Improbable Phrases, or "SIPs", are the most distinctive phrases in the text of books in the Search Inside!™ program. To identify SIPs, our computers scan the text of all books in the Search Inside! program. If they find a phrase that occurs a large number of times in a particular book relative to all Search Inside! books, that phrase is a SIP in that book.

SIPs are not necessarily improbable within a particular book, but they are improbable relative to all books in Search Inside!. For example, most SIPs for a book on taxes are tax related. But because we display SIPs in order of their improbability score, the first SIPs will be on tax topics that this book mentions more often than other tax books. For works of fiction, SIPs tend to be distinctive word combinations that often hint at important plot elements.

Click on a SIP to view a list of books in which the phrase occurs. You can also view a list of references to the phrase in each book. Learn more about the phrase by clicking on the A9.com search link.

August 9, 2005 | Permalink

Comments

Post a comment