Computerworld
Metcalfe's column : Finding the best deals on the 'Net
Bob Metcalfe  09 April, 1998 12:01

This week I've got a great brain-teasing puzzle for you. It's famous on the Internet as the Monty Hall Paradox. Anyway, I get to be Monty Hall and offer you a choice of three doors, behind which are two goats and a car, respectively.

This brain-teaser was passed to me by Dr. Michael Lynch, whose considerable expertise is in adaptive pattern recognition. Lynch is an expert in the application of Bayesian methods, named for the 18th-century reverend Thomas Bayes, who developed them to assess the probabilities of zero or more gods.

Lynch is founder of Neurodynamics and its 1996 spin-off, Autonomy, both out of Cambridge, England. Lynch drove into London from his farm, where he keeps rare otter hounds. We met over dinner at the Ritz to talk about Autonomy, which is now based in the San Francisco's multimedia gulch.

Lynch says Autonomy, with $US15 million in venture capital and early profitability, will be the "Oracle" of unstructured information. Somebody please warn Larry Ellison.

Autonomy offers software to help us sort through all that wonderful content now clogging the Internet. See http://www.agentware.com.

So, imagine I'm a game show host -- not so hard. And imagine you're a game show contestant. I show you three doors behind which are two goats and a car. We assume you want to win the car -- my irresistible all-wheel-drive Volvo station wagon.

Now, I ask you to choose -- but not open -- one of the doors, and you do. Then, aiming to improve your chances of winning, I open one of the doors you didn't choose and show you a goat. Then, I ask if you want to take what's behind the door you've chosen, or switch to take what's behind the other unopened door?

Do you think you should open the door you've chosen, switch to the other unopened door, or it doesn't matter. Pause here to think about this puzzle.

Here's what the puzzle is trying to illustrate. Autonomy's Bayesian-based software reads documents looking for word probabilities. With these it is able to cluster documents on similar subjects. It is able to hyperlink and summarise them. It is able to reject goats.

While you're reading an article, Agentware is able to suggest related reading. All automatically. No human operators tagging contents. And with impressive accuracy.

Agentware does not work just by counting words. It looks for unlikely groupings to deduce what documents are really about. For example, if a sentence read, "down a street comes a car," there's not much going on there. But if instead it read, "down a street comes a goat," that's more likely what the document is about.

Here's another example: Cats have tails. Cats have maybe nine lives. But, a document containing "cat o' nine tails" is probably not about cats, but pirates. This is something Agentware can deduce. Using statistics and conditional probability methods developed by, and since, Reverend Bayes, Agentware captures concepts rather than just raw word frequencies.

OK, ready for my solution to our game show puzzle?

Your chances of winning the car are much higher if, after I show you the goat, you switch to the other unopened door. You have a two-thirds chance of winning if you switch vs. one-third chance if you don't.

Many people think it doesn't matter if you switch. They think there's no difference between the door you chose and the other unopened door, even after seeing that goat. They are wrong.

Hint: Seeing the goat tells you nothing new about the door you've already chosen but quite a lot about the unopened door you did not choose.

Look at the puzzle in a Bayesian way. If you originally chose the car, with probability one third, would it be good to switch? Obviously not. But, if you originally chose a goat, with probability two-thirds, then the unopened door would have the car, so switching wins. Therefore, because it is twice as likely that you first chose a goat, it is twice as likely that you'll win by switching.

Do you agree? Let me know. And please think before firing off a nastygram -- thinking is the point.

I have written about collaborative filtering and channel push technologies for personalising infoglut. Lynch is annoyed at all the fruitless attention these goats have gotten. He's asking us to switch to a powerful statistical approach.

Technology pundit Bob Metcalfe invented Ethernet in 1973 and founded 3Com in 1979, and today he specialises in the Internet. Send e-mail to metcalfe@infoworld.com.

More about Oracle, Metcalfe, 3Com, Volvo

Comments

Post new comment

Login or register to link comments to your user profile, or you may also post a comment without being logged in.
The content of this field is kept private and will not be shown publicly.
Enter the fully qualified URL, eg. http://www.example.com/
  • Web page addresses and e-mail addresses turn into links automatically.
  • Allowed HTML tags: <a> <em> <strong> <cite> <code> <ul> <ol> <li> <dl> <dt> <dd>
  • Lines and paragraphs break automatically.

More information about formatting options

Zones
Zone logoZones provide focussed content from Computerworld and leading technology partners.
Newsletter Subscription
Newsletter Subscription
Sign up for our Computerworld newsletters!
Syndicate content
 

Computerworld Webinar

Thursday, June 11th, 2009
10:30am EST (Sydney, Australia)
Screening at your PC

Computerworld is hosting a 30 minute live webinar to help you to learn how unified communications can save you money, foster innovation and business agility by making it easier for people to find, reach and collaborate with one another.

Register Now

Computerworld Community Comments
Whitepaper

LANPlanner | Ensuring High Performance WLAN Networks

Learn how the Motorola LANPlanner facilitates prompt and precise planning and the design and measurement of robust 802.11a/b/g/n networks. Download this paper now to discover how to take wireless network performance to the next level.

Enterprise IT Buyer's Guide
Find Technology Vendors Fast
 
Find vendors by name | Find by category
Sponsored Links
 
Send Us E-mail | Privacy Policy
Features List | Media Kit | Advertising | Contact Us

Copyright 2009 IDG Communications. ABN 14 001 592 650. All rights reserved.
Reproduction in whole or in part in any form or medium without express written permission of IDG Communications is prohibited.