Add to Technorati Favorites

The Mahalo-Wikipedia-Google love triangle

Lots of people seem to like dumping on Mahalo and Jason Calacanis. For example, Andrew Baron recently posted about Why Mahalo is Fundamentally Flawed.

Try Googling Mahalo sucks and you’ll get about 232,000 hits. Take your pick of the highly critical coverage.

Some of the negative commentary on Mahalo is probably due to professional and personal jealousy. Some of it is due to the fact that it’s early days yet. And I think some of it may be due to Jason happily telling people to look left while he goes right.

How can Jason raise money for Mahalo at valuations north of $100M? Surely there must be a revenue plan that holds water? If you want to argue that Mahalo is a failure and that Jason is simply a ceaseless self-marketer full of hot air, you’ll need to argue that some of the same things are true of Mahalo’s investors. Or maybe we’re in a bubble and they’ve all simply lost it.

Here’s what I think is going on.

Firstly, I think Jason is using a little smoke and mirrors when he calls Mahalo a search engine and frequently compares Mahalo’s “search” results to Google’s. With few exceptions, everyone seems to be buying it! With few exceptions, people compare Mahalo with Google – presumably because Jason tells them to and because he talks about being a search engine. And, with few exceptions, the technorati tell us that Mahalo is a pretty crappy search engine.

I agree, because Mahalo is not a search engine. Putting a box labeled “Search” on your web site to dig hits out of your own content does not make you a search engine – if it did, millions of sites would qualify. Passing queries off to Google and showing the results does not make you a search engine, either. Telling people to compare your content with Google’s results does not make you a search engine. Nor does putting the words “search engine” in your company’s strapline.

Mahalo will never be a search engine, and almost certainly does not want to be a search engine. That would be suicidal.

I believe their strategy is entirely different and that the relevant comparison is not with Google, but with Wikipedia.

Mahalo is a rapidly growing collection of carefully curated content. Mahalo is Wikipedia with a different model of control, ownership, and content creation. It’s a benevolent dictator with a purchase agreement instead of a loose anarchy with the GNU Free Documentation License.

If you want to compare Mahalo to something, compare it to Wikipedia. Jason is a huge fan of Wikipedia. And here he is begging Jimbo Wales not to leave $100M/yr on the table. Interesting.

Right now Mahalo has roughly 25K pages. Google has information on, let’s say, 10 billion pages. By this simplistic measure, Google is about 400,000 times bigger than Mahalo. You’re not going to catch or compete with Google using people to make content. Yes, you can use Google for things you don’t have static pages for, as Mahalo does. But Mahalo is not a search engine. Never will be.

Now consider Wikipedia. Wikipedia has 1.2M English pages. That means that, in English, Wikipedia is a mere 48 times larger than Mahalo! Now we’re talking. Mahalo are currenly adding something like 1,000 pages a week. Suppose Jason manages to double that quite soon. That would be 100K pages a year, or about 8.3% of Wikipedia annually. So I think it’s conceivable that Mahalo could catch Wikipedia. Even if they keep a steady ship and only gain linearly they could easily be 35-40% the size of Wikipedia in 4 years’ time.

But sheer number of pages is only part of the story. Because the distribution of search requests will follow some kind of power law, you can pick up (say) half of all search requests by only covering a small number of them, and, as always, leave the long tail to Google.

So with a small finite amount of work, you can cover a very large chunk of Wikipedia. And I think that’s exactly what Mahalo are aiming to do.

A few weeks ago I pulled down all of Mahalo’s URIs for another project. Here’s a tiny sample – and I really did pick this out at random:

    http://www.mahalo.com/Valerie_Plame_Affair
    http://www.mahalo.com/Violence_on_Television
    http://www.mahalo.com/Violent_Crime_Rate
    http://www.mahalo.com/Virginia_Tech_Report
    http://www.mahalo.com/Voting_Machine_Controversy
    http://www.mahalo.com/Walter_Reed_Army_Medical_Center
    http://www.mahalo.com/War_Wounded
    http://www.mahalo.com/Washington_D.C._Lobbying_Scandal
    http://www.mahalo.com/Abdullah_Gul
    http://www.mahalo.com/Alan_Garcia
    http://www.mahalo.com/Alex_Salmond
    http://www.mahalo.com/Angela_Merkel

So what? you might ask. Well, let’s replace www.mahalo.com with en.wikipedia.org/wiki in the above. We get:

    http://en.wikipedia.org/wiki/Valerie_Plame_Affair
    http://en.wikipedia.org/wiki/Violence_on_Television
    http://en.wikipedia.org/wiki/Violent_Crime_Rate
    http://en.wikipedia.org/wiki/Virginia_Tech_Report
    http://en.wikipedia.org/wiki/Voting_Machine_Controversy
    http://en.wikipedia.org/wiki/Walter_Reed_Army_Medical_Center
    http://en.wikipedia.org/wiki/War_Wounded
    http://en.wikipedia.org/wiki/Washington_D.C._Lobbying_Scandal
    http://en.wikipedia.org/wiki/Abdullah_Gul
    http://en.wikipedia.org/wiki/Alan_Garcia
    http://en.wikipedia.org/wiki/Alex_Salmond
    http://en.wikipedia.org/wiki/Angela_Merkel

And guess what? All those URIs actually work! See below for a possible reason for this uncanny coincidence.

Research question: what percentage of Mahalo URIs work as Wikipedia URIs with the above simple substitution? I may do this test when I get a little more time. I bet the answer is high.

Ask yourself again: does Mahalo look more like Google or more like Wikipedia?

The idea of Mahalo-as-search-alternative-to-Google is just Jason operating Mahalo in stealth mode in broad daylight. “Hey, Rocky, watch me pull a search engine out of my hat! Oops! That’s not a search engine. I swear there was a search engine in there somewhere.”

How is Mahalo different from Wikipedia?

A big one is that Mahalo owns all its content. If Mahalo puts one of your pages on its site, you’ll first sign a purchase agreement in which the

Seller hereby irrevocably sells, grants, assigns, conveys and transfers to Mahalo, exclusively and forever, Seller’s entire right, title and interest in and to the SeRPs

and in which you warrant that the content is legit, in which you fully indemnify Mahalo, and in which you agree to let them be your agent and attorney should they need to take some action to obtain or protect the content.

In consideration you get $10-$15 which you can have in cash. Or, in a wonderfully ironic and masterful gesture, you can have your earnings donated to the Wikimedia Foundation! That’s just brilliant, I love it. How can you not be in awe of that? The guy’s a genius.

Talking of genius, just look at the language on the payment details page at Mahalo: “A Greenhouse Guide begins their career in the Greenhouse…”. You see? Writing articles for Mahalo is the beginning of a career. George Lakoff would probably count that as a classic example of framing (also see here).

Unlike Wikipedia, Mahalo owns every word of its content. That means they can sell it. That they can be acquired. But who would want to acquire Mahalo? Wait.

What other differences are there between Wikipedia and Mahalo?

Another big one is the millions of links on the internet that point to Wikipedia pages. Those little tubules that make up the internets, with Google’s PageRank worming its way down each and every one, assigning and passing on credit.

There are two things here: 1) the links themselves and 2) the high consequent position Wikipedia’s pages have on Google.

Can Mahalo get large numbers of people to link to their pages? If the pages are any good (and they are), then why not? Plus, it may be that Mahalo can catch Wikipedia in terms of how many people link to them.

According to the Netcraft October 2007 Web Server Survey, the number of servers on the net has been growing at an amazing 5% per month!

That’s just the rate of increase of new servers, not the rate of new pages being put onto existing sites. Let’s assume the Netcraft server number isn’t too far from the overall growth, and that the web roughly doubles in size every two years. That means if the size today is X, then in 4 years, towards the end of Jason’s horizon, it will be size 4X. If so, there are 3X pages yet to come into existence. The creators of these will have a choice to point links at Wikipedia or Mahalo. If popular momentum can be shifted to Mahalo, it can grab a large chunk of the link pie graph.

All of which brings us, inevitably, to Google.

Quick survey question: when you need to find something that you know you’d be happy to read in Wikipedia, do you first go to Wikipedia, find English (or your language), find their search box, enter your query, and click on the link? Or do you go to Google and take its Wikipedia link?

I thought so – you use Google. It’s a uniform way to get to things, it’s likely integrated into your browser, and they generally do a better and faster job of indexing sites’ content than the sites do themselves. So the existence and massive popularity of Wikipedia drives traffic to Google. And Google of course drives traffic to Wikipedia. The two of them are dating. But Wikipedia is not the perfect lover: they stubbornly refuse to put ads on their pages, to share the love. Along comes Jason Calacanis, then at AOL, to whom this is all very clear. He tells Wikipedia in no uncertain terms that with all that traffic they could make $100M per year from ads on just the home page. He points to a conservative estimate of the worth of Wikipedia at $600M, and his own estimate is $5B. Hmmmmm. What’s an entrepreneur to do when he sees someone leaving that much value on the table?

Back to Google. They would like to have more content. Traditionally, when you got back a page of their search results, you wouldn’t see links to pages on Google – that wouldn’t make sense: there were no pages on Google, after all. Google was supposed to point you to other pages. It was an index to help you find the things you actually wanted to look at. That was the old model. These days, Google is buying content (e.g., YouTube) and pointing their search results at their content, neatly taking the ad revenue in both places. All the better if the content comes with indemnification.

You can see where I’m going. Mahalo already does advertising with Google. In fact, they’re already a premium adsense publisher to the surprise of some. If ads on the single front page of Wikipedia could generate $100M annually, what could ads on all Mahalo pages generate if Mahalo grows to rival Wikipedia?

And… who weighs the importance of links (and other unknown factors) in Google’s results page? Yes, of course, Google does. According to this Fast Company article, Mahalo gets 65% of revenue Google makes when it sends its users into Google. And Google makes money when it sends its users into Mahalo.

If there’s really (say) $1B of value to be had by building a successful commercial version of Wikipedia, you can see why Google might have some interest in nudging links to Mahalo a little higher in its results. Maybe even higher than the equivalent page for Wikipedia. Now would be a good moment to remember that I illustrated above just how trivial it can be to match up equivalent Mahalo and Wikipedia pages… Got it? User enters a query, Google does the search and finds a highly-linked Wikipedia page, then in an instant they can make and instead display a link to the equivalent Mahalo page, optionally displaying the Wikipedia page below the fold. Would that qualify as evil?

All of which leads to a very clear answer to my “who would want to acquire Mahalo?” question. Interestingly, Google will want to wait until Mahalo is big (they will know exactly when, supposing Mahalo keeps using adsense). They want Mahalo to be independent and with strong momentum before they turn the corporate intake valve in the Mahalo direction.

Can Jason build a viable alternative to Wikipedia? I bet he can. He has the lessons of Wikipedia. He doesn’t have the anarchy factor. He has no spam. He knows what he’s doing, and he’s in control. It’s a content play, and Jason is a content guy. An editor with a track record of building valuable content in this way. He’s playing to his strengths. The engineering is not nearly as daunting as building a Google. He has the money. As he ramps it up he’s going to have more money.

Who’s going to stop him? Certainly not Google – that’s not in their interest at all. Almost certainly not Wikipedia – unless they start putting up ads and funneling large amounts of money back to Google. And Jason is unlikely to shoot himself in the foot either – quite the reverse.

So if that’s the strategy, and if he’s on track with content (as he seems to be), and if the content is passably good, or better (which it is), and if he has a good understanding with his “friends at Google” (which you can bet he does — let’s not forget the Sequoia factor either), and if the revenue numbers are about right, then a $175M valuation for an upcoming round to accelerate things might look like a steal.

Along the way, Jason gets to have a quiet inner smile at all the people whining about how Mahalo is a crap search engine. He feeds the fire all the while, telling them to go ahead, make his day, and compare Mahalo’s results to Google’s (but not Wikipedia’s). Misdirecting attention towards Google and having people write him off probably suits him just fine. Meanwhile, they’re getting on with the real mission.


You can follow any responses to this entry through the RSS 2.0 feed. Both comments and pings are currently closed.

28 Responses to “The Mahalo-Wikipedia-Google love triangle”

  1. No comment. ;-)

  2. No comment. ;-)

  3. […] just posted on mahalo. it’s so good it hurts. Ask yourself again: does Mahalo look more like Google or more like […]

  4. Wow. Brilliant analysis. And brilliant “first post” too. ;-)

  5. Wow. Brilliant analysis. And brilliant “first post” too. ;-)

  6. It’s clear that Jason is going after the high-value ads, with hand-crafted pages.

    The mystery to me is how he plans to get the Google-ranking to drive the audience to it.

    Wikipedia has a very weird and complex way of doing that, I don’t think the phenomena can be reproduced readily.

  7. It’s clear that Jason is going after the high-value ads, with hand-crafted pages.

    The mystery to me is how he plans to get the Google-ranking to drive the audience to it.

    Wikipedia has a very weird and complex way of doing that, I don’t think the phenomena can be reproduced readily.

  8. Hmm … what do you mean by “All those URIs actually work!”

    I’m getting “Wikipedia does not have an article with this exact name.” for quite a few

    http://en.wikipedia.org/wiki/Anythinghere will return a valid page.

  9. Hmm … what do you mean by “All those URIs actually work!”

    I’m getting “Wikipedia does not have an article with this exact name.” for quite a few

    http://en.wikipedia.org/wiki/Anythinghere will return a valid page.

  10. Hi Seth

    Oops, yes, you’re right. I should have looked more closely. About half of them return an equivalent page. Half return the generic empty wiki page, as you say.

    Terry

  11. Hi Seth

    Oops, yes, you’re right. I should have looked more closely. About half of them return an equivalent page. Half return the generic empty wiki page, as you say.

    Terry

  12. Wow, excellent analysis!! The best I’ve seen so far on the Mahalo-Google affair.

  13. Wow, excellent analysis!! The best I’ve seen so far on the Mahalo-Google affair.

  14. There is a huge flaw in your argument, as you are comparing Wikipedia’s content (which is actual articles, things people can read) with Mahalo’s, which is 99% links.

    There may be paralels in some ways with Wikipedia, but the actual end-user product is very different. The actual output is far more similar to Google– a list of links.

    I do think there’s value in lists of links, but not nearly the type of value which is being discussed here.

  15. There is a huge flaw in your argument, as you are comparing Wikipedia’s content (which is actual articles, things people can read) with Mahalo’s, which is 99% links.

    There may be paralels in some ways with Wikipedia, but the actual end-user product is very different. The actual output is far more similar to Google– a list of links.

    I do think there’s value in lists of links, but not nearly the type of value which is being discussed here.

  16. Hi Harold

    I agree with your comment, though not that it’s a huge flaw :-)

    If I were trying the above strategy, I’d go for breadth first – get many pages out there, of reasonable quality, make a noise, and then come back to flesh them out more.

    I don’t know that that’s the strategy, but it’s what I’d do.

    Terry

  17. Hi Harold

    I agree with your comment, though not that it’s a huge flaw :-)

    If I were trying the above strategy, I’d go for breadth first – get many pages out there, of reasonable quality, make a noise, and then come back to flesh them out more.

    I don’t know that that’s the strategy, but it’s what I’d do.

    Terry

  18. […] to create the page you go to by default. who’s it going to be? terry said mahalo was really angling for wikipedia. i think this shows that there’s more to this. why not tilt at the windmills of wikipedia and […]

  19. […] fluidinfo » Blog Archive » The Mahalo-Wikipedia-Google love triangle because Mahalo is not a search engine. Putting a box labeled “Search” on your web site to dig hits out of your own content does not make you a search engine – if it did, millions of sites would qualify. (tags: mahalo google wikipedia search advertising jason calacans) […]

  20. […] Back in November I wrote some comments on Mahalo in an article titled The Mahalo-Wikipedia-Google love triangle. […]

  21. Welcome to the ugg cardy and ugg boots.http://Www.bootslife.com is recognized as a leader in specialty retailing with UGG around the world.

  22.  
    discount high quality & hot—sale digital camera battery
    discount high quality & hot—sale laptop battery
    discount high quality & hot—sale PDA battery
    discount high quality & hot—sale cell phonebattery
    discount high quality & hot—sale car alarms and security
    discount high quality & hot—sale car audio
    discount high quality & hot—sale car charger
    discount high quality & hot—sale car video
    discount high quality & hot—sale FM transmitter
    discount high quality & hot—sale cell phone watchs
    discount high quality & hot—sale computer & networking
    discount high quality & hot—sale adapter chargers
    discount high quality & hot—sale card reader
    discount high quality & hot—sale converter adaptor
    discount high quality & hot—sale drives and storage
    discount high quality & hot—sale game pad
    discount high quality & hot—sale headset
    discount high quality & hot—sale ip phones
    discount high quality & hot—sale laptop cpu fan
    discount high quality & hot—sale networking
    discount high quality & hot—sale notebook accessories
    discount high quality & hot—sale pc mouse
    discount high quality & hot—sale PDA accessories
    discount high quality & hot—sale projector
    discount high quality & hot—sale skype phone
    discount high quality & hot—sale sound cards
    discount high quality & hot—sale Speaker
    discount high quality & hot—sale USB hub
    discount high quality & hot—sale wireless lan
    discount high quality & hot—sale pc camera
    discount high quality & hot—sale DVB-T
    discount high quality & hot—sale games
    discount high quality & hot—sale nintendo accessories
    discount high quality & hot—sale ps2 accessories
    discount high quality & hot—sale ps3 accessories
    discount high quality & hot—sale psp accessories
    discount high quality & hot—sale Xbox accessories
    discount high quality & hot—sale HDMI and AV cables
    discount high quality & hot—sale TV Turner box
    discount high quality & hot—sale IPOD accessories
    discount high quality & hot—sale aluminum case
    discount high quality & hot—sale earphone
    discount high quality & hot—sale ipod battery
    discount high quality & hot—sale ipod data cable
    discount high quality & hot—sale ipod protective film
    discount high quality & hot—sale ipod silicon case
    discount high quality & hot—sale leather case
    discount high quality & hot—sale Speaker
    discount high quality & hot—sale Apple ipod
    discount high quality & hot—sale digital voice recorder
    discount high quality & hot—sale ipod accessories
    discount high quality & hot—sale MP3 & MP4
    discount high quality & hot—sale MP3 accessories
    discount high quality & hot—sale MP3 player recording pens
    discount high quality & hot—sale MP3 watchs
    discount high quality & hot—sale portable MP3 player
    discount high quality & hot—sale sunglasses MP3 player
    discount high quality & hot—sale video eyewear
    discount high quality & hot—sale printer
    discount high quality & hot—sale For Samsung
    discount high quality & hot—sale For brother
    discount high quality & hot—sale For canon
    discount high quality & hot—sale For HP
    discount high quality & hot—sale For panasonic
    discount high quality & hot—sale surveillance equipment
    discount high quality & hot—sale alarm system
    discount high quality & hot—sale DVR cards & systems
    discount high quality & hot—sale security devices
    discount high quality & hot—sale spy camera & accessories
    discount high quality & hot—sale wired CCTV cameras
    discount high quality & hot—sale security systems
    discount high quality & hot—sale toy and hobbies
    discount high quality & hot—sale radio control toy
    discount high quality & hot—sale light
    discount high quality & hot—sale bike lights & car lights
    discount high quality & hot—sale flashlights
    discount high quality & hot—sale headlights
    discount high quality & hot—sale laser pointers
    discount high quality & hot—sale led lights
    discount high quality & hot—sale Electronics Gadgets
    discount high quality & hot—sale home video and theater
    discount high quality & hot—sale keychain photo viwers
    discount high quality & hot—sale thlephones and pagers
    discount high quality & hot—sale watch
    discount high quality & hot—sale clocks and clock radio
    discount high quality & hot—sale USB desktop
    discount high quality & hot—sale Digital pictures frames
    discount high quality & hot—sale cell phone accessories
    discount high quality & hot—sale dual band cell phone
    discount high quality & hot—sale mobile covers
    discount high quality & hot—sale
    discount high quality & hot—sale quad band cell phones
    discount high quality & hot—sale Motorola
    discount high quality & hot—sale Samsung
    discount high quality & hot—sale blackBerry
    discount high quality & hot—sale camera memory
    discount high quality & hot—sale digital camera tripod
    discount high quality & hot—sale kodka
    discount high quality & hot—sale canon
     

  23. unless they start putting up ads and funneling large amounts of money back to Google. And Jason is unlikely to shoot himself in the foot either – quite the reverse.

  24. Buy designer Hermes handbags online. Hermes bags buy Hermes Wallets for cheap 张亚楠测试 online 1

  25. Buy designer Hermes handbags online. Hermes bags buy Hermes Wallets for cheap 张亚楠测试 online 1

  26. Francisca Ellis Says:

    This is surely a totally useful and aimed content. Though my opinion differs from that of the author, I find that this is certainly excellent site. I will certainly become a member of Feed on this amazing and unique blog site.

  27. That is surely a extremely informative and centered piece of writing. Though my opinion may differ from that of the writer, I realize that this is certainly very good web-site. I’ll definitely become a member of Feed on this terrific as well as distinctive blog website.