Top tweeters as followed by HN readers now in FluidDB

July 29th, 2010 by Terry Jones. Filed under Programming.

Yesterday Jeff Miller posted some interesting data on the Twitter users most followed by readers of Hacker News.

I just took those top 100 Tweeters and added Jeff’s data (their rank and the fraction of HN readers who follow them) to FluidDB. The tags I used in FluidDB are ycombinator.com/top-100 and ycombinator.com/follow-percent. The top-100 tag has values that are the Twitter user’s rank (from 1 to 100), and the follower-percent tag holds the (floating point) percentage of Hacker News readers that follow that Twitter user, as found by Jeff.

What does this all mean?

It means you can now query on Jeff’s data using FluidDB. And because FluidDB contains various other pieces of information about Twitter users, you can combine his data with other data in searches – including searches that Jeff probably never anticipated (and, because of FluidDB, never had to anticipated).

It also mean you can add to the data too. All you need is a FluidDB account (sign up) and then you can take the FluidDB API for a spin (docs).

To see the kinds of things that are possible, you can also do some queries using the advanced tab of Tickery.

For example, Who are more than 20.0 percent of HN readers following that have a TunkRank score of at least 60?

Or, Who is in the HN top 100 that I have met?

Or, Who of the top 100 do I follow?

The possibilities are endless. The main point of FluidDB is that you can play too. You can add your own data (any data) to the exact same objects that I’ve put Jeff’s data onto and which Tickery and TunkRank and We Met At are all using – and you don’t have to ask permission.

We’ve written plenty more on this subject. See also Tickery, for programmers, TunkRank scores added to FluidDB, Putting metadata onto tweets with FluidDB and FluidDB as a universal metadata engine.

You can get all the code I used to put the data into FluidDB from our hackernews repo on GitHub. It was about 90 minutes of work from start to finish.

Have fun, and please comment below!

  • Pingback: Tweets that mention FluidDB » Blog Archive » Top tweeters as followed by HN readers now in FluidDB -- Topsy.com()

  • Are there open formats (standards) we can follow to make that 90minutes a cut and paste job, or an email to fluid?
    <hackernewsinfluencers>
    <twitterids><delim=”,”><percentage_of_users>
    user1,score
    user2,score
    </twitterids></percent_of_users>

    <

  • Hi Mark

    Good question! I even have a semi-decent answer. It turns out that just last Friday Nicholas Tollervey wrote some code to do pretty much what you've suggested. His project is called Flimp (FluidDB Importer) and you can find it at http://github.com/fluidinfo/flimp It's a very early version of something that could become more generally powerful. I didn't use it, but I probably should have. I just happened to have some code hanging around from Tickery that did almost exactly what I wanted. Maybe I'll ask Nicholas if he'd like to make an input file in JSON and pass it to Flimp and then blog about it to show how it's done.

    Thanks!

  • Groovy. Blog by email was a big hit for Posterous. Nifty UI and formats could boost Fluids attractiveness to even non-devs.

  • Hey Mark,

    Terry is right, “flimp” will help automate the process of importing data into FluidDB. As for a show and tell demo… an upcoming screencast (and associated documentation) will feature flimp importing metadata from data.gov.uk.

  • I look forward to it Nicholas.