Skip to content

Monthly Archives: January 2011

Retrieving the US National Debt and Population using Python and BeautifulSoup

30-Jan-11

Update 12-May-2011, I cleaned up the code, added logging of the data to a tab-delimited file, and published it to github. Happy Hacking! Someone suggested I create a bot that tweets the US National Debt.  Here’s how I’m retrieving the National Debt amount from the US Treasury site.  I then retrieve the US Population from [...]

World Cup 2010: Analysis of tweets celebrating goals

10-Jan-11

The 2010 World Cup proved to be one of the most tweeted about events of 2010.  Through the millions of tweets saved to my local Cloudera CDH3 Hadoop cluster, I wrote a quick pig script to discover the ways that people are celebrating(ok, spelling) goals.  Here are the top few variations of Goal/Gol.  The full [...]