Skip to content

Tag Archives: python

An analysis of Steve Jobs tribute messages displayed by Apple

20-Oct-11

Two weeks have passed since Apple’s Co-Founder/CEO Steve Jobs passed away.  Upon his passing, Apple encouraged people to share their memories, thoughts, and feelings by emailing rememberingsteve@apple.com. Earlier this week, Apple posted a site (http://www.apple.com/stevejobs) in tribute to Steve Jobs. According to the site, over a million people have submitted messages. The site cycles through the submitted [...]

Chuck Norris doesn’t screen-scrape, the data runs scared to his hard drive.

01-Mar-11

Inspired by a tweet from Roger Ehrenberg and my 11-year-old son who’s crazy about Chuck Norris facts, I screen-scraped the contents of http://www.chucknorrisfacts.com. Code and data can be found here. Using Python and BeautifulSoup, it simply loops through all of the pages on http://www.chucknorrisfacts.com and reads the items displayed on the page. Output looks like Visit [...]

Visualizations of Canabalt scores scraped from twitter

16-Feb-11

Canabalt, a ridiculously addicting web/IOS-device game allows one to show off their high scores, and their not-so-high scores to Twitter. Each of these tweets contains a bit of information – The score represented in meters, the method of death (hitting a wall and tumbling to my death) and the device (iPhone). Other useful information can [...]

Word Cloud from 6,500 tweets mentioning Kayne West. From this morning

14-Dec-10

After removing a few stopwords and then clearing out a few other words(nowplaying, lastfm, and the like), here’s what’s left.  The data represents a half-day’s worth of tweets.   I’m sitting on about 90,000 tweets about Kanye and am looking forward to taking the time for some more in-depth analysis.  Huge thanks to @jrlevine and [...]

Words mentioned in 23-Jun-2010 Canadian Earthquake tweets

24-Jun-10

Using twitter gardenhose access, remove stopwords and punctuation sprinkle in a little bit of mapping, some reducing, and voila! The most frequently-occurring words in tweets that mentioned earthquake from June 23, 2010. I left earthquake out of the image itself because being that it was in every tweet, it overwhelmed the rest of the words. [...]

An analysis of Oracle errors in the leaked 9/11 Pager Data

30-Nov-09

Yes, you read that correctly. Here’s how it started: I’m working on some text analysis in Python and was looking for some test data. Someone recommended I use the 9/11 Pager Data from Wikileaks. I downloaded the data, ran my program against it (which is the subject of another post) and all was well. Got [...]

Generating multiple Oracle TKPROF reports using Python

24-Nov-09

Recently, a customer told me that they felt a batch job was taking too long each night, I gave them a few commands to add to their nightly run. These commands named the tracefile and enabled 10046 logging. Since I’m lazy(the good kind), I figured I’d use Python to build the commands to run TKPROF [...]