As some of you know, my twitter bot @hellooooonewman responds to every tweet containing the word/hashtag ‘Seinfeld’. Using Python and the Twitter Search REST API, it looks for mentions and then replies to the original author with a random Seinfeld quote. People seem to get a kick out of it, judging by its 2,000 followers [...]
Also filed in
|
|
See that tiny little ‘s-o-r-r-y’? Right under the giant ‘p-e-o-p-l-e’. He’s spoke more about others’ reactions to his infidelities than he did taking responsibility for his own actions. But what did we expect? Word-frequencies: 45 – 1 Accenture – 1 Achievements – 1 And – 1 Buddhist – 1 California – 1 Center – 1 [...]
Also filed in
|
|
To troubleshoot a pesky java-in-the-database issue at one of my sites, I loaded the log4j jar into the Oracle JVM. We configured the log to write to a file on the local filesystem. I then thought it would be nice to be able to monitor the log from the database using SQL and some existing [...]
Also filed in
|
|
A really quick visualization I did while researching data for another project. Census.gov has a link to the most frequently occurring first names and surnames from the 1990 census. Surely more current data must exists; I found this dataset by accident. The original data is tab-delimited in the format: Name Frequency in percent Cumulative Frequency [...]
While doing some research(yeah, right!) for my latest twitter bot @TheBotLebowski, I found a copy of the original script. Sounded like a good candidate for a word cloud, right? The only scrubbing of the data I had to do was to remove the capitalized character’s name that’s printed above every line of dialog. A simple [...]
My cousin @joelkodner (private stream, follow to view) is closing in on 20,000 tweets. Pretty impressive considering he joined not too long ago. That amounts to about 300 a day(!?). Bookmark on Delicious Digg this post Recommend on Facebook share via Reddit Share with Stumblers Tweet about it Subscribe to the comments on this post
Received this gem while trying to enter my hours. Looks like some incomplete market research to me Bookmark on Delicious Digg this post Recommend on Facebook share via Reddit Share with Stumblers Tweet about it Subscribe to the comments on this post
Yes, you read that correctly. Here’s how it started: I’m working on some text analysis in Python and was looking for some test data. Someone recommended I use the 9/11 Pager Data from Wikileaks. I downloaded the data, ran my program against it (which is the subject of another post) and all was well. Got [...]
Recently, a customer told me that they felt a batch job was taking too long each night, I gave them a few commands to add to their nightly run. These commands named the tracefile and enabled 10046 logging. Since I’m lazy(the good kind), I figured I’d use Python to build the commands to run TKPROF [...]
While browsing through Pete Skomoroch’s delicious bookmarks(which is a full-time job in and of itself), I learned that StackOverflow.com makes their underlying q&a data available. Just for fun, I wrote a few quick queries against this dataset, centered around the R tag. Here are a handful of findings – data is through 31-Oct-2009. Some of [...]