Skip to content

Monthly Archives: November 2009

An analysis of Oracle errors in the leaked 9/11 Pager Data

30-Nov-09

Yes, you read that correctly. Here’s how it started: I’m working on some text analysis in Python and was looking for some test data. Someone recommended I use the 9/11 Pager Data from Wikileaks. I downloaded the data, ran my program against it (which is the subject of another post) and all was well. Got [...]

Generating multiple Oracle TKPROF reports using Python

24-Nov-09

Recently, a customer told me that they felt a batch job was taking too long each night, I gave them a few commands to add to their nightly run. These commands named the tracefile and enabled 10046 logging. Since I’m lazy(the good kind), I figured I’d use Python to build the commands to run TKPROF [...]

A few quick observations on StackOverflow questions tagged R

12-Nov-09

While browsing through Pete Skomoroch’s delicious bookmarks(which is a full-time job in and of itself), I learned that StackOverflow.com makes their underlying q&a data available. Just for fun, I wrote a few quick queries against this dataset, centered around the R tag. Here are a handful of findings – data is through 31-Oct-2009. Some of [...]

How to determine holidays and the next/prior business day in PL/SQL

09-Nov-09

Here’s an excerpt of a package I wrote that performs many date-related operations in PL/SQL, as they relate to business days and holidays. It does a few interesting things, such as determine holiday dates dynamically, and then determine when the holidays should be observed. It takes holidays falling on a Saturday/Sunday and determines if we [...]

Process the whole world, or just one item

05-Nov-09

One of my favorite PL/SQL techniques used in batch job development is to add an extra parameter, default it to NULL, so that I can test a single account at a time. example So the real trick is the NVL in the where clause – If I supply a value for in_employer_id that’s the only [...]

Manager: How big is my table? Me: What do you mean?

03-Nov-09

Recently, a data warehouse manager sent me a list of 49 tables; he wants the approximate size of each. For an end-user, this is no easy task. Sure, GUI interfaces such as Toad or OEM will give this information, but not all managers (or developers for that matter) know how to get this information. Furthermore, [...]