Via the rails envy podcast I found Super-fast bulk data imports in Rails with ar-extensions. Basically a scaling issue is that importing a billion records into rails via ActiveRecord sucks, this lets you do bulk imports and makes them much faster.