Quantcast
Channel: joy of data » Kettle/PDI
Browsing latest articles
Browse All 5 View Live

Image may be NSFW.
Clik here to view.

Transforming an XML document into a table structure with Pentaho Kettle/PDI

In the past few months I have been using data sets provided by Eurostat a lot and so I crafted a Kettle job that loads SDMX files (an XML document keeping the data), sets up custom-tailored tables in a...

View Article


Image may be NSFW.
Clik here to view.

Pentaho Kettle 4.4 database repository under the hood

We are using Pentaho Kettle 4.4 for realizing ETL processes and to make the involved jobs centrally accessible those are stored in a database repository. Given that several proxies separate my VPN...

View Article


Image may be NSFW.
Clik here to view.

Pentaho Kettle 5 comes with improved database repository performance

Two weeks ago I wrote about Kettle 4.4 and its database repository and how working with it is truly no fun due to excessive latency connected with loading and saving of jobs and transformations....

View Article

Image may be NSFW.
Clik here to view.

Using the Dimension Lookup/Update Step in Pentaho Kettle

In a traditional star schema the dimensions are located within specialized tables which are referred to by numeric keys from the fact table. A dimension can represent anything from the gender (“male”,...

View Article

Image may be NSFW.
Clik here to view.

Mondrian Schema for OLAP Cube Definition ft. Google Analytics and Saiku

What I am going to showcase in this tutorial is how to load web stats from Google Analytics into a fact table with Penthao Kettle/PDI. And then how to represent that fact table with Mondrian 3.6 schema...

View Article

Browsing latest articles
Browse All 5 View Live