Transforming an XML document into a table structure with Pentaho Kettle/PDI
In the past few months I have been using data sets provided by Eurostat a lot and so I crafted a Kettle job that loads SDMX files (an XML document keeping the data), sets up custom-tailored tables in a...
View ArticlePentaho Kettle 4.4 database repository under the hood
We are using Pentaho Kettle 4.4 for realizing ETL processes and to make the involved jobs centrally accessible those are stored in a database repository. Given that several proxies separate my VPN...
View ArticlePentaho Kettle 5 comes with improved database repository performance
Two weeks ago I wrote about Kettle 4.4 and its database repository and how working with it is truly no fun due to excessive latency connected with loading and saving of jobs and transformations....
View ArticleUsing the Dimension Lookup/Update Step in Pentaho Kettle
In a traditional star schema the dimensions are located within specialized tables which are referred to by numeric keys from the fact table. A dimension can represent anything from the gender (“male”,...
View ArticleMondrian Schema for OLAP Cube Definition ft. Google Analytics and Saiku
What I am going to showcase in this tutorial is how to load web stats from Google Analytics into a fact table with Penthao Kettle/PDI. And then how to represent that fact table with Mondrian 3.6 schema...
View Article