During the last couple of years, Pentaho Data Integration (PDI) a.k.a. Kettle has become one of the leading ETL tools.  Here are a few useful or memorable links to things I wrote on my blog about Kettle…

If you have other interesting Kettle/PDI related links, feel free to comment

19 thoughts on “Kettle / PDI”

  1. Hi,matt,First,thanks to reply my thread on forums.pentaho.org.
    I want to talk to you.Can I?
    I’m chinese,want to lrean kettle from you ^-^

  2. Hello,

    I’ve been using Kettle 2.2 till now. Also, there were plugins developed by others.
    I’ve downloaded PDI 3.0.4. But, most of the existing transforms are not working in PDI 3.
    Is there a way to use/access those plugins in PDI 3.0.4 ?
    How to migrate those plugins into PDI3 ?

    Please help, thnx.

    Virendra

  3. Hi Bambam,

    Unfortunately I’ve never heard about that database. That doesn’t mean much though. There are new databases popping up all the time, usually PostgreSQL, MySQL, etc clones but others as well.
    Ask your vendor for a JDBC or ODBC driver and we can work together to create the driver in Kettle. File a feature request at http://jira.pentaho.org/browse/PDI

    Matt

  4. hi matt,
    Some of the kettle I convert smth into Chinese work , but do not know how to submit this infomation to your organization’s projects. Can you tell me how to do it?

  5. hi matt,
    our project needs the Kettle,and i integrated it in our project.i use the next math to run the Job,but i don’t know how to stop the Job.you cen tell me how to stop this Job?
    this is my run the Job src:
    //???
    EnvUtil.environmentInit();
    JobEntryLoader.init();
    StepLoader.init();
    //??
    LogWriter log=LogWriter.getInstance(“KettleTest.log”, true, LogWriter.LOG_LEVEL_DETAILED);
    //??
    UserInfo userInfo=new UserInfo();
    userInfo.setLogin(“admin”);
    userInfo.setName(“admin”);
    DatabaseMeta connection=new DatabaseMeta(“10.207.6.109-sspa”, “Oracle”, “Native”, “10.207.6.109”, “orcl”, “1521”, “sspa”, “sspa”);
    //??????
    RepositoryMeta repinfo=new RepositoryMeta();
    repinfo.setConnection(connection);
    //???
    Repository rep=new Repository(log, repinfo, userInfo);
    //?????
    rep.connect(“”);
    //???????
    RepositoryDirectory dir=new RepositoryDirectory(rep);
    //??????
    StepLoader steploader=StepLoader.getInstance();
    //Job???
    JobMeta jobMeta=new JobMeta(log, rep,”ceshiJob”, dir);
    //Job
    Job job=new Job(log, steploader, rep, jobMeta);
    //??Job
    //job.execute();
    job.run();
    //??Job????
    job.waitUntilFinished();

  6. Hi Matt,

    I have read Pentaho Kettle Solution book and it was awesome. Is there any chance for a new edition to be released ?

    Thanks,
    Hari

  7. Hello

    I am looking at resources or samples to help me guide me to connect a PDI transformation from within a C# page. Appreciate any help.

    Regards
    Sanjeev

  8. Hi, Matt

    I want add to my custom kettle plugin to PDI CE 5
    but, I can’t find sample.

    now I built the PDI5 CE[kettle] on eclipse.
    I am looking at the guide of configurating a custom plugin to Kettle project on eclipse.

    do you help me?

  9. Hi Matt

    I have been looking into your responses and they were really helpful and solved a lot of my pentaho issues. Recently, I have come up with this problem of using big data plugin like hadoop file output in my ktr. The ktr runs fines when I run from the spoon. But when I try to run from my java code, I get the following missing plugin error.

    2015/11/18 13:47:23 – Property_Validation – ERROR (version 5.3.0.0-200, build 1 from 2015-01-20_19-50-27 by buildguy) : org.pentaho.di.core.exception.KettleException:
    2015/11/18 13:47:23 – Property_Validation – Unexpected error during transformation metadata load
    2015/11/18 13:47:23 – Property_Validation –
    2015/11/18 13:47:23 – Property_Validation – Missing plugins found while loading a transformation
    2015/11/18 13:47:23 – Property_Validation –
    2015/11/18 13:47:23 – Property_Validation – Step : HadoopFileOutputPlugin
    2015/11/18 13:47:23 – Property_Validation –
    2015/11/18 13:47:23 – Property_Validation – at org.pentaho.di.job.entries.trans.JobEntryTrans.getTransMeta(JobEntryTrans.java:1205)
    2015/11/18 13:47:23 – Property_Validation – at org.pentaho.di.job.entries.trans.JobEntryTrans.execute(JobEntryTrans.java:648)
    2015/11/18 13:47:23 – Property_Validation – at org.pentaho.di.job.Job.execute(Job.java:716)
    2015/11/18 13:47:23 – Property_Validation – at org.pentaho.di.job.Job.access$000(Job.java:115)
    2015/11/18 13:47:23 – Property_Validation – at org.pentaho.di.job.Job$1.run(Job.java:835)
    2015/11/18 13:47:23 – Property_Validation – at java.lang.Thread.run(Thread.java:745)
    2015/11/18 13:47:23 – Property_Validation – Caused by: org.pentaho.di.core.exception.KettleMissingPluginsException:
    2015/11/18 13:47:23 – Property_Validation – Missing plugins found while loading a transformation
    2015/11/18 13:47:23 – Property_Validation –

    I have already added the below big data plugin dependency into my pom.xml file but still I get the above error. Do I need to register this plugin in the java code ? If so, how do I do it ?

    Some suggestions from forums were to copy the plugins folder into my current java working directory and set the System.getProperty(“KETTLE_PLUGIN_BASE_FOLDERS”,”current working directory path”) but still i get the same below error.

    I am using the following java code to trigger the job. It works fine without the hadoop input but once I add the hadoop input in my ktr, i get the above error.

    pentaho
    pentaho-big-data-plugin
    5.2.1.0-148

    Please give us any suggestions or tips to get past this issue.

    Thanks
    Koushik

Leave a Reply to Virendra Rathore Cancel reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.