<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd"
xmlns:rawvoice="http://www.rawvoice.com/rawvoiceRssModule/"
>

<channel>
	<title>Solution Hacker &#187; pentaho</title>
	<atom:link href="http://www.solutionhacker.com/tag/pentaho/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.solutionhacker.com</link>
	<description>This blog provides solutions for enterpreneurs!</description>
	<lastBuildDate>Mon, 06 Feb 2012 07:19:37 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=371</generator>
<!-- podcast_generator="Blubrry PowerPress/2.0.4" -->
	<itunes:summary>This blog provides solutions for enterpreneurs!</itunes:summary>
	<itunes:author>Solution Hacker</itunes:author>
	<itunes:explicit>no</itunes:explicit>
	<itunes:image href="http://www.solutionhacker.com/wp-content/plugins/powerpress/itunes_default.jpg" />
	<itunes:subtitle>This blog provides solutions for enterpreneurs!</itunes:subtitle>
	<image>
		<title>Solution Hacker &#187; pentaho</title>
		<url>http://www.solutionhacker.com/wp-content/plugins/powerpress/rss_default.jpg</url>
		<link>http://www.solutionhacker.com</link>
	</image>
		<item>
		<title>Reporting solution!</title>
		<link>http://www.solutionhacker.com/implement-your-idea/build-your-website/reporting-solution/</link>
		<comments>http://www.solutionhacker.com/implement-your-idea/build-your-website/reporting-solution/#comments</comments>
		<pubDate>Wed, 21 Jan 2009 08:24:28 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Design]]></category>
		<category><![CDATA[Site Building]]></category>
		<category><![CDATA[AMF]]></category>
		<category><![CDATA[iBatis]]></category>
		<category><![CDATA[jasper]]></category>
		<category><![CDATA[pagination]]></category>
		<category><![CDATA[pdf generation]]></category>
		<category><![CDATA[pentaho]]></category>
		<category><![CDATA[sql]]></category>
		<category><![CDATA[streaming]]></category>

		<guid isPermaLink="false">http://www.solutionhacker.com/?p=205</guid>
		<description><![CDATA[<h2>Open source reporting</h2>
<p>My company needs a reporting engine but it doesn't want to go for the expensive commerical ones like <strong>MicroStrategy</strong>. In fact, I don't know why we need to pay so much because there are tools out there for FREE. As usual, I googled the Net and found out two seemingly promising open source reporting solution.</p>
<ol>
    <li><strong>Pentaho </strong>Reporting</li>
    <li><strong>Jasper </strong>Reporting</li>
</ol>
<p>Both of them are bundled with a suite of tools related to <strong>OLAP</strong>, <strong>Data Mining</strong>, <strong>ETL</strong>.. etc. To me, I just want an non-invasive reporting engine that can easily integrate into our architecture. To my dismay, I found out Pentaho doesn't go this route. It basically gives you a reporting server configured. You could build your reports and deploy them following the manual. However, I hardly see a reporting solution that could satisfy all the business requirements without customization. All I expected from Pentaho is a jar file with documents that shows me how to use its api to generate reports in different formats and how to integrate with our database. I have attempted to look into the code and extracted the stuff I want from Pentaho. However, I found out the engine is actually not powerful. To strip out the workflow part, it is basically a simple SQL executor that later on will render the result according to the UI info embedded in the report definition. What is wrong with that?</p>
<p><!--more--></p>
<ol>
    <li>We want to handle <strong>pagination </strong>and <strong>data streaming</strong> as our data volume is huge. In Pentaho, you need to take care these yourself. So, you write your own sql, paginate yourself, stream it yourself if the resultset is huge. Isn't it what we are doing without it? Apart from that, each report in Pentaho needs a report definition. It supports dynamic sql via token replacement. It is primitive as I want it to support control flow because I may decide what tables to join based on the input filters.</li>
    <li>On the UI side, Pentaho helps you to render your result into graph, table...etc. Again, I don't like this UI solution as well. I found that <strong>JFreeChart </strong>is not as powerful as the Flex solution. I am adopting Flex and it gives me much powerful visualization tool. All I want is to ship my Flex app the data from my query's result.</li>
</ol>
<p>How about <strong>Japser</strong>? Pretty much the same but the good thing of Jasper is that it gives you the jar and document of how to use it instead of a reporting server like Pentaho. So, I can use it as report renderer to generate PDF and Excel like other utility libraries I use. So, what is my final solution?</p>
<p>I finally decide to create my own report definition that my Flex UI can take and render out the reporting interface. So, I don't need to create form for each report. Apart from that, in my report definition, I have iBatis SQL template embedded. So, I can leverage its dynamic sql feature that supports control flow logic and the auto result class population. Yes, I still need to handle pagination and streaming myself. But, at least, it already saves up my time. The result object populated will return to Flex via AMF. So, I don't need to marshal and de-marshal it in xml. It saves the processing time and costs less bandwidth. At the end, my solution combines the best in the market:</p>
<ol>
    <li>Powerful reporting widgets provided by Flex</li>
    <li>Fast streaming and RPC protocol - <strong>AMF</strong></li>
    <li>Good dynamic sql generation and mapping tool from <strong>iBatis</strong></li>
    <li>Good reporting rendering tool from Jasper that helps me to do PDF and Excel generation</li>
</ol>
<p>My solution is more flexible. As I can plugin hibernate map if I don't want to write my own sql at all. Apart from that, no UI work is needed to deploy a new report unless my generic reporting interface is not enough.</p>
<p>Later, if I really need the workflow engine provided by Pentaho, I can plug it in. Again, the document provided doesn't give us clear instruction or APIs of how to do it.</p>
<h2>Reference</h2>
<p>Below are references I used to build my solution:</p>
<ol>
    <li><a href="http://Flexible reporting with JasperReports and iBATIS">Flexible reporting with JasperReport and iBatis</a></li>
    <li><a href="http://e-docs.bea.com/kodo/docs41/full/html/ref_guide_dbsetup_lrs.html">How Kodo JPA handles large result set</a> (its <a href="http://edocs.bea.com/kodo/docs41/full/html/ref_guide_optimization.html">optimization</a> guide is good reference even you may not use Kodo)</li>
    <li><a href="http://my.advisor.com/articles.nsf/aid/14753">Process Large Result Sets in Java Web Application</a></li>
    <li><a href="http://www.ahmadsoft.org/articles/stream/presidents.html">Streaming architecture</a></li>
</ol>
<p>&#160;</p>]]></description>
			<content:encoded><![CDATA[<h2>Open source reporting</h2>
<p>My company needs a reporting engine but it doesn&#8217;t want to go for the expensive commerical ones like <strong>MicroStrategy</strong>. In fact, I don&#8217;t know why we need to pay so much because there are tools out there for FREE. As usual, I googled the Net and found out two seemingly promising open source reporting solution.</p>
<ol>
<li><strong>Pentaho </strong>Reporting</li>
<li><strong>Jasper </strong>Reporting</li>
</ol>
<p>Both of them are bundled with a suite of tools related to <strong>OLAP</strong>, <strong>Data Mining</strong>, <strong>ETL</strong>.. etc. To me, I just want an non-invasive reporting engine that can easily integrate into our architecture. To my dismay, I found out Pentaho doesn&#8217;t go this route. It basically gives you a reporting server configured. You could build your reports and deploy them following the manual. However, I hardly see a reporting solution that could satisfy all the business requirements without customization. All I expected from Pentaho is a jar file with documents that shows me how to use its api to generate reports in different formats and how to integrate with our database. I have attempted to look into the code and extracted the stuff I want from Pentaho. However, I found out the engine is actually not powerful. To strip out the workflow part, it is basically a simple SQL executor that later on will render the result according to the UI info embedded in the report definition. What is wrong with that?</p>
<p><span id="more-205"></span></p>
<ol>
<li>We want to handle <strong>pagination </strong>and <strong>data streaming</strong> as our data volume is huge. In Pentaho, you need to take care these yourself. So, you write your own sql, paginate yourself, stream it yourself if the resultset is huge. Isn&#8217;t it what we are doing without it? Apart from that, each report in Pentaho needs a report definition. It supports dynamic sql via token replacement. It is primitive as I want it to support control flow because I may decide what tables to join based on the input filters.</li>
<li>On the UI side, Pentaho helps you to render your result into graph, table&#8230;etc. Again, I don&#8217;t like this UI solution as well. I found that <strong>JFreeChart </strong>is not as powerful as the Flex solution. I am adopting Flex and it gives me much powerful visualization tool. All I want is to ship my Flex app the data from my query&#8217;s result.</li>
</ol>
<p>How about <strong>Japser</strong>? Pretty much the same but the good thing of Jasper is that it gives you the jar and document of how to use it instead of a reporting server like Pentaho. So, I can use it as report renderer to generate PDF and Excel like other utility libraries I use. So, what is my final solution?</p>
<p>I finally decide to create my own report definition that my Flex UI can take and render out the reporting interface. So, I don&#8217;t need to create form for each report. Apart from that, in my report definition, I have iBatis SQL template embedded. So, I can leverage its dynamic sql feature that supports control flow logic and the auto result class population. Yes, I still need to handle pagination and streaming myself. But, at least, it already saves up my time. The result object populated will return to Flex via AMF. So, I don&#8217;t need to marshal and de-marshal it in xml. It saves the processing time and costs less bandwidth. At the end, my solution combines the best in the market:</p>
<ol>
<li>Powerful reporting widgets provided by Flex</li>
<li>Fast streaming and RPC protocol &#8211; <strong>AMF</strong></li>
<li>Good dynamic sql generation and mapping tool from <strong>iBatis</strong></li>
<li>Good reporting rendering tool from Jasper that helps me to do PDF and Excel generation</li>
</ol>
<p>My solution is more flexible. As I can plugin hibernate map if I don&#8217;t want to write my own sql at all. Apart from that, no UI work is needed to deploy a new report unless my generic reporting interface is not enough.</p>
<p>Later, if I really need the workflow engine provided by Pentaho, I can plug it in. Again, the document provided doesn&#8217;t give us clear instruction or APIs of how to do it.</p>
<h2>Reference</h2>
<p>Below are references I used to build my solution:</p>
<ol>
<li><a href="http://Flexible reporting with JasperReports and iBATIS">Flexible reporting with JasperReport and iBatis</a></li>
<li><a href="http://e-docs.bea.com/kodo/docs41/full/html/ref_guide_dbsetup_lrs.html">How Kodo JPA handles large result set</a> (its <a href="http://edocs.bea.com/kodo/docs41/full/html/ref_guide_optimization.html">optimization</a> guide is good reference even you may not use Kodo)</li>
<li><a href="http://my.advisor.com/articles.nsf/aid/14753">Process Large Result Sets in Java Web Application</a></li>
<li><a href="http://www.ahmadsoft.org/articles/stream/presidents.html">Streaming architecture</a></li>
</ol>
<p>&#160;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.solutionhacker.com/implement-your-idea/build-your-website/reporting-solution/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Business Intelligence &#8211; Part 1 Pentaho</title>
		<link>http://www.solutionhacker.com/data-intelligence/collective-intelligence/business-intelligence-part-1-pentaho/</link>
		<comments>http://www.solutionhacker.com/data-intelligence/collective-intelligence/business-intelligence-part-1-pentaho/#comments</comments>
		<pubDate>Thu, 26 Jun 2008 09:00:41 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Design]]></category>
		<category><![CDATA[Extract Intelligence]]></category>
		<category><![CDATA[BI]]></category>
		<category><![CDATA[business intelligence]]></category>
		<category><![CDATA[Kettle]]></category>
		<category><![CDATA[pentaho]]></category>
		<category><![CDATA[reporting]]></category>

		<guid isPermaLink="false">http://www.solutionhacker.com/?p=153</guid>
		<description><![CDATA[<h2>Getting into Business Intelligent World</h2>
<p>When I dig deeper in business intelligence, I found out that it is a huge topic ranging from reporting to data mining. Like all the knowledge acquisition plan, I put a series of milestones for myself. If you are interested, here is my list:</p>
<p><strong>Get and prepare your data</strong></p>
<ul>
    <li>Data collection - log processing, web services (SOAP and REST), RSS, screen scraping and more.</li>
    <li>Data preparation and crunching - ETL (Kettle)</li>
    <li>Data storage - data warehousing</li>
</ul>
<p><strong>Visualize your data</strong></p>
<ul>
    <li>Reporting and Charting (Pentaho as server and Flex as frontend)</li>
</ul>
<p><strong>Analysis your data</strong></p>
<ul>
    <li>Data modeling</li>
    <li>Data analysis (OLAP)</li>
</ul>
<p><strong>Get smart of your data</strong></p>
<ul>
    <li>Collective intelligence</li>
    <li>Data mining</li>
</ul>
<p><!--more--></p>
<h2>Introduction of Pentaho</h2>
<p>Firstly, I want to see whether there is any out of the box&#160; open-source solution that captures what I am trying to do here. If so, I can reach my goal much faster. Yes. It has to be open-source b/c I don't have $$ and I don't want to be just a user. After doing my homework a bit, I found out an open-source BI tool named Pentaho that looks pretty solid. So, I decide to dive deep to this. Like all the tools I mess around with, I want to integrate Pentaho as library. However, I don't find anything on the Net that shows me how to do it. I am looking into its download and checkout its pentaho-sample project. What it shows me is how to use their tools to create a report on their systems using their UI. I definitely need more!</p>
<p>After few days of efforts, I managed to pull out all the unnecessary dependencies from pentaho. The heart of Pentaho is its xaction interpreter. The approach Pentaho uses is to write adapters, plug into its framework and use xaction to wire them up in a workflow fashion. In fact, most of its functionalities come from other open-source projects like quartz for scheduling, shark for workflow engine and jfreereport for reporting. I don't think their xaction is clean but I do like their architectural approach.</p>
<p>Enough talk, lets start! Here I would use series of articles to cut your learning curve and show you how to get yourself familiar with Pentaho as a developer rather than user. First thing first, follow the articles below to set up your environment.&#160;</p>
<ol>
    <li><span style="font-size: 10pt; font-family: &#34;Arial&#34;,&#34;sans-serif&#34;; color: navy;"><a href="http://wiki.pentaho.com/display/PentahoDoc/07.+Debugging+with+the+Standalone+Platform+Project">http://wiki.pentaho.com/display/PentahoDoc/07.+Debugging+with+the+Standalone+Platform+Project</a><o:p></o:p></span></li>
    <li><span style="font-size: 10pt; font-family: &#34;Arial&#34;,&#34;sans-serif&#34;; color: navy;"><a href="http://wiki.pentaho.com/display/PentahoDoc/Building+and+Debugging+Pentaho+with+Eclipse">http://wiki.pentaho.com/display/PentahoDoc/Building+and+Debugging+Pentaho+with+Eclipse</a><o:p></o:p></span></li>
    <li><span style="font-size: 10pt; font-family: &#34;Arial&#34;,&#34;sans-serif&#34;; color: navy;"><a href="http://wiki.pentaho.com/display/PentahoDoc/Manual+Deployment+of+Pentaho">http://wiki.pentaho.com/display/PentahoDoc/Manual+Deployment+of+Pentaho</a></span></li>
</ol>
<p><span style="font-size: 10pt; font-family: &#34;Arial&#34;,&#34;sans-serif&#34;; color: navy;"><o:p></o:p></span></p>]]></description>
			<content:encoded><![CDATA[<h2>Getting into Business Intelligent World</h2>
<p>When I dig deeper in business intelligence, I found out that it is a huge topic ranging from reporting to data mining. Like all the knowledge acquisition plan, I put a series of milestones for myself. If you are interested, here is my list:</p>
<p><strong>Get and prepare your data</strong></p>
<ul>
<li>Data collection &#8211; log processing, web services (SOAP and REST), RSS, screen scraping and more.</li>
<li>Data preparation and crunching &#8211; ETL (Kettle)</li>
<li>Data storage &#8211; data warehousing</li>
</ul>
<p><strong>Visualize your data</strong></p>
<ul>
<li>Reporting and Charting (Pentaho as server and Flex as frontend)</li>
</ul>
<p><strong>Analysis your data</strong></p>
<ul>
<li>Data modeling</li>
<li>Data analysis (OLAP)</li>
</ul>
<p><strong>Get smart of your data</strong></p>
<ul>
<li>Collective intelligence</li>
<li>Data mining</li>
</ul>
<p><span id="more-153"></span></p>
<h2>Introduction of Pentaho</h2>
<p>Firstly, I want to see whether there is any out of the box&nbsp; open-source solution that captures what I am trying to do here. If so, I can reach my goal much faster. Yes. It has to be open-source b/c I don&#8217;t have $$ and I don&#8217;t want to be just a user. After doing my homework a bit, I found out an open-source BI tool named Pentaho that looks pretty solid. So, I decide to dive deep to this. Like all the tools I mess around with, I want to integrate Pentaho as library. However, I don&#8217;t find anything on the Net that shows me how to do it. I am looking into its download and checkout its pentaho-sample project. What it shows me is how to use their tools to create a report on their systems using their UI. I definitely need more!</p>
<p>After few days of efforts, I managed to pull out all the unnecessary dependencies from pentaho. The heart of Pentaho is its xaction interpreter. The approach Pentaho uses is to write adapters, plug into its framework and use xaction to wire them up in a workflow fashion. In fact, most of its functionalities come from other open-source projects like quartz for scheduling, shark for workflow engine and jfreereport for reporting. I don&#8217;t think their xaction is clean but I do like their architectural approach.</p>
<p>Enough talk, lets start! Here I would use series of articles to cut your learning curve and show you how to get yourself familiar with Pentaho as a developer rather than user. First thing first, follow the articles below to set up your environment.&nbsp;</p>
<ol>
<li><span style="font-size: 10pt; font-family: &quot;Arial&quot;,&quot;sans-serif&quot;; color: navy;"><a href="http://wiki.pentaho.com/display/PentahoDoc/07.+Debugging+with+the+Standalone+Platform+Project">http://wiki.pentaho.com/display/PentahoDoc/07.+Debugging+with+the+Standalone+Platform+Project</a><o:p></o:p></span></li>
<li><span style="font-size: 10pt; font-family: &quot;Arial&quot;,&quot;sans-serif&quot;; color: navy;"><a href="http://wiki.pentaho.com/display/PentahoDoc/Building+and+Debugging+Pentaho+with+Eclipse">http://wiki.pentaho.com/display/PentahoDoc/Building+and+Debugging+Pentaho+with+Eclipse</a><o:p></o:p></span></li>
<li><span style="font-size: 10pt; font-family: &quot;Arial&quot;,&quot;sans-serif&quot;; color: navy;"><a href="http://wiki.pentaho.com/display/PentahoDoc/Manual+Deployment+of+Pentaho">http://wiki.pentaho.com/display/PentahoDoc/Manual+Deployment+of+Pentaho</a></span></li>
</ol>
<p><span style="font-size: 10pt; font-family: &quot;Arial&quot;,&quot;sans-serif&quot;; color: navy;"><o:p></o:p></span></p>
]]></content:encoded>
			<wfw:commentRss>http://www.solutionhacker.com/data-intelligence/collective-intelligence/business-intelligence-part-1-pentaho/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

