<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Jukka Zitting</title>
	<atom:link href="http://jukkaz.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://jukkaz.wordpress.com</link>
	<description>Software craftmanship</description>
	<lastBuildDate>Sun, 22 Nov 2009 23:38:40 +0000</lastBuildDate>
	<generator>http://wordpress.com/</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<cloud domain='jukkaz.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://www.gravatar.com/blavatar/61b91d1104160eee551653d13013e4cc?s=96&#038;d=http://s.wordpress.com/i/buttonw-com.png</url>
		<title>Jukka Zitting</title>
		<link>http://jukkaz.wordpress.com</link>
	</image>
			<item>
		<title>Daily Shoot, week 1</title>
		<link>http://jukkaz.wordpress.com/2009/11/23/daily-shoot-week-1/</link>
		<comments>http://jukkaz.wordpress.com/2009/11/23/daily-shoot-week-1/#comments</comments>
		<pubDate>Sun, 22 Nov 2009 23:12:58 +0000</pubDate>
		<dc:creator>Jukka Zitting</dc:creator>
				<category><![CDATA[General]]></category>
		<category><![CDATA[dailyshoot]]></category>
		<category><![CDATA[photography]]></category>

		<guid isPermaLink="false">http://jukkaz.wordpress.com/?p=254</guid>
		<description><![CDATA[A week ago James Duncan Davidson and Mike Clark launched @dailyshoot, a Twitter feed that posts daily photo assignments. The idea is to encourage people who want to learn photography to practice it every day with the help of a simple assignment that fits a single tweet. I&#8217;m following Duncan&#8217;s blog, so I found out [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=jukkaz.wordpress.com&blog=52630&post=254&subd=jukkaz&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>A week ago <a href="http://duncandavidson.com/">James Duncan Davidson</a> and <a href="http://clarkware.com/about.html">Mike Clark</a> launched <a href="http://twitter.com/dailyshoot">@dailyshoot</a>, a Twitter feed that posts daily photo assignments. The idea is to encourage people who want to learn photography to practice it every day with the help of a simple assignment that fits a single tweet. I&#8217;m following Duncan&#8217;s blog, so I <a title="The Daily Shoot" href="http://blog.duncandavidson.com/2009/11/the-daily-shoot.html">found out</a> about Daily Shoot the day it was launched.</p>
<p>So far I&#8217;ve completed all the assignments and I&#8217;ve already learned quite a bit doing so. It&#8217;s very interesting to see how other people interpret the same assignments. I avoid looking at other responses before completing an assignment so that I don&#8217;t end up just copying someone else&#8217;s approach. Once I&#8217;m done I look at what other&#8217;s have done for some nice insight on what I could have done differently. The process is quite educational.</p>
<p>Here&#8217;s what I&#8217;ve shot this week:</p>
<p style="text-align:center;"><a href="http://www.flickr.com/photos/jlz/4110418300/"><img class="alignnone" title="The Red Desert" src="http://farm3.static.flickr.com/2804/4110418300_4368b4b7dd_m.jpg" alt="" width="240" height="160" /></a> <a href="http://www.flickr.com/photos/jlz/4113281186/"><img class="alignnone" title="Martinsgasse" src="http://farm3.static.flickr.com/2762/4113281186_b80c2b5c50_m.jpg" alt="" width="240" height="160" /></a> <a href="http://www.flickr.com/photos/jlz/4115683738/"><img class="alignnone" title="Into the Unknown" src="http://farm3.static.flickr.com/2750/4115683738_6902197e9a_m.jpg" alt="" width="240" height="160" /></a> <a href="http://www.flickr.com/photos/jlz/4117124465/"><img class="alignnone" title="Satrap" src="http://farm3.static.flickr.com/2591/4117124465_f3250abbb6_m.jpg" alt="" width="240" height="160" /></a> <a href="http://www.flickr.com/photos/jlz/4119973735/"><img class="alignnone" title="Four" src="http://farm3.static.flickr.com/2773/4119973735_af899a27e9_m.jpg" alt="" width="240" height="160" /></a> <a href="http://www.flickr.com/photos/jlz/4122393762/"><img class="alignnone" title="Fier Franken fünfundachtzig" src="http://farm3.static.flickr.com/2622/4122393762_46642d7b60_m.jpg" alt="" width="240" height="160" /></a> <a href="http://www.flickr.com/photos/jlz/4125156460/"><img class="alignnone" title="Der Basler" src="http://farm3.static.flickr.com/2755/4125156460_2976ac927d_m.jpg" alt="" width="240" height="160" /></a></p>
<p>You can click on the pictures for more background on each assignment and how I approached it. For more information on Daily Shoot, see the recently launched <a href="http://dailyshoot.com/">website</a>.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/jukkaz.wordpress.com/254/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/jukkaz.wordpress.com/254/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/jukkaz.wordpress.com/254/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/jukkaz.wordpress.com/254/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/jukkaz.wordpress.com/254/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/jukkaz.wordpress.com/254/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/jukkaz.wordpress.com/254/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/jukkaz.wordpress.com/254/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/jukkaz.wordpress.com/254/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/jukkaz.wordpress.com/254/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=jukkaz.wordpress.com&blog=52630&post=254&subd=jukkaz&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://jukkaz.wordpress.com/2009/11/23/daily-shoot-week-1/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/4442496cf0c59d39cadc84b8bc56fcba?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">jukkaz</media:title>
		</media:content>

		<media:content url="http://farm3.static.flickr.com/2804/4110418300_4368b4b7dd_m.jpg" medium="image">
			<media:title type="html">The Red Desert</media:title>
		</media:content>

		<media:content url="http://farm3.static.flickr.com/2762/4113281186_b80c2b5c50_m.jpg" medium="image">
			<media:title type="html">Martinsgasse</media:title>
		</media:content>

		<media:content url="http://farm3.static.flickr.com/2750/4115683738_6902197e9a_m.jpg" medium="image">
			<media:title type="html">Into the Unknown</media:title>
		</media:content>

		<media:content url="http://farm3.static.flickr.com/2591/4117124465_f3250abbb6_m.jpg" medium="image">
			<media:title type="html">Satrap</media:title>
		</media:content>

		<media:content url="http://farm3.static.flickr.com/2773/4119973735_af899a27e9_m.jpg" medium="image">
			<media:title type="html">Four</media:title>
		</media:content>

		<media:content url="http://farm3.static.flickr.com/2622/4122393762_46642d7b60_m.jpg" medium="image">
			<media:title type="html">Fier Franken fünfundachtzig</media:title>
		</media:content>

		<media:content url="http://farm3.static.flickr.com/2755/4125156460_2976ac927d_m.jpg" medium="image">
			<media:title type="html">Der Basler</media:title>
		</media:content>
	</item>
		<item>
		<title>Content Repository over HTTP</title>
		<link>http://jukkaz.wordpress.com/2009/11/18/content-repository-over-http/</link>
		<comments>http://jukkaz.wordpress.com/2009/11/18/content-repository-over-http/#comments</comments>
		<pubDate>Wed, 18 Nov 2009 01:45:11 +0000</pubDate>
		<dc:creator>Jukka Zitting</dc:creator>
				<category><![CDATA[General]]></category>

		<guid isPermaLink="false">http://jukkaz.wordpress.com/?p=251</guid>
		<description><![CDATA[Two weeks ago during the BarCamp at the ApacheCon US I chaired a short session titled &#8220;The RESTful Content Repository&#8221;. The idea of the session was to discuss the various ways that existing content repositories support RESTful access over HTTP and to perhaps find some common ground from which a generic content repository protocol could [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=jukkaz.wordpress.com&blog=52630&post=251&subd=jukkaz&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>Two weeks ago during the <a href="http://us.apachecon.com/c/acus2009/schedule/barcamp">BarCamp</a> at the <a href="http://us.apachecon.com/c/acus2009/">ApacheCon US</a> I chaired a short session titled &#8220;The RESTful Content Repository&#8221;. The idea of the session was to discuss the various ways that existing content repositories support RESTful access over HTTP and to perhaps find some common ground from which a generic content repository protocol could be formulated.</p>
<p>The <a title="Representational State Transfer" href="http://www.ics.uci.edu/~fielding/pubs/dissertation/rest_arch_style.htm">REST architectural style</a> was generally accepted as a useful set of constraints for the architecture of distributed content-based applications, but as an architectural style it doesn&#8217;t define what the bits on the wire should look like. This is what we set out to define with the <a title="Hypertext Transfer Protocol -- HTTP/1.1" href="http://tools.ietf.org/html/rfc2616">HTTP protocol</a> as a baseline. We didn&#8217;t get too far, but see below for some collected thoughts and a useful set of &#8220;test cases&#8221; that I hope to use to further investigate this idea.</p>
<p><strong>Existing solutions</strong></p>
<p>Many existing content repositories and related products already support one or more HTTP-based access patterns: <a href="http://jackrabbit.apache.org/">Apache Jackrabbit</a> exposes two slightly different WebDAV-based access points. <a href="http://sling.apache.org/">Apache Sling</a> adds the <a href="http://sling.apache.org/site/manipulating-content-the-slingpostservlet-servletspost.html">SlingPostServlet</a> and default JSON and XML renderings of content. <a href="http://couchdb.apache.org/">Apache CouchDB</a> uses JSON over HTTP as the primary access protocol. <a href="http://lucene.apache.org/solr/">Apache Solr</a> uses XML over HTTP. <a href="http://www.midgard-project.org/">Midgard</a> doesn&#8217;t have a built-in HTTP binding for content, but makes it very easy to implement such bindings. This list just scratches the surface&#8230;</p>
<p>There are even existing generic protocols that match at least parts of what we wanted to achieve. <a href="http://tools.ietf.org/html/rfc2518">WebDAV</a> has been around for ten years already, but the way it extends HTTP with extra methods makes it harder to use with existing HTTP clients and libraries. The <a href="http://tools.ietf.org/html/rfc5023">AtomPub protocol</a> solves that issue, but being based on the <a href="http://tools.ietf.org/html/rfc4287">Atom format</a> and leaving much of the server behaviour undefined, AtomPub may not be the best solution for generic content repositories.</p>
<p><strong>Content repository operations over HTTP</strong></p>
<p>To better understand the needs and capabilities of existing solutions, we should come up with a simple set of content operations and find out if and how different systems support those operations over HTTP. The most basic such set of operations is CRUD, i.e. how to create, read, update, and delete a document, so let&#8217;s start with that. I&#8217;m giving each operation a key (CRn, as in &#8220;Content Repository operation N&#8221;) and a brief description of what&#8217;s expected. In later posts I hope to explore how these operations can be implemented with <a href="http://curl.haxx.se/">curl</a> or some other simple HTTP client accessing various kinds of content repositories. I&#8217;m also planning to extend the set of required operations to cover features like search, linking, versioning, transactions, etc.</p>
<p><strong>CR1: Create a document</strong></p>
<p>Documents with simple properties like strings and dates are basic building blocks of all content applications. How can I create a new document with the following properties?</p>
<ul>
<li>title = &#8220;Hello, World!&#8221; (string)</li>
<li>date = 2009-11-17 (date)</li>
</ul>
<p>At the end of this operation I should have a URL that I can use to access the created document.</p>
<p><strong>CR2: Read a document</strong></p>
<p>Given the URL of a document (see CR1), how do I read the properties of that document?</p>
<p>The retrieved property values should match the values given when the document was created.</p>
<p><strong>CR3: Update a document</strong></p>
<p>Given the URL of a document (see CR1), how do update the properties of that document? For example, I want to update the existing date property and add a new string property:</p>
<ul>
<li>date = 2009-11-18 (date)</li>
<li>history = &#8220;Document date updated&#8221; (string)</li>
</ul>
<p>When the document is read (see CR2) after this update, the retrieved information should contain the original title and the above updated date and history values.</p>
<p><strong>CR4: Delete a document</strong></p>
<p>Given the URL of a document (see CR1), how do I delete that document?</p>
<p>Once deleted, it should no longer be possible to read (see CR2) or update (see CR3) the document.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/jukkaz.wordpress.com/251/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/jukkaz.wordpress.com/251/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/jukkaz.wordpress.com/251/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/jukkaz.wordpress.com/251/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/jukkaz.wordpress.com/251/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/jukkaz.wordpress.com/251/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/jukkaz.wordpress.com/251/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/jukkaz.wordpress.com/251/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/jukkaz.wordpress.com/251/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/jukkaz.wordpress.com/251/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=jukkaz.wordpress.com&blog=52630&post=251&subd=jukkaz&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://jukkaz.wordpress.com/2009/11/18/content-repository-over-http/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/4442496cf0c59d39cadc84b8bc56fcba?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">jukkaz</media:title>
		</media:content>
	</item>
		<item>
		<title>NoSQL interests</title>
		<link>http://jukkaz.wordpress.com/2009/10/27/nosql-interests/</link>
		<comments>http://jukkaz.wordpress.com/2009/10/27/nosql-interests/#comments</comments>
		<pubDate>Tue, 27 Oct 2009 10:01:12 +0000</pubDate>
		<dc:creator>Jukka Zitting</dc:creator>
				<category><![CDATA[General]]></category>
		<category><![CDATA[apachecon]]></category>
		<category><![CDATA[meetup]]></category>
		<category><![CDATA[nosql]]></category>

		<guid isPermaLink="false">http://jukkaz.wordpress.com/?p=246</guid>
		<description><![CDATA[We&#8217;re organizing a NoSQL meetup in Oakland on Monday next week. In addition to helping set the meetup agenda, the &#8220;Topics you are interested in&#8221; question in the sign up form provides some interesting insight on the current interests of the NoSQL community. Here&#8217;s a quick breakdown of the key terms distilled from the 88 [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=jukkaz.wordpress.com&blog=52630&post=246&subd=jukkaz&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p><a href="http://www.nosqloakland.org/"><img class="alignright size-full wp-image-247" title="NoSQL Oakland" src="http://jukkaz.files.wordpress.com/2009/10/nosqloakland-small.png?w=203&#038;h=44" alt="NoSQL Oakland" width="203" height="44" /></a>We&#8217;re organizing a <a href="http://www.nosqloakland.org/">NoSQL meetup in Oakland</a> on Monday next week. In addition to helping set the meetup agenda, the &#8220;Topics you are interested in&#8221; question in the <a href="http://spreadsheets.google.com/viewform?formkey=dENwRmlTMlhGZ3lfclJqYW9hVGlkTHc6MA">sign up form</a> provides some interesting insight on the current interests of the NoSQL community. Here&#8217;s a quick breakdown of the key terms distilled from the 88 signups we&#8217;ve received so far.</p>
<p>Note that the data is biased towards Apache projects due to the meetup being organized at <a href="http://us.apachecon.com/c/acus2009/">ApacheCon US 2009</a>.</p>
<h2>Projects</h2>
<p>The following open source projects were mentioned. The list is in alphabetical order, as the data set is too small to make any reasonable ordering by popularity.</p>
<ul>
<li><a href="http://incubator.apache.org/cassandra/">Cassandra</a></li>
<li><a href="http://couchdb.apache.org/">CouchDB</a></li>
<li><a href="http://hadoop.apache.org/">Hadoop</a></li>
<li><a href="http://hadoop.apache.org/hbase/">HBase</a></li>
<li><a href="http://hadoop.apache.org/hdfs/">HDFS</a></li>
<li><a href="http://jackrabbit.apache.org/">Jackrabbit</a></li>
<li><a href="http://lucene.apache.org/">Lucene</a></li>
<li><a href="http://lucene.apache.org/mahout/"> Mahout</a></li>
<li><a href="http://www.danga.com/memcached/">memcached</a></li>
<li><a href="http://www.mongodb.org/">MongoDB</a></li>
<li><a href="http://code.google.com/p/redis/">Redis</a></li>
<li><a href="http://riak.basho.com/">Riak</a></li>
<li><a href="http://code.google.com/p/scalaris/">Scalaris</a></li>
<li><a href="http://sling.apache.org/">Sling</a></li>
<li><a href="http://1978th.net/tokyocabinet/">Tokyo Cabinet</a></li>
<li><a href="http://project-voldemort.com/">Voldemort</a></li>
</ul>
<h2>Topics</h2>
<p>Many responses were about the &#8220;big data&#8221; aspect of the NoSQL movement. Some frequent keywords: distributed storage, large transactional data, consistency, failover, availability, reliability, stability, failure detection, failed node replacement, (petabyte) scalability, consistency levels, storage technology, performance, benchmarks, optimization, backup and recovery, map/reduce</p>
<p>Another common theme were the various database types and the NoSQL &#8220;development model&#8221;. Keywods: document stores, key/value stores, consistent hashing, graph databases, object databases, persistent queues, content modeling, migration from the relational model, social graphs, streaming, software as a service, offline applications, full text search, natural language processing</p>
<p>Beyond the above big themes, I found it interesting that the following technologies were specifically named: Erlang, Java, WebSimpleDB, WebDAV</p>
<p>In addition to specific topics, many people were asking for case studies or &#8220;lessons learned&#8221; -type presentations.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/jukkaz.wordpress.com/246/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/jukkaz.wordpress.com/246/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/jukkaz.wordpress.com/246/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/jukkaz.wordpress.com/246/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/jukkaz.wordpress.com/246/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/jukkaz.wordpress.com/246/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/jukkaz.wordpress.com/246/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/jukkaz.wordpress.com/246/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/jukkaz.wordpress.com/246/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/jukkaz.wordpress.com/246/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=jukkaz.wordpress.com&blog=52630&post=246&subd=jukkaz&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://jukkaz.wordpress.com/2009/10/27/nosql-interests/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/4442496cf0c59d39cadc84b8bc56fcba?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">jukkaz</media:title>
		</media:content>

		<media:content url="http://jukkaz.files.wordpress.com/2009/10/nosqloakland-small.png" medium="image">
			<media:title type="html">NoSQL Oakland</media:title>
		</media:content>
	</item>
		<item>
		<title>Putting POI on a diet</title>
		<link>http://jukkaz.wordpress.com/2009/10/16/putting-poi-on-a-diet/</link>
		<comments>http://jukkaz.wordpress.com/2009/10/16/putting-poi-on-a-diet/#comments</comments>
		<pubDate>Fri, 16 Oct 2009 22:39:21 +0000</pubDate>
		<dc:creator>Jukka Zitting</dc:creator>
				<category><![CDATA[General]]></category>
		<category><![CDATA[apache]]></category>
		<category><![CDATA[ooxml]]></category>
		<category><![CDATA[poi]]></category>
		<category><![CDATA[tika]]></category>
		<category><![CDATA[xmlbeans]]></category>

		<guid isPermaLink="false">http://jukkaz.wordpress.com/?p=239</guid>
		<description><![CDATA[The Apache POI team is doing an amazing job at making Microsoft Office file formats more accessible to the open source Java world. One of the projects that benefits from their work is Apache Tika that uses POI to extract text content and metadata from all sorts of Office documents.

However, there&#8217;s one problem with POI [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=jukkaz.wordpress.com&blog=52630&post=239&subd=jukkaz&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>The <a href="http://poi.apache.org/">Apache POI</a> team is doing an amazing job at making <a href="http://office.microsoft.com/">Microsoft Office</a> file formats more accessible to the open source Java world. One of the projects that benefits from their work is <a href="http://lucene.apache.org/tika/">Apache Tika</a> that uses POI to extract text content and metadata from all sorts of Office documents.</p>
<p><a href="http://poi.apache.org/"><img class="aligncenter size-full wp-image-240" title="Apache POI" src="http://jukkaz.files.wordpress.com/2009/10/poi.jpg?w=138&#038;h=126" alt="Apache POI" width="138" height="126" /></a></p>
<p>However, there&#8217;s one problem with POI that I&#8217;d like to see fixed: It&#8217;s too big.</p>
<p>More specifically, the <a href="http://www.jarvana.com/jarvana/archive-details/org/apache/poi/ooxml-schemas/1.0/ooxml-schemas-1.0.jar">ooxml-schemas jar</a> used by POI for the pre-generated <a title="Apache XMLBeans" href="http://xmlbeans.apache.org/">XMLBeans</a> bindings for the <a title="ECMA-376: Office Open XML File Formats" href="http://www.ecma-international.org/publications/standards/Ecma-376.htm">Office Open XML</a> schemas is taking up over 50% of the 25MB size of the current Tika application. The pie chart below illustrates the relative sizes of the different parser library dependencies of Tika:</p>
<p><img class="aligncenter size-full wp-image-241" title="Relative sizes of Tika parser dependencies" src="http://jukkaz.files.wordpress.com/2009/10/tika-pie.png?w=500&#038;h=200" alt="Relative sizes of Tika parser dependencies" width="500" height="200" /></p>
<p>Both PDF and the Microsoft Office formats are pretty big and complex, so one can expect the relevant parser libraries to be large. But the 14MB size of the ooxml-schemas jar seems excessive, especially since the standard OOXML schema package from which the ooxml-schemas jar is built is only 220KB in size.</p>
<p>Does anyone have good ideas on how to best trim down this OOXML dependency?</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/jukkaz.wordpress.com/239/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/jukkaz.wordpress.com/239/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/jukkaz.wordpress.com/239/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/jukkaz.wordpress.com/239/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/jukkaz.wordpress.com/239/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/jukkaz.wordpress.com/239/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/jukkaz.wordpress.com/239/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/jukkaz.wordpress.com/239/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/jukkaz.wordpress.com/239/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/jukkaz.wordpress.com/239/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=jukkaz.wordpress.com&blog=52630&post=239&subd=jukkaz&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://jukkaz.wordpress.com/2009/10/16/putting-poi-on-a-diet/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/4442496cf0c59d39cadc84b8bc56fcba?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">jukkaz</media:title>
		</media:content>

		<media:content url="http://jukkaz.files.wordpress.com/2009/10/poi.jpg" medium="image">
			<media:title type="html">Apache POI</media:title>
		</media:content>

		<media:content url="http://jukkaz.files.wordpress.com/2009/10/tika-pie.png" medium="image">
			<media:title type="html">Relative sizes of Tika parser dependencies</media:title>
		</media:content>
	</item>
		<item>
		<title>Some graphics work for a change</title>
		<link>http://jukkaz.wordpress.com/2009/09/23/some-graphics-work-for-a-change/</link>
		<comments>http://jukkaz.wordpress.com/2009/09/23/some-graphics-work-for-a-change/#comments</comments>
		<pubDate>Wed, 23 Sep 2009 17:11:59 +0000</pubDate>
		<dc:creator>Jukka Zitting</dc:creator>
				<category><![CDATA[Jackrabbit]]></category>

		<guid isPermaLink="false">http://jukkaz.wordpress.com/?p=235</guid>
		<description><![CDATA[I&#8217;ve recently spent some effort in improving the look of the Apache Jackrabbit website. I&#8217;m no designer, so the results aren&#8217;t that great, but it&#8217;s been a nice break from the regular project work. And I got to brush up my Photoshop and Gimp skills.
One part of the effort was creating an icon for the [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=jukkaz.wordpress.com&blog=52630&post=235&subd=jukkaz&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>I&#8217;ve recently spent <a title="JCRSITE-20: Site readability improvements" href="https://issues.apache.org/jira/browse/JCRSITE-20">some</a> <a title="fancy download button" href="http://twitter.com/jukkaz/status/3243226733">effort</a> in improving the look of the <a href="http://jackrabbit.apache.org/">Apache Jackrabbit</a> website. I&#8217;m no designer, so the results aren&#8217;t that great, but it&#8217;s been a nice break from the regular project work. And I got to brush up my <a title="Adobe Photoshop" href="http://www.adobe.com/products/photoshop/photoshop/">Photoshop</a> and <a title="The GNU Image Manipulation Program" href="http://www.gimp.org/">Gimp</a> skills.</p>
<p>One part of the effort was <a title="JCRSITE-24: Jackrabbit favicon" href="https://issues.apache.org/jira/browse/JCRSITE-24">creating an icon</a> for the site. Previously the site used the feather icon used as the default on all Apache project sites, but I wanted a Jackrabbit-specific icon that helps me to quickly identify and access Jackrabbit pages among the numerous tabs I usually have open in my browser. The work is a good example of incremental improvements in action:</p>
<p><img class="aligncenter size-full wp-image-236" title="Jackrabbit icon steps" src="http://jukkaz.files.wordpress.com/2009/09/jackrabbit-icon-work.png?w=372&#038;h=218" alt="Jackrabbit icon steps" width="372" height="218" /></p>
<p>I started with a copy of the Jackrabbit logo with nice alpha-layered transparent background. It looked great until I noticed that some browsers lost the smooth alpha layer and instead resulted in a rather badly aliased icon seen above.</p>
<p>The straightforward solution was to add a white background as can be seen in step 2. That worked already pretty well in all browsers.</p>
<p>After a few days of watching the icon I found it a bit too blocky to my taste, so I tried to restore some of the nice transparency effect by rounding the corners a bit. I&#8217;m pretty happy with the result.</p>
<p>Of course, if you have design talent and think you can do better, go for it!</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/jukkaz.wordpress.com/235/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/jukkaz.wordpress.com/235/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/jukkaz.wordpress.com/235/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/jukkaz.wordpress.com/235/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/jukkaz.wordpress.com/235/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/jukkaz.wordpress.com/235/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/jukkaz.wordpress.com/235/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/jukkaz.wordpress.com/235/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/jukkaz.wordpress.com/235/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/jukkaz.wordpress.com/235/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=jukkaz.wordpress.com&blog=52630&post=235&subd=jukkaz&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://jukkaz.wordpress.com/2009/09/23/some-graphics-work-for-a-change/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/4442496cf0c59d39cadc84b8bc56fcba?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">jukkaz</media:title>
		</media:content>

		<media:content url="http://jukkaz.files.wordpress.com/2009/09/jackrabbit-icon-work.png" medium="image">
			<media:title type="html">Jackrabbit icon steps</media:title>
		</media:content>
	</item>
		<item>
		<title>Release time</title>
		<link>http://jukkaz.wordpress.com/2009/09/19/release-time/</link>
		<comments>http://jukkaz.wordpress.com/2009/09/19/release-time/#comments</comments>
		<pubDate>Sat, 19 Sep 2009 11:02:43 +0000</pubDate>
		<dc:creator>Jukka Zitting</dc:creator>
				<category><![CDATA[ASF]]></category>

		<guid isPermaLink="false">http://jukkaz.wordpress.com/?p=231</guid>
		<description><![CDATA[There&#8217;s lots of upcoming release activity at the Apache projects I&#8217;m more or less involved with:

The incubating Apache PDFBox project is just about to release the eagerly anticipated 0.8.0 release. I&#8217;m expecting to see the release announcement on Tuesday next week. PDFBox is a Java library for working with PDF documents.
Another incubating project, Apache UIMA, [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=jukkaz.wordpress.com&blog=52630&post=231&subd=jukkaz&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>There&#8217;s lots of upcoming release activity at the Apache projects I&#8217;m more or less involved with:</p>
<ul>
<li><a href="http://incubator.apache.org/pdfbox/"><img class="alignright" title="Apache PDFBox" src="https://s3.amazonaws.com/bits.ohloh.net/attachments/5981/logo_med.png" alt="" width="64" height="64" /></a>The incubating <a href="http://incubator.apache.org/pdfbox/">Apache PDFBox</a> project is just about to release the <a title="Apache PDFBox status update" href="http://jukkaz.wordpress.com/2009/01/23/apache-pdfbox-status-update/">eagerly anticipated</a> 0.8.0 release. I&#8217;m expecting to see the release announcement on Tuesday next week. PDFBox is a Java library for working with PDF documents.</li>
<li><a href="http://incubator.apache.org/uima/"><img class="alignright" title="Apache UIMA" src="https://s3.amazonaws.com/bits.ohloh.net/attachments/23695/uima-icon_med.png" alt="" width="64" height="64" /></a>Another incubating project, <a href="http://incubator.apache.org/uima/">Apache UIMA</a>, is <a title="Marshall Schor: Making good progress to 2.3.0 release, next checkpoint at close of next week" href="http://markmail.org/message/p3463oyvgohknxev">working towards</a> the 2.3.0 release. I&#8217;m looking forward to seeing both UIMA and PDFBox graduating from the <a href="http://incubator.apache.org/">Apache Incubator</a> shortly after the respective releases. UIMA is a framework and a set of components for analyzing large volumes of unstructured information.</li>
<li><a href="http://sling.apache.org/"><img class="alignright" title="Apache Sling" src="https://s3.amazonaws.com/bits.ohloh.net/attachments/8717/sling_med.png" alt="" width="64" height="64" /></a>The <a href="http://sling.apache.org/">Apache Sling</a> project is a component-based project like <a href="http://felix.apache.org/">Apache Felix</a>, so there is no clear project-wide release cycle.  Instead Sling is about to start releasing new versions of most of the components changed since the all-inclusive incubator releases. Sling is a JCR-based web framework.</li>
<li><a href="http://lucene.apache.org/tika/"><img class="alignright" title="Apache Tika" src="https://s3.amazonaws.com/bits.ohloh.net/attachments/8697/tikaNoText_med.png" alt="" width="64" height="64" /></a><a href="http://lucene.apache.org/tika/">Apache Tika</a> uses PDFBox for extracting text content from PDF documents. I&#8217;m hoping to see a Tika 0.5 release soon with the latest PDFBox dependency and the <a title="TIKA-275: Parse context" href="https://issues.apache.org/jira/browse/TIKA-275">design improvements</a> I&#8217;ve been working on. Tika is a toolkit for extracting text and metadata from all kinds of documents.</li>
<li><a href="http://lucene.apache.org/solr/"><img class="alignright" title="Apache Solr" src="https://s3.amazonaws.com/bits.ohloh.net/attachments/13297/solr_FC_med.jpg" alt="" width="64" height="35" /></a><a href="http://lucene.apache.org/solr/">Apache Solr</a> is about to enter <a title="Yonik Seeley: solr 1.4 release schedule" href="http://markmail.org/message/6mb442fxjtq3dt6m">code freeze</a> in preparation for the 1.4 release that will include the &#8220;<a title="ExtractingRequestHandler" href="http://wiki.apache.org/solr/ExtractingRequestHandler">Solar Cell</a>&#8221; feature based on Tika. Solr is a search server based on Lucene.</li>
<li>The <a href="http://commons.apache.org/io/">Commons IO</a> project has been upgraded to use Java 5 features and I&#8217;m starting to <a title="Jukka Zitting: [io] Towards the 2.0 release" href="http://markmail.org/message/poqipa3yisit53wt">push it</a> towards a 2.0 release. Commons IO is a library of Java IO utilities.</li>
<li><a href="http://lucene.apache.org/java/"><img class="alignright" title="Lucene Java" src="https://s3.amazonaws.com/bits.ohloh.net/attachments/23787/lucene_med.png" alt="" width="64" height="64" /></a><a href="http://lucene.apache.org/java/">Lucene Java</a> is <a title="Yonik Seeley: Re: Lucene 2.9 RC4 now available for testing" href="http://markmail.org/message/5vez3yxwhznxaylv">gearing up</a> for the 2.9 release, and will soon <a title="DM Smith: Lucene 3.0 and Java 5 (was Re: Finishing Lucene 2.9)" href="http://markmail.org/message/3v7aj5yhizggoikm">follow up</a> with the 3.0 release. The<a title="LUCENE-1470: Add TrieRangeFilter to contrib" href="https://issues.apache.org/jira/browse/LUCENE-1470"> trie range</a> feature is an especially welcome addition for many use cases. Lucene is a feature-rich high performance search engine.</li>
<li><a href="http://jackrabbit.apache.org/"><img class="alignright" title="Apache Jackrabbit" src="https://s3.amazonaws.com/bits.ohloh.net/attachments/2233/jlogo64_med.png" alt="" width="64" height="64" /></a>And last but not least, <a href="http://jackrabbit.apache.org/">Apache Jackrabbit</a> is <a title="Jukka Zitting: Re: Jackrabbit 2.0 release plan" href="http://markmail.org/message/ced7d6kgdnos6atw">getting ready</a> to release the 2.0 version based on the <a title="JSR 283 Final Approval Ballot" href="http://jcp.org/en/jsr/results?id=4979">recently approved</a> JCR 2.0 standard. Jackrabbit is a feature-complete JCR content repository implementation.</li>
</ul>
<p>I&#8217;m hoping to see most of these releases happening in time for the <a href="http://us.apachecon.com/c/acus2009/">ApacheCon US 2009</a> conference in early November.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/jukkaz.wordpress.com/231/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/jukkaz.wordpress.com/231/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/jukkaz.wordpress.com/231/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/jukkaz.wordpress.com/231/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/jukkaz.wordpress.com/231/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/jukkaz.wordpress.com/231/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/jukkaz.wordpress.com/231/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/jukkaz.wordpress.com/231/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/jukkaz.wordpress.com/231/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/jukkaz.wordpress.com/231/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=jukkaz.wordpress.com&blog=52630&post=231&subd=jukkaz&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://jukkaz.wordpress.com/2009/09/19/release-time/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/4442496cf0c59d39cadc84b8bc56fcba?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">jukkaz</media:title>
		</media:content>

		<media:content url="//s3.amazonaws.com/bits.ohloh.net/attachments/5981/logo_med.png" medium="image">
			<media:title type="html">Apache PDFBox</media:title>
		</media:content>

		<media:content url="//s3.amazonaws.com/bits.ohloh.net/attachments/23695/uima-icon_med.png" medium="image">
			<media:title type="html">Apache UIMA</media:title>
		</media:content>

		<media:content url="//s3.amazonaws.com/bits.ohloh.net/attachments/8717/sling_med.png" medium="image">
			<media:title type="html">Apache Sling</media:title>
		</media:content>

		<media:content url="//s3.amazonaws.com/bits.ohloh.net/attachments/8697/tikaNoText_med.png" medium="image">
			<media:title type="html">Apache Tika</media:title>
		</media:content>

		<media:content url="//s3.amazonaws.com/bits.ohloh.net/attachments/13297/solr_FC_med.jpg" medium="image">
			<media:title type="html">Apache Solr</media:title>
		</media:content>

		<media:content url="//s3.amazonaws.com/bits.ohloh.net/attachments/23787/lucene_med.png" medium="image">
			<media:title type="html">Lucene Java</media:title>
		</media:content>

		<media:content url="//s3.amazonaws.com/bits.ohloh.net/attachments/2233/jlogo64_med.png" medium="image">
			<media:title type="html">Apache Jackrabbit</media:title>
		</media:content>
	</item>
		<item>
		<title>Apache Jackrabbit 1.6.0 released</title>
		<link>http://jukkaz.wordpress.com/2009/08/11/apache-jackrabbit-1-6-0-released/</link>
		<comments>http://jukkaz.wordpress.com/2009/08/11/apache-jackrabbit-1-6-0-released/#comments</comments>
		<pubDate>Tue, 11 Aug 2009 15:11:03 +0000</pubDate>
		<dc:creator>Jukka Zitting</dc:creator>
				<category><![CDATA[JCR]]></category>
		<category><![CDATA[Jackrabbit]]></category>

		<guid isPermaLink="false">http://jukkaz.wordpress.com/?p=225</guid>
		<description><![CDATA[The Apache Jackrabbit project has just released Jackrabbit version 1.6.0. This release will most likely be the latest JCR 1.0 -based Jackrabbit 1.x minor release before the upcoming Jackrabbit 2.0 and the upgrade to JCR version 2.0. The purpose goal of this release is to push out as many of the recent Jackrabbit trunk improvements [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=jukkaz.wordpress.com&blog=52630&post=225&subd=jukkaz&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>The <a href="http://jackrabbit.apache.org/">Apache Jackrabbit</a> project has just released Jackrabbit version 1.6.0. This release will most likely be the latest JCR 1.0 -based Jackrabbit 1.x minor release before the upcoming Jackrabbit 2.0 and the upgrade to JCR version 2.0. The purpose goal of this release is to push out as many of the recent Jackrabbit trunk improvements as possible so that the number of new things in Jackrabbit 2.0 remains manageable.</p>
<p style="text-align:center;"><a href="http://jackrabbit.apache.org/downloads.html"><img class="size-full wp-image-226" title="Download Apache Jackrabbit 1.6.0" src="http://jukkaz.files.wordpress.com/2009/08/jackrabbit-download-1-6-0.png?w=368&#038;h=95" alt="Download Apache Jackrabbit 1.6.0" width="368" height="95" /></a></p>
<p>The most notable changes and new features in this release are:</p>
<ul>
<li>The RepositoryCopier tool makes it easy to backup and migrate repositories (<a href="https://issues.apache.org/jira/browse/JCR-442">JCR-442</a>). There is also improved support for selectively copying content and version histories between repositories (<a href="https://issues.apache.org/jira/browse/JCR-1972">JCR-1972</a>).</li>
<li>A new WebDAV-based JCR remoting layer has been added to complement the existing JCR-RMI layer <a href="https://issues.apache.org/jira/browse/JCR-1877">(JCR-1877</a>, <a href="https://issues.apache.org/jira/browse/JCR-1958">JCR-1958</a>).</li>
<li>Query performance has been further optimized (<a href="https://issues.apache.org/jira/browse/JCR-1820">JCR-1820</a>, <a href="https://issues.apache.org/jira/browse/JCR-1855">JCR-1855</a> and <a href="https://issues.apache.org/jira/browse/JCR-2025">JCR-2025</a>).</li>
<li>Added support for Ingres and MaxDB/SapDB databases (<a href="https://issues.apache.org/jira/browse/JCR-1960">JCR-1960</a>, <a href="https://issues.apache.org/jira/browse/JCR-1527">JCR-1527</a>).</li>
<li>Session.refresh() can now be used to synchronize a cluster node with changes from the other nodes in the cluster (<a href="https://issues.apache.org/jira/browse/JCR-1753">JCR-1753</a>).</li>
<li>Unreferenced version histories are now automatically removed once all the contained versions have been removed (<a href="https://issues.apache.org/jira/browse/JCR-134">JCR-134</a>).</li>
<li>Standalone components like the JCR-RMI layer and the OCM framework have been moved to a separate <a href="http://jackrabbit.apache.org/commons/">JCR Commons</a> subproject of Jackrabbit, and are not included in this release. Updates to those components will be distributed as separate releases.</li>
<li>Development preview: There are even more <a href="http://jcp.org/en/jsr/summary?id=283">JSR 283</a> features in Jackrabbit 1.6 than were included in the 1.5 version. These new features are accessible through special &#8220;jsr283&#8243; interfaces in the Jackrabbit API. Note that none of these features are ready for production use, and will be replaced with final JCR 2.0 versions in Jackrabbit 2.0.</li>
</ul>
<p>This release is the result of contributions from <a title="Jira contribution report for Jackrabbit 1.6.0" href="https://issues.apache.org/jira/secure/ConfigureReport.jspa?versionId=12313459&amp;issueStatus=all&amp;selectedProjectId=10591&amp;reportKey=com.sourcelabs.jira.plugin.report.contributions:contributionreport&amp;Next=Next">quite a few people</a>. Thanks to everyone involved, this is open source in action!</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/jukkaz.wordpress.com/225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/jukkaz.wordpress.com/225/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/jukkaz.wordpress.com/225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/jukkaz.wordpress.com/225/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/jukkaz.wordpress.com/225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/jukkaz.wordpress.com/225/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/jukkaz.wordpress.com/225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/jukkaz.wordpress.com/225/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/jukkaz.wordpress.com/225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/jukkaz.wordpress.com/225/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=jukkaz.wordpress.com&blog=52630&post=225&subd=jukkaz&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://jukkaz.wordpress.com/2009/08/11/apache-jackrabbit-1-6-0-released/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/4442496cf0c59d39cadc84b8bc56fcba?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">jukkaz</media:title>
		</media:content>

		<media:content url="http://jukkaz.files.wordpress.com/2009/08/jackrabbit-download-1-6-0.png" medium="image">
			<media:title type="html">Download Apache Jackrabbit 1.6.0</media:title>
		</media:content>
	</item>
		<item>
		<title>JCR 2.0 implementation progress</title>
		<link>http://jukkaz.wordpress.com/2009/07/18/jcr-2-0-implementation-progress/</link>
		<comments>http://jukkaz.wordpress.com/2009/07/18/jcr-2-0-implementation-progress/#comments</comments>
		<pubDate>Sat, 18 Jul 2009 17:38:30 +0000</pubDate>
		<dc:creator>Jukka Zitting</dc:creator>
				<category><![CDATA[JCR]]></category>
		<category><![CDATA[Jackrabbit]]></category>

		<guid isPermaLink="false">http://jukkaz.wordpress.com/?p=216</guid>
		<description><![CDATA[The JCR 2.0 API specified by JSR 283 has been in Proposed Final Draft (PFD) stage since March, and Apache Jackrabbit developers have been busy implementing all the specified new features and adding compliance test cases for them.

Both the Reference Implementation (RI) and the Technology Compatibility Kit (TCK) of JSR 283 will be based on [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=jukkaz.wordpress.com&blog=52630&post=216&subd=jukkaz&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>The JCR 2.0 API specified by <a title="JSR 283: Content Repository for JavaTM Technology API Version 2.0" href="http://jcp.org/en/jsr/summary?id=283">JSR 283</a> has been in Proposed Final Draft (PFD) stage since March, and <a href="http://jackrabbit.apache.org/">Apache Jackrabbit</a> developers have been busy implementing all the specified new features and adding compliance test cases for them.</p>
<p><a href="http://jackrabbit.apache.org/"><img class="aligncenter size-full wp-image-128" title="Apache Jackrabbit" src="http://jukkaz.files.wordpress.com/2009/01/jackrabbit-logo.gif?w=336&#038;h=100" alt="Apache Jackrabbit" width="336" height="100" /></a></p>
<p>Both the Reference Implementation (RI) and the Technology Compatibility Kit (TCK) of JSR 283 will be based on Jackrabbit code, and we expect the final version of the specification to be released shortly after Jackrabbit trunk becomes feature-complete and the API coverage of the TCK reaches 100%. The following two graphs illustrate our progress on both these fronts.</p>
<p>First a track of all the JCR 2.0 implementation tasks we&#8217;ve filed under the <a title="JSR 283 support" href="https://issues.apache.org/jira/browse/JCR-1104">JCR-1104</a> collection issue. The amount of work per each sub-task is not uniform, so this graph only shows the general trend and does not suggest any specific completion date.</p>
<p><img class="aligncenter size-full wp-image-222" title="jcr-20-implementation" src="http://jukkaz.files.wordpress.com/2009/07/jcr-20-implementation1.png?w=500&#038;h=291" alt="jcr-20-implementation" width="500" height="291" /></p>
<p>The second graph tracks the TCK API coverage. We started with the JCR 1.0 TCK, so the first 300-400 method signatures were already covered with few changes to existing test code. Based on Julian&#8217;s API coverage reports in <a title="test case (TCK) maintenance for JCR 2.0" href="https://issues.apache.org/jira/browse/JCR-2085">JSR-2085</a>, this graph tracks progress in covering the 100+ new method signatures introduced in JCR 2.0. Again, the graph is meant to show just a general trend and should not be used to extrapolate future progress.</p>
<p><img class="aligncenter size-full wp-image-218" title="JCR 2.0 TCK API coverage" src="http://jukkaz.files.wordpress.com/2009/07/jcr-api-coverage.png?w=500&#038;h=291" alt="JCR 2.0 TCK API coverage" width="500" height="291" /></p>
<p>Wan&#8217;t to see JCR 2.0 in action? The latest Jackrabbit 2.0 alpha releases are <a href="http://jackrabbit.apache.org/downloads.html">available for download</a>!</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/jukkaz.wordpress.com/216/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/jukkaz.wordpress.com/216/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/jukkaz.wordpress.com/216/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/jukkaz.wordpress.com/216/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/jukkaz.wordpress.com/216/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/jukkaz.wordpress.com/216/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/jukkaz.wordpress.com/216/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/jukkaz.wordpress.com/216/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/jukkaz.wordpress.com/216/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/jukkaz.wordpress.com/216/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=jukkaz.wordpress.com&blog=52630&post=216&subd=jukkaz&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://jukkaz.wordpress.com/2009/07/18/jcr-2-0-implementation-progress/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/4442496cf0c59d39cadc84b8bc56fcba?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">jukkaz</media:title>
		</media:content>

		<media:content url="http://jukkaz.files.wordpress.com/2009/01/jackrabbit-logo.gif" medium="image">
			<media:title type="html">Apache Jackrabbit</media:title>
		</media:content>

		<media:content url="http://jukkaz.files.wordpress.com/2009/07/jcr-20-implementation1.png" medium="image">
			<media:title type="html">jcr-20-implementation</media:title>
		</media:content>

		<media:content url="http://jukkaz.files.wordpress.com/2009/07/jcr-api-coverage.png" medium="image">
			<media:title type="html">JCR 2.0 TCK API coverage</media:title>
		</media:content>
	</item>
		<item>
		<title>Commits per weekday and hour</title>
		<link>http://jukkaz.wordpress.com/2009/06/04/commits-per-weekday-and-hour/</link>
		<comments>http://jukkaz.wordpress.com/2009/06/04/commits-per-weekday-and-hour/#comments</comments>
		<pubDate>Wed, 03 Jun 2009 23:33:39 +0000</pubDate>
		<dc:creator>Jukka Zitting</dc:creator>
				<category><![CDATA[ASF]]></category>

		<guid isPermaLink="false">http://jukkaz.wordpress.com/?p=211</guid>
		<description><![CDATA[The punchcard graphs at Github are a nice way to quickly detect the rough geographical distribution (or nighttime coding habits) of the key contributors of an open source project. Here&#8217;s a few selected examples from the ASF.
Apache HTTP Server

Apache Maven (core)

Apache Jackrabbit

       <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=jukkaz.wordpress.com&blog=52630&post=211&subd=jukkaz&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>The punchcard graphs at <a href="http://github.com/">Github</a> are a nice way to quickly detect the rough geographical distribution (or nighttime coding habits) of the key contributors of an open source project. Here&#8217;s a few selected examples from <a title="The Apache Software Foundation" href="http://www.apache.org/">the ASF</a>.</p>
<p><a href="http://httpd.apache.org/">Apache HTTP Server</a></p>
<p><a href="http://github.com/apache/httpd/graphs/punch_card"><img class="size-full wp-image-212 alignnone" title="Apache HTTP Server" src="http://jukkaz.files.wordpress.com/2009/06/httpd.png?w=500&#038;h=187" alt="Apache HTTP Server" width="500" height="187" /></a></p>
<p><a href="http://maven.apache.org/">Apache Maven</a> (core)</p>
<p><a href="http://github.com/apache/maven/graphs/punch_card"><img class="alignnone size-full wp-image-214" title="Apache Maven" src="http://jukkaz.files.wordpress.com/2009/06/maven.png?w=500&#038;h=187" alt="Apache Maven" width="500" height="187" /></a></p>
<p><a href="http://jackrabbit.apache.org/">Apache Jackrabbit</a></p>
<p><a href="http://github.com/apache/jackrabbit/graphs/punch_card"><img class="alignnone size-full wp-image-213" title="Apache Jackrabbit" src="http://jukkaz.files.wordpress.com/2009/06/jackrabbit.png?w=500&#038;h=187" alt="Apache Jackrabbit" width="500" height="187" /></a></p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/jukkaz.wordpress.com/211/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/jukkaz.wordpress.com/211/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/jukkaz.wordpress.com/211/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/jukkaz.wordpress.com/211/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/jukkaz.wordpress.com/211/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/jukkaz.wordpress.com/211/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/jukkaz.wordpress.com/211/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/jukkaz.wordpress.com/211/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/jukkaz.wordpress.com/211/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/jukkaz.wordpress.com/211/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=jukkaz.wordpress.com&blog=52630&post=211&subd=jukkaz&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://jukkaz.wordpress.com/2009/06/04/commits-per-weekday-and-hour/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/4442496cf0c59d39cadc84b8bc56fcba?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">jukkaz</media:title>
		</media:content>

		<media:content url="http://jukkaz.files.wordpress.com/2009/06/httpd.png" medium="image">
			<media:title type="html">Apache HTTP Server</media:title>
		</media:content>

		<media:content url="http://jukkaz.files.wordpress.com/2009/06/maven.png" medium="image">
			<media:title type="html">Apache Maven</media:title>
		</media:content>

		<media:content url="http://jukkaz.files.wordpress.com/2009/06/jackrabbit.png" medium="image">
			<media:title type="html">Apache Jackrabbit</media:title>
		</media:content>
	</item>
		<item>
		<title>Would you trust a pirate?</title>
		<link>http://jukkaz.wordpress.com/2009/05/17/would-you-trust-a-pirate/</link>
		<comments>http://jukkaz.wordpress.com/2009/05/17/would-you-trust-a-pirate/#comments</comments>
		<pubDate>Sun, 17 May 2009 22:06:13 +0000</pubDate>
		<dc:creator>Jukka Zitting</dc:creator>
				<category><![CDATA[General]]></category>

		<guid isPermaLink="false">http://jukkaz.wordpress.com/?p=209</guid>
		<description><![CDATA[Apparently they&#8217;re now setting up a Pirate Party also in Finland. I guess it&#8217;s good to have a political force that questions the appropriateness of traditional copyright in the digital world. However, as a knowledge worker I&#8217;m not that excited about drastic changes in the protection of immaterial rights.
Anyway, my appreciation for the movement in [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=jukkaz.wordpress.com&blog=52630&post=209&subd=jukkaz&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>Apparently they&#8217;re now setting up a <a href="http://en.wikipedia.org/wiki/Pirate_Party">Pirate Party</a> also <a href="http://www.piraattipuolue.fi/english">in Finland</a>. I guess it&#8217;s good to have a political force that questions the appropriateness of traditional copyright in the digital world. However, as a knowledge worker I&#8217;m not that excited about drastic changes in the protection of immaterial rights.</p>
<p>Anyway, my appreciation for the movement in Finland went down considerably when I saw their spokesman in the news today. When asked about the main goals of the new party he only mentioned freedom of speech and protection of privacy. Did he just forget the massive overhaul of copyright and patent laws that they&#8217;re primarily after?</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/jukkaz.wordpress.com/209/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/jukkaz.wordpress.com/209/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/jukkaz.wordpress.com/209/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/jukkaz.wordpress.com/209/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/jukkaz.wordpress.com/209/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/jukkaz.wordpress.com/209/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/jukkaz.wordpress.com/209/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/jukkaz.wordpress.com/209/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/jukkaz.wordpress.com/209/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/jukkaz.wordpress.com/209/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=jukkaz.wordpress.com&blog=52630&post=209&subd=jukkaz&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://jukkaz.wordpress.com/2009/05/17/would-you-trust-a-pirate/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/4442496cf0c59d39cadc84b8bc56fcba?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">jukkaz</media:title>
		</media:content>
	</item>
	</channel>
</rss>