<?xml version="1.0" encoding="UTF-8"?><rss
version="2.0"
xmlns:content="http://purl.org/rss/1.0/modules/content/"
xmlns:dc="http://purl.org/dc/elements/1.1/"
xmlns:atom="http://www.w3.org/2005/Atom"
xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
> <channel><title>Comments on: Compressors galore: pbzip2, lbzip2, plzip, xz, and lrzip tested on a FASTQ file</title> <atom:link href="https://bogdan.org.ua/2015/03/28/compressors-galore-pbzip2-lbzip2-plzip-xz-and-lrzip-tested-on-a-fastq-file.html/feed" rel="self" type="application/rss+xml" /><link>https://bogdan.org.ua/2015/03/28/compressors-galore-pbzip2-lbzip2-plzip-xz-and-lrzip-tested-on-a-fastq-file.html</link> <description>Tiny bits of bioinformatics, [web-]programming etc</description> <lastBuildDate>Mon, 01 Jan 2024 17:12:20 +0000</lastBuildDate> <sy:updatePeriod>hourly</sy:updatePeriod> <sy:updateFrequency>1</sy:updateFrequency> <generator>https://wordpress.org/?v=3.8.27</generator> <item><title>By: Bogdan</title><link>https://bogdan.org.ua/2015/03/28/compressors-galore-pbzip2-lbzip2-plzip-xz-and-lrzip-tested-on-a-fastq-file.html#comment-462187</link> <dc:creator><![CDATA[Bogdan]]></dc:creator> <pubDate>Wed, 12 Oct 2016 21:15:54 +0000</pubDate> <guid
isPermaLink="false">http://bogdan.org.ua/?p=2257#comment-462187</guid> <description><![CDATA[Trotos,
your comment reminded me that I did mention Fastqz in my previous post on the topic: http://bogdan.org.ua/2013/10/17/favourite-file-compressor-gzip-bzip2-7z.html
Looks like I haven&#039;t actually tested it, because of the concern that data recovery _might_ be too complicated with Fastqz.
For comparison, a single block damage with bzip2 would only cause the loss of between 100 and 900 K of compressed data, which - for fastq files - will probably have negligible effects.
Another reason to not test it was that it is not clear if it will see any future support.
If, for example, a change in compiler makes building fastqz not possible without first modifying the code, then it&#039;s... bad :)
Maybe I&#039;ll test it anyway - next time.]]></description> <content:encoded><![CDATA[<p>Trotos,</p><p>your comment reminded me that I did mention Fastqz in my previous post on the topic: <a
href="http://bogdan.org.ua/2013/10/17/favourite-file-compressor-gzip-bzip2-7z.html" rel="nofollow">http://bogdan.org.ua/2013/10/17/favourite-file-compressor-gzip-bzip2-7z.html</a></p><p>Looks like I haven&#8217;t actually tested it, because of the concern that data recovery _might_ be too complicated with Fastqz.<br
/> For comparison, a single block damage with bzip2 would only cause the loss of between 100 and 900 K of compressed data, which &#8211; for fastq files &#8211; will probably have negligible effects.</p><p>Another reason to not test it was that it is not clear if it will see any future support.<br
/> If, for example, a change in compiler makes building fastqz not possible without first modifying the code, then it&#8217;s&#8230; bad <img
src="https://bogdan.org.ua/wp-includes/images/smilies/icon_smile.gif" alt=":)" class="wp-smiley" /></p><p>Maybe I&#8217;ll test it anyway &#8211; next time.</p> ]]></content:encoded> </item> <item><title>By: trotos</title><link>https://bogdan.org.ua/2015/03/28/compressors-galore-pbzip2-lbzip2-plzip-xz-and-lrzip-tested-on-a-fastq-file.html#comment-461901</link> <dc:creator><![CDATA[trotos]]></dc:creator> <pubDate>Tue, 11 Oct 2016 13:02:42 +0000</pubDate> <guid
isPermaLink="false">http://bogdan.org.ua/?p=2257#comment-461901</guid> <description><![CDATA[Could you please try that derivative of zpaq?
http://mattmahoney.net/dc/fastqz/]]></description> <content:encoded><![CDATA[<p>Could you please try that derivative of zpaq?<br
/> <a
href="http://mattmahoney.net/dc/fastqz/" rel="nofollow">http://mattmahoney.net/dc/fastqz/</a></p> ]]></content:encoded> </item> <item><title>By: Bogdan</title><link>https://bogdan.org.ua/2015/03/28/compressors-galore-pbzip2-lbzip2-plzip-xz-and-lrzip-tested-on-a-fastq-file.html#comment-461001</link> <dc:creator><![CDATA[Bogdan]]></dc:creator> <pubDate>Thu, 06 Oct 2016 19:17:17 +0000</pubDate> <guid
isPermaLink="false">http://bogdan.org.ua/?p=2257#comment-461001</guid> <description><![CDATA[Thanks Hmage, that sounds interesting. Maybe in my next installment of compressor testing I&#039;ll include &lt;strong&gt;pxz&lt;/strong&gt;, too :)
I did eventually try a newer (already parallel, I think) version of &lt;strong&gt;xz&lt;/strong&gt; on genomic data, and had mixed success.
&lt;strong&gt;lbzip2&lt;/strong&gt; sometimes achieved even better ratios, mostly just a little bit worse, rarely much worse, but was always many times faster.]]></description> <content:encoded><![CDATA[<p>Thanks Hmage, that sounds interesting. Maybe in my next installment of compressor testing I&#8217;ll include <strong>pxz</strong>, too <img
src="https://bogdan.org.ua/wp-includes/images/smilies/icon_smile.gif" alt=":)" class="wp-smiley" /></p><p>I did eventually try a newer (already parallel, I think) version of <strong>xz</strong> on genomic data, and had mixed success.<br
/> <strong>lbzip2</strong> sometimes achieved even better ratios, mostly just a little bit worse, rarely much worse, but was always many times faster.</p> ]]></content:encoded> </item> <item><title>By: hmage</title><link>https://bogdan.org.ua/2015/03/28/compressors-galore-pbzip2-lbzip2-plzip-xz-and-lrzip-tested-on-a-fastq-file.html#comment-459841</link> <dc:creator><![CDATA[hmage]]></dc:creator> <pubDate>Fri, 30 Sep 2016 19:55:18 +0000</pubDate> <guid
isPermaLink="false">http://bogdan.org.ua/?p=2257#comment-459841</guid> <description><![CDATA[Try &lt;b&gt;pxz&lt;/b&gt;, it&#039;s a parallel version of &lt;b&gt;xz&lt;/b&gt; and is a drop-in replacement in terms of file format.]]></description> <content:encoded><![CDATA[<p>Try <b>pxz</b>, it&#8217;s a parallel version of <b>xz</b> and is a drop-in replacement in terms of file format.</p> ]]></content:encoded> </item> <item><title>By: Bogdan</title><link>https://bogdan.org.ua/2015/03/28/compressors-galore-pbzip2-lbzip2-plzip-xz-and-lrzip-tested-on-a-fastq-file.html#comment-347714</link> <dc:creator><![CDATA[Bogdan]]></dc:creator> <pubDate>Thu, 30 Apr 2015 09:36:16 +0000</pubDate> <guid
isPermaLink="false">http://bogdan.org.ua/?p=2257#comment-347714</guid> <description><![CDATA[It is good to know, thanks. I was using versions currently available in Debian testing. I guess I&#039;ll make another comparison in a year or so :)
I must say that even with multithreading xz with default settings will likely be &lt;strong&gt;significantly slower&lt;/strong&gt; than lbzip2 - on the order of 200+ seconds on the same test file and hardware, and assuming a really good parallelism implementation. For my use this is way too slow, and probably not worth the extra savings. Also, more complicated xz file format looks like another drawback to me (harder to recover data).
Clearly, everyone&#039;s needs are different, so I&#039;m not saying that lbzip2 is much better overall - but it is for me ;)]]></description> <content:encoded><![CDATA[<p>It is good to know, thanks. I was using versions currently available in Debian testing. I guess I&#8217;ll make another comparison in a year or so <img
src="https://bogdan.org.ua/wp-includes/images/smilies/icon_smile.gif" alt=":)" class="wp-smiley" /></p><p>I must say that even with multithreading xz with default settings will likely be <strong>significantly slower</strong> than lbzip2 &#8211; on the order of 200+ seconds on the same test file and hardware, and assuming a really good parallelism implementation. For my use this is way too slow, and probably not worth the extra savings. Also, more complicated xz file format looks like another drawback to me (harder to recover data).</p><p>Clearly, everyone&#8217;s needs are different, so I&#8217;m not saying that lbzip2 is much better overall &#8211; but it is for me <img
src="https://bogdan.org.ua/wp-includes/images/smilies/icon_wink.gif" alt=";)" class="wp-smiley" /></p> ]]></content:encoded> </item> <item><title>By: Seumas</title><link>https://bogdan.org.ua/2015/03/28/compressors-galore-pbzip2-lbzip2-plzip-xz-and-lrzip-tested-on-a-fastq-file.html#comment-347552</link> <dc:creator><![CDATA[Seumas]]></dc:creator> <pubDate>Wed, 29 Apr 2015 21:46:10 +0000</pubDate> <guid
isPermaLink="false">http://bogdan.org.ua/?p=2257#comment-347552</guid> <description><![CDATA[Version 5.2 of xz is out, which does have multi-thread support. You may have to compile it yourself but it might be worth testing. I haven&#039;t tested it myself yet.
I use xz for non-realtime compression (e.g. overnight backups), because although it&#039;s slow, it&#039;s so much better than bzip2 and, of course, if it&#039;s overnight I don&#039;t care if it takes half an hour or whatever to run.]]></description> <content:encoded><![CDATA[<p>Version 5.2 of xz is out, which does have multi-thread support. You may have to compile it yourself but it might be worth testing. I haven&#8217;t tested it myself yet.</p><p>I use xz for non-realtime compression (e.g. overnight backups), because although it&#8217;s slow, it&#8217;s so much better than bzip2 and, of course, if it&#8217;s overnight I don&#8217;t care if it takes half an hour or whatever to run.</p> ]]></content:encoded> </item> </channel> </rss>