<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Valentin's Lab &#187; tip</title>
	<atom:link href="https://vaab.blog.kal.fr/category/tip/feed/" rel="self" type="application/rss+xml" />
	<link>https://vaab.blog.kal.fr</link>
	<description>Ratiocination of an opensource techie</description>
	<lastBuildDate>Thu, 15 Nov 2018 08:04:35 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=4.1.1</generator>
	<atom:link rel="payment" title="Flattr this!" href="https://flattr.com/submit/auto?user_id=vaab&amp;popout=1&amp;url=https%3A%2F%2Fvaab.blog.kal.fr%2F&amp;language=en_US&amp;category=text&amp;title=Valentin%27s+Lab&amp;description=Ratiocination+of+an+opensource+techie&amp;tags=blog" type="text/html" />
	<item>
		<title>bash lore: how to properly parse NUL separated fields</title>
		<link>https://vaab.blog.kal.fr/2015/01/03/bash-lore-how-to-properly-parse-nul-separated-fields/</link>
		<comments>https://vaab.blog.kal.fr/2015/01/03/bash-lore-how-to-properly-parse-nul-separated-fields/#comments</comments>
		<pubDate>Sat, 03 Jan 2015 09:26:28 +0000</pubDate>
		<dc:creator><![CDATA[vaab]]></dc:creator>
				<category><![CDATA[dev]]></category>
		<category><![CDATA[tip]]></category>
		<category><![CDATA[bash]]></category>
		<category><![CDATA[nul]]></category>

		<guid isPermaLink="false">http://vaab.blog.kal.fr/?p=513</guid>
		<description><![CDATA[As a lot of other part of bash, this is black magic. Lets suppose a friendly command that spits out NUL separated fields (as find -print0, shyaml get-values-0, ...). Which - may I insist - is the recommended way to &#8230;<p class="read-more"><a href="https://vaab.blog.kal.fr/2015/01/03/bash-lore-how-to-properly-parse-nul-separated-fields/">Read more &#187;</a></p>]]></description>
				<content:encoded><![CDATA[
<div class="document">


<!-- -*- mode: rst -*- -->
<p>As a lot of other part of bash, this is black magic.</p>
<p>Lets suppose a friendly command that spits out <tt class="docutils literal">NUL</tt> separated fields (as <tt class="docutils literal">find <span class="pre">-print0</span></tt>, <tt class="docutils literal">shyaml <span class="pre">get-values-0</span></tt>, ...). Which - may I insist - is the recommended way to communicate wild binary data in a solid way in bash.</p>
<p>How would you parse adequately each individual records by paquets ?</p>
<p>For the purpose of demonstration, lets use the fixed content of a simple <tt class="docutils literal">data.bin</tt> file
as our <tt class="docutils literal">NUL</tt>-separated input:</p>
<pre class="literal-block">
cat &lt;&lt;EOF | tr : &quot;\000&quot; &gt; /tmp/data.bin
a:1:b:2 3:c:4
  5:d:6\n7:e::f:9
EOF
</pre>
<p>Let's verify that we have our <tt class="docutils literal">NUL</tt> bytes:</p>
<pre class="literal-block">
$ cat /tmp/data.bin | hexdump -v -e '/1 &quot;%02X &quot;'
61 00 31 00 62 00 32 20 33 00 63 00 34 0A 20 20 35 00 64 00 36 5C 6E 37 00 65 00 00 66 00 39 0A
</pre>
<p>You have noticed that we have some values containing:</p>
<blockquote>
<ul class="simple">
<li>spaces (hex: <tt class="docutils literal">20</tt>),</li>
<li>line breaks (hex: <tt class="docutils literal">0A</tt>),</li>
<li>a <tt class="docutils literal">\</tt> followed by a <tt class="docutils literal">n</tt>.</li>
<li>a 0 sized value</li>
<li>a final value <tt class="docutils literal">9</tt> ending with a <tt class="docutils literal">0a</tt> and no final <tt class="docutils literal">00</tt>.</li>
</ul>
</blockquote>
<p>If using NUL separated fields is recommended, it's to support this kind of data.</p>
<p>I want the implementation of a function <tt class="docutils literal"><span class="pre">read-0</span></tt> that would allow this type of interaction:</p>
<pre class="literal-block">
$ cat /tmp/data.bin | while read-0 f1 f2; do
    echo &quot;f1: '$f1', f2: '$f2'&quot;
  done
f1: 'a', f2: '1'
f1: 'b', f2: '2 3'
f1: 'c', f2: '4
  5'
f1: 'd', f2: '6\n7'
f1: 'e', f2: ''
f1: 'f', f2: '9
'
</pre>
<div class="section" id="first-try">
<h3>First try</h3>
<p>Let's be naive, and we'll use <tt class="docutils literal">read f1 f2</tt>:</p>
<pre class="literal-block">
$ cat /tmp/data.bin | while read f1 f2; do echo &quot;f1: '$f1', f2: '$f2'&quot;; done
f1: 'a1b2', f2: '3c4'
f1: '5d6n7ef9', f2: ''
</pre>
<p>You can notice that:</p>
<blockquote>
<ul class="simple">
<li><tt class="docutils literal">NUL</tt> char where ignored for field separation</li>
<li>fields where separated upon <strong>consecutive</strong> space or return, it uses value stored in <tt class="docutils literal">IFS</tt> environment variable.</li>
<li>their are only 2 lines because the <tt class="docutils literal">\n</tt> was used to separate each record. We should use <tt class="docutils literal"><span class="pre">-d</span></tt> to specify the line delimiter.</li>
<li>Note that the <tt class="docutils literal">NUL</tt> chars are also extracted out of the data as variables don't support the NUL char.</li>
<li>The <tt class="docutils literal">\</tt> was eaten, because <tt class="docutils literal">read</tt> builtin parse and give it special meaning. We should use <tt class="docutils literal"><span class="pre">-r</span></tt> to avoid that.</li>
</ul>
</blockquote>
<p>But how should we provide the <tt class="docutils literal">NUL</tt> delimiter to the read builtin ? knowing that you can't put <tt class="docutils literal">NUL</tt> chars on the command line ? Hopefully I stumbled onto this blog post: <a class="reference external" href="http://transnum.blogspot.sg/2008/11/bashs-read-built-in-supports-0-as.html">http://transnum.blogspot.sg/2008/11/bashs-read-built-in-supports-0-as.html</a></p>
<p>Conclusion is that <tt class="docutils literal"><span class="pre">-d</span> ''</tt> should be understood magically by bash <tt class="docutils literal">read</tt> builtin to delimit lines with <tt class="docutils literal">NUL</tt>
characters.</p>
</div>
<div class="section" id="better-try">
<h3>Better try</h3>
<p>Let's apply our new acquired knowledge by trying <tt class="docutils literal"><span class="pre">IFS=$'\0'</span> read <span class="pre">-d</span> '' <span class="pre">-r</span> f1 f2</tt>:</p>
<pre class="literal-block">
$ cat /tmp/data.bin | while IFS=$'\0' read -d '' -r f1 f2; do echo &quot;f1: '$f1', f2: '$f2'&quot;; done
f1: 'a', f2: ''
f1: '1', f2: ''
f1: 'b', f2: ''
f1: '2 3', f2: ''
f1: 'c', f2: ''
f1: '4
  5', f2: ''
f1: 'd', f2: ''
f1: '6\n7', f2: ''
f1: 'e', f2: ''
f1: '', f2: ''
f1: 'f', f2: ''
</pre>
<p>That's much better. But notice that:</p>
<blockquote>
<ul class="simple">
<li>we didn't get anything in <tt class="docutils literal">$f2</tt>, that's normal: by specifying <tt class="docutils literal">NUL</tt> as line delimiter (with <tt class="docutils literal"><span class="pre">-d</span> ''</tt>) and having NUL as field delimiter (<tt class="docutils literal">IFS</tt>) we will be doomed to have one field per record. We will need to manage the repacking in a <tt class="docutils literal">while</tt> loop. This doesn't sound too difficult.</li>
<li>where's the final field <tt class="docutils literal">0A</tt> ? Hum, as there is no <tt class="docutils literal">NUL</tt> final character in the data, <tt class="docutils literal">read</tt> returned errlvl 1 on this last field but filled correctly the variable. A simple <tt class="docutils literal">echo $f1</tt> prints <tt class="docutils literal">9</tt> (if you use this form: <tt class="docutils literal">while <span class="pre">IFS=''</span> read <span class="pre">-d</span> '' <span class="pre">-r</span> f1 f2; do echo &quot;f1: '$f1', f2: <span class="pre">'$f2'&quot;;</span> done &lt; /tmp/data.txt</tt> to access variables of the <tt class="docutils literal">while</tt>).</li>
</ul>
</blockquote>
</div>
<div class="section" id="final-implementation">
<h3>Final Implementation ?</h3>
<p>So knowing this, here is the final implemetation of <tt class="docutils literal"><span class="pre">read-0</span></tt>:</p>
<pre class="literal-block">
read-0() {
    local eof
    eof=
    while [ &quot;$1&quot; -a -z &quot;$eof&quot; ]; do
        IFS='' read -r -d '' &quot;$1&quot; || eof=true
        shift
    done
    test &quot;$eof&quot; != true -o -z &quot;$1&quot;
}
</pre>
<p>Final ? It surely properly works for our specification test. But what happens if <tt class="docutils literal">EOF</tt> happens before
we have fed all the variables ?:</p>
<pre class="literal-block">
$ echo -n &quot;a&quot; | while read-0 f1 f2; do echo &quot;f1: '$f1', f2: '$f2'&quot;; done
$
</pre>
<p>Nothing is spit out, despite the fact that we have sent a character.</p>
<p>This is now a specification issue: Do we want <tt class="docutils literal"><span class="pre">read-0</span></tt> to return errorlevel 0 when it hits <tt class="docutils literal">EOF</tt>
while filing the variables ? Okay, but we said 0-sized string was a possible value... <tt class="docutils literal"><span class="pre">read-0</span></tt> in the
current specification knows it hit <tt class="docutils literal">EOF</tt> while filling variables as your first variable can be the
0-sized string. We could make a special case, but I want to be able to distinguish a last empty element from an element.</p>
<p>That <tt class="docutils literal"><span class="pre">read-0</span></tt>, in the actual specification, can't do it. But we can offer a slight change in the way you build your while loop to allow that parsing.</p>
</div>
<div class="section" id="correct-implementation">
<h3>Correct Implementation</h3>
<p>To fill partial records, will need another specification change as current implementation will fail whenever it encounters EOF. This is an incompatible specification issue. Aside from this, we need also to take care to actually set the value of the remaining fields to the empty string. This will require to use another version of <tt class="docutils literal"><span class="pre">read-0</span></tt>:</p>
<pre class="literal-block">
read-0() {
    local eof
    eof=
    while [ &quot;$1&quot; ]; do
        IFS='' read -r -d '' -- &quot;$1&quot; || eof=true
        shift
    done
    test &quot;$eof&quot; != true
}
</pre>
<p>So this would work with <tt class="docutils literal"><span class="pre">read-0</span></tt>:</p>
<pre class="literal-block">
$ echo -n a | tr :  '\000' | {  eof= ; while [ -z $eof ]; do read-0 f1 f2 || eof=true; echo &quot;f1: '$f1', f2: '$f2'&quot;; done  }
f1: 'a', f2: ''

$ echo -n a: | tr :  '\000' | {  eof= ; while [ -z $eof ]; do read-0 f1 f2 || eof=true; echo &quot;f1: '$f1', f2: '$f2'&quot;; done  }
f1: 'a', f2: ''
</pre>
<p>Basically, this construct allows a last round in the loop after detecting EOF... and achieve the starting spec:</p>
<pre class="literal-block">
$ cat /tmp/data.bin | {  eof= ; while [ -z $eof ]; do read-0 f1 f2 || eof=true; echo &quot;f1: '$f1', f2: '$f2'&quot;; done  }
f1: 'a', f2: '1'
f1: 'b', f2: '2 3'
f1: 'c', f2: '4
  5'
f1: 'd', f2: '6\n7'
f1: 'e', f2: ''
f1: 'f', f2: '9
'
</pre>
<p>Trivial ?</p>
<p>Happy hacking.</p>
</div>
</div>
 <p><a href="https://vaab.blog.kal.fr/?flattrss_redirect&amp;id=513&amp;md5=57730888592c27e128ffadf2e707ab79" title="Flattr" target="_blank"><img src="https://vaab.blog.kal.fr/wp-content/plugins/flattr/img/flattr-badge-large.png" alt="flattr this!"/></a></p>]]></content:encoded>
			<wfw:commentRss>https://vaab.blog.kal.fr/2015/01/03/bash-lore-how-to-properly-parse-nul-separated-fields/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<atom:link rel="payment" title="Flattr this!" href="https://flattr.com/submit/auto?user_id=vaab&amp;popout=1&amp;url=https%3A%2F%2Fvaab.blog.kal.fr%2F2015%2F01%2F03%2Fbash-lore-how-to-properly-parse-nul-separated-fields%2F&amp;language=en_GB&amp;category=text&amp;title=bash+lore%3A+how+to+properly+parse+NUL+separated+fields&amp;description=As+a+lot+of+other+part+of+bash%2C+this+is+black+magic.+Lets+suppose+a+friendly+command+that+spits+out+NUL+separated+fields+%28as+find+-print0%2C+shyaml+get-values-0%2C+...%29.+Which...&amp;tags=bash%2Cnul%2Cblog" type="text/html" />
	</item>
		<item>
		<title>bash lore: command substitution and final new lines</title>
		<link>https://vaab.blog.kal.fr/2014/05/07/bash-lore-command-substitution-and-final-new-lines/</link>
		<comments>https://vaab.blog.kal.fr/2014/05/07/bash-lore-command-substitution-and-final-new-lines/#comments</comments>
		<pubDate>Wed, 07 May 2014 12:38:43 +0000</pubDate>
		<dc:creator><![CDATA[vaab]]></dc:creator>
				<category><![CDATA[tip]]></category>
		<category><![CDATA[bash]]></category>

		<guid isPermaLink="false">http://vaab.blog.kal.fr/?p=500</guid>
		<description><![CDATA[Be wary that command substitution will remove all final new lines. Command substitution are $(command ..) construct or older `command ..` one that will be substitued by the standard output of the given command. But that's incorrect: a proper definition &#8230;<p class="read-more"><a href="https://vaab.blog.kal.fr/2014/05/07/bash-lore-command-substitution-and-final-new-lines/">Read more &#187;</a></p>]]></description>
				<content:encoded><![CDATA[
<div class="document">


<!-- -*- mode: rst -*- -->
<p>Be wary that command substitution will remove all final new lines.</p>
<p>Command substitution are <tt class="docutils literal">$(command ..)</tt> construct or older <tt class="docutils literal">`command ..`</tt> one that will be substitued by
the standard output of the given command.</p>
<p>But that's incorrect: a proper definition would have been: It will be substitued by
the standard output <strong>minus any final new lines</strong>. For example:</p>
<pre class="literal-block">
$ value_w_final_dot=&quot;$(echo -en &quot;hello\n\n\n.&quot;)&quot;
$ value_wo_final_dot=&quot;$(echo -en &quot;hello\n\n\n&quot;)&quot;
$ echo &quot;My first value: &lt;&lt;$value_w_final_dot&gt;&gt;&quot;
My first value: &lt;&lt;hello


.&gt;&gt;
$ echo &quot;My second value: &lt;&lt;$value_wo_final_dot&gt;&gt;&quot;
My second value: &lt;&lt;hello&gt;&gt;
</pre>
<p>When you are bitten by these types of &quot;features&quot;, don't you value more
the python motto &quot;explicit is better than implicit&quot; ?.</p>
<p>This chopping occurs at command substitution time. You can definitively store
ending new lines in a bash variable, in command parameters, and send them of course
over a pipe, just avoid command substitution:</p>
<pre class="literal-block">
$ value_wo_final_dot=&quot;hello&quot;$'\n\n\n'
$ echo &quot;My corrected value: &lt;&lt;$value_wo_final_dot&gt;&gt;&quot;
My first value: &lt;&lt;hello


&gt;&gt;
$ /bin/echo &quot;As system process argument: &quot; &quot;$value_wo_final_dot&quot;
As system process argument: hello



$
</pre>
</div>
 <p><a href="https://vaab.blog.kal.fr/?flattrss_redirect&amp;id=500&amp;md5=90e9a0a394f4fc7896f82befc435741c" title="Flattr" target="_blank"><img src="https://vaab.blog.kal.fr/wp-content/plugins/flattr/img/flattr-badge-large.png" alt="flattr this!"/></a></p>]]></content:encoded>
			<wfw:commentRss>https://vaab.blog.kal.fr/2014/05/07/bash-lore-command-substitution-and-final-new-lines/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		<atom:link rel="payment" title="Flattr this!" href="https://flattr.com/submit/auto?user_id=vaab&amp;popout=1&amp;url=https%3A%2F%2Fvaab.blog.kal.fr%2F2014%2F05%2F07%2Fbash-lore-command-substitution-and-final-new-lines%2F&amp;language=en_GB&amp;category=text&amp;title=bash+lore%3A+command+substitution+and+final+new+lines&amp;description=Be+wary+that+command+substitution+will+remove+all+final+new+lines.+Command+substitution+are+%24%28command+..%29+construct+or+older+%60command+..%60+one+that+will+be+substitued+by+the+standard+output...&amp;tags=bash%2Cblog" type="text/html" />
	</item>
		<item>
		<title>bash lore: NUL character and variables</title>
		<link>https://vaab.blog.kal.fr/2014/05/03/bash-lore-nul-character-and-variables/</link>
		<comments>https://vaab.blog.kal.fr/2014/05/03/bash-lore-nul-character-and-variables/#comments</comments>
		<pubDate>Sat, 03 May 2014 10:29:27 +0000</pubDate>
		<dc:creator><![CDATA[vaab]]></dc:creator>
				<category><![CDATA[dev]]></category>
		<category><![CDATA[tip]]></category>
		<category><![CDATA[bash]]></category>
		<category><![CDATA[linux]]></category>

		<guid isPermaLink="false">http://vaab.blog.kal.fr/?p=489</guid>
		<description><![CDATA[Bash variables are often thought as being able to store any binary content. Please bear in mind that it can't store NUL character, and only this one. Here you are: $ ascii_table() { echo -en &#34;$(echo '\'0{0..3}{0..7}{0..7} &#124; tr -d &#8230;<p class="read-more"><a href="https://vaab.blog.kal.fr/2014/05/03/bash-lore-nul-character-and-variables/">Read more &#187;</a></p>]]></description>
				<content:encoded><![CDATA[
<div class="document">


<!-- -*- mode: rst -*- -->
<p>Bash variables are often thought as being able to store any binary content.</p>
<p>Please bear in mind that it can't store <tt class="docutils literal">NUL</tt> character, and only this one.</p>
<p>Here you are:</p>
<pre class="literal-block">
$ ascii_table() { echo -en &quot;$(echo '\'0{0..3}{0..7}{0..7} | tr -d &quot; &quot;)&quot;; }
$ ascii_table | hd
00000000  00 01 02 03 04 05 06 07  08 09 0a 0b 0c 0d 0e 0f  |................|
00000010  10 11 12 13 14 15 16 17  18 19 1a 1b 1c 1d 1e 1f  |................|
00000020  20 21 22 23 24 25 26 27  28 29 2a 2b 2c 2d 2e 2f  | !&quot;#$%&amp;'()*+,-./|
00000030  30 31 32 33 34 35 36 37  38 39 3a 3b 3c 3d 3e 3f  |0123456789:;&lt;=&gt;?|
00000040  40 41 42 43 44 45 46 47  48 49 4a 4b 4c 4d 4e 4f  |&#64;ABCDEFGHIJKLMNO|
00000050  50 51 52 53 54 55 56 57  58 59 5a 5b 5c 5d 5e 5f  |PQRSTUVWXYZ[\]^_|
00000060  60 61 62 63 64 65 66 67  68 69 6a 6b 6c 6d 6e 6f  |`abcdefghijklmno|
00000070  70 71 72 73 74 75 76 77  78 79 7a 7b 7c 7d 7e 7f  |pqrstuvwxyz{|}~.|
00000080  80 81 82 83 84 85 86 87  88 89 8a 8b 8c 8d 8e 8f  |................|
00000090  90 91 92 93 94 95 96 97  98 99 9a 9b 9c 9d 9e 9f  |................|
000000a0  a0 a1 a2 a3 a4 a5 a6 a7  a8 a9 aa ab ac ad ae af  |................|
000000b0  b0 b1 b2 b3 b4 b5 b6 b7  b8 b9 ba bb bc bd be bf  |................|
000000c0  c0 c1 c2 c3 c4 c5 c6 c7  c8 c9 ca cb cc cd ce cf  |................|
000000d0  d0 d1 d2 d3 d4 d5 d6 d7  d8 d9 da db dc dd de df  |................|
000000e0  e0 e1 e2 e3 e4 e5 e6 e7  e8 e9 ea eb ec ed ee ef  |................|
000000f0  f0 f1 f2 f3 f4 f5 f6 f7  f8 f9 fa fb fc fd fe ff  |................|
00000100
</pre>
<p>But:</p>
<pre class="literal-block">
$ echo -n &quot;$(ascii_table)&quot; | hd
00000000  01 02 03 04 05 06 07 08  09 0a 0b 0c 0d 0e 0f 10  |................|
00000010  11 12 13 14 15 16 17 18  19 1a 1b 1c 1d 1e 1f 20  |............... |
00000020  21 22 23 24 25 26 27 28  29 2a 2b 2c 2d 2e 2f 30  |!&quot;#$%&amp;'()*+,-./0|
00000030  31 32 33 34 35 36 37 38  39 3a 3b 3c 3d 3e 3f 40  |123456789:;&lt;=&gt;?&#64;|
00000040  41 42 43 44 45 46 47 48  49 4a 4b 4c 4d 4e 4f 50  |ABCDEFGHIJKLMNOP|
00000050  51 52 53 54 55 56 57 58  59 5a 5b 5c 5d 5e 5f 60  |QRSTUVWXYZ[\]^_`|
00000060  61 62 63 64 65 66 67 68  69 6a 6b 6c 6d 6e 6f 70  |abcdefghijklmnop|
00000070  71 72 73 74 75 76 77 78  79 7a 7b 7c 7d 7e 7f 80  |qrstuvwxyz{|}~..|
00000080  81 82 83 84 85 86 87 88  89 8a 8b 8c 8d 8e 8f 90  |................|
00000090  91 92 93 94 95 96 97 98  99 9a 9b 9c 9d 9e 9f a0  |................|
000000a0  a1 a2 a3 a4 a5 a6 a7 a8  a9 aa ab ac ad ae af b0  |................|
000000b0  b1 b2 b3 b4 b5 b6 b7 b8  b9 ba bb bc bd be bf c0  |................|
000000c0  c1 c2 c3 c4 c5 c6 c7 c8  c9 ca cb cc cd ce cf d0  |................|
000000d0  d1 d2 d3 d4 d5 d6 d7 d8  d9 da db dc dd de df e0  |................|
000000e0  e1 e2 e3 e4 e5 e6 e7 e8  e9 ea eb ec ed ee ef f0  |................|
000000f0  f1 f2 f3 f4 f5 f6 f7 f8  f9 fa fb fc fd fe ff     |...............|
000000ff
</pre>
<p>So bash variables (and <tt class="docutils literal"><span class="pre">$(...)</span></tt> idiom) are reasonably strong: if you know your content hasn't any <tt class="docutils literal">NUL</tt> characters you can safely use them. All this is probably
linked to the fact that command line arguments cannot hold <tt class="docutils literal">NUL</tt> characters neither, a common string termination in C programs.</p>
<p>If you really need to store full binary content in a bash variable, you should think about encoding it (base64, <tt class="docutils literal">xxd</tt>, or any format of yours).</p>
<p>Please understand that <tt class="docutils literal">bash</tt> might also be able to do what you want without using variables but only pipes (stdin and stdout). They won't suffer from this limitation of course.</p>
</div>
 <p><a href="https://vaab.blog.kal.fr/?flattrss_redirect&amp;id=489&amp;md5=1083440b003f7df80df9ef35d8b40717" title="Flattr" target="_blank"><img src="https://vaab.blog.kal.fr/wp-content/plugins/flattr/img/flattr-badge-large.png" alt="flattr this!"/></a></p>]]></content:encoded>
			<wfw:commentRss>https://vaab.blog.kal.fr/2014/05/03/bash-lore-nul-character-and-variables/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		<atom:link rel="payment" title="Flattr this!" href="https://flattr.com/submit/auto?user_id=vaab&amp;popout=1&amp;url=https%3A%2F%2Fvaab.blog.kal.fr%2F2014%2F05%2F03%2Fbash-lore-nul-character-and-variables%2F&amp;language=en_GB&amp;category=text&amp;title=bash+lore%3A+NUL+character+and+variables&amp;description=Bash+variables+are+often+thought+as+being+able+to+store+any+binary+content.+Please+bear+in+mind+that+it+can%27t+store+NUL+character%2C+and+only+this+one.+Here+you+are%3A...&amp;tags=bash%2Clinux%2Cblog" type="text/html" />
	</item>
		<item>
		<title>SpamAssassin, Amavis and ubuntu</title>
		<link>https://vaab.blog.kal.fr/2012/04/02/spamassassin-amavis-and-ubuntu/</link>
		<comments>https://vaab.blog.kal.fr/2012/04/02/spamassassin-amavis-and-ubuntu/#comments</comments>
		<pubDate>Mon, 02 Apr 2012 19:59:01 +0000</pubDate>
		<dc:creator><![CDATA[vaab]]></dc:creator>
				<category><![CDATA[admin]]></category>
		<category><![CDATA[tip]]></category>

		<guid isPermaLink="false">http://vaab.blog.kal.fr/?p=291</guid>
		<description><![CDATA[If running spamassassin (SA) with amavis without usage of spamd on ubuntu 10.04 and you feel that sa-learn isn't working right, here are a few tips. I had mails that were false positive and I had to dig a little &#8230;<p class="read-more"><a href="https://vaab.blog.kal.fr/2012/04/02/spamassassin-amavis-and-ubuntu/">Read more &#187;</a></p>]]></description>
				<content:encoded><![CDATA[
<div class="document">


<!-- -*- mode: rst -*- -->
<p>If running <tt class="docutils literal">spamassassin</tt> (SA) with <tt class="docutils literal">amavis</tt> <em>without</em> usage of <tt class="docutils literal">spamd</tt> on ubuntu 10.04 and you feel that <tt class="docutils literal"><span class="pre">sa-learn</span></tt> isn't working right, here are a few tips.</p>
<p>I had mails that were false positive and I had to dig a little more than usual to get it working.</p>
<dl class="docutils">
<dt>learning</dt>
<dd><p class="first">Think that SA is executed by <tt class="docutils literal">amavis</tt> user, so:</p>
<pre class="literal-block">
sudo -u amavis -H sa-learn --showdots {--ham|--spam} YOUR_MAIL_OR_MBOX_OR_MAILDIR
</pre>
<p class="last">The trick is the <em>sudo</em> part, <tt class="docutils literal"><span class="pre">sa-learn</span></tt> docs can be found everywhere on the net.</p>
</dd>
<dt>checking</dt>
<dd><p class="first">This is how to check how spamassassin will rate your mail:</p>
<pre class="literal-block">
cat badmail | sudo -u amavis -H spamassassin -t
</pre>
<p class="last">Which is very usefull when used with the same mail, before and after learning.</p>
</dd>
<dt>statistics</dt>
<dd><p class="first">Looking at statistics can be usefull to check that the number of spam and ham are consistent with your learning sessions:</p>
<pre class="last literal-block">
sudo -u amavis -H sa-learn --dump magic | grep am
</pre>
</dd>
</dl>
<div class="note">
<p class="first admonition-title">Note</p>
<p class="last">Do not forget to restart <tt class="docutils literal">amavis</tt> with <tt class="docutils literal">/etc/init.d/amavis restart</tt>.</p>
</div>
</div>
 <p><a href="https://vaab.blog.kal.fr/?flattrss_redirect&amp;id=291&amp;md5=d24d5215efba1f8072bece0a58c57fac" title="Flattr" target="_blank"><img src="https://vaab.blog.kal.fr/wp-content/plugins/flattr/img/flattr-badge-large.png" alt="flattr this!"/></a></p>]]></content:encoded>
			<wfw:commentRss>https://vaab.blog.kal.fr/2012/04/02/spamassassin-amavis-and-ubuntu/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<atom:link rel="payment" title="Flattr this!" href="https://flattr.com/submit/auto?user_id=vaab&amp;popout=1&amp;url=https%3A%2F%2Fvaab.blog.kal.fr%2F2012%2F04%2F02%2Fspamassassin-amavis-and-ubuntu%2F&amp;language=en_GB&amp;category=text&amp;title=SpamAssassin%2C+Amavis+and+ubuntu&amp;description=If+running+spamassassin+%28SA%29+with+amavis+without+usage+of+spamd+on+ubuntu+10.04+and+you+feel+that+sa-learn+isn%27t+working+right%2C+here+are+a+few+tips.+I+had+mails+that...&amp;tags=blog" type="text/html" />
	</item>
		<item>
		<title>extracting a subpart of a git repository with history</title>
		<link>https://vaab.blog.kal.fr/2011/06/29/extracting-a-subpart-of-a-git-repository-with-history/</link>
		<comments>https://vaab.blog.kal.fr/2011/06/29/extracting-a-subpart-of-a-git-repository-with-history/#comments</comments>
		<pubDate>Wed, 29 Jun 2011 08:10:24 +0000</pubDate>
		<dc:creator><![CDATA[vaab]]></dc:creator>
				<category><![CDATA[comp]]></category>
		<category><![CDATA[dev]]></category>
		<category><![CDATA[sci]]></category>
		<category><![CDATA[tip]]></category>
		<category><![CDATA[git]]></category>

		<guid isPermaLink="false">http://vaab.blog.kal.fr/?p=233</guid>
		<description><![CDATA[I recently had to extract a subdirectory of an existing git repository towards its own new repository. And I wanted to keep all the commit information and history that was related to this directory. EDIT: Even if the solution I &#8230;<p class="read-more"><a href="https://vaab.blog.kal.fr/2011/06/29/extracting-a-subpart-of-a-git-repository-with-history/">Read more &#187;</a></p>]]></description>
				<content:encoded><![CDATA[
<div class="document">


<!-- -*- mode: rst -*- -->
<p>I recently had to extract a subdirectory of an existing git repository towards its own new repository. And I wanted to keep all the commit information and history that was related to this directory.</p>
<p><strong>EDIT</strong>: Even if the solution I used was simpler than I would have expected, I was far from knowing that GIT had a 'filter-branch' command which does the job perfectly well (and it is  <a class="reference external" href="http://progit.org/book/ch6-4.html#making_a_subdirectory_the_new_root">documented in ProGIT</a>). Thanks Jon for the tip.</p>
<div class="section" id="sketching-the-tools-used">
<h3>sketching the tools used</h3>
<p>First step, is to retrieve only the commit Hashes related to this directory:</p>
<pre class="literal-block">
git log --format=%H -- MY_DIRECTORY
</pre>
<p>to filter out only the commit Hashes related to this directory. Then:</p>
<pre class="literal-block">
git format-patch HASH -1
</pre>
<p>will be used to output patch files with the commit information (subject, author...)</p>
<p>Then, with some <tt class="docutils literal">sed</tt> you'll have to change the directories used in all the outputed patch:</p>
<pre class="literal-block">
sed -i s%MYOLDDIRECTORY%MYNEWDIRECTORY%g *.patch
</pre>
<p>The new directory in my case was &quot;&quot; (nothing) as it was in the root folder.</p>
<p>Finally, you'll have to apply your patches to your other repository with:</p>
<pre class="literal-block">
git am *.patch
</pre>
</div>
<div class="section" id="complete-commands-used">
<h3>Complete commands used</h3>
<p>The complete commands that I've used:</p>
<pre class="literal-block">
cd PARENT_OF_MYDIRECTORY
git log --format=%H -- MYDIRECTORY | tac | \
    (i=0
     while read h; do
         i=$[$i+1]
         git format-patch $h -1 --start-number $i
     done)
</pre>
<p>This produces <tt class="docutils literal"><span class="pre">NNNN-*.patch</span></tt> files, one for each commit:</p>
<pre class="literal-block">
sed -i s%MYDIRECTORY%%g *.patch
</pre>
<p>This will update the paths in all patches:</p>
<pre class="literal-block">
mv *.patch MY_NEW_REPO
cd MY_NEW_REPO
git am *.patch
</pre>
<p>A quick check with <tt class="docutils literal">gitk</tt> ensures me that message subject and commit were transfered ok.</p>
<p>Et voilà !</p>
</div>
</div>
 <p><a href="https://vaab.blog.kal.fr/?flattrss_redirect&amp;id=233&amp;md5=c192eb8bf3b9556f88fb1adac13f4fef" title="Flattr" target="_blank"><img src="https://vaab.blog.kal.fr/wp-content/plugins/flattr/img/flattr-badge-large.png" alt="flattr this!"/></a></p>]]></content:encoded>
			<wfw:commentRss>https://vaab.blog.kal.fr/2011/06/29/extracting-a-subpart-of-a-git-repository-with-history/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		<atom:link rel="payment" title="Flattr this!" href="https://flattr.com/submit/auto?user_id=vaab&amp;popout=1&amp;url=https%3A%2F%2Fvaab.blog.kal.fr%2F2011%2F06%2F29%2Fextracting-a-subpart-of-a-git-repository-with-history%2F&amp;language=en_GB&amp;category=text&amp;title=extracting+a+subpart+of+a+git+repository+with+history&amp;description=I+recently+had+to+extract+a+subdirectory+of+an+existing+git+repository+towards+its+own+new+repository.+And+I+wanted+to+keep+all+the+commit+information+and+history+that+was...&amp;tags=git%2Cblog" type="text/html" />
	</item>
		<item>
		<title>Forcing monospace font in HTML edit post in wordpress</title>
		<link>https://vaab.blog.kal.fr/2011/06/19/forcing-monospace-font-in-html-edit-post-in-wordpress/</link>
		<comments>https://vaab.blog.kal.fr/2011/06/19/forcing-monospace-font-in-html-edit-post-in-wordpress/#comments</comments>
		<pubDate>Sun, 19 Jun 2011 15:00:42 +0000</pubDate>
		<dc:creator><![CDATA[vaab]]></dc:creator>
				<category><![CDATA[tip]]></category>
		<category><![CDATA[rest]]></category>
		<category><![CDATA[wordpress]]></category>

		<guid isPermaLink="false">http://vaab.blog.kal.fr/?p=209</guid>
		<description><![CDATA[I'm writing my blog post in ReST_ code, so naturally I want that the `HTML` edition tab of wordpress have a fixed-width font as ``monospace``.
Here's how to achieve this.<p class="read-more"><a href="https://vaab.blog.kal.fr/2011/06/19/forcing-monospace-font-in-html-edit-post-in-wordpress/">Read more &#187;</a></p>]]></description>
				<content:encoded><![CDATA[
<div class="document">


<!-- -*- mode: rst -*- -->
<p>I'm writing my blog post in <a class="reference external" href="http://docutils.sourceforge.net/rst.html">ReST</a> code, so naturally I want that the <cite>HTML</cite> edition tab of wordpress have a fixed-width font as <tt class="docutils literal">monospace</tt>.
Here's how to achieve this.</p>
<p>Go edit your own themes file <tt class="docutils literal"><span class="pre">wp-content/themes/mytheme/functions.php</span></tt>, and add these lines:</p>
<pre class="literal-block">
...
add_action( 'admin_print_styles-post.php', 'my_admin_css' );
add_action( 'admin_print_styles-post-new.php', 'my_admin_css' );
function my_admin_css() {
     ?&gt;
     &lt;style type=&quot;text/css&quot;&gt;
     #editorcontainer textarea#content { font-family: monospace!important }
     &lt;/style&gt;
     &lt;?php
}
...
</pre>
<p>It's done !</p>
<p>Thanks a lot to <a class="reference external" href="http://wordpress.org/support/topic/change-the-visual-editor-font-size?replies=11">WordPress forum</a>...</p>
</div>
 <p><a href="https://vaab.blog.kal.fr/?flattrss_redirect&amp;id=209&amp;md5=be5fe1e25add928c93f125f7f2ea9f70" title="Flattr" target="_blank"><img src="https://vaab.blog.kal.fr/wp-content/plugins/flattr/img/flattr-badge-large.png" alt="flattr this!"/></a></p>]]></content:encoded>
			<wfw:commentRss>https://vaab.blog.kal.fr/2011/06/19/forcing-monospace-font-in-html-edit-post-in-wordpress/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<atom:link rel="payment" title="Flattr this!" href="https://flattr.com/submit/auto?user_id=vaab&amp;popout=1&amp;url=https%3A%2F%2Fvaab.blog.kal.fr%2F2011%2F06%2F19%2Fforcing-monospace-font-in-html-edit-post-in-wordpress%2F&amp;language=en_GB&amp;category=text&amp;title=Forcing+monospace+font+in+HTML+edit+post+in+wordpress&amp;description=I%27m+writing+my+blog+post+in+ReST+code%2C+so+naturally+I+want+that+the+HTML+edition+tab+of+wordpress+have+a+fixed-width+font+as+monospace.+Here%27s+how+to+achieve+this....&amp;tags=rest%2Cwordpress%2Cblog" type="text/html" />
	</item>
		<item>
		<title>Using ACL on linux system to mimic samba&#8217;s &#8220;force user&#8221;</title>
		<link>https://vaab.blog.kal.fr/2011/05/21/using-acl-on-linux-system-to-mimic-sambas-force-user/</link>
		<comments>https://vaab.blog.kal.fr/2011/05/21/using-acl-on-linux-system-to-mimic-sambas-force-user/#comments</comments>
		<pubDate>Sat, 21 May 2011 18:05:49 +0000</pubDate>
		<dc:creator><![CDATA[vaab]]></dc:creator>
				<category><![CDATA[tip]]></category>
		<category><![CDATA[acl]]></category>
		<category><![CDATA[linux]]></category>
		<category><![CDATA[permission]]></category>
		<category><![CDATA[samba]]></category>

		<guid isPermaLink="false">http://vaab.blog.kal.fr/?p=115</guid>
		<description><![CDATA[How to enforce samba-like policy upon file or directory creation at the filesystem level thanks to linux ACL.<p class="read-more"><a href="https://vaab.blog.kal.fr/2011/05/21/using-acl-on-linux-system-to-mimic-sambas-force-user/">Read more &#187;</a></p>]]></description>
				<content:encoded><![CDATA[
<div class="document">


<!-- -*- mode: rst -*- -->
<p>I'm using samba to share files across a network. I found <tt class="docutils literal">force user</tt> and <tt class="docutils literal">force group</tt> very usefull in a classical team working together on project.</p>
<p>This is the sample /etc/samba/smb.conf:</p>
<pre class="literal-block">
...
force user = %U
force group = dev-team
create mask = 0775
directory mask = 0775
force create mode = 0660
force directory mode = 0660
...
</pre>
<p>It ensures that any file created will receive full read, write permission for member of &quot;dev-team&quot; group.</p>
<p>But I have bunch of the user that have a direct access to these directory. And without going through samba, the policy is not enforced.</p>
<p>The solution is quite simple and involves linux <a class="reference external" href="http://en.wikipedia.org/wiki/Access_control_list">ACL</a>.</p>
<div class="section" id="install-acl">
<h3>Install ACL</h3>
<p>Your system should already have package <tt class="docutils literal">acl</tt> installed. You can install it (or make sure you already have it) by typing:</p>
<pre class="literal-block">
apt-get install acl
</pre>
<p>Then you should edit your <tt class="docutils literal">/etc/fstab</tt> to contain the option <tt class="docutils literal">acl</tt> on the mounted partition you need to get. My fstab looks like:</p>
<pre class="literal-block">
...
UUID=9cd6deba-e3a6-427c-82e1-48cfaf0793b1 /shares ext4 defaults,acl 0 2
...
</pre>
<p>Once the <tt class="docutils literal">/etc/fstab</tt> edited, you can remount your partition on the fly with:</p>
<pre class="literal-block">
mount -o remount /shares
</pre>
</div>
<div class="section" id="using-acl">
<h3>Using ACL</h3>
<p>I can mimick the behavior of samba's policy by simply typing:</p>
<pre class="literal-block">
setfacl -R -d -m u::rwx,g:dev-team:rwx,o::r-x /shares
</pre>
<p>Options are:</p>
<table class="docutils option-list" frame="void" rules="none">
<col class="option" />
<col class="description" />
<tbody valign="top">
<tr><td class="option-group">
<kbd><span class="option">-R</span></kbd></td>
<td>is the same than -R for chown or chmod ... it will apply to subdirectory also.</td></tr>
<tr><td class="option-group">
<kbd><span class="option">-d</span></kbd></td>
<td>is switching to default permissions for newly created files.</td></tr>
<tr><td class="option-group">
<kbd><span class="option">-m</span></kbd></td>
<td><p class="first">is adding the following permission rules that are expressed as:</p>
<div class="last line-block">
<div class="line">u::rwx               same as chmod u+rwx</div>
<div class="line">g:dev-team:rwx       same as chgrp dev-team AND chmod g+rwx</div>
<div class="line">o::r-x               same as chmod o+rx</div>
</div>
</td></tr>
</tbody>
</table>
<p>You can check ACLs of a given directory with <tt class="docutils literal">getfacl</tt>:</p>
<pre class="literal-block">
$ getfacl /shares
# file: shares/assistance/
# owner: musicalta
# group: assistance
# flags: -s-
user::rwx
group::rwx
other::r-x
default:user::rwx
default:group::rwx
default:group:dev-team:rwx
default:mask::rwx
default:other::r-x
</pre>
<p>If you need more info on the topic, this blog post was of some help.</p>
<p><a class="reference external" href="http://brunogirin.blogspot.com/2010/03/shared-folders-in-ubuntu-with-setgid.html">http://brunogirin.blogspot.com/2010/03/shared-folders-in-ubuntu-with-setgid.html</a></p>
</div>
</div>
 <p><a href="https://vaab.blog.kal.fr/?flattrss_redirect&amp;id=115&amp;md5=bb16398d9f4cb78f14b7d69cfe0feaae" title="Flattr" target="_blank"><img src="https://vaab.blog.kal.fr/wp-content/plugins/flattr/img/flattr-badge-large.png" alt="flattr this!"/></a></p>]]></content:encoded>
			<wfw:commentRss>https://vaab.blog.kal.fr/2011/05/21/using-acl-on-linux-system-to-mimic-sambas-force-user/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<atom:link rel="payment" title="Flattr this!" href="https://flattr.com/submit/auto?user_id=vaab&amp;popout=1&amp;url=https%3A%2F%2Fvaab.blog.kal.fr%2F2011%2F05%2F21%2Fusing-acl-on-linux-system-to-mimic-sambas-force-user%2F&amp;language=en_GB&amp;category=text&amp;title=Using+ACL+on+linux+system+to+mimic+samba%26%238217%3Bs+%26%238220%3Bforce+user%26%238221%3B&amp;description=I%27m+using+samba+to+share+files+across+a+network.+I+found+force+user+and+force+group+very+usefull+in+a+classical+team+working+together+on+project.+This+is+the+sample...&amp;tags=acl%2Clinux%2Cpermission%2Csamba%2Cblog" type="text/html" />
	</item>
	</channel>
</rss>
