Let me break down my problem into the simplest example I can. Create a test file containing one line of text. <pre class="prettyprint"><code>[root@myserver ] /tmp> echo "test ReplaceMe DoNotReplaceMe" > /tmp/daj.txt </code></pre> We have an existing <code>find</code> command that we use to substitute text in all the files that match it (in this example I've simplified this command to only work on one file, and stripped out the other stuff it does). The problem is that it substitutes "ReplaceMe" everywhere it appears, instead of only when it is a word on its own. <pre class="prettyprint"><code>[root@myserver ] /tmp> find /tmp/daj.txt -exec sh -c 'f="{}"; sed -e 's/ReplaceMe/#DONE#/gi' "${f#.}" ' \; test #DONE# DoNot#DONE# </code></pre> I've written a new <code>sed</code> command to only substitute "ReplaceMe" when it is a word on its own, but NOT when it is a substring of another word. The output from this command is correct. <pre class="prettyprint"><code>[root@myserver ] /tmp> cat /tmp/daj.txt | sed -e 's/$\W$$ReplaceMe$$\W$/\1#DONE#\3/gi' test #DONE# DoNotReplaceMe </code></pre> When I try to incorporate the updated <code>sed</code> command into the <code>find</code> command, it breaks. It looks like I am hitting an escaping problem, but I haven't managed to solve it by adding extra escaping. <pre class="prettyprint"><code>[root@myserver ] /tmp> find /tmp/daj.txt -exec sh -c 'f="{}"; sed -e 's/$\W$$ReplaceMe$$\W$/\1#DONE#\3/gi' "${f#.}" ' \; sh: -c: line 0: syntax error near unexpected token `(' sh: -c: line 0: `f="/tmp/daj.txt"; sed -e s/(W)(ReplaceMe)(W)/1#DONE#3/gi "${f#.}" ' </code></pre> Is there a way to escape my <code>sed</code> command so that I can run it via <code>find</code>, or do I have to look for an alternative solution? Update: The full <code>find</code> command we are running prints out the filename and permissions, and then pipes the output of the <code>sed</code> to <code>md5sum</code>. Here's an example of it running and matching multiple files: <pre class="prettyprint"><code>[root@myserver ] ~> find /tmp -regex '.*daj.*\.txt' -printf '%p %m ' -exec sh -c 'f="{}"; sed -e 's/ReplaceMe/#DONE#/gi' "${f#.}" | md5sum' \; /tmp/daj2.txt 644 d52bbd311552234b761bcae694c2055a - /tmp/daj.txt 644 d52bbd311552234b761bcae694c2055a - </code></pre>

You should not be using <code>{}</code> directly in the shell, instead you should be passing the file names in as shell parameters. Also, if you want to limit to whole-word matches then use <code>\<word\></code> for <code>sed</code> <h3>Update</h3> <pre class="prettyprint"><code>find /tmp -regex '.*daj.*\.txt' -printf '%p %m ' -exec sh -c "sed -e 's/\<ReplaceMe\>/#DONE#/gi' \$@ | md5sum" _ {} \; </code></pre> <h3>Output</h3> <pre class="prettyprint"><code>$ find . -regex '.*daj.*\.txt' -printf '%p %m ' -exec sh -c "sed -e 's/\<ReplaceMe\>/#DONE#/gi' \$@ | md5sum" _ {} \; ./daj2.txt 664 ea324b4721ed037dbc2402ded4446005 - ./daj.txt 664 0bbb9104da99c1c1187a2a35e6ac0e9b - </code></pre>

This doesn't answer your question about the escape sequence but does solve the problem. I'd basically use <code>xargs</code> with <code>sed</code> like this: <pre class="prettyprint"><code>$ find ~/tmp/data.txt | xargs sed -e 's/\<replaceme\>/1234/' 1234 in this sentance donotreplaceme in this sentance $ </code></pre> and the contents of data.txt: <pre class="prettyprint"><code>replaceme in this sentance donotreplaceme in this sentance </code></pre> Also if you might have filenames with spaces in it using the <code>-print0</code> parameter tells find to output the list of files as null terminated strings. Otherwise find will interpret the space in the filename as the end of the the filename. Then when using <code>xargs</code> you need to use the <code>-0</code> parameter to tell xargs that the input is a list of null terminated strings. Example below: <pre class="prettyprint"><code>find /somedir -print0 | xargs -0 command </code></pre>

How to escape commands passed to find with the exec option on Linux

Tags:

linux

find

escaping

sed

exec

Let me break down my problem into the simplest example I can.

Create a test file containing one line of text.

[root@myserver ] /tmp> echo "test ReplaceMe DoNotReplaceMe" > /tmp/daj.txt

We have an existing find command that we use to substitute text in all the files that match it (in this example I've simplified this command to only work on one file, and stripped out the other stuff it does).

The problem is that it substitutes "ReplaceMe" everywhere it appears, instead of only when it is a word on its own.

[root@myserver ] /tmp> find /tmp/daj.txt -exec sh -c 'f="{}"; sed -e 's/ReplaceMe/#DONE#/gi' "${f#.}" ' \;
test #DONE# DoNot#DONE#

I've written a new sed command to only substitute "ReplaceMe" when it is a word on its own, but NOT when it is a substring of another word. The output from this command is correct.

[root@myserver ] /tmp> cat /tmp/daj.txt | sed -e 's/\(\W\)\(ReplaceMe\)\(\W\)/\1#DONE#\3/gi'    
test #DONE# DoNotReplaceMe

When I try to incorporate the updated sed command into the find command, it breaks. It looks like I am hitting an escaping problem, but I haven't managed to solve it by adding extra escaping.

[root@myserver ] /tmp> find /tmp/daj.txt -exec sh -c 'f="{}"; sed -e 's/\(\W\)\(ReplaceMe\)\(\W\)/\1#DONE#\3/gi' "${f#.}" ' \;
sh: -c: line 0: syntax error near unexpected token `('
sh: -c: line 0: `f="/tmp/daj.txt"; sed -e s/(W)(ReplaceMe)(W)/1#DONE#3/gi "${f#.}" '

Is there a way to escape my sed command so that I can run it via find, or do I have to look for an alternative solution?

Update: The full find command we are running prints out the filename and permissions, and then pipes the output of the sed to md5sum. Here's an example of it running and matching multiple files:

[root@myserver ] ~> find /tmp -regex '.*daj.*\.txt' -printf '%p %m ' -exec sh -c 'f="{}"; sed  -e 's/ReplaceMe/#DONE#/gi' "${f#.}" | md5sum' \;
/tmp/daj2.txt 644 d52bbd311552234b761bcae694c2055a  -
/tmp/daj.txt 644 d52bbd311552234b761bcae694c2055a  -

340

asked Jan 06 '11 00:01

Dan J

2 Answers

You should not be using {} directly in the shell, instead you should be passing the file names in as shell parameters. Also, if you want to limit to whole-word matches then use \<word\> for sed

Update

find /tmp -regex '.*daj.*\.txt' -printf '%p %m ' -exec sh -c "sed  -e 's/\<ReplaceMe\>/#DONE#/gi' \$@ | md5sum" _ {} \;

Output

$ find . -regex '.*daj.*\.txt' -printf '%p %m ' -exec sh -c "sed  -e 's/\<ReplaceMe\>/#DONE#/gi' \$@ | md5sum" _ {} \;
./daj2.txt 664 ea324b4721ed037dbc2402ded4446005  -
./daj.txt 664 0bbb9104da99c1c1187a2a35e6ac0e9b  -

176

answered Jan 02 '23 12:01

SiegeX

This doesn't answer your question about the escape sequence but does solve the problem. I'd basically use xargs with sed like this:

$ find ~/tmp/data.txt | xargs sed -e 's/\<replaceme\>/1234/'
1234 in this sentance
donotreplaceme in this sentance
$

and the contents of data.txt:

replaceme in this sentance
donotreplaceme in this sentance

Also if you might have filenames with spaces in it using the -print0 parameter tells find to output the list of files as null terminated strings. Otherwise find will interpret the space in the filename as the end of the the filename. Then when using xargs you need to use the -0 parameter to tell xargs that the input is a list of null terminated strings. Example below:

find /somedir -print0 | xargs -0 command