I'm interested whether I could use a single grep command for the following situation. I have a dhcpd.conf file in which DHCP hosts are defined. Given the hostname, I need to find its MAC address in the dhcpd.conf file. I need to use it to disable its PXE boot config, but that's not part of this question. The file's syntax is uniform, but I still want to make it a little fool-proof. Here is how the hosts are defined: <pre class="prettyprint"><code> host client1 { hardware ethernet 12:23:34:56:78:89; fixed-address 192.168.1.11; filename "pxelinux.0"; } host client2 { hardware ethernet 23:34:45:56:67:78; fixed-address 192.168.1.12; filename "pxelinux.0"; } host client3 { hardware ethernet AB:CD:EF:01:23:45; fixed-address 192.168.1.13; filename "pxelinux.0"; } host client4 { hardware ethernet C1:CA:88:FA:F4:90; fixed-address 192.168.1.14; filename "pxelinux.0"; } </code></pre> We assume that all configurations take only one line, even though the dhcpd.conf syntax would allow to break options to several lines. We assume that the order of options may differ, however. I came up with the following grep command: <pre class="prettyprint"><code>grep -o "^[^#]*host.*${DHCP_HOSTNAME}.*hardware ethernet.*..:..:..:..:..:..;" /etc/dhcp/dhcpd-hosts.conf </code></pre> It is supposed to ignore lines those are commented, allow arbitrary whitespace between tokens, and match until the end of the MAC address. When I run it, I get lines like this: <pre class="prettyprint"><code>host client1 { hardware ethernet 12:23:34:56:78:89; </code></pre> This is great! But the point is that I only need a MAC address, without the preceding trash. Now I know that using another grep, or cut, or awk to cut only the MAC address from this output would be trivial. But I wonder, is there a way to use a single grep command to get the end result, without having to pipe this output into another filter? Obviously I can't leave out the beginning of the pattern, because I want to get a specific hostname, thus matching for "<code>..:..:..:..:..:..</code>" would give me all the MAC addresses. Once again, I want a single command (not necessarily grep) which cuts out only the proper MAC address from the file. Thus I am not interested in any solutions those say "grep ... | grep ..." or "grep ... | cut ...", etc.. Of course, in practice, nothing bad happens if I use multiple filters and pipe them, I am just curious whether it is possible to solve with one filter. I would assign the output to a variable.

You can use a Perl one-liner to match each line of the file against a single regex with an appropriate capture group, and for each line that matches you can print the submatch. There are several ways to use Perl for this task. I suggest going with the <code>perl -ne {program}</code> idiom, which implicitly loops over the lines of stdin and executes the one-liner <code>{program}</code> once for each line, with the current line made available as the <code>$_</code> special variable. (Note: The <code>-n</code> option does not cause the final value of <code>$_</code> to be automatically printed at the end of each iteration of the implicit loop, which is what the <code>-p</code> option would do; that is, <code>perl -pe {program}</code>.) Below is the solution. Note that I decided to pass the target hostname using the obscure <code>-s</code> option, which enables parsing of variable assignment specifications after the <code>{program}</code> argument, similar to awk's <code>-v</code> option. (It is not possible to pass normal command-line arguments with the <code>-n</code> option because the implicit <code>while (<>) { ... }</code> loop gobbles up all such arguments for file names, but the <code>-s</code> mechanism provides an excellent solution. See Is it possible to pass command-line arguments to @ARGV when using the -n or -p options?.) This design prevents the need to embed the <code>$DHCP_HOSTNAME</code> variable in the <code>{program}</code> string itself, which allows us to single-quote it and save a few (actually 8) backslashes. <pre class="prettyprint"><code>DHCP_HOSTNAME='client3'; perl -nse 'print($1) if m(^\s*host\s*$host\s*\{.*\bhardware\s*ethernet\s*(..:..:..:..:..:..));' -- -host="$DHCP_HOSTNAME" <dhcpd.cfg; ## AB:CD:EF:01:23:45 </code></pre> I often prefer Perl to <code>sed</code> for the following reasons: <ul> <li>Perl provides a complete general-purpose programming environment, whereas <code>sed</code> is more limited.</li> <li>Perl has an enormous repository of publicly available modules on CPAN which can easily be installed and then used with the <code>-M{module}</code> option. <code>sed</code> is not extensible.</li> <li>Perl has a much more powerful regular expression engine than sed, with lookaround assertions, backtracking control verbs, within-regex and replacement Perl code, many more options and special escapes, embedded group options, and more. See perlre.</li> <li>Counter-intuitively, despite its greater sophistication, Perl is often much faster than <code>sed</code> due to its two-pass process and highly optimized opcode implementation. See http://rc3.org/2014/08/28/surprisingly-perl-outperforms-sed-and-awk/ for example.</li> <li>I often find that the equivalent Perl implementation is more intuitive than that of <code>sed</code>, since <code>sed</code> has a more primitive set of commands for manipulating the underlying text.</li> </ul>

I'd choose sed for this, because you can use a regexp for line addressing: <pre class="prettyprint"><code>sed -e "/host *${DHCP_HOSTNAME}/!d" -e "s/*.$hardware [^;]*$.*/\1/g" </code></pre> The first expression deletes all lines not matching <code>${DHCP_HOSTNAME}</code> (you might want to massage this in the shell if you might have any regexp metacharacters in your hostnames, but I'll assume you don't). The second expression matches the hardware address portion, and deletes the rest of the line.

Print only a part of a match with grep

Tags:

linux

grep

shell

unix

gnu

I'm interested whether I could use a single grep command for the following situation.

I have a dhcpd.conf file in which DHCP hosts are defined. Given the hostname, I need to find its MAC address in the dhcpd.conf file. I need to use it to disable its PXE boot config, but that's not part of this question.

The file's syntax is uniform, but I still want to make it a little fool-proof. Here is how the hosts are defined:

    host client1 { hardware ethernet 12:23:34:56:78:89; fixed-address 192.168.1.11; filename "pxelinux.0"; }
    host client2 { hardware ethernet 23:34:45:56:67:78; fixed-address 192.168.1.12; filename "pxelinux.0"; }
    host client3 { hardware ethernet AB:CD:EF:01:23:45; fixed-address 192.168.1.13; filename "pxelinux.0"; }
    host client4 { hardware ethernet C1:CA:88:FA:F4:90; fixed-address 192.168.1.14; filename "pxelinux.0"; }

We assume that all configurations take only one line, even though the dhcpd.conf syntax would allow to break options to several lines. We assume that the order of options may differ, however.

I came up with the following grep command:

grep -o "^[^#]*host.*${DHCP_HOSTNAME}.*hardware ethernet.*..:..:..:..:..:..;" /etc/dhcp/dhcpd-hosts.conf

It is supposed to ignore lines those are commented, allow arbitrary whitespace between tokens, and match until the end of the MAC address. When I run it, I get lines like this:

host client1 { hardware ethernet 12:23:34:56:78:89;

This is great! But the point is that I only need a MAC address, without the preceding trash. Now I know that using another grep, or cut, or awk to cut only the MAC address from this output would be trivial. But I wonder, is there a way to use a single grep command to get the end result, without having to pipe this output into another filter? Obviously I can't leave out the beginning of the pattern, because I want to get a specific hostname, thus matching for "..:..:..:..:..:.." would give me all the MAC addresses.

Once again, I want a single command (not necessarily grep) which cuts out only the proper MAC address from the file. Thus I am not interested in any solutions those say "grep ... | grep ..." or "grep ... | cut ...", etc..

Of course, in practice, nothing bad happens if I use multiple filters and pipe them, I am just curious whether it is possible to solve with one filter.

I would assign the output to a variable.

689

asked Aug 30 '16 15:08

MegaBrutal

2 Answers

You can use a Perl one-liner to match each line of the file against a single regex with an appropriate capture group, and for each line that matches you can print the submatch.

There are several ways to use Perl for this task. I suggest going with the perl -ne {program} idiom, which implicitly loops over the lines of stdin and executes the one-liner {program} once for each line, with the current line made available as the $_ special variable. (Note: The -n option does not cause the final value of $_ to be automatically printed at the end of each iteration of the implicit loop, which is what the -p option would do; that is, perl -pe {program}.)

Below is the solution. Note that I decided to pass the target hostname using the obscure -s option, which enables parsing of variable assignment specifications after the {program} argument, similar to awk's -v option. (It is not possible to pass normal command-line arguments with the -n option because the implicit while (<>) { ... } loop gobbles up all such arguments for file names, but the -s mechanism provides an excellent solution. See Is it possible to pass command-line arguments to @ARGV when using the -n or -p options?.) This design prevents the need to embed the $DHCP_HOSTNAME variable in the {program} string itself, which allows us to single-quote it and save a few (actually 8) backslashes.

DHCP_HOSTNAME='client3';
perl -nse 'print($1) if m(^\s*host\s*$host\s*\{.*\bhardware\s*ethernet\s*(..:..:..:..:..:..));' -- -host="$DHCP_HOSTNAME" <dhcpd.cfg;
## AB:CD:EF:01:23:45

I often prefer Perl to sed for the following reasons:

Perl provides a complete general-purpose programming environment, whereas sed is more limited.
Perl has an enormous repository of publicly available modules on CPAN which can easily be installed and then used with the -M{module} option. sed is not extensible.
Perl has a much more powerful regular expression engine than sed, with lookaround assertions, backtracking control verbs, within-regex and replacement Perl code, many more options and special escapes, embedded group options, and more. See perlre.
Counter-intuitively, despite its greater sophistication, Perl is often much faster than sed due to its two-pass process and highly optimized opcode implementation. See http://rc3.org/2014/08/28/surprisingly-perl-outperforms-sed-and-awk/ for example.
I often find that the equivalent Perl implementation is more intuitive than that of sed, since sed has a more primitive set of commands for manipulating the underlying text.

114

answered Oct 16 '22 04:10

bgoldst

I'd choose sed for this, because you can use a regexp for line addressing:

sed -e "/host  *${DHCP_HOSTNAME}/!d" -e "s/*.\(hardware [^;]*\).*/\1/g"

The first expression deletes all lines not matching ${DHCP_HOSTNAME} (you might want to massage this in the shell if you might have any regexp metacharacters in your hostnames, but I'll assume you don't).

The second expression matches the hardware address portion, and deletes the rest of the line.

answered Oct 16 '22 03:10

Toby Speight

Related questions
                            
                                Check if Qt c++ application is running as sudo [duplicate]
                            
                                Is any kind of keepalive necessary on localhost socket?
                            
                                django settings.py os.environ.get("X") not fetching correct values
                            
                                Linux Virtual USB device driver
                            
                                BigQuery: Does bq load command support loading from named pipe as a source?
                            
                                POSIX partial write()
                            
                                Where do my Python prints got with Flask deployed under nginx with uWSGI?
                            
                                File Descriptors and File Handles (and C)
                            
                                How Linux decides what `malloc` to use?
                            
                                List the directories whose the file sizes are all within a range
                            
                                Java on ubuntu is out of memory, but plenty of cached memory present
                            
                                How to grab a particular log file and show its content in jenkins Console output
                            
                                What does the "cxa" prefix mean in "__cxa_demangle"?
                            
                                Monitoring (Sniffing) /dev/ttyUSB0 created by FTDI USB Serial Converter
                            
                                Seg fault...on hello world
                            
                                How do I run a Docker container on a USB drive?
                            
                                Calling Rscript from linux shell script
                            
                                Get timestamps by line with iperf3 in bash script
                            
                                Recording the java stack-trace when a specific system call is made?
                            
                                Sending float values on socket C/C++

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With