pipe one long line as multiple lines

Question

Say I have a bunch of XML files which contain no newlines, but basically contain a long list of records, delimited by </record><record>

If the delimiter were </record> <record> I would be able to do something like cat *.xml | grep xyz | wc -l to count instances of records of interest, because cat would emit the records one per line.

Is there a way to write SOMETHING *.xml | grep xyz | wc -l where SOMETHING can stream out the records one per line? I tried using awk for this but couldn't find a way to avoid streaming the whole file into memory.

Hopefully the question is clear enough :)

Beta · Accepted Answer

This is a little ugly, but it works:

sed 's|</record>|</record>\
|g' *.xml | grep xyz | wc -l

(Yes, I know I could make it a little bit shorter, but only at the cost of clarity.)

Prince John Wesley · Answer

If your record body has no character like < or / or >, then you may try this:

grep -E -o 'SEARCH_STRING[^<]*</record>' *.xml| wc -l

or

grep -E -o 'SEARCH_STRING[^/]*/record>' *.xml| wc -l

or

grep -E -o 'SEARCH_STRING[^>]*>' *.xml| wc -l

pipe one long line as multiple lines

Tags:

bash

shell

scripting

zsh

awk

nicolaskruchten

2 Answers

Beta

Prince John Wesley

Recent Activity

Donate For Us

pipe one long line as multiple lines

Tags:

bash

shell

scripting

zsh

awk

nicolaskruchten

2 Answers

Beta

Prince John Wesley

Related questions

Recent Activity

Donate For Us