How to find all words appearing between parenthesis?

Tags:

grep

bash

I have a file containing some words in parenthesis. I'd like to compile a list of all of the unique words appearing there, e.g.:

This is some (text).
This (text) has some (words) in parenthesis.
Sometimes, there are numbers, such as (123) in parenthesis too.

This would be the resulting list:

text
words
123

How can I list all of the items appearing between parenthesis?

271

asked May 19 '12 01:05

Village

3 Answers

You can use awk like this:

awk -F "[()]" '{ for (i=2; i<NF; i+=2) print $i }' file.txt

prints:

text
text
words
123

You can use an array to print the unique values:

awk -F "[()]" '{ for (i=2; i<NF; i+=2) array[$1]=$i; print array[$1] }' file.txt

prints:

text
words
123

HTH

195

answered Oct 07 '22 16:10

Steve

With GNU grep, you can use a perl-compatible regex with look-around assertions to exclude the parens:

grep -Po '(?<=\().*?(?=\))' file.txt | sort -u

answered Oct 07 '22 17:10

glenn jackman

grep -oE '$[[:alnum:]]*?$' | sed 's/[()]//g' | sort | uniq

-o Only prints the matching text
-E means use extended regular expressions
\( means match a literal paren
[[:alnum:]] is the POSIX character class for letters and numbers.

That sed script should strip out the parens. This is tested against GNU grep, but BSD sed so be wary.

answered Oct 07 '22 18:10

Matt K

Related questions
                            
                                Minimal "Task Queue" with stock Linux tools to leverage Multicore CPU
                            
                                What's wrong with my bash array?
                            
                                finding the missing values in a range using any scripting language - perl, python or shell script
                            
                                Update bashrc with virtualenv info using Ansible
                            
                                How to convert numbers to the first letters of the alphabet?
                            
                                bash - color escapes codes
                            
                                Using sed and regex to capture last part of url
                            
                                Output arguments in sorted order
                            
                                Why these simple shell commands fail when used in sed's replacement part
                            
                                set `ulimit -c` from outside shell
                            
                                cat /dev/null to multiple files to clear existing files like logs
                            
                                How can I check if current web server is NGINX or Apache using bash script?
                            
                                bash script append text to first line of a file
                            
                                Checking correctness of an email address with a regular expression in Bash
                            
                                Shell script - Sudo-permissions lost over time
                            
                                How can I make this git command alias?
                            
                                Add date to git commit message automatically
                            
                                Extract last word of a file in bash/sed/awk
                            
                                coming from bash, what windows scripting language to learn?
                            
                                Regex pattern to edit /etc/sudoers file

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With