Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Performing regex capture and then substitute using SED/PERL

Tags:

regex

sed

perl

I have a data that looks like this (let's call this file submit.txt):

dir1/pmid_5409464.txt
dir1/pmid_5788247.txt
dir1/pmid_4971884.txt

What I want to do is to perform an inline file regex change so that it results in the following

perl mycode.pl /home/neversaint/dir1/pmid_5409464.txt > /home/neversaint/dir1/pmid_5409464.output
perl mycode.pl/home/neversaint/dir1/pmid_5788247.txt > /home/neversaint/dir1/pmid_5788247.output
perl mycode.pl /home/neversaint/dir1/pmid_4971884.txt > /home/neversaint/dir1/pmid_4971884.output

Is there a SED/Perl one liner to do that?

My difficulty is in capturing the input file name and then create the output file (.output) - for each line - based on that. I'm stuck with this:

sed 's/^/perl mycode.pl \/home\/neversaint\/dir1\//g' submit.txt |
sed 's/$/ >/'
like image 598
neversaint Avatar asked Jul 31 '13 02:07

neversaint


3 Answers

You asked for a Sed one-liner, you got it.

sed 's/\([^.]*\)\.txt/perl mycode.pl \/home\/neversaint\/\1.txt > \/home\/neversaint\/\1.output/' submit.txt > output.txt

like image 37
AlienHoboken Avatar answered Nov 11 '22 10:11

AlienHoboken


You can use escaped parenthesis to capture groups, and access the groups with \1, \2, etc.

sed 's/^\(.*\).txt$/perl mycode.pl \/home\/neversaint\/\1\.txt > \/home\/neversaint\/\1.output/' submit.sh

output:

perl mycode.pl /home/neversaint/dir1/pmid_5409464.txt > /home/neversaint/dir1/pmid_5409464.output
perl mycode.pl /home/neversaint/dir1/pmid_5788247.txt > /home/neversaint/dir1/pmid_5788247.output
perl mycode.pl /home/neversaint/dir1/pmid_4971884.txt > /home/neversaint/dir1/pmid_4971884.output

edit: it doesn't look like sed has a built-in in place file editing (GNU sed has the -i option). It still possible to do but this solution just prints to standard out. You could also use a Perl one liner as shown here: sed edit file in place

like image 128
hmatt1 Avatar answered Nov 11 '22 10:11

hmatt1


The perl oneliner for doing the same is

perl -pe "s@(.*?)(\.txt)@perl mycode.pl /home/neversaint/\\1\\2 > /home/neversaint/\\1.output@g" submit.txt

The above command will produce a replaced string in the console and you have to redirect the output to another file.

For replacing within the file (inline replace) you can add -i option . For eg

perl -pe "s@(.*?)(.txt)@perl mycode.pl /home/neversaint/\1\2 > /home/neversaint/\1.output@g" -i submit.txt

The above will perform a replace within the submit.txt file itself.

like image 1
Ramg Avatar answered Nov 11 '22 11:11

Ramg