My CSV file is tab delimited, and I am trying to filter out the p values that are greater than 0.05 (in another way, I want to keep the entries with p <= 0.05). The p values are in the 7th column, and I tried to use the following:
 awk '$7 <= 0.05 {print $0}' rawFile.csv > filteredFile.csv
But this filtering does not work, it returns the file without filtering.
The p-values in the column #7 are something like this: 0.33532935, 0.0, 0.8591287
Try this one:
awk 'BEGIN {FS='\t'} {if ($7 < 0.05) print $0}'
The BEGIN clause gives you a place to change the default Field Separator (FS) to a tab character ('\t'). This won't work in older versions of awk (and gawk might be a helpful alternative).
The main logic happens inside the second set of curly braces ... where you'll say to print the line if column 7 is <= 0.05.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With