Sorry if this is too basic. I have a csv file where the columns have a header row (v1, v2, etc.). I understand that to extract columns 1 and 2, I have to do: <code>awk -F "," '{print $1 "," $2}' infile.csv > outfile.csv</code>. But what if I have to extract, say, columns 1 to 10, 20 to 25, and 30, 33? As an addendum, is there any way to extract directly with the header names rather than with column numbers?

I don't know if it's possible to do ranges in awk. You could do a for loop, but you would have to add handling to filter out the columns you don't want. It's probably easier to do this: <pre class="prettyprint"><code>awk -F, '{OFS=",";print $1,$2,$3,$4,$5,$6,$7,$8,$9,$10,$20,$21,$22,$23,$24,$25,$30,$33}' infile.csv > outfile.csv </code></pre> something else to consider - and this faster and more concise: <pre class="prettyprint"><code>cut -d "," -f1-10,20-25,30-33 infile.csv > outfile.csv </code></pre> As to the second part of your question, I would probably write a script in perl that knows how to handle header rows, parsing the columns names from stdin or a file and then doing the filtering. It's probably a tool I would want to have for other things. I am not sure about doing in a one liner, although I am sure it can be done.

Extract specific columns from delimited file using Awk

Tags:

unix

csv

awk

Sorry if this is too basic. I have a csv file where the columns have a header row (v1, v2, etc.). I understand that to extract columns 1 and 2, I have to do: awk -F "," '{print $1 "," $2}' infile.csv > outfile.csv. But what if I have to extract, say, columns 1 to 10, 20 to 25, and 30, 33? As an addendum, is there any way to extract directly with the header names rather than with column numbers?

800

asked Oct 22 '11 02:10

user702432

1 Answers

I don't know if it's possible to do ranges in awk. You could do a for loop, but you would have to add handling to filter out the columns you don't want. It's probably easier to do this:

awk -F, '{OFS=",";print $1,$2,$3,$4,$5,$6,$7,$8,$9,$10,$20,$21,$22,$23,$24,$25,$30,$33}' infile.csv > outfile.csv

something else to consider - and this faster and more concise:

cut -d "," -f1-10,20-25,30-33 infile.csv > outfile.csv

As to the second part of your question, I would probably write a script in perl that knows how to handle header rows, parsing the columns names from stdin or a file and then doing the filtering. It's probably a tool I would want to have for other things. I am not sure about doing in a one liner, although I am sure it can be done.

200

answered Oct 20 '22 05:10

Cliff

Related questions
                            
                                How can I parse CSV files on the Linux command line? [closed]
                            
                                Redirecting TCP-traffic to a UNIX domain socket under Linux
                            
                                Printing only the first field in a string
                            
                                compare contents of two directories on remote server using unix
                            
                                Performing grep operation in tar files without extracting
                            
                                How to convert a tab-separated file into a comma-separated file?
                            
                                Delete all SYSTEM V shared memory and semaphores on UNIX-like systems
                            
                                Which program creates a C array given any file?
                            
                                What is available and free memory in response of free command on Linux? [closed]
                            
                                What is the difference between Ctrl-C and SIGINT?
                            
                                Does bash have a way to un-export a variable without unsetting it?
                            
                                Hide/encrypt password in bash file to stop accidentally seeing it
                            
                                Questions about putenv() and setenv()
                            
                                How can I use the UNIX shell to count the number of times a letter appears in a text file?
                            
                                How to recursively copy directories starting with "abc" on Linux/Unix?
                            
                                How to merge two files using AWK? [duplicate]
                            
                                FTP Delete non empty directory
                            
                                Check if a condition is false
                            
                                How to use grep efficiently?
                            
                                How to change permissions to certain file pattern/extension?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With