Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How to delete the first column ( which is in fact row names) from a data file in linux?

Tags:

linux

bash

shell

I have data file with many thousands columns and rows. I want to delete the first column which is in fact the row counter. I used this command in linux:

cut -d " " -f 2- input.txt > output.txt 

but nothing changed in my output. Does anybody knows why it does not work and what should I do?

This is what my input file looks like:

col1 col2 col3 col4 ...      1 0 0 0 1      2 0 1 0 1      3 0 1 0 0      4 0 0 0 0       5 0 1 1 1       6 1 1 1 0      7 1 0 0 0       8 0 0 0 0      9 1 0 0 0      10 1 1 1 1      11 0 0 0 1     .     .     . 

I want my output look like this:

col1 col2 col3 col4 ... 0 0 0 1 0 1 0 1 0 1 0 0 0 0 0 0  0 1 1 1  1 1 1 0 1 0 0 0  0 0 0 0 1 0 0 0 1 1 1 1 0 0 0 1 . . . 

I also tried the sed command:

 sed '1d' input.file > output.file 

But it deletes the first row not the first column.

Could anybody guide me?

like image 883
zara Avatar asked Sep 27 '15 21:09

zara


People also ask

How do you delete a column in a text file?

Use the colrm command to remove specified columns from a file. Input is taken from standard input. Output is sent to standard output. If the command is called with one parameter, the columns of each line from the specified column to the last column are removed.

How do you delete a column in awk?

Without GNU awk you need a match() + substr() combo or multiple sub() s + vars to remove a middle field. See also Print all but the first three columns. Note: on Ubuntu Trusty GNU Awk 4.0. 1 doesn't have the awk inplace extension enabled by default.


2 Answers

idiomatic use of cut will be

cut -f2- input > output 

if you delimiter is tab ("\t").

Or, simply with awk magic (will work for both space and tab delimiter)

 awk '{$1=""}1' input | awk '{$1=$1}1' > output 

first awk will delete field 1, but leaves a delimiter, second awk removes the delimiter. Default output delimiter will be space, if you want to change to tab, add -vOFS="\t" to the second awk.

UPDATED

Based on your updated input the problem is the initial spaces that cut treats as multiple columns. One way to address is to remove them first before feeding to cut

sed 's/^ *//' input | cut -d" " -f2- > output 

or use the awk alternative above which will work in this case as well.

like image 164
karakfa Avatar answered Sep 22 '22 05:09

karakfa


@Karafka I had CSV files so I added the "," separator (you can replace with yours

cut -d"," -f2- input.csv  > output.csv 

Then, I used a loop to go over all files inside the directory

# files are in the directory tmp/ for f in tmp/* do     name=`basename $f`     echo "processing file : $name"     #kepp all column excep the first one of each csv file       cut -d"," -f2- $f > new/$name     #files using the same names are stored in directory new/   done 
like image 37
Fouad Djebbar Avatar answered Sep 22 '22 05:09

Fouad Djebbar