I want to convert all the .odt
.doc
.xls
.pdf
files to .txt
files.
I want to convert these files to text files using a shell script or a perl script
Google docs does indeed open . odt files. I just checked that function. I start from Google Drive, right click, open with, Google docs.
To change into Word format, you can open the existing text file in either Wordpad or Word program by Pressing CTRL+O and browsing to the folder where text file is. Select it and open it. After opening, Save As *. docx to convert into Word format.
There's a program for odt files and alikes:
odt2txt - avaliable in repos.
$ unoconv --format=txt document1.odt
Should produce document1.txt.
OpenOffice has a built-in document converter capable of handling a bunch of formats- take a look at unoconv: http://dag.wieers.com/home-made/unoconv/
That being said, I have had some troubles getting that to work in the past- If you're having trouble, take a look at similar programs for AbiWord (another open source word processor).
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With