Open an lzo file in python, without decompressing the file

Question

I'm currently working on a 3rd year project involving data from Twitter. The department have provided me with .lzo's of a months worth of Twitter. The smallest is 4.9gb and when decompressed is 29gb so I'm trying to open the file and read as I'm going. Is this possible or do I need to decompress and work with the data that way?

EDIT: Have attempted to read it line by line and decompress the read line

UPDATE: Found a solution - reading the STDOUT of lzop -dc works like a charm

EDIT: Have attempted to read it line by line and decompress the read line

UPDATE: Found a solution - reading the STDOUT of lzop -dc works like a charm

eumiro · Accepted Answer

How about starting an lzop binary in a subprocess with -c switch and then read its STDOUT line by line?

cleg · Answer

I know only one library for LZO with Python — https://github.com/jd-boyd/python-lzo and it requires full decompression (moreover — it decompress contents in memory).

So I think you'll need to decompress files before work with them.

Open an lzo file in python, without decompressing the file

Tags:

python

lzo

DrugCrazed

2 Answers

eumiro

cleg

Recent Activity

Donate For Us

Open an lzo file in python, without decompressing the file

Tags:

python

lzo

DrugCrazed

2 Answers

eumiro

cleg

Related questions

Recent Activity

Donate For Us