How to find table region for camelot

Question

As mentioned in camelot, we can extract table from particular region like:

tables = camelot.read_pdf('table_regions.pdf', table_regions=['170,370,560,270'])

But how can I find these regions for my pdf.

Stefano Fiorucci - anakin87 · Accepted Answer

You can detect this regions, by some visual debugging.

https://camelot-py.readthedocs.io/en/master/user/advanced.html#visual-debugging

Benedict Witzenberger · Answer

I know it's a late reply - but I just came across a possible solution.

If you're looking for a automated extraction method, you could use lattice in a first step, retrieve the table boundaries with tables[0]._bbox and use these numbers in a second call to camelot.read_pdf() into the argument table_areas.

Be aware that they are in a weirdly sorted format for a bbox.

How to find table region for camelot

Tags:

python-camelot

Shubham Mishra

2 Answers

Stefano Fiorucci - anakin87

Benedict Witzenberger

Recent Activity

Donate For Us

How to find table region for camelot

Tags:

python-camelot

Shubham Mishra

2 Answers

Stefano Fiorucci - anakin87

Benedict Witzenberger

Related questions

Recent Activity

Donate For Us