Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Java library for reading Word documents

Tags:

java

ms-word

Is there an open-source Java library for reading Word documents (both .docx and the older .doc format)?

Read-only access if sufficient; I do not need to modify the Word documents using Java. However, I would like to have access to images and style information.

EDIT

I've checked out Apache POI, but it doesn't look like it is being actively maintained. See http://poi.apache.org/hwpf/index.html:

At the moment we unfortunately do not have someone taking care for HWPF and fostering its development.

like image 288
Tony the Pony Avatar asked Mar 02 '26 23:03

Tony the Pony


1 Answers

Apache POI HWPF for .doc and XWPF for .docx files

like image 178
Guido Avatar answered Mar 04 '26 13:03

Guido