Data Extraction
Contents
[
Hide
]
The GroupDocs.Parser for Java (which is a part of Conholdate.Total for Java) API. This API is known as a powerfull data extraction tool which allows you to extract data from various types of formats e.g. PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, ODT, ODS, RTF, EPUB and many others.
Why Use GroupDocs.Parser?
- Accurate and fast Raw text extraction modes;
- No additional software is required to extract data from the documents;
- Parsing documents by user-generated templates.
- Online free document data extraction App for simple cases and powerful Java library for many data extraction scenarios;
- Images extraction;
- Document information extraction - file type, page count etc;
- Metadata extraction;
- Parsing PDF forms.
- Attachments extraction;
- Parsing documents by user-generated templates.