Automatic document field detection

Our parsers intelligently recognizes and auto-detects unique fields from uploaded documents.

Document language detection

Detect the language in scanned or printed documents, images, and PDFs.

Optical Character Recognition (OCR)

Convert scanned or printed documents, including images and PDFs, into machine-readable text.

Integration and automation

Our document parsers can be integrated into existing software systems or workflows.

Marketplace

  • Created by: paleicikas
  • Total usage: 0

Editorials

Extract data from Editorials documents
A total of 15 fields will be extracted from the uploaded documents:
  • Title
  • Date published
  • Author
  • Name of publication
  • Main headline
  • Subheadline
  • Body text
  • Page number
  • Column location
  • Section
  • Keywords
  • Relevant Image(s)
  • Pull Quote(s)
  • Reference(s)
  • Summary
What format will be used for exporting the data?
  • JSON
  • CSV (Comma Separated Values)
  • Excel
  • XML (Extensible Markup Language)
Test your document
Also available
News Article
Extract data from News Articles