Apache Tika RUNNING

PDF text extraction service for Pellucid Labs

What It Does

Converts PDF documents into raw text using the Apache Tika content analysis toolkit. Part of the Pellucid Labs entity extraction pipeline — PDFs go in, searchable text comes out.

Endpoints

PUT /tikaExtract text from a document

GET /tikaServer status

GET /versionTika version info