Apache Tika RUNNING

PDF text extraction service for Pellucid Labs

What It Does

Converts PDF documents into raw text using the Apache Tika content analysis toolkit. Part of the Pellucid Labs entity extraction pipeline — PDFs go in, searchable text comes out.

Endpoints

PUT /tikaExtract text from a document
GET /tikaServer status
GET /versionTika version info