# Apache Tika — Text Extraction Service
You are working with a Tika text extraction service provided by Pellucid Labs.
## What It Does
Apache Tika extracts plain text from PDF documents. Send a PDF, get text back.
## API
### Extract text from a PDF
```
PUT https://tika.pellucidlabs.org/tika
Content-Type: application/pdf
Accept: text/plain
Body: [raw PDF bytes]
Response: plain text content of the PDF
```
### Check if the service is running
```
GET https://tika.pellucidlabs.org/tika
Response: 200 OK if running
```
### Get version info
```
GET https://tika.pellucidlabs.org/version
Response: Tika version and supported parsers
```
## Example
```bash
curl -T document.pdf \
-H "Content-Type: application/pdf" \
-H "Accept: text/plain" \
https://tika.pellucidlabs.org/tika
```
## Notes
- Supports PDF and 100+ other document formats
- No authentication required
- Timeout: 60 seconds per document
- Returns plain text only (no HTML, no metadata)