Memory leak

We're exploring using the unstructured API at work. 

We're running `quay.io/unstructured-io/unstructured-api:c9b74d4` on a ["Pro" (private service) Render instance](https://render.com/pricing#compute) (i.e. 4GB RAM)

We're using the service to process PDFs with the following parameters `strategy=hi_res`, `pdf_infer_table_structure=true` and `skip_infer_table_types=[]`. We're also using parallel mode via `UNSTRUCTURED_PARALLEL_MODE_ENABLED=true` (using the defaults for the other environment vars).

We've seen the service fall over several times due to OOM, and looking at metrics it looks as if there are resources not being freed after processing runs.

![image](https://github.com/Unstructured-IO/unstructured-api/assets/12185627/6ad86875-36fd-4004-8060-3f317c87ad09)

Each spike represents a processing run, with about 10 minutes between each.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memory leak #197

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Memory leak #197

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions