We're exploring using the unstructured API at work.
We're running quay.io/unstructured-io/unstructured-api:c9b74d4 on a "Pro" (private service) Render instance (i.e. 4GB RAM)
We're using the service to process PDFs with the following parameters strategy=hi_res, pdf_infer_table_structure=true and skip_infer_table_types=[]. We're also using parallel mode via UNSTRUCTURED_PARALLEL_MODE_ENABLED=true (using the defaults for the other environment vars).
We've seen the service fall over several times due to OOM, and looking at metrics it looks as if there are resources not being freed after processing runs.

Each spike represents a processing run, with about 10 minutes between each.
We're exploring using the unstructured API at work.
We're running
quay.io/unstructured-io/unstructured-api:c9b74d4on a "Pro" (private service) Render instance (i.e. 4GB RAM)We're using the service to process PDFs with the following parameters
strategy=hi_res,pdf_infer_table_structure=trueandskip_infer_table_types=[]. We're also using parallel mode viaUNSTRUCTURED_PARALLEL_MODE_ENABLED=true(using the defaults for the other environment vars).We've seen the service fall over several times due to OOM, and looking at metrics it looks as if there are resources not being freed after processing runs.
Each spike represents a processing run, with about 10 minutes between each.