Skip to content

Commit 5b36a11

Browse files
infinityCuratorRAGFlow Curator
andauthored
Synchronize documentation. (#432)
Co-authored-by: RAGFlow Curator <infinitydocs.curator@users.noreply.github.com>
1 parent 523e554 commit 5b36a11

File tree

1 file changed

+10
-0
lines changed

1 file changed

+10
-0
lines changed

website/docs/faq.mdx

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -536,5 +536,15 @@ uv pip install -U "mineru[core]" -i https://mirrors.aliyun.com/pypi/simple
536536
4. In the web UI, navigate to the **Configuration** page of your dataset. Click **Built-in** in the **Ingestion pipeline** section, select a chunking method from the **Built-in** dropdown, which supports PDF parsing, and slect **MinerU** in **PDF parser**.
537537
5. If you use a custom ingestion pipeline instead, you must also complete the first three steps before selecting **MinerU** in the **Parsing method** section of the **Parser** component.
538538

539+
---
540+
541+
### How to configure MinerU-specific settings?
539542

543+
1. Set `MINERU_EXECUTABLE` (default: `mineru`) to the path of the MinerU executable.
544+
2. Set `MINERU_DELETE_OUTPUT` to `0` to keep MinerU's output. (Default: `1`, which deletes temporary output)
545+
3. Set `MINERU_OUTPUT_DIR` to specify the output directory for MinerU.
546+
4. Set `MINERU_BACKEND` to `"pipeline"`. (Options: `"pipeline"` (default) | `"vlm-transformers"`)
540547

548+
:::tip NOTE
549+
For information about other environment variables natively supported by MinerU, see [here](https://opendatalab.github.io/MinerU/usage/cli_tools/#environment-variables-description).
550+
:::

0 commit comments

Comments
 (0)