Is infiniflow/ragflow safe?

ragflow is an AI python_package analyzed by SkillTotal's deterministic static scanner. The scan found malicious indicators (14 findings with evidence); treat it as unsafe until reviewed. It can: dynamic code execution, filesystem read, filesystem write, install time execution, mcp tools detected, network egress, prompt surface risk and shell execution — capabilities are what the code can do, not a verdict on intent. Risk score 90/100 (critical).

ragflow 0.26.3

python_package · https://github.com/infiniflow/ragflow

CRITICAL

/ 100 risk score

Snapshot · scanned Jul 3, 2026 · ragflow@0.26.3 · engine 0.24.0 / ruleset 25

Malicious indicators found

Why:

Prompt injection / instruction override
Sensitive-data access combined with network egress
Python shell/command execution

Malicious indicators detected — review findings before use.

Automated static-analysis result. It can contain false positives and false negatives, and is not a claim about the intent of infiniflow/ragflow's authors. Report a false positive.

Capabilities — what this component can do (not a risk score):

dynamic code executionfilesystem readfilesystem writeinstall time executionmcp tools detectednetwork egressprompt surface riskshell execution

Findings (14)

CRITICALSensitive-data access combined with network egressST-COMBO-EXFIL

The component both reads secret locations (keys, tokens) and can send data over the network.

common/data_source/box_connector.py:165-165

#     OAuthConfig(client_id="8suvn9ik7qezsq2dub0ye6ubox61081z", client_secret="QScv…[redacted, 32 chars]")

conf/service_conf.yaml:29-29

password: 'infi…[redacted, 21 chars]'

example/sdk/dataset_example.py:25-25

API_KEY = "ragf…[redacted, 40 chars]"

admin/client/http_client.py:22-22

import requests

admin/client/http_client.py:91-91

session = requests.Session()

admin/client/ragflow_cli.py:27-27

import requests

Why it matters: That combination is the classic path for quietly stealing credentials off your machine.

Fix: Verify that secrets read from disk are never transmitted off-host without explicit, auditable user consent.

HIGHUnsafe deserializationST-DESERIALIZE-PY

It loads data with a format that can rebuild arbitrary objects (e.g. pickle, or unsafe YAML).

api/db/services/compilation_template_service.py:213-213

template = yaml.load(f) or {}

common/config_utils.py:34-34

return yaml.load(f)

tools/scripts/mysql_migration.py:78-78

config = yaml.load(f)

Why it matters: Feeding such a loader untrusted data can execute code hidden inside that data.

Fix: Deserialize untrusted data with a safe format/loader: JSON, or yaml.safe_load / Loader=SafeLoader. Reserve pickle/marshal for data you fully control.

HIGHNode.js dynamic code executionST-DYN-NODE

The code turns strings into live code at runtime (eval / new Function / exec).

web/public/pdfjs-dist/pdf.worker.min.js:22-22

!function webpackUniversalModuleDefinition(e,t){"object"==typeof exports&&"object"==typeof module?module.exports=t():"function"==typeof define&&define.amd?define("pdfjs-dist/build/pdf.worker",[],t):"object"==typeof exports?exports["pdfjs-di …

Why it matters: If those strings aren't fixed and trusted, they become a way to run arbitrary code.

Fix: Avoid evaluating dynamically constructed code; if unavoidable, ensure the input is a trusted constant and never derived from external data.

HIGHEmbedded secret / credentialST-SECRET-EMBEDDED

A hardcoded credential (API key, token, or private key) is shipped in the code.

common/data_source/box_connector.py:165-165

#     OAuthConfig(client_id="8suvn9ik7qezsq2dub0ye6ubox61081z", client_secret="QScv…[redacted, 32 chars]")

conf/service_conf.yaml:29-29

password: 'infi…[redacted, 21 chars]'

example/sdk/dataset_example.py:25-25

API_KEY = "ragf…[redacted, 40 chars]"

sdk/python/test.py:3-3

rag_object = RAGFlow(api_key="ragf…[redacted, 51 chars]", base_url="http://localhost:9222")

Why it matters: Anyone who gets the package gets the secret — rotate it and load secrets at runtime instead.

Fix: Remove the secret from the code, rotate it immediately, and load credentials from the environment or a secrets manager at runtime.

HIGHNode.js shell/command executionST-SHELL-NODE

The component can run operating-system commands or spawn processes.

web/public/pdfjs-dist/pdf.worker.min.js:22-22

!function webpackUniversalModuleDefinition(e,t){"object"==typeof exports&&"object"==typeof module?module.exports=t():"function"==typeof define&&define.amd?define("pdfjs-dist/build/pdf.worker",[],t):"object"==typeof exports?exports["pdfjs-di …

web/src/components/floating-chat-widget-markdown.tsx:312-312

const match = /language-(\w+)/.exec(className || '');

web/src/components/highlight-markdown/index.tsx:49-49

const match = /language-(\w+)/.exec(className || '');

web/src/components/jsonjoy-builder/utils/json-validator.ts:56-56

const match = propPattern.exec(line);

web/src/components/markdown-content/index.tsx:277-277

const match = /language-(\w+)/.exec(className || '');

web/src/components/markdown-content/reference-utils.ts:16-16

while ((match = currentReg.exec(text)) !== null) {

web/src/components/next-markdown-content/index.tsx:414-414

const match = /language-(\w+)/.exec(className || '');

web/src/constants/setting.ts:38-38

const match = /^GMT(?<sign>\+|-)(?<hours>\d{2}):(?<minutes>\d{2})$/i.exec(

web/src/pages/agent/canvas/node/variable-display.tsx:35-35

while ((match = regex.exec(content)) !== null) {

web/src/pages/agent/form/components/prompt-editor/utils.ts:45-45

const match = PromptVariableLeadingPathRegex.exec(text);

web/src/pages/agent/form/components/prompt-editor/variable-picker-plugin.tsx:373-373

const match = triggerRegex.exec(text);

web/src/pages/agent/form/components/prompt-editor/variable-picker-plugin.tsx:589-589

while ((match = regex.exec(line)) !== null) {

web/src/pages/next-search/markdown-content/index.tsx:265-265

const match = /language-(\w+)/.exec(className || '');

web/src/pages/skills/components/markdown-viewer.tsx:60-60

const match = /language-(\w+)/.exec(className || '');

Why it matters: Powerful and often legitimate — confirm the commands aren't built from untrusted input.

Fix: Confirm the command and its arguments are fully controlled and not derived from untrusted input; prefer execFile with an argument array.

HIGHPython shell/command executionST-SHELL-PY

The component can run operating-system commands or spawn processes.

agent/sandbox/executor_manager/services/execution.py:233-233

tar_proc = await asyncio.create_subprocess_exec("tar", "czf", "-", "-C", workdir, code_name, runner_name, str(bundle["args_name"]), stdout=asyncio.subprocess.PIPE)

agent/sandbox/executor_manager/services/execution.py:236-238

docker_proc = await asyncio.create_subprocess_exec(
            "docker", "exec", "-i", container, "tar", "xzf", "-", "-C", f"/workspace/{task_id}", stdin=asyncio.subprocess.PIPE, stderr=asyncio.subprocess.PIPE
        )

agent/sandbox/executor_manager/utils/common.py:22-22

proc = await asyncio.create_subprocess_exec(*args, stdout=asyncio.subprocess.PIPE, stderr=asyncio.subprocess.PIPE)

agent/sandbox/providers/local.py:128-138

process = subprocess.Popen(
            command,
            cwd=instance_dir,
            stdout=subprocess.PIPE,
            stderr=subprocess.PIPE,
            text=True,
            encoding="utf-8",
            errors="replace", …

api/channels/whatsapp/gateway.py:110-116

proc = await asyncio.create_subprocess_exec(
                npm,
                "install",
                "--no-fund",
                "--no-audit",
                cwd=str(gateway_dir),
            )

api/channels/whatsapp/gateway.py:141-145

self._process = await asyncio.create_subprocess_exec(
            *cfg.command,
            cwd=cfg.cwd,
            env=env,
        )

api/utils/file_utils.py:179-184

proc = subprocess.run(
                cmd,
                capture_output=True,
                text=True,
                timeout=GHOSTSCRIPT_TIMEOUT_SEC,
            )

common/doc_store/infinity_conn_base.py:891-896

result = subprocess.run(
                psql_cmd,
                capture_output=True,
                text=True,
                timeout=10,  # 10 second timeout
            )

common/misc_utils.py:241-241

subprocess.check_call([sys.executable, "-m", "pip", "install", *pkg_names])

common/versions.py:39-39

version_info = subprocess.check_output(["git", "describe", "--tags", "--match=v*", "--first-parent", "--always"]).strip().decode("utf-8")

run_tests.py:183-183

result = subprocess.run(cmd, check=False)

tools/hooks/check_files.py:27-32

proc = subprocess.run(
        ["git", "diff", "--cached", "--name-only", "--diff-filter=ACMR"],
        check=True,
        capture_output=True,
        text=True,
    )

tools/hooks/check_files.py:150-155

proc = subprocess.run(
        ["git", "ls-files"],
        check=True,
        capture_output=True,
        text=True,
    )

Why it matters: Powerful and often legitimate — confirm the commands aren't built from untrusted input.

Fix: Confirm the command and its arguments are fully controlled and not derived from untrusted input; avoid shell=True.

MEDIUMServer bound to all network interfacesST-EXPOSE-BIND

A server is bound to all network interfaces (0.0.0.0), not just your own machine.

admin/server/admin_server.py:71-71

hostname="0.0.0.0",

api/channels/line/channel.py:37-37

webhook_host: str = "0.0.0.0"

api/channels/line/channel.py:222-222

webhook_host=str(cfg.get("webhook_host", "0.0.0.0")),

api/channels/wecom/channel.py:35-35

webhook_host: str = "0.0.0.0"

api/channels/wecom/channel.py:560-560

webhook_host=str(cfg.get("webhook_host", "0.0.0.0")),

api/ragflow_server.py:103-103

debugpy.listen(("0.0.0.0", RAGFLOW_DEBUGPY_LISTEN))

api/utils/health_utils.py:263-263

if "0.0.0.0" in url:

api/utils/health_utils.py:264-264

url = url.replace("0.0.0.0", "127.0.0.1")

test.py:15-15

uvicorn.run(app, host="0.0.0.0", port=8000)

Why it matters: Without authentication, other hosts on the network can reach it.

Fix: Bind to 127.0.0.1 for local-only use, or require authentication and restrict access if remote exposure is intended.

MEDIUMPython filesystem readST-FS-PY-READ

The component reads files from disk.

agent/component/base.py:218-218

with open(param_validation_path, "r") as fin:

agent/component/browser.py:595-595

blob = path.read_bytes()

agent/component/docs_generator.py:251-251

with open(file_path, "rb") as f:

agent/component/docs_generator.py:410-410

with open(file_path, "rb") as f:

agent/component/docs_generator.py:432-432

with open(file_path, "rb") as f:

agent/component/docs_generator.py:583-583

with open(file_path, "rb") as f:

agent/component/message.py:525-525

with open(tmp_name, "rb") as f:

agent/sandbox/providers/local.py:340-340

"content_b64": base64.b64encode(path.read_bytes()).decode("ascii"),

api/db/init_data.py:122-122

with open(template_path, "r", encoding="utf-8") as f:

api/db/init_data.py:157-157

with open(os.path.join(get_project_base_directory(), "conf", "system_settings.json"), "r") as f:

api/db/services/compilation_template_service.py:212-212

with open(template_path, "r", encoding="utf-8") as f:

api/utils/crypt.py:31-31

rsa_key = RSA.importKey(Path(file_path).read_text(), "Welcome")

api/utils/crypt.py:40-40

rsa_key = RSA.importKey(Path(file_path).read_text(), "Welcome")

api/utils/crypt.py:56-56

pem = Path(file_path).read_text()

common/config_utils.py:32-32

with open(conf_path) as f:

common/doc_store/es_conn_base.py:51-51

with open(fp_mapping, "r") as f:

common/doc_store/es_conn_base.py:144-144

with open(fp_mapping, "r") as f:

common/doc_store/infinity_conn_base.py:198-198

with open(fp_mapping) as f:

common/doc_store/infinity_conn_base.py:408-408

with open(fp_mapping) as f:

common/doc_store/infinity_conn_base.py:522-522

with open(fp_mapping) as f:

common/doc_store/infinity_conn_base.py:806-806

with open(fp_mapping) as f:

common/settings.py:249-249

with open(os.path.join(get_project_base_directory(), "conf", "llm_factories.json"), "r") as f:

common/versions.py:29-29

with open(version_path, "r") as f:

deepdoc/parser/docling_parser.py:374-374

with open(src_path, "rb") as f:

Why it matters: Usually legitimate, but worth confirming it can't be steered into reading sensitive files.

Fix: Confirm which files are read and that paths cannot be influenced by untrusted input to reach sensitive locations.

MEDIUMPython filesystem write/deleteST-FS-PY-WRITE

The component writes or deletes files on disk.

agent/component/browser.py:238-238

with open(local_path, "wb") as f:

agent/component/browser.py:250-250

os.remove(local_path)

agent/component/browser.py:259-259

os.remove(local_path)

agent/component/browser.py:561-561

with open(local_path, "wb") as f:

agent/component/browser.py:710-710

shutil.rmtree(profile_dir, ignore_errors=True)

agent/component/docs_generator.py:159-159

os.remove(file_path)

agent/component/docs_generator.py:212-212

with open(file_path, "wb") as f:

agent/component/docs_generator.py:428-428

with open(temp_file, "wb") as f:

agent/component/docs_generator.py:609-609

os.remove(header_path)

agent/component/message.py:530-530

os.remove(tmp_name)

agent/sandbox/executor_manager/services/execution.py:218-218

with open(code_path, "wb") as f:

agent/sandbox/executor_manager/services/execution.py:222-222

with open(runner_path, "w", encoding="utf-8") as f:

agent/sandbox/executor_manager/services/execution.py:226-226

with open(args_path, "w", encoding="utf-8") as f:

agent/sandbox/providers/local.py:178-178

shutil.rmtree(instance_dir)

agent/sandbox/providers/local.py:273-273

script_path.write_text(build_python_wrapper(code, args_json), encoding="utf-8")

agent/sandbox/providers/local.py:277-277

script_path.write_text(build_javascript_wrapper(code, args_json), encoding="utf-8")

api/apps/restful_apis/chat_api.py:1075-1075

os.remove(temp_audio_path)

api/apps/restful_apis/chat_api.py:1089-1089

os.remove(temp_audio_path)

common/config_utils.py:43-43

with open(conf_path, "w") as f:

common/data_source/google_drive/connector.py:1255-1255

with open("stats.txt", "w") as f:

common/token_utils.py:37-37

shutil.copyfile(bundled_encoding_path, cached_encoding_path)

deepdoc/parser/docling_parser.py:540-540

with open(tmp_pdf, "wb") as f:

deepdoc/parser/mineru_parser.py:201-201

with zip_ref.open(member) as src, open(dest_path, "wb") as dst:

deepdoc/parser/mineru_parser.py:202-202

shutil.copyfileobj(src, dst)

deepdoc/parser/mineru_parser.py:333-333

with open(output_zip_path, "wb") as f:

Why it matters: Usually legitimate, but worth confirming the paths can't be controlled by untrusted input.

Fix: Confirm which files are written/deleted and that paths cannot be influenced by untrusted input.

MEDIUMnpm prepare hookST-INSTALL-NPM-PREPARE

package.json has a 'prepare' script (runs on git/local installs and before publishing).

web/package.json:12-12

"prepare": "cd .. && git rev-parse --git-dir >/dev/null 2>&1 && lefthook install || true",

Why it matters: Usually a build step, but confirm it doesn't fetch or run remote code.

Fix: Usually a legitimate build step; confirm it only builds and does not fetch or execute remote code.

MEDIUMNode.js network egressST-NET-NODE

The component makes outbound network requests.

api/channels/whatsapp/gateway-node/index.js:1-1

import http from 'node:http';

web/public/pdfjs-dist/pdf.worker.min.js:22-22

!function webpackUniversalModuleDefinition(e,t){"object"==typeof exports&&"object"==typeof module?module.exports=t():"function"==typeof define&&define.amd?define("pdfjs-dist/build/pdf.worker",[],t):"object"==typeof exports?exports["pdfjs-di …

web/src/components/document-preview/hooks.ts:8-8

import axios from 'axios';

web/src/components/document-preview/hooks.ts:88-88

const ret = await axios.get(api);

web/src/components/document-preview/hooks.ts:105-105

const ret = await axios.get(api, {

web/src/components/document-preview/hooks.ts:159-159

const { data } = await axios.get(url, { headers: httpHeaders });

web/src/components/image/index.tsx:49-49

item.promise = fetch(url, { headers: { [Authorization]: authorization } })

web/src/components/ui/audio-button.tsx:257-257

const response = await fetch(api.chatsTranscriptions, {

web/src/constants/agent.tsx:20-20

[ProgrammingLanguage.Javascript]: `const axios = require('axios');

web/src/constants/agent.tsx:23-23

const response = await axios.get('https://github.com/infiniflow/ragflow');

web/src/hooks/logic-hooks.ts:18-18

import axios from 'axios';

web/src/hooks/logic-hooks.ts:165-165

const ret = await axios.get('/conf.json');

web/src/hooks/logic-hooks.ts:249-249

const response = await fetch(url, {

web/src/hooks/logic-hooks.ts:351-351

const response = await fetch(url, {

web/src/hooks/use-send-message.ts:140-140

const response = await fetch(url, {

web/src/pages/admin/login.tsx:1-1

import { type AxiosResponseHeaders } from 'axios';

web/src/pages/skills/components/upload-modal.tsx:359-359

const response = await fetch(url, { headers });

web/src/pages/skills/components/upload-modal.tsx:496-496

const response = await fetch(downloadUrl);

web/src/pages/skills/hooks.ts:623-623

const response = await fetch('/api/v1/skills/search', {

web/src/pages/skills/hooks.ts:932-932

const indexResponse = await fetch('/api/v1/skills/index', {

web/src/pages/skills/hooks.ts:1540-1540

const response = await fetch('/api/v1/skills/status', {

web/src/pages/user-setting/setting-model/modal/provider-modal/hooks/use-list-models-picker.ts:68-68

* - the API fetch (with edit-mode pre-check seeding) and modal-reset effect

web/src/services/admin-service.ts:2-2

import axios from 'axios';

web/src/services/admin-service.ts:15-15

const request = axios.create({

web/src/utils/next-request.ts:9-9

import axios from 'axios';

Why it matters: Usually legitimate, but confirm the destinations are expected and no sensitive data leaves.

Fix: Confirm the destination hosts are expected and that no sensitive data is sent off-host.

MEDIUMPython network egressST-NET-PY

The component makes outbound network requests.

admin/client/http_client.py:22-22

import requests

admin/client/http_client.py:91-91

session = requests.Session()

admin/client/ragflow_cli.py:27-27

import requests

admin/client/ragflow_cli.py:60-60

self.session = requests.Session()

admin/client/ragflow_client.py:21-21

import urllib.parse

admin/client/ragflow_client.py:483-483

encoded_key: str = urllib.parse.quote(key, safe="")

admin/server/config.py:25-25

from urllib.parse import urlparse

admin/server/config.py:237-237

parsed = urlparse(url)

agent/component/browser.py:29-29

from urllib.error import HTTPError, URLError

agent/component/browser.py:30-30

from urllib.parse import unquote, urlparse

agent/component/browser.py:31-31

from urllib.request import Request, urlopen

agent/component/browser.py:171-171

parsed = urlparse(token)

agent/component/browser.py:181-181

name = unquote(m.group(1).strip().strip('"'))

agent/component/browser.py:195-195

parsed = urlparse(url)

agent/component/browser.py:197-197

name = unquote(raw_name).strip()

agent/component/browser.py:227-227

req = Request(url, headers={"User-Agent": "RAGFlow-Browser-Node/1.0"})

agent/component/browser.py:228-228

with urlopen(req, timeout=30) as response:

agent/component/invoke.py:23-23

from urllib.parse import urlparse

agent/component/invoke.py:25-25

import requests

agent/component/invoke.py:196-196

parsed = urlparse(url)

agent/sandbox/providers/self_managed.py:30-30

import requests

agent/sandbox/providers/self_managed.py:148-148

response = requests.post(url, json=payload, timeout=exec_timeout, headers={"Content-Type": "application/json"})

agent/sandbox/providers/self_managed.py:210-210

response = requests.get(url, timeout=5)

Why it matters: Usually legitimate, but confirm the destinations are expected and no sensitive data leaves.

Fix: Confirm the destination hosts are expected and that no sensitive data is sent off-host.

MEDIUMPrompt injection / instruction overrideST-PROMPT-INJECTION

Text that tries to override an AI's instructions was found (e.g. 'ignore previous instructions').

internal/cli/filesystem/skill_hub/security/patterns.go:74-74

Description: "prompt injection: ignore previous instructions",

internal/cli/filesystem/skill_hub/security/patterns.go:81-81

Description: "DAN (Do Anything Now) jailbreak attempt",

internal/cli/filesystem/skill_install.go:400-400

- Prompt injection attempts (DAN mode, ignore instructions, etc.)

Why it matters: Embedded in a component, it can hijack an agent's behavior or hide actions from you.

Fix: Treat embedded instructions as untrusted. Review whether this component attempts to manipulate an agent's behavior or hide actions from users.

LOWMCP tool surface detectedST-MCP-DETECTED

An MCP tool surface (manifest or tool definitions) was found.

rag/llm/tool_decorator.py:16-16

"""Lightweight ``@tool`` decorator and matching ``ToolCallSession`` adapter.

rag/llm/tool_decorator.py:23-23

@tool

rag/llm/tool_decorator.py:162-162

``@tool`` callable apart from a raw schema dict.

rag/llm/tool_decorator.py:176-176

"""Adapter that lets a list of ``@tool``-decorated callables satisfy the

rag/llm/tool_decorator.py:191-191

raise TypeError(f"{getattr(fn, '__name__', fn)!r} is not a @tool-decorated callable")

Why it matters: Just context — review which tools it offers and their permissions.

Fix: Review the declared MCP tools and their permissions.

Check your own component

Run the same evidence-backed scan on any MCP server, agent skill, or package.

Scan your own component

Or get notified if this component's risk changes:

How we determine this: deterministic static analysis (regex + AST), evidence-anchored, no code execution. Methodology →