Is firecrawl/firecrawl safe?

repo is an AI ai_component analyzed by SkillTotal's deterministic static scanner. The scan found malicious indicators (14 findings with evidence); treat it as unsafe until reviewed. It can: dynamic code execution, filesystem read, filesystem write, install time execution, mcp tools detected, network egress, prompt surface risk and shell execution — capabilities are what the code can do, not a verdict on intent. Risk score 30/100 (medium).

repo

ai_component · https://github.com/firecrawl/firecrawl

MEDIUM

/ 100 risk score

Snapshot · scanned Jul 3, 2026 · repo@f4464e1 · engine 0.24.0 / ruleset 25

Malicious indicators found

Why:

Prompt injection / instruction override
Python shell/command execution
Node.js shell/command execution

Malicious indicators detected — review findings before use.

Automated static-analysis result. It can contain false positives and false negatives, and is not a claim about the intent of firecrawl/firecrawl's authors. Report a false positive.

Capabilities — what this component can do (not a risk score):

dynamic code executionfilesystem readfilesystem writeinstall time executionmcp tools detectednetwork egressprompt surface riskshell execution

Findings (14)

HIGHNode.js dynamic code executionST-DYN-NODE

The code turns strings into live code at runtime (eval / new Function / exec).

apps/api/src/controllers/v2/parse-upload.ts:193-193

const result = await getRedisConnection().eval(

apps/api/src/lib/keyless.ts:145-145

const result = (await redisRateLimitClient.eval(

apps/api/src/lib/keyless.ts:190-190

const total = (await redisRateLimitClient.eval(

Why it matters: If those strings aren't fixed and trusted, they become a way to run arbitrary code.

Fix: Avoid evaluating dynamically constructed code; if unavoidable, ensure the input is a trusted constant and never derived from external data.

HIGHUntrusted-instruction surface with file access and network egressST-FLOW-TRIFECTA

The component is exposed to untrusted instructions (a prompt-injection surface) and also can read files and send data over the network. Together these are the 'lethal trifecta' an attacker needs to turn an injected instruction into data exfiltration.

apps/api/openapi.json:153-153

"description": "Headers to send to the webhook URL.",

apps/api/openapi.json:797-797

"description": "Headers to send to the webhook URL.",

.github/scripts/check_version_has_incremented.py:48-48

version_file = Path(file_path).read_text()

.github/scripts/check_version_has_incremented.py:62-62

with open(file_path, 'r') as file:

.github/scripts/check_version_has_incremented.py:41-41

import requests

.github/scripts/check_version_has_incremented.py:56-56

response = requests.get(f"https://pypi.org/pypi/{package_name}/json")

Fix: Remove the injectable instruction surface, or constrain the component so untrusted input cannot drive file reads and outbound network requests.

HIGHnpm install-time lifecycle hookST-INSTALL-NPM

package.json runs scripts automatically when the package is installed.

apps/api/native/package.json:52-52

"install": "pnpm build"

Why it matters: Install scripts are a favorite supply-chain foothold — they execute on every machine that installs the package.

Fix: Inspect the hook command. Install-time scripts are a common supply chain execution vector; ensure they do nothing beyond a documented build step.

HIGHNode.js shell/command executionST-SHELL-NODE

The component can run operating-system commands or spawn processes.

.github/scripts/audit-ci-vuln-scan.mjs:3-3

import { spawnSync } from "node:child_process";

.github/scripts/audit-ci-vuln-scan.mjs:42-42

return spawnSync(command, args, {

apps/api/scripts/knip-with-typescript5.cjs:3-3

const { spawnSync } = require("node:child_process");

apps/api/scripts/knip-with-typescript5.cjs:38-38

const result = spawnSync(

apps/api/src/controllers/v2/parse-upload.ts:227-227

.exec();

apps/api/src/harness.ts:2-2

import { type ChildProcess, spawn } from "child_process";

apps/api/src/harness.ts:258-258

child = spawn("cmd", ["/c", command], {

apps/api/src/harness.ts:264-264

child = spawn("sh", ["-c", command], {

apps/api/src/harness.ts:273-273

child = spawn(cmd, args, {

apps/api/src/harness.ts:359-359

const killer = spawn(

apps/api/src/lib/cclog.ts:142-142

await pipeline.exec();

apps/api/src/lib/concurrency-limit.ts:70-70

await pipeline.exec();

apps/api/src/lib/concurrency-limit.ts:130-130

await pipeline.exec();

apps/api/src/lib/crawl-redis.ts:138-138

await pipeline.exec();

apps/api/src/lib/crawl-redis.ts:160-160

await pipeline.exec();

apps/api/src/lib/crawl-redis.ts:187-187

await pipeline.exec();

apps/api/src/lib/crawl-redis.ts:484-484

const results = await pipeline.exec();

apps/api/src/lib/crawl-redis.ts:492-492

await uniquePipeline.exec();

apps/api/src/lib/crawl-redis.ts:536-536

const results = await pipeline.exec();

apps/api/src/lib/fire-privacy-chunker.ts:49-49

while ((match = SENTENCE_END.exec(window)) !== null) {

apps/api/src/scraper/WebScraper/crawler.ts:574-574

const results = await pipeline.exec();

apps/api/src/services/index-cache.ts:224-224

const results = await pipeline.exec();

Why it matters: Powerful and often legitimate — confirm the commands aren't built from untrusted input.

Fix: Confirm the command and its arguments are fully controlled and not derived from untrusted input; prefer execFile with an argument array.

HIGHPython shell/command executionST-SHELL-PY

The component can run operating-system commands or spawn processes.

.github/scripts/resolve_api_image_version.py:29-29

return subprocess.check_output(["git", *args], text=True).strip()

Why it matters: Powerful and often legitimate — confirm the commands aren't built from untrusted input.

Fix: Confirm the command and its arguments are fully controlled and not derived from untrusted input; avoid shell=True.

MEDIUMNode.js filesystem readST-FS-NODE-READ

The component reads files from disk.

apps/api/src/scraper/scrapeURL/lib/mock.ts:64-64

const load = JSON.parse(await fs.readFile(mockPath, "utf8"));

apps/api/src/services/system-monitor.ts:51-51

const data = fs.readFileSync("/sys/fs/cgroup/memory.current", "utf8");

apps/api/src/services/system-monitor.ts:56-56

const data = fs.readFileSync("/sys/fs/cgroup/memory.max", "utf8").trim();

apps/api/src/services/system-monitor.ts:114-114

const data = fs.readFileSync("/sys/fs/cgroup/cpu.stat", "utf8");

apps/api/src/services/system-monitor.ts:126-126

const data = fs.readFileSync(cpusetPath, "utf8").trim();

apps/api/utils/logview.js:4-4

// const logs = fs.readFileSync("7a373219-0eb4-4e47-b2df-e90e12afd5c1.log", "utf8")

apps/api/utils/logview.js:15-15

].flatMap(x => JSON.parse(fs.readFileSync(x, "utf8"))).map(x => x.jsonPayload);

examples/scrape_and_analyze_airbnb_data_e2b/index.ts:59-59

return fs.readFileSync('airbnb_listings.json', 'utf8')

examples/scrape_and_analyze_airbnb_data_e2b/scraping.ts:91-91

const listingsData = fs.readFileSync('airbnb_listings.json', 'utf8')

Why it matters: Usually legitimate, but worth confirming it can't be steered into reading sensitive files.

Fix: Confirm which files are read and that paths cannot be influenced by untrusted input to reach sensitive locations.

MEDIUMNode.js filesystem write/deleteST-FS-NODE-WRITE

The component writes or deletes files on disk.

apps/api/scripts/knip-with-typescript5.cjs:22-22

fs.unlinkSync(linkPath);

apps/api/src/lib/extract/completions/batchExtract.ts:133-133

// await fs.writeFile(

apps/api/src/lib/extract/completions/singleAnswer.ts:121-121

// await fs.writeFile(

apps/api/src/lib/extract/extraction-service.ts:1047-1047

// fs.writeFile(

apps/api/src/lib/extract/reranker.ts:48-48

// await fs.writeFile(

apps/api/src/lib/extract/reranker.ts:96-96

// fs.writeFile(

apps/api/src/lib/extract/reranker.ts:189-189

// await fs.writeFile(

apps/api/src/lib/extract/reranker.ts:253-253

// fs.writeFile(

apps/api/src/lib/scrape-interact/browser-agent.ts:58-58

await fs.appendFile(this.filePath, this.lines.join("\n") + "\n");

apps/api/src/scraper/scrapeURL/engines/utils/downloadFile.ts:190-190

await fs.unlink(tempFilePath);

apps/api/src/scraper/scrapeURL/lib/mock.ts:20-20

await fs.writeFile(

apps/api/src/scraper/scrapeURL/transformers/llmExtract.ts:785-785

// await fs.writeFile(

apps/api/utils/logview.js:24-24

fs.writeFileSync("crawl-" + crawlId + ".log", crawlLogs.map(x => JSON.stringify(x)).join("\n"));

apps/api/utils/logview.js:30-30

fs.writeFileSync(crawlId + ".md",

examples/scrape_and_analyze_airbnb_data_e2b/index.ts:111-111

fs.writeFileSync(filename, pngData)

examples/scrape_and_analyze_airbnb_data_e2b/scraping.ts:86-86

fs.writeFileSync(

Why it matters: Usually legitimate, but worth confirming the paths can't be controlled by untrusted input.

Fix: Confirm which files are written/deleted and that paths cannot be influenced by untrusted input.

MEDIUMPython filesystem readST-FS-PY-READ

The component reads files from disk.

.github/scripts/check_version_has_incremented.py:48-48

version_file = Path(file_path).read_text()

.github/scripts/check_version_has_incremented.py:62-62

with open(file_path, 'r') as file:

.github/scripts/check_version_has_incremented.py:76-76

build_file = Path(file_path).read_text()

.github/scripts/check_version_has_incremented.py:84-84

version_file = Path(file_path).read_text()

.github/scripts/check_version_has_incremented.py:118-118

csproj_file = Path(file_path).read_text()

.github/scripts/check_version_has_incremented.py:138-138

version_file = Path(file_path).read_text()

apps/python-sdk/firecrawl/firecrawl.backup.py:32-32

version_file = Path(os.path.join(package_path, '__init__.py')).read_text()

apps/python-sdk/firecrawl/v1/client.py:33-33

version_file = Path(os.path.join(package_path, '__init__.py')).read_text()

apps/python-sdk/firecrawl/v2/methods/parse.py:91-91

file_bytes = file_path.read_bytes()

apps/python-sdk/firecrawl/v2/utils/get_version.py:8-8

version_file = (package_path / "__init__.py").read_text()

apps/python-sdk/setup.py:7-7

long_description_content = (this_directory / "README.md").read_text()

apps/python-sdk/setup.py:12-12

version_file = (this_directory / "firecrawl" / "__init__.py").read_text()

examples/gemini-2.5-screenshot-editor/cli.py:584-584

with open(args.batch, 'r') as f:

examples/gemini-2.5-screenshot-editor/cli.py:610-610

with open(url, 'rb') as f:

examples/gemini-2.5-screenshot-editor/cli.py:616-616

with open(img_path, 'rb') as f:

Why it matters: Usually legitimate, but worth confirming it can't be steered into reading sensitive files.

Fix: Confirm which files are read and that paths cannot be influenced by untrusted input to reach sensitive locations.

MEDIUMPython filesystem write/deleteST-FS-PY-WRITE

The component writes or deletes files on disk.

.github/scripts/resolve_api_image_version.py:70-70

with open(github_output, "a", encoding="utf-8") as output:

examples/blog-articles/scheduling_scrapers/scripts/bs4_scraper.py:91-91

with open(filename, "w") as f:

examples/blog-articles/scheduling_scrapers/scripts/firecrawl_scraper.py:54-54

with open(filename, "w") as f:

examples/claude-3.7-stock-analyzer/claude-3.7-stock-analyzer.py:80-80

with open(os.path.join(current_dir, 'chart.png'), 'wb') as f:

examples/claude_stock_analyzer/claude_stock_analyzer.py:80-80

with open(os.path.join(current_dir, 'chart.png'), 'wb') as f:

examples/gemini-2.5-screenshot-editor/cli.py:304-304

with open(intermediate_path, 'wb') as f:

examples/gemini-2.5-screenshot-editor/cli.py:426-426

with open(output_file, 'wb') as f:

examples/gemini-github-analyzer/gemini-github-analyzer.py:241-241

with open(filename, 'w', encoding='utf-8') as f:

examples/hacker_news_scraper/bs4_scraper.py:91-91

with open(filename, "w") as f:

examples/hacker_news_scraper/firecrawl_scraper.py:54-54

with open(filename, "w") as f:

Why it matters: Usually legitimate, but worth confirming the paths can't be controlled by untrusted input.

Fix: Confirm which files are written/deleted and that paths cannot be influenced by untrusted input.

MEDIUMnpm prepare hookST-INSTALL-NPM-PREPARE

package.json has a 'prepare' script (runs on git/local installs and before publishing).

apps/api/package.json:45-45

"prepare": "cd ../.. && husky ./apps/api/.husky",

Why it matters: Usually a build step, but confirm it doesn't fetch or run remote code.

Fix: Usually a legitimate build step; confirm it only builds and does not fetch or execute remote code.

MEDIUMNode.js network egressST-NET-NODE

The component makes outbound network requests.

.github/scripts/audit-ci-vuln-scan.mjs:254-254

const response = await fetch(url, {

apps/api/src/controllers/auth.ts:204-204

const response = await fetch(introspectUrl, {

apps/api/src/controllers/v0/admin/check-fire-engine.ts:27-27

const response = await fetch(`${config.FIRE_ENGINE_BETA_URL}/scrape`, {

apps/api/src/controllers/v2/agent-cancel.ts:32-32

const resp = await fetch(

apps/api/src/controllers/v2/agent-status.ts:35-35

const optionsRequest = await fetch(

apps/api/src/controllers/v2/agent.ts:81-81

const passthrough = await fetch(

apps/api/src/controllers/v2/browser.ts:145-145

const res = await fetch(url, {

apps/api/src/controllers/v2/research-proxy.ts:237-237

return fetch(url, {

apps/api/src/controllers/v2/support-proxy.ts:35-35

const upstream = await fetch(target, {

apps/api/src/index.ts:19-19

import http from "node:http";

apps/api/src/index.ts:20-20

import https from "node:https";

apps/api/src/lib/admin-integration-integrations-proxy.ts:55-55

upstream = await fetch(url, {

apps/api/src/lib/avgrab-resolve.ts:24-24

const res = await fetch(`${config.AVGRAB_SERVICE_URL}/supported-urls`);

apps/api/src/lib/avgrab-resolve.ts:82-82

const response = await fetch(`${config.AVGRAB_SERVICE_URL}/resolve`, {

apps/api/src/lib/data-layer.ts:92-92

const response = await fetch(url, {

apps/api/src/lib/fire-privacy-client.ts:247-247

response = await fetch(`${config.FIRE_PRIVACY_URL}/redact`, {

apps/api/src/lib/html-to-markdown-client.ts:8-8

import axios, { AxiosInstance, AxiosError } from "axios";

apps/api/src/lib/html-to-markdown-client.ts:30-30

* Convert HTML to Markdown using direct axios call

apps/api/src/lib/html-to-markdown-client.ts:71-71

const response = await axios.post<ConvertResponse>(

apps/api/src/lib/html-to-markdown-client.ts:99-99

if (axios.isAxiosError(error)) {

Why it matters: Usually legitimate, but confirm the destinations are expected and no sensitive data leaves.

Fix: Confirm the destination hosts are expected and that no sensitive data is sent off-host.

MEDIUMPython network egressST-NET-PY

The component makes outbound network requests.

.github/scripts/check_version_has_incremented.py:41-41

import requests

.github/scripts/check_version_has_incremented.py:56-56

response = requests.get(f"https://pypi.org/pypi/{package_name}/json")

.github/scripts/check_version_has_incremented.py:70-70

response = requests.get(f"https://registry.npmjs.org/{package_name}/latest")

.github/scripts/check_version_has_incremented.py:92-92

response = requests.get(f"https://rubygems.org/api/v1/versions/{package_name}/latest.json")

.github/scripts/check_version_has_incremented.py:105-105

response = requests.get(url)

.github/scripts/check_version_has_incremented.py:127-127

response = requests.get(url)

.github/scripts/check_version_has_incremented.py:147-147

response = requests.get(url)

.github/scripts/check_version_has_incremented.py:181-184

response = requests.get(
        f"https://crates.io/api/v1/crates/{package_name}",
        headers={"User-Agent": "firecrawl-version-check"}
    )

.github/scripts/eval_run.py:1-1

import requests

.github/scripts/eval_run.py:16-26

response = requests.post(
            f"{args.api_url}/run",
            json={
                "experiment_id": args.experiment_id,
                "api_key": args.api_key,
                "label": args.label
            },
            hea …

apps/python-sdk/firecrawl/firecrawl.backup.py:20-20

import requests

apps/python-sdk/firecrawl/firecrawl.backup.py:23-23

import aiohttp

apps/python-sdk/firecrawl/firecrawl.backup.py:591-596

response = requests.post(
            f'{self.api_url}/v1/scrape',
            headers=_headers,
            json=scrape_params,
            timeout=(timeout / 1000.0 + 5 if timeout is not None else None)
        )

apps/python-sdk/firecrawl/firecrawl.backup.py:687-691

response = requests.post(
            f"{self.api_url}/v1/search",
            headers={"Authorization": f"Bearer {self.api_key}"},
            json=params_dict
        )

apps/python-sdk/firecrawl/firecrawl.backup.py:1253-1257

response = requests.post(
            f"{self.api_url}/v1/map",
            headers={"Authorization": f"Bearer {self.api_key}"},
            json=params_dict
        )

apps/python-sdk/firecrawl/firecrawl.backup.py:2207-2207

response = requests.post(url, headers=headers, json=data, timeout=((data["timeout"] / 1000.0 + 5) if "timeout" in data and data["timeout"] is not None else None))

apps/python-sdk/firecrawl/firecrawl.backup.py:2236-2236

response = requests.get(url, headers=headers)

apps/python-sdk/firecrawl/firecrawl.backup.py:2265-2265

response = requests.delete(url, headers=headers)

apps/python-sdk/firecrawl/firecrawl.backup.py:2359-2359

raise requests.exceptions.HTTPError(message, response=response)

Why it matters: Usually legitimate, but confirm the destinations are expected and no sensitive data leaves.

Fix: Confirm the destination hosts are expected and that no sensitive data is sent off-host.

MEDIUMPrompt injection / instruction overrideST-PROMPT-INJECTION

Text that tries to override an AI's instructions was found (e.g. 'ignore previous instructions').

apps/api/openapi.json:153-153

"description": "Headers to send to the webhook URL.",

apps/api/openapi.json:797-797

"description": "Headers to send to the webhook URL.",

apps/api/v1-openapi.json:153-153

"description": "Headers to send to the webhook URL.",

apps/api/v1-openapi.json:797-797

"description": "Headers to send to the webhook URL.",

Why it matters: Embedded in a component, it can hijack an agent's behavior or hide actions from you.

Fix: Treat embedded instructions as untrusted. Review whether this component attempts to manipulate an agent's behavior or hide actions from users.

LOWMCP tool surface detectedST-MCP-DETECTED

An MCP tool surface (manifest or tool definitions) was found.

apps/playwright-service-ts/api.ts:100-100

const server = new Server({

Why it matters: Just context — review which tools it offers and their permissions.

Fix: Review the declared MCP tools and their permissions.

Check your own component

Run the same evidence-backed scan on any MCP server, agent skill, or package.

Scan your own component

Or get notified if this component's risk changes:

How we determine this: deterministic static analysis (regex + AST), evidence-anchored, no code execution. Methodology →