OS Command Injection in 2026: Metacharacters and argv Arrays

OS command injection is the textbook RCE primitive and the one I still find first when I review a small PHP or Node service that talks to the operating system. The shape is always the same: the application builds a shell command by concatenating user input, hands the whole string to /bin/sh -c, and the shell parses the attacker's metacharacters before the target binary ever sees them. Everything that follows in this article is a variation on that one mistake.

This is the variant deep dive that sits under the remote code execution practitioner guide. I cover the canonical shell_exec sink, the full shell-metacharacter catalogue, the same exploit run four different ways against the rce-basic lab, why escapeshellarg is only partial protection, the argv-array pattern that actually closes the class across PHP, Python, Node, and Go, and the blast-radius controls that limit damage when the prevention layer fails.

TL;DR

OS command injection happens when an application builds a shell command from user input and passes the resulting string to a shell. Shell metacharacters in the user value (;, |, &&, `, $(), newline, redirection) are parsed by the shell as syntax, not data, and run whatever the attacker chose alongside the intended command. The textbook sink is shell_exec('ping -c 1 ' . $_GET['host']) and a ?host=localhost;id request runs both ping and id on the server. escapeshellarg blocks the metacharacter escape but leaves an argument injection hole through flags that the called binary itself parses. The real fix is to never invoke a shell at all: pass arguments as an argv array (execFile, subprocess.run with a list, proc_open with an array, exec.Command), validate each value against a strict allowlist, and prefer a library call over a subprocess when one exists. Containment defences (unprivileged user, dropped capabilities, seccomp, read-only root, egress filtering) limit how far an exploit travels once the bug is reached.

The textbook sink

The canonical vulnerable PHP fits on one line:

php

$host = $_GET['host'];
$output = shell_exec('ping -c 1 ' . $host);
echo "<pre>$output</pre>";

A well-behaved request ?host=example.com runs ping -c 1 example.com. The bug is what happens when host contains anything other than a hostname. shell_exec does not call ping directly: it forks /bin/sh -c "ping -c 1 example.com;id" and the shell parses that string first. The semicolon is shell syntax for "end of one command, start of another". So the shell runs ping, then runs id, and the application echoes both outputs.

The mistake is treating the user value as if it were a single argument to ping. It is not. It is a substring of a shell program, and the shell looks for its own metacharacters before any binary is invoked.

Every language ships the same family of footguns: PHP shell_exec/exec/system/backticks, Python os.system and subprocess.run(..., shell=True), Node child_process.exec, Ruby backticks and Kernel#system with a single string, Java Runtime.exec(String). The common factor is "single string handed to a shell".

The shell metacharacter catalogue

The shell has a small but rich vocabulary for combining commands. Every one of these is a working injection vector when user input lands inside a shell string:

; ends one command, starts the next. ping foo;id runs both unconditionally.
| pipes the first command's stdout into the second's stdin. The side effect is that id runs.
|| runs the right-hand command only if the left one fails (ping invalid||id).
&& runs the right-hand command only if the left one succeeds.
` ` is backtick command substitution. ping `id` runs id first and substitutes its output into the ping arguments.
$( ) is the modern syntax for the same substitution. Nests cleanly, no quoting trips.
>, >>, < redirect stdout/stdin. ping foo > /tmp/pwn writes attacker output to an attacker-named path, useful for dropping webshells when the path is web-accessible.
& runs the command in the background. ping foo & id runs both, ping detached.
Newline (%0a URL-encoded) acts like ; in most shells. Useful when a filter strips semicolons but not newlines.

Most payload lists fixate on ;, |, and $(). Every one of the above is a fully functional injection point. Filtering a subset and forgetting the rest is one of the most common partial-fixes I see in code review.

Walking the lab

The rce-basic lab in the techearl-labs repo ships the exact shell_exec('ping -c 1 -W 1 ' . $host) sink at /ping.php. Boot it:

bash

docker compose up rce-basic

It listens on http://localhost:8085. The four payloads below all run id as the web user, each through a different metacharacter, each ending with uid=33(www-data) gid=33(www-data) groups=33(www-data) appended to the ping output:

code

GET /ping.php?host=localhost;id      # command separator
GET /ping.php?host=localhost|id      # pipe
GET /ping.php?host=`id`              # backtick substitution
GET /ping.php?host=$(id)             # dollar-paren substitution

All four converge on the same outcome through different metacharacters. A filter that blocks ; alone (an actual fix I have seen shipped) leaves three working bypasses. A filter that blocks ;|& still misses command substitution. A filter that blocks all of those still misses newline injection through %0a. Filtering metacharacters is fighting the symptom; the disease is the shell sitting between the application and the binary.

Why escapeshellarg is not enough alone

The natural next reach in PHP is escapeshellarg. It wraps the value in single quotes and escapes any embedded single quotes, so the shell sees one quoted argument. The metacharacter exploits above all stop working because the metacharacters are now inside a quoted string.

php

$output = shell_exec('ping -c 1 ' . escapeshellarg($host));

A ?host=localhost;id request now runs ping -c 1 'localhost;id', which ping rejects with "unknown host". The semicolon does nothing because the shell never sees it as syntax.

What escapeshellarg does not solve is the called binary's own argument parser. The shell hands one argument to the binary; that argument starts with a dash; the binary interprets the dash as a flag. The canonical case in the lab is /lookup.php:

php

$output = shell_exec('dig ' . escapeshellarg($domain));

A request ?domain=-f /etc/passwd runs dig '-f /etc/passwd'. Shell-wise that is correct: one quoted argument. dig then sees the leading -f, interprets it as the batch-file flag, opens /etc/passwd, fails to parse the lines as DNS queries, and dumps the file contents through its error output.

This is the argument injection variant, with its own deep dive. Any tool that takes flags is a candidate: curl -K/-o, find -exec, tar --use-compress-program, git --upload-pack, wget --use-askpass, ssh -oProxyCommand. The fix at the call site is -- before the user argument so the binary stops looking for flags, or refuse any value starting with -. The real fix is to stop using a shell at all.

The argv-array pattern that actually works

The pattern that closes the whole class is the same across every modern language: bypass the shell entirely, pass the command name and each argument as separate elements of an array. The OS executes the binary directly via execve; no shell parses anything.

PHP, using proc_open with an argv array (PHP 7.4+):

php

$process = proc_open(
    ['ping', '-c', '1', '-W', '1', $host],
    [1 => ['pipe', 'w'], 2 => ['pipe', 'w']],
    $pipes
);
$output = stream_get_contents($pipes[1]);
proc_close($process);

Python, using subprocess.run with a list (shell=False is the default):

python

result = subprocess.run(
    ['ping', '-c', '1', '-W', '1', host],
    capture_output=True, text=True, timeout=5,
)

Node, using child_process.execFile:

javascript

const { execFile } = require('child_process');
execFile('ping', ['-c', '1', '-W', '1', host], (err, stdout) => {
    if (err) return res.status(500).send('ping failed');
    res.type('text/plain').send(stdout);
});

Go, using exec.Command:

out, err := exec.Command("ping", "-c", "1", "-W", "1", host).Output()

The pattern is identical across all four because the OS primitive is identical. execve takes a binary path and a char *argv[]. A shell is one program among many that you can invoke through execve; the bug class only exists when you choose to invoke that program with a string assembled from user input. Pair the argv form with strict per-field allowlists (re.fullmatch(r"[a-zA-Z0-9.-]+", host) and similar) so even argument-injection style flag values get rejected before the call.

The safer pattern: do not shell out at all

Argv arrays are an improvement on the vulnerable baseline. The ideal is to not call an external binary in the first place. The reflex to shell out usually comes from familiarity with the command-line tool, not from any actual requirement that the work happen in a subprocess.

For DNS lookups, every language has a resolver in the standard library: dns_get_record in PHP, dns.resolver in Python via dnspython, dns.resolve in Node, net.LookupHost in Go. For HTTP requests, every language ships an HTTP client. For ICMP ping checks, raw ICMP sockets are available with a small library (icmplib in Python, golang.org/x/net/icmp in Go). For file operations, image processing, compression, encryption, the standard library or a binding is almost always better than a subprocess.

The library version is faster (no fork/exec), produces structured output instead of stdout to scrape, has no shell to misparse anything, and runs in the application's own privilege boundary. When I review a service and find five different shell_exec calls, four of them collapse to a single library call. The fifth is usually a niche binary with no library equivalent, and that is the one that ends up wrapped in proc_open with an argv array and a strict allowlist.

Blast-radius mitigation

Argv arrays prevent the bug. Containment controls limit what the bug does when the prevention layer fails. The two layers compose:

Run as a non-root user. The web process runs as www-data or an app-specific UID. An RCE that lands as a low-privilege user has limited reach.
Drop Linux capabilities. --cap-drop=ALL in Docker, then add back only what the app needs (usually nothing).
Read-only root filesystem. --read-only plus writable tmpfs for the few paths the app legitimately writes to. A webshell needs to write itself somewhere.
seccomp, AppArmor, or SELinux. A seccomp profile that blocks execve from the web process is a remarkable defence: even if the attacker reaches code execution inside the language runtime, they cannot fork a shell.
Separate user per service. Database, web, workers each run as a different UID. Lateral movement needs another privilege boundary to cross.
Network segmentation. The web container does not need to reach the database admin port, the secrets manager unencrypted, or the cloud metadata service. IMDSv2 with required hop-limit closes the SSRF-into-credentials chain on AWS.

None of these prevent the bug. They make the bug less useful when it happens.

Real-world incidents

A short tour of command-injection-shaped CVEs. For per-version specifics I would rather link out than risk a stale number; the lessons below are what I want to remember.

Shellshock, CVE-2014-6271 (September 2014). Bash before 4.3 patch 25 mis-parsed function definitions in exported environment variables: if a variable's value started with a function definition followed by a trailing command, Bash would parse the function and then execute the trailing command at shell startup. Combined with CGI, which exports HTTP headers as environment variables, this turned any CGI-backed endpoint into an unauthenticated RCE through a crafted User-Agent header. The lesson is what happens when data containers (env vars) get parsed as code (function definitions). The fix shipped within days; the cleanup took years because Bash was everywhere.
GitLab ExifTool injection, CVE-2021-22205 (April 2021). GitLab Community and Enterprise editions before 13.10.3, 13.9.6, and 13.8.8 passed user-uploaded image files to ExifTool to strip metadata. ExifTool's DjVu handler allowed embedded Perl code in image metadata to be executed during parsing. Unauthenticated RCE by uploading a crafted image to a public project. Patched in April 2021, then re-exploited in the wild through 2022 against unpatched self-hosted GitLabs. Image-processing libraries are RCE sinks, and every web app that runs them on uploaded files inherits the risk.
Confluence OGNL injection, CVE-2022-26134 (June 2022). Atlassian Confluence Server and Data Center across every supported branch (1.3.0 before 7.4.17, plus 7.13.0 before 7.13.7, 7.14.0 before 7.14.3, 7.15.0 before 7.15.2, 7.16.0 before 7.16.4, 7.17.0 before 7.17.4, and 7.18.0 before 7.18.1) evaluated OGNL expressions inside the request URI. A request like /${@java.lang.Runtime@getRuntime().exec("id")}/ triggered evaluation as part of URL routing, executing the command. Unauthenticated, single-request RCE, exploited in the wild before the patch shipped. Not shell-metacharacter injection, but the underlying pattern (user string reaching an evaluator that calls Runtime.exec) is the same shape one layer up.

The version-specific details for each CVE live in the NVD entries linked above; pull the current advisory before quoting a CVSS or patched-version number.

Frequently asked questions

It stops shell-metacharacter injection, where the attacker uses ;, |, backticks, or $() to chain a second command. It does not stop argument injection, where the value starts with - and the called binary interprets it as a flag. dig -f, curl -K, find -exec, tar --use-compress-program, and similar are all reachable through a correctly quoted value. The right pattern is escapeshellarg plus a leading -- separator, plus ideally not shelling out at all in favour of a library call.

The OS execve syscall takes a binary path and an array of argument strings. When you pass an argv array to a language API (execFile in Node, subprocess.run with a list in Python, proc_open with an array in PHP, exec.Command in Go), the API calls execve directly. No shell is involved, so metacharacters like ; or $() in a user value pass through to the binary as literal characters in one argument. The shell only runs when you explicitly invoke one, which is what the unsafe APIs (shell_exec, shell=True, child_process.exec) do under the hood.

A WAF with a current ruleset blocks the obvious payloads and slows opportunistic scanners. It will not block argument injection like dig -f because the payload looks benign. It will not catch bypasses using newline encoding, hex encoding, or characters the rule did not anticipate. Treat the WAF as one layer behind the actual fix (argv arrays plus allowlists), never as the primary control.

shell=False with a list argument is the safe shape. shell=True with a string is the vulnerable shape. The reliable pattern is shell=False (the default) plus a list of arguments. If you find shell=True in a code review, replace it; there is almost never a good reason.

Same class, different shell. cmd.exe metacharacters include & (separator), && (logical and), || (logical or), | (pipe), ^ (escape), and % for variable expansion. PowerShell adds ; and $(...). Process.Start in .NET with UseShellExecute=true goes through the shell; the safe pattern is UseShellExecute=false with a properly escaped argument list. Windows also has the additional twist that argument quoting is the calling process's responsibility, so there are subtle escaping vectors that do not exist on Unix.

Where to go next

This article is the deep dive on the classic shell-metacharacter sink. The variants and the wider map:

Up to the remote code execution practitioner guide for the full taxonomy: command injection, argument injection, server-side template injection, and direct eval.
Across to argument injection for the variant that gets through escapeshellarg by abusing the called binary's own flag parser.
Across to server-side template injection for the same data-becomes-code mistake one layer up in template engines.
Across to eval injection for the dumbest version of the bug: user input handed straight to the language runtime.
Back to the web application security vulnerabilities taxonomy for the hub.
Hands-on: the commix tutorial against a vulnerable app walks this exact sink end to end, from detection to a popped shell.

The recurring lesson across the whole RCE family is the same one I keep writing about. Every place untrusted input crosses into something that parses bytes as code is a sink. For OS command injection that something is /bin/sh. The only reliable defence is to make the crossing not happen: pass arguments as arguments, never as a concatenated string, and prefer a library call over a subprocess whenever one exists.

OS Command Injection: shell_exec, Metacharacters, and the argv-Array Defence

TL;DR

The textbook sink

The shell metacharacter catalogue

Walking the lab

Why escapeshellarg is not enough alone

The argv-array pattern that actually works

The safer pattern: do not shell out at all

Blast-radius mitigation

Real-world incidents

Frequently asked questions

Where to go next

Sources

Ishan Karunaratne

Related posts

Learn SQL Injection: A Structured Path from Zero to Defence

SQL Injection: Variants, Exploitation, and Defence

find Command Cheat Sheet: Search, Filter, and -exec Examples

Does escapeshellarg stop command injection?

Why are argv arrays safer than shell strings?

Can a WAF reliably block command injection?

Is shell=False enough in Python subprocess?

What about Windows command injection?

Sources

Ishan Karunaratne