grep Regex: BRE vs ERE vs PCRE Explained (2026)

grep does not have one regex engine. It has three, and which one you get depends on the flag. The default is basic regex (BRE). Add -E and you get extended regex (ERE). Add -P and you get Perl-compatible regex (PCRE). The same pattern string can match different text, or fail to compile, depending on which mode is active.

The mode that surprises everyone is the default. In BRE, the characters +, ?, |, (, ), {, and } are literal text. They match themselves. To use them as metacharacters you have to backslash-escape them: \+, \?, \|, $, $, \{, \}. That inversion (escape the metacharacter to make it special, leave it bare to make it literal) is the single biggest reason people give up on grep and reach for grep -E.

This article is the deep dive on the three modes. For the full flag reference, see the grep cheat sheet.

Set your values

Try it with your own values

Set your OS, search path, and a test pattern. Every grep example below updates with your values.

Operating systemSearch pathPattern

The three modes at a glance

Mode	Flag	Metacharacters bare	Best for
Basic (BRE)	none (default)	`.` `*` `^` `$` `[...]` `\{` `\}` `$` `$` `\+` `\?` `\|`	Simple literal-ish searches; portable scripts
Extended (ERE)	`-E` (or `egrep`)	adds `+` `?` `	( ) ` bare
Perl-compatible (PCRE)	`-P`	adds lookaround, `\d` `\w` `\s`, non-greedy, backreferences	Anything BRE and ERE cannot express

The practical advice: reach for -E by default. Use plain grep only when the pattern is genuinely basic, and use -P only when you need something PCRE-exclusive and you are on GNU grep.

BRE: the default, where metacharacters are literal

In basic regex, this list of characters means themselves, not their regex function:

code

+   matches a literal plus sign
?   matches a literal question mark
|   matches a literal pipe character
(   matches a literal open paren
)   matches a literal close paren
{   matches a literal open brace
}   matches a literal close brace

To get the regex behavior, you escape them. So in BRE, "one or more digits" is written with an escaped plus:

bash

grep '[0-9]\+' app.log

That \+ is "one or more of the preceding". Without the backslash, [0-9]+ would match a digit followed by a literal + character. Grouping and alternation work the same way, escaped:

bash

grep '\(error\|warn\)' app.log

The escaped $ and $ form a group; the escaped \| is alternation. Interval quantifiers also need escaping. To match "between 2 and 4 of the preceding", you write the braces escaped:

bash

grep 'a\{2,4\}' app.log

What does work bare in BRE: . (any character), * (zero or more of the preceding), ^ (start of line), $ (end of line), [...] (character class), and [^...] (negated class). Those five are the BRE toolkit. Everything else is escape-to-activate.

BRE exists because it is the original 1970s grep behavior, frozen by POSIX for backward compatibility. The default stays, and -E is the opt-in to sanity.

ERE: extended regex, the one you actually want

Extended regex flips the rule. In ERE, + ? | ( ) { } are metacharacters directly, no backslash needed. To match them literally you escape them, which is what every other regex flavor does and what your instincts expect.

The same three patterns from above, rewritten for ERE:

bash

grep -E '[0-9]+' app.log
grep -E '(error|warn)' app.log
grep -E 'a{2,4}' app.log

Cleaner, and it matches how regex works in Python, JavaScript, Perl, and every editor's find dialog. This is why ERE is the right default for interactive use.

bash· Linux (GNU)

grep -E ':pattern' :search_path/*.log

egrep is the historical shorthand for grep -E. It still works on most systems but modern GNU and BSD grep print a deprecation warning and tell you to use grep -E. Treat egrep as legacy; write grep -E in anything you commit.

One thing ERE does not add: the Perl shorthand classes. \d, \w, and \s are not part of ERE. More on that below.

PCRE: the full Perl engine

grep -P switches to PCRE, the regex library that backs Perl. This is a genuinely different and far larger engine. It adds everything ERE has plus:

Lookahead (?=...) and negative lookahead (?!...)
Lookbehind (?<=...) and negative lookbehind (?<!...)
Non-greedy quantifiers: *?, +?, ??, {n,m}?
Shorthand classes: \d (digit), \w (word char), \s (whitespace), and their negations \D, \W, \S
Named groups: (?<name>...)
Backreferences by number \1 and by name \k<name>
Word boundaries \b that work reliably across the engine

Lookbehind is the headline feature. To extract the value after user= without including user= itself in the match, you anchor with a lookbehind:

bash

grep -oP '(?<=user=)\w+' app.log

The (?<=user=) lookbehind asserts "preceded by user=" without consuming those characters, so -o prints just the username. There is no way to write that in BRE or ERE. The closest you get is a capture group plus sed or awk to pull the group out.

Non-greedy matching is the other one ERE cannot do. .* is greedy and grabs as much as possible; .*? stops at the first opportunity:

bash

grep -oP '".*?"' data.json

-P is GNU only. It is a compile-time option in GNU grep, and even on Linux some minimal builds omit it (you get grep: support for the -P option has not been compiled in). It does not exist at all in BSD grep, which is what macOS ships. That platform gap is the next section.

The same match in all three flavors

Here is one task (find lines with one or more digits followed by ms) written three ways:

code

BRE:  grep    '[0-9]\+ms'  app.log
ERE:  grep -E '[0-9]+ms'   app.log
PCRE: grep -P '\d+ms'      app.log

All three match the same lines. BRE escapes the +; ERE uses it bare; PCRE uses it bare and swaps [0-9] for the \d shorthand. ERE is the portable choice that still reads cleanly.

A second example, "a word repeated 2 to 3 times", shows the brace difference:

code

BRE:  grep    '\(foo\)\{2,3\}'  app.log
ERE:  grep -E '(foo){2,3}'      app.log
PCRE: grep -P '(foo){2,3}'      app.log

ERE and PCRE are identical here; only BRE needs the escaping.

Feature comparison

Feature	BRE	ERE	PCRE
Anchors `^` `$`	Yes	Yes	Yes
Any char `.`, star `*`	Yes	Yes	Yes
Character class `[...]`	Yes	Yes	Yes
Grouping	``	`( )`	`( )`
Alternation	`\|`	`	`
One-or-more, zero-or-one	`\+` `\?`	`+` `?`	`+` `?`
Interval quantifier	`\{n,m\}`	`{n,m}`	`{n,m}`
Shorthand `\d` `\w` `\s`	No	No	Yes
POSIX class `[[:digit:]]`	Yes	Yes	Yes
Backreference `\1`	Yes	No (POSIX), GNU adds it	Yes
Non-greedy `*?`	No	No	Yes
Lookahead, lookbehind	No	No	Yes
Named groups	No	No	Yes

The two rows that catch people: shorthand classes are PCRE-only, and ERE actually drops backreference support that BRE has (GNU re-adds it as an extension, but POSIX ERE has no \1).

\d \w \s are not in BRE or ERE

This is the most common false assumption. \d looks universal because it works in Python, JavaScript, and PCRE. But in BRE and ERE, \d is just an escaped d, which matches a literal d. So grep -E '\d' finds the letter d, not digits.

The portable replacement is a POSIX character class or an explicit range:

Perl shorthand	POSIX class (BRE/ERE)	Explicit range
`\d`	`[[:digit:]]`	`[0-9]`
`\w`	`[[:alnum:]_]`	`[A-Za-z0-9_]`
`\s`	`[[:space:]]`	(no clean range)
`\D`	`[^[:digit:]]`	`[^0-9]`

So "three digits" in ERE is:

bash

grep -E '[[:digit:]]{3}' app.log

POSIX classes have an advantage over [0-9]: they are locale-aware. In a non-ASCII locale, [[:alpha:]] matches accented letters that [A-Za-z] misses. For pure ASCII data the explicit ranges are fine. If you genuinely want \d and \w, that is your signal to use -P on GNU grep.

macOS BSD grep vs GNU grep

macOS ships BSD grep, not GNU grep. They agree on BRE and ERE. They diverge hard on PCRE.

Capability	GNU grep	BSD grep (macOS default)
BRE (default)	Yes	Yes
ERE (`-E`)	Yes	Yes
PCRE (`-P`)	Yes (if compiled in)	Not supported at all
`\d` `\w` `\s` in `-E`	Literal `d` `w` `s`	Literal `d` `w` `s`
POSIX classes `[[:digit:]]`	Yes	Yes
Backreference `\1` in ERE	Yes (GNU extension)	No

On macOS, grep -P fails immediately with grep: invalid option -- P. There is no PCRE engine behind BSD grep to enable. Three fixes:

Install GNU grep. brew install grep puts it on PATH as ggrep. Run ggrep -P '...', or alias grep='ggrep' in your shell rc.
Use pcregrep. A separate Homebrew package (brew install pcre) that is purpose-built for PCRE and also does multi-line matching with -M.
Use ripgrep. brew install ripgrep, then rg --pcre2 '...'. ripgrep defaults to its own ERE-like engine and switches to PCRE2 on the --pcre2 flag.

bash· Linux (GNU)

grep -oP '(?<=v)[0-9]+' :search_path/*.log

PowerShell's Select-String uses the .NET regex engine, which supports lookaround and \d natively, so the PCRE-style patterns just work on Windows without any extra install.

Common mistakes

1. Using + in BRE and expecting one-or-more. Plain grep '[0-9]+' looks for a digit followed by a literal plus sign, because in BRE the + is literal. You wanted grep '[0-9]\+' or, better, grep -E '[0-9]+'. This is the number-one BRE trap.

2. Expecting \d to work under -E. grep -E '\d{3}' does not match three digits. ERE has no \d; the engine reads it as a literal d. Use grep -E '[0-9]{3}' or grep -P '\d{3}'.

3. Reaching for lookahead without -P. (?=...) and (?<=...) are PCRE constructs. Under plain grep or grep -E they are parsed as a literal group containing a literal ? and =. If you need lookaround, you need -P, full stop.

4. Running grep -P on macOS. BSD grep has no -P and never will. The command fails with invalid option. Install GNU grep, pcregrep, or ripgrep instead of fighting it.

5. Escaping in the wrong direction. In ERE, \( matches a literal paren and ( starts a group. People coming from BRE escape their groups out of habit, then wonder why the grouping vanished. Pick a mode and commit to its rules.

6. Forgetting POSIX intervals need a closing brace. grep -E 'a{2,' with an unterminated {2, is sometimes accepted as literal text and sometimes errors, depending on the build. Always close the interval.

When NOT to use this

Regex is not always the right tool. Skip it when:

The pattern is a fixed literal string. If you are searching for 192.168.1.1 or Cmd+Shift+P, use grep -F (fixed strings). It is faster, and it means the . and + in your search term are treated as literal characters with zero escaping. No regex mode needed.
You need to actually parse structured data. Regex is a poor JSON, HTML, or CSV parser. For JSON use jq; for columnar text use awk; for real grammar use a proper parser. A regex that "mostly works" on structured input is a bug waiting for the one edge case that breaks it.
You need fields, arithmetic, or multi-line logic. That is awk territory. grep finds lines; awk processes them. If your pattern is growing capture groups just to pull out a column, switch tools.
You are matching across newlines. grep is line-oriented and no regex mode changes that. Use pcregrep -M, ripgrep --multiline, or preprocess with tr.

FAQ

-E selects extended regex (ERE), which is the POSIX standard. It makes +, ?, |, parentheses, and braces work as metacharacters without backslashes, but it has no lookaround and no shorthand classes.

-P selects Perl-compatible regex (PCRE), a much larger engine. It adds lookahead, lookbehind, non-greedy quantifiers, named groups, backreferences, and the d w s shorthands. -P is GNU-only and absent from macOS BSD grep.

Plain grep uses basic regex (BRE), where + is a literal plus sign, not the one-or-more quantifier. To get quantifier behavior in BRE you escape it: [0-9]\+. The cleaner fix is grep -E, which switches to extended regex where + works bare like every other regex flavor.

Only under grep -P. The \d shorthand for a digit is a PCRE construct. In basic and extended regex it is read as a literal letter d, so grep -E '\d' matches the character d, not a number.

For BRE and ERE, use the POSIX class [[:digit:]] or the explicit range [0-9] instead. Both are portable across GNU and BSD grep.

Lookaround is PCRE-only, so you need grep -P. A positive lookbehind looks like (?<=prefix) and a positive lookahead like (?=suffix). They assert context without consuming it, which pairs well with -o to print just the matched core.

If you are on macOS, BSD grep has no -P. Install GNU grep with brew install grep and run ggrep -P, or use pcregrep.

macOS ships BSD grep, which has no PCRE engine compiled in. The -P flag does not exist there, so the command fails with an invalid option error.

Install GNU grep through Homebrew (brew install grep) and call it as ggrep, install pcregrep as a standalone PCRE tool, or use ripgrep with its --pcre2 flag. Any of the three gives you full PCRE on macOS.

Yes. egrep is a historical shorthand for grep -E (extended regex), and fgrep is shorthand for grep -F (fixed strings). Modern GNU and BSD grep keep both wrappers working but print a deprecation warning recommending the grep -E and grep -F forms. Write the long form in any script you commit.

grep Regex: BRE vs ERE vs PCRE Explained

Set your values

The three modes at a glance

BRE: the default, where metacharacters are literal

ERE: extended regex, the one you actually want

PCRE: the full Perl engine

The same match in all three flavors

Feature comparison

\d \w \s are not in BRE or ERE

macOS BSD grep vs GNU grep

Common mistakes

When NOT to use this

See also

FAQ

Ishan Karunaratne

Related posts

The Regex (*ACCEPT) Control Verb, Explained

find -regex vs -name: When to Use Regex in find

The Git Staging Area Explained

Mode	Flag	Metacharacters bare	Best for
Basic (BRE)	none (default)	`.` `*` `^` `$` `[...]` `\{` `\}` `\(` `\)` `\+` `\?` `\|`	Simple literal-ish searches; portable scripts
Extended (ERE)	`-E` (or `egrep`)	adds `+` `?` `	( ) ` bare
Perl-compatible (PCRE)	`-P`	adds lookaround, `\d` `\w` `\s`, non-greedy, backreferences	Anything BRE and ERE cannot express

What is the difference between grep -E and grep -P?

Why does + not work in plain grep?

Does \d work in grep?

How do I use lookahead or lookbehind in grep?

Why does grep -P fail on my Mac?

Is egrep the same as grep -E?

Ishan Karunaratne