Equifax Breach: Unpatched Apache Struts RCE Explained

In 2017, attackers breached Equifax, one of the three major US credit bureaus, and stole the personal data of about 147 million people, including names, Social Security numbers, dates of birth, and addresses. The way in was a remote code execution vulnerability in Apache Struts, CVE-2017-5638, for which a patch had been available for over two months and which Equifax had been specifically warned to apply. Once inside, the attackers operated for months largely undetected, because a digital certificate on Equifax's traffic-monitoring system had been expired for nineteen months, leaving the sensor that should have caught the data theft effectively blind.

This is the case I use to teach two lessons at once: that an unpatched, publicly-known RCE is one of the most dangerous things you can run, and that detection which is silently broken is worse than no detection at all, because everyone assumes it is working.

The vulnerability: CVE-2017-5638

The root bug is a clean example of remote code execution through injection. Apache Struts is a Java web framework, and CVE-2017-5638 was a flaw in how its Jakarta Multipart parser handled the Content-Type HTTP header. By sending a crafted Content-Type value containing an OGNL (Object-Graph Navigation Language) expression, an attacker could get the server to evaluate that expression, which is to say, run code of the attacker's choosing on the server. It is the same family of "untrusted input parsed as code" failure as server-side template injection, with OGNL as the interpreter.

Crucially, this was not a zero-day at the time of the breach. The Apache Struts project disclosed the vulnerability and released a fixed version in early March 2017, with a public advisory (S2-045). The US Department of Homeland Security separately notified organisations, including Equifax, to patch it. Exploitation in the wild began within days of disclosure. By the time the attackers used it against Equifax, the patch had existed for weeks.

The attack chain

Editorial illustration of the Equifax breach: an unlocked door in a wall, a figure slipping through, a watchtower whose eye is closed, and document files streaming out unseen behind it. — An unpatched door, an intruder, and a blinded watchtower: the monitoring that should have caught the theft had been dead for nineteen months.

The unpatched portal. Equifax ran an online dispute portal (a public-facing web application) on a version of Apache Struts that was still vulnerable to CVE-2017-5638. Despite the advisory and the DHS notification, the patch was not applied to this system.
Remote code execution. In mid-May 2017, attackers exploited the Struts flaw to run code on the dispute-portal server, giving them a foothold inside Equifax's environment.
Credentials and lateral movement. On that server and nearby, the attackers found credentials stored in plaintext. They used them to move laterally across a network that was not well segmented, reaching far beyond the single compromised application into around fifty databases.
Months of quiet exfiltration. Over roughly ten weeks, the attackers ran thousands of queries and exfiltrated the data in small, encrypted chunks to blend in with normal traffic. They were able to do this for months because the monitoring that should have flagged it was not working.

The detail that defines the case: a blind sensor

The single most instructive failure is not the unpatched server, common as that is. It is why the breach went undetected for so long.

Equifax inspected the encrypted traffic leaving its network using a device that needed a valid digital certificate to decrypt and examine that traffic. That certificate had expired about nineteen months earlier and had not been renewed. For nineteen months, the system that was supposed to be watching outbound traffic for exactly this kind of data theft was passing it through without looking, because it could not decrypt what it was meant to inspect.

The breach was discovered on 29 July 2017, and the way it was discovered is the lesson in one sentence: Equifax renewed the expired certificate, the monitoring system started inspecting traffic again, and it immediately surfaced the suspicious activity that had been flowing out unseen. The attack was not stealthy in some sophisticated sense. The alarm had simply been unplugged, and the moment it was plugged back in, it went off.

This is why I treat "is our monitoring actually working?" as a question you have to verify, not assume. A detection control that has silently failed gives you the confidence of coverage with none of the protection, which is the worst of both.

The scale and the aftermath

Figure	What it was
~147 million	People whose personal data was exposed.
Names, SSNs, DOB, addresses	The core data taken, the full kit for identity theft.
~209,000	Credit card numbers also exposed.
At least $575M, up to $700M	The 2019 settlement with the FTC, CFPB, and US states.

The data taken was uniquely damaging because of what Equifax is: a credit bureau holds exactly the identifiers (Social Security numbers, dates of birth) that are used to verify identity, and unlike a password, you cannot change your SSN after it leaks. The regulatory and political response was severe: a settlement of up to around $700 million, the departure of the CEO, CIO, and chief security officer, and a Congressional investigation whose report read as a catalogue of preventable failures. In 2020, the US Department of Justice indicted four members of the Chinese military over the intrusion.

The lessons I take from it

Patch known RCEs immediately, and verify the patch reached every system. The vulnerability had a patch and a public warning. The failure was operational: Equifax did not get the fix onto the vulnerable dispute portal. A publicly-known remote code execution flaw on an internet-facing system is an emergency, and "we issued the patch instruction" is not the same as "every affected system is patched." You need an inventory accurate enough to know what you run and confirmation that the fix actually landed.

Verify that detection is alive. The expired certificate is the detail to carry into your own environment. Monitoring, logging, and inspection controls fail silently, and a dead sensor looks exactly like a quiet network. Test your detection: confirm certificates are current, confirm logs are flowing, confirm the alerts fire. A control you have not verified recently is a control you are only assuming you have.

Segment the network and never store plaintext credentials. The RCE compromised one application. What turned that into 147 million records was a flat network and plaintext credentials that let the attackers roam. Segmentation and proper secrets management would have contained the foothold. This is the same lesson as the Heartland breach: the entry bug is rarely the whole story, and the blast radius is decided by what the attacker can reach next.

RCE is the worst case, so treat its inputs accordingly. Remote code execution collapses the gap between "a bug in one app" and "the attacker runs the company's servers." Any framework or library that parses untrusted input is a candidate, and keeping that dependency patched is not optional maintenance, it is front-line security.

A remote code execution flaw in Apache Struts, CVE-2017-5638, in how the framework's Jakarta Multipart parser handled the Content-Type HTTP header. A crafted header containing an OGNL expression caused the server to execute attacker-controlled code. Equifax ran an unpatched, internet-facing dispute portal on a vulnerable Struts version, even though a patch had been available for over two months and DHS had warned organisations to apply it.

Because a digital certificate on Equifax's traffic-monitoring device had been expired for about nineteen months. The device needed that certificate to decrypt and inspect outbound encrypted traffic, so for nineteen months it passed traffic through without looking. The attackers exfiltrated data unseen for roughly ten weeks. When Equifax finally renewed the certificate on 29 July 2017, the monitoring immediately surfaced the suspicious activity, which is how the breach was discovered.

About 147 million people. The exposed data included names, Social Security numbers, dates of birth, and addresses, plus around 209,000 credit card numbers. The data was especially damaging because a credit bureau holds the exact identifiers used to verify identity, and a Social Security number, unlike a password, cannot be changed after it leaks. Equifax later agreed to a settlement of up to roughly $700 million.

By applying the available Apache Struts patch to the vulnerable dispute portal promptly (the patch and a DHS warning both predated the attack), and by keeping the traffic-monitoring certificate current so the data theft would have been detected immediately. Network segmentation and not storing plaintext credentials would also have contained the attacker's lateral movement, limiting a single-server compromise from becoming a 147-million-record breach.

Where to go next

For the mechanics of the bug class, the remote code execution deep dive covers how untrusted input becomes code execution and how to defend against it, with server-side template injection as the closest cousin of the OGNL flaw here. The other RCE case study in this cluster is Log4Shell, where a vulnerability in a logging library produced mass exploitation. For the full set of web attack classes, see the web application security vulnerabilities taxonomy.

The Equifax Breach: An Unpatched Bug and a Blind Sensor

The vulnerability: CVE-2017-5638

The attack chain

The detail that defines the case: a blind sensor

The scale and the aftermath

The lessons I take from it

Where to go next

Sources

Ishan Karunaratne

Related posts

The 2022 Uber Breach: MFA Fatigue and a Hardcoded Password

The Capital One Breach: SSRF and the Cloud Metadata Service

The interview question that cost me the job, and what I'd ask now

What vulnerability caused the Equifax breach?

Why did the Equifax breach go undetected for so long?

How many people were affected by the Equifax breach?

How could the Equifax breach have been prevented?

Sources

Ishan Karunaratne