Recovering from the global tech outage could be a long, arduous process | CNN Business (2024)

Recovering from the global tech outage could be a long, arduous process | CNN Business (1)

Travelers wait in Terminal 1 for check-in at Hamburg Airport, in Hamburg, Germany, Friday July 19, 2024. A widespread Microsoft outage disrupted flights, banks, media outlets and companies around the world on Friday.

CNN

The company that caused a massive computer outage across the world says a flawed update has been rolled back – but that doesn’t necessarily help the thousands of businesses that have been affected by the glitch.

The CrowdStrike software issue at the heart of the outage runs at such a deep level in affected computers and systems that getting them up and running just to be fixed will be, in many cases, an enormous challenge.

That’s compounded by the fact that many of the servers that may contain information needed to get these systems working again are themselves caught in a cycle of crashing and rebooting.

And some affected computers might not even be easily accessible, set up in remote locations and intended to run without human intervention.

“I don’t think it’s too early to call it: this will be the largest IT outage in history,” said security expert Troy Hunt in a post on X.

The CrowdStrike software at fault operates at what’s called the kernel level of a computer, a much deeper level than what more ordinary applications such as browsers or video games do. This portion of a device has much greater visibility and control over a computer and its components, making it critical for the operation of all other systems — and far more sensitive.

Running at the kernel level means CrowdStrike’s software can do more to detect cyberattacks, but it also means the current bug is causing Windows computers to crash to a Blue Screen of Death before users can take any actions to correct it.

The issue appears to be recoverable, CrowdStrike has said, but in many cases it requires painstaking work: Each affected device must be accessed by an administrator and manually rebooted into safe mode. Then, the offending CrowdStrike file must be deleted by hand.

For businesses with hundreds or thousands of laptops, desktops and servers running CrowdStrike’s security software, an individual human may have to perform that process over and over and over again.

“You can’t automate that,” said Kevin Beaumont, a security researcher and former Microsoft threat analyst, in a post on X. “So this is going to be incredibly painful for CrowdStrike customers.”

On Friday, a Microsoft status page reported that some Windows Virtual Machine users have successfully recovered from the issue by repeatedly rebooting, in some situations up to 15 times in a row.

“We have received feedback from customers that several reboots (as many as 15 have been reported) may be required, but overall feedback is that reboots are an effective troubleshooting step at this stage,” Microsoft said on the page. The company did not speculate as to why the technique appears to work.

Affected organizations can also try to restore their machines to an earlier state by reverting to a previous system backup, Microsoft added, though it acknowledged that may not be possible in all cases.

“Companies that haven’t invested in rapid backup solutions are stuck in a catch-22,” said Eric O’Neill, a cybersecurity expert and former FBI counterintelligence official.

It gets worse.

Organizations that take security seriously will have likely encrypted their computers’ hard drives, making it even more challenging to access the file that needs to be deleted.

For those organizations, “you need to manually decrypt the disk with a BitLocker Recovery Key, which is probably — for most companies — stored digitally on one of the servers that is currently booting over and over,” said Ira Bailey, a security researcher, in a post on BlueSky.

Every affected computer that is BitLocker-encrypted will need to be unlocked with a recovery key before organizations can begin the process of deleting the bad CrowdStrike file and restoring normal operation, said the cybersecurity expert who goes by the pseudonymous handle SwiftOnSecurity in a post on X.

Recovery will be enormously expensive for Fortune 500 companies with large teams of IT staff and likely even more challenging for smaller firms, Kenn White, an independent security researcher who specializes in network security, told CNN.

“If you don’t have physical staff that can actually touch it, this is going to take many, many days for much of corporate America to recover from,” White said. “It’s just a ton of labor-intensive manual work.”

“It’s a fairly complicated procedure for non-technical people,” White added, “and even a lot of skilled IT professionals will find it difficult to do this at the scale that’s going to be required given the number of machines that are affected.”

How did the CrowdStrike bug lead to such widespread effects?

Because CrowdStrike’s security software is running on countless individual computers all around the globe, the update that got pushed to those devices caused them all to shut down, virtually simultaneously.

And in today’s networked economy, an outage in one part of a supply chain can cause domino effects up and down the line. When multiple parts of a supply chain go down, it touches off a cascade of problems.

Imagine a person trying to buy a coffee, said Andrew Peck, a cybersecurity expert at Loughborough University in the UK. What may seem like a simple transaction relies on multiple computers working in tandem, from the coffee shop’s point of sale to the payment processor’s own back-end systems.

“There are a lot of computers in this chain, and usually the larger the business, the larger the chain,” Peck said. “If any one of the computers are down in the chain, the transaction will not complete.”

It could take millions of person-hours of work by corporate IT professionals to fix all the computers that were affected, said O’Neill, the former FBI counterintelligence operative. But, he said, coming up with a firm estimate is difficult because it’s unknown how many computers were affected.

Imagine something like the massive aviation industry, the critical financial services sector or the life-or-death operations of a health care provider, and the scope of the disaster becomes starkly clear.

With many people now working from home, he said, IT professionals can’t just go desk-to-desk to fix different computers. Instead, they’ll have to communicate with individual employees and talk them through the process remotely.

“That magnifies the issue,” he said. “Something that could have been fixed in hours is going to take days.”

Some affected machines may be rarely serviced by people or located in remote areas. Others may not even have monitors or keyboards plugged in, because they don’t regularly require humans to directly interact with them.

The most extreme examples may include weather monitoring sensors or devices in railway signal boxes, Peck said, which could require technicians to physically visit potentially hundreds of thousands of machines to perform the recovery process.

Recovery will cost the world “thousands of hours and millions, potentially billions of dollars,” Peck said, which quickly adds up to “some very exhausted IT support teams burning budget they didn’t have.”

What is Microsoft’s role in all this?

A separate issue earlier, on Thursday, did lead to significant impacts on many of Microsoft’s own cloud customers, but it was resolved overnight and was unrelated to the CrowdStrike issue, Microsoft and multiple cybersecurity experts told CNN.

The CrowdStrike bug may have initially been conflated with the Microsoft issue because CrowdStrike’s error affected only Windows machines.

“Both are Microsoft-related, but Microsoft had nothing to do with the second incident,” White told CNN.

That appears to be supported by Microsoft’s own status account on X, which on Thursday announced an issue affecting “Microsoft 365 apps and services” and a separate announcement Friday addressing the CrowdStrike outage. The two issues are being tracked using different reference numbers.

As of Friday morning, Microsoft said the issue with Microsoft 365 had been resolved and that the situation was improving.

“The ongoing CrowdStrike issue is unrelated to a previous outage in the Central US Azure region on July 18, impacting Azure customers using that region as well as some Microsoft 365 services,” Microsoft said.

Microsoft CEO Satya Nadella acknowledged the CrowdStrike issue in a post on X Friday morning, saying Microsoft is “working closely with CrowdStrike and across the industry to provide customers technical guidance and support to safely bring their systems back online.”

Since the update to CrowdStrike’s software was delivered by the company’s own systems, it appears unlikely that Microsoft bears direct responsibility for Friday’s outages, said Beaumont, who said he reviewed a copy of CrowdStrike’s flawed update.

The problem with CrowdStrike’s update was that it wasn’t formatted correctly “and causes Windows to crash every time,” Beaumont posted on X.

CNN’s Olesya Dmitracova and Chris Isidore contributed reporting.

This story has been updated with additional context and developments.

Recovering from the global tech outage could be a long, arduous process | CNN Business (2024)

FAQs

Has the CrowdStrike outage been resolved? ›

CrowdStrike itself was able to identify and deploy a fix for the issue in 79 minutes. While CrowdStrike quickly identified and deployed a fix for the issue, the recovery process for businesses is complex and time-consuming.

What was the cause of the global tech outage? ›

When Windows devices using CrowdStrike's cybersecurity tools tried to access the flawed file, it caused an “out-of-bounds memory read” that “could not be gracefully handled, resulting in a Windows operating system crash,” CrowdStrike said.

Is global tech outage fixed? ›

The global tech outage that forced local airlines, hospitals and banks to a standstill on Friday was caused by a faulty update from a cybersecurity company delivered to Microsoft Windows customers.

What businesses are affected by Microsoft outage? ›

Synopsis. The global outage due to a technical issue around Microsoft's applications and services has largely affected businesses and institutions in different countries. Flight operations, banks, retailers and IT firms were at the receiving end of this IT outage, that has left people in panic.

Why is CrowdStrike falling? ›

While investors were sleeping, CrowdStrike released a defective update to its software that caused Microsoft-based IT systems to go down. The outage was perhaps most notable for airlines, which were forced to cancel and delay flights.

What caused the CrowdStrike failure? ›

Bottom line: An issue with the software that checks CrowdStrike updates are working properly meant that the firm missed a problem with a content file. The update was pushed out to Windows systems, causing them to crash.

How to fix global outage? ›

How to manually fix your affected computer
  1. Boot Windows into safe mode or the Windows Recovery Environment.
  2. Navigate to the C:\Windows\System32\drivers\CrowdStrike directory.
  3. Locate the file matching “C-00000291*. sys” and delete it.
  4. Boot the machine normally.
Jul 19, 2024

Which banks are affected by Microsoft outage? ›

Which banks were affected by the Microsoft outage? What we know.
  • TD Bank.
  • Charles Schwab.
  • VISA.
  • Bank of America.
  • Chase.
  • Capital One.
  • Arvest Bank.
  • Wells Fargo.
Jul 19, 2024

What is the massive global tech outage? ›

Key Background. CrowdStrike, a widely used cybersecurity firm, took responsibility for the massive global outage on Friday, a disruption that shut down systems on Microsoft 365 apps and services and brought users on Windows systems to the dreaded so-called blue screen of death.

Is Microsoft at fault for the CrowdStrike outage? ›

Microsoft, Not CrowdStrike, to Blame For Global IT Outage

Guy, CEO of Sevco Security, an asset intelligence provider, blames Microsoft, not CrowdStrike, for the global IT outage. “Yes, CrowdStrike pushed a kernel-level update that causes widespread blue screens,” he said.

Is the Microsoft CrowdStrike fixed? ›

CrowdStrike's faulty update took down these machines, but the company has fixed the issue and has been working with businesses to get them back online. Microsoft has also released a bootable USB drive tool that can help restore crashed systems.

How much will CrowdStrike outage cost? ›

What's been described as the largest IT outage in history will cost Fortune 500 companies alone more than $5 billion in direct losses, according to one insurer's analysis of the incident published Wednesday.

Is Microsoft Global outage resolved? ›

A global outage affecting Microsoft products including email service Outlook and video game Minecraft has been resolved, the technology giant said in an update. The firm said preliminary investigations show the outage was caused by a cyber-attack and a failure to properly defend against it.

What is causing the Global IT outage? ›

The disruption was caused by a flawed update to a cloud-based security software of CrowdStrike, one of the global top cybersecurity companies.

Will Microsoft outage affect grocery stores? ›

The global Microsoft outage impacted point-of-sales systems at U.S. stores. “Store Closed” signs were posted on a Key Foods grocery store on Flatbush Avenue in Brooklyn, The New York Times reported. McDonald's suffered cash register malfunctions at locations in Japan, according to its website.

Has the CrowdStrike issue been fixed? ›

This is not a security incident or cyber attack. “The issue has been identified, isolated and a fix has been deployed. We refer customers to the support portal for the latest updates and will continue to provide complete and continuous updates on our website.

Is CrowdStrike down today? ›

Current CrowdStrike status is UP.

Is CrowdStrike back up and running? ›

We are working closely with impacted customers and partners to ensure that all systems are restored, so you can deliver the services your customers rely on. CrowdStrike is operating normally, and this issue does not affect our Falcon platform systems.

What is the current status of CrowdStrike? ›

User reports indicate no current problems at CrowdStrike.

Top Articles
Latest Posts
Recommended Articles
Article information

Author: Twana Towne Ret

Last Updated:

Views: 5847

Rating: 4.3 / 5 (64 voted)

Reviews: 87% of readers found this page helpful

Author information

Name: Twana Towne Ret

Birthday: 1994-03-19

Address: Apt. 990 97439 Corwin Motorway, Port Eliseoburgh, NM 99144-2618

Phone: +5958753152963

Job: National Specialist

Hobby: Kayaking, Photography, Skydiving, Embroidery, Leather crafting, Orienteering, Cooking

Introduction: My name is Twana Towne Ret, I am a famous, talented, joyous, perfect, powerful, inquisitive, lovely person who loves writing and wants to share my knowledge and understanding with you.