Become a fan of Slashdot on Facebook

 



Forgot your password?
typodupeerror
×
IT

Cloudflare Dashboard and APIs Down After Data Center Power Outage (bleepingcomputer.com) 22

An ongoing Cloudflare outage has taken down many of its products, including the company's dashboard and related application programming interfaces (APIs) customers use to manage and read service configurations. From a report: The complete list of services whose functionality is wholly or partially impacted includes the Cloudflare dashboard, the Cloudflare API, Logpush, WARP / Zero Trust device posture, Stream API, Workers API, and the Alert Notification System. "This issue is impacting all services that rely on our API infrastructure including Alerts, Dashboard functionality, Zero Trust, WARP, Cloudflared, Waiting Room, Gateway, Stream, Magic WAN, API Shield, Pages, Workers," Cloudflare said. "Customers using the Dashboard / Cloudflare APIs are impacted as requests might fail and/or errors may be displayed."

Customers currently have issues when attempting to log into their accounts and are seeing 'Code: 10000' authentication errors and internal server errors when trying to access the Cloudflare dashboard. Cloudflare says the service issues don't affect the cached file delivery via the Cloudflare CDN or Cloudflare Edge security features.

This discussion has been archived. No new comments can be posted.

Cloudflare Dashboard and APIs Down After Data Center Power Outage

Comments Filter:
  • by Rosco P. Coltrane ( 209368 ) on Thursday November 02, 2023 @12:24PM (#63974594)

    I still have zero trust in CloudFlare.

    • by PPH ( 736903 )

      I trust them completely. To slap their silly little message on my screen, ensuring that I'm running JavaScript and have not installed a popup blocker.

      Thank goodness I have a plug-in that auto checks the "I am a human" test.

    • by gweihir ( 88907 )

      Nobody sane trusts any modern tech company. They all are far too optimized for profit to deserve any level of trust.

  • by NickDngr ( 561211 ) on Thursday November 02, 2023 @12:27PM (#63974608) Journal
    Dreamhost had a power outage that took down their Portland datacenter for several hours this morning too (so much for their backup power and 100% uptime guarantee). Are they in the same facility?
    • by Ksevio ( 865461 )

      Turns out Cloudflare just rented a shared hosting server from Dreamhost!

    • Re:Dreamhost too (Score:4, Informative)

      by Anonymous Coward on Thursday November 02, 2023 @01:20PM (#63974792)

      Dreamhost had a power outage that took down their Portland datacenter for several hours this morning too (so much for their backup power and 100% uptime guarantee). Are they in the same facility?

      Yes, it is Flexential datacenter PDX02 in Portland.
      Dreamhost lives there too.

      At last update, the datacenter lost utility, switched to generators, and the generators failed (unspecified how) in a way that took out battery backup to the building.

      In the case of Cloudflare, they began a sitewide-live-failover when utility was lost, but when the generators took out the UPS systems, the data center had a complete unclean shutdown much sooner than expected.
      I can only imagine how that would f-up a full site migration in progress!

      Personally I'm a bit surprised Cloudflare is renting datacenter space, instead of having their own.

      • battery backup just needs to last for the time it takes generators to start and to stabilize the power not run the full site for hours.

        • by jbengt ( 874751 )
          Typically, I've seen UPS batteries sized for a theoretical 15 minute runtime, but they've achieved double that or more in practice. This has been enough time for orderly shutdowns without a generator, but I don't know that that would be enough for transferring any unsynced data to another site.
  • by gavron ( 1300111 ) on Thursday November 02, 2023 @12:57PM (#63974706)

    From the article:
    > ...the company revealed that the ongoing issues are due to power outages at multiple data centers.

    Odds of that for a 4-9s datacenter is 0.001% multiplied by N datacenter.

    Company says:
    >Cloudflare is assessing a loss of power impacting data centres while simultaneously failing over services.

    This a multiple-hour multiple-service outage.
    Odds that they're "simultaenously" failing over services is a bit lower than the odds of multiple datacenters losing power.

    >This is the second large outage that has hit Cloudflare since the start of the week

    The odds of such "coincidences" are worse than the odds of winning the grand prize in the Powerball lottery.

    What's more likely is someone pushed a network configuration change outside a maintenance window, with no peer review, and locked themselves out of fixing it, not having used safe-mode or auto-restore procedures.

    • "Datacenter Engineers a aware of ongoing stability issues with power at our Portland 2 facility. Engineers are fully engaged and have vendors on-site working on the facility power. Here is a current status: - This facility has 2 power feeds. One is currently running on utility power and the other on generator. Both feeds are stable and there is fuel on-site to run for at least 24 hours. - All cooling at the facility is operating as designed with no known issue. - The facility has 32 UPS modules in a redunda
      • It's always fantastic when your redundant capacity knocks you offline when there isn't a need for the redundancy to be active to begin with.

        Reminds me of spanning-tree failures when someone plugs a cable into the wrong ports and causes an unintended loop.

    • Interesting observation.

      Going to become even more interesting if somehow, someway a corporate liability policy actually covers the harm done from a 1-in-Powerball power outage mass event. I'd love to know how much they paid the insurance adjuster.

  • ...got much better service and less drama.
  • "No there is no outage. You must be mistaken! Check again."
    *unplugs coffee machine and plugs server back in*
  • Was not the cloud supposed to provide cheapo five 9â(TM)s reliability?
    • Was not the cloud supposed to provide cheapo five 9s reliability?

      Good, Cheap, or Fast. Pick any two.

      Alright, alright, you got me. Pick any one.

Don't tell me how hard you work. Tell me how much you get done. -- James J. Ling

Working...