由买买提看人间百态

boards

本页内容为未名空间相应帖子的节选和存档,一周内的贴子最多显示50字,超过一周显示500字 访问原贴
PDA版 - Amazon和Google云服务的中断给数字化生活敲响了警钟,可能意味着更大的风险。一个潜在的bug可能会带来一连串的事故
相关主题
萎软让Tmo的Sidekick用户数据永久丢失outlook.com app发神经了
怪异的virginmobile被三爽坑惨了
我说咋连不上4G网络了Time warner cable - what can we do to serve you worse?
Sprint Customers 50% credit towards last 3 months or free A阿三将gmail搞咂了
BB的service outage是咋回事?Cricket Wireless is having issues since 4:31 PM EDT
悲催的sprint 3G+iphone 4sAmazon outage (转载)
有人拍surface马屁了Interesting-Apple voids warranties over cigarette smoke
Google Drive suffering from service outageamazon web service (AWS) 不是第一年使用micro instance免费吗 (转载)
相关话题的讨论汇总
话题: google话题: outages话题: amazon话题: cloud话题: services
进入PDA版参与讨论
1 (共1页)
B********f
发帖数: 142
1
Amazon和Google云服务的中断给数字化生活敲响了警钟,可能意味着更大的风险。一个
潜在的bug可能会带来一连串的事故
Amazon, Google Cloud Outages Highlight Bigger Risk
Just when you thought the list of possible gotchas causing cloud outages
could not get much longer, a post-mortem of the recent Amazon outage that
took out Reddit and Heroku, among others, fingered a memory bug. The finding
follows a Google outage on Wednesday that locked folks out of some of its
most popular consumer services briefly.
The Amazon Web Services team notes in a blog post:
We’d like to share more about the service event that occurred on Monday
, October 22nd in the US- East Region. We have now completed the analysis of
the events that affected AWS customers, and we want to describe what
happened, our understanding of how customers were affected, and what we are
doing to prevent a similar issue from occurring in the future.
The short of the Amazon matter:
At 10:00AM PDT Monday, a small number of Amazon Elastic Block Store (EBS
) volumes in one of our five Availability Zones in the US-East Region began
seeing degraded performance, and in some cases, became “stuck” (i.e.
unable to process further I/O requests). The root cause of the problem was a
latent bug in an operational data collection agent that runs on the EBS
storage servers.
And the long of it:
We apologize for the inconvenience and trouble this caused for affected
customers. We know how critical our services are to our customers’
businesses, and will work hard (and expeditiously) to apply the learning
from this event to our services. While we saw that some of the changes that
we previously made helped us mitigate some of the impact, we also learned
about new failure modes. We will spend many hours over the coming days and
weeks improving our understanding of the event and further investing in the
resiliency of our services.
Cloud outages are not new, but Amazon being the 800-pounder in the room has
meant it is the whipping boy. But with Google joining the outage punching
bag fray recently, a bigger issue had emerged: Cloud platform outages
translate into actual downtime and can affect multiple tools and services.
And this week, Googles Gmail, Drive and Reader were down for a while,
sending panic signals out across the Web.
Wired Enterprise reports:
At about 10:47 p.m. British time on Wednesday, Paul O’Brien couldn’t
reach Google. At all.
“Strange,” he said, with a post to Twitter. “My phone just completely
lost connectivity to all Google services. Anyone else?”
The response was immediate. “Same here in Mexico,” said someone who
calls himself orb3000, who tells us he does work at Veracruz State
University. “All google services are out…”
Here at the Wired newsroom in San Francisco, we saw much the same thing.
“Gmail, Drive, Reader…everything is down for me,” wrote one reporter on
our communal chat system, and soon countless others were complaining as well
. “It’s a good thing we’re not beholden to google or anything for our
digital lives,” said one particularly sarcastic type.
Six minutes later and Google was back, writes Wired Enterprise’s Cade Metz.
“During those six minutes, about 10 percent of people trying to reach a
long list of Google services were unable to do so, according to statement
from the company. “We apologize to everyone affected and have worked hard
to get our services back to normal as quickly as possible,” Google said.
Google declined to discuss the matter further. But this massive outage
— however brief — shows how tenuous our “digital lives” can be. And how
much we’re dependent on Google in particular. Google has gone to extreme
lengths to minimize outages. But it too is fallible, and clearly, multiple
services can go down in the event of an engineering mistake, technical
malfunction, or natural disaster.
This reality, highlighted by the mainstream media attention these cloud
outages received, is making me think of these cloud platform outages as akin
to RIM’s BlackBerry Enterprise Servers outages. Centralized platforms mean
outages are bigger trouble than with decentralized systems.
Have your say in the comments section or forum thread below: Is centralizing
on cloud platforms a risk you can take? Will the cloud platforms be able to
get on top of outages to the point of them being inconsequential? If
outages are the new normal, will moving to private or hybrid clouds give you
a leg up on your rivals?
h*******s
发帖数: 8454
2
按这个频率 比自己的机器安全可靠多了吧

finding

【在 B********f 的大作中提到】
: Amazon和Google云服务的中断给数字化生活敲响了警钟,可能意味着更大的风险。一个
: 潜在的bug可能会带来一连串的事故
: Amazon, Google Cloud Outages Highlight Bigger Risk
: Just when you thought the list of possible gotchas causing cloud outages
: could not get much longer, a post-mortem of the recent Amazon outage that
: took out Reddit and Heroku, among others, fingered a memory bug. The finding
: follows a Google outage on Wednesday that locked folks out of some of its
: most popular consumer services briefly.
: The Amazon Web Services team notes in a blog post:
: We’d like to share more about the service event that occurred on Monday

d*******3
发帖数: 6550
3
这个也就是中断一会,自己电脑坏了可是要折腾好久,浪费的是时间。
b********7
发帖数: 12906
4
对个人用户来说可能没什么。但是云真正赚钱是在商业用户身上。对商业用户来说每中
断一分钟都要损失好多钱。
1 (共1页)
进入PDA版参与讨论
相关主题
amazon web service (AWS) 不是第一年使用micro instance免费吗 (转载)BB的service outage是咋回事?
微软最后一个大杀器悲催的sprint 3G+iphone 4s
山寨iPad有人拍surface马屁了
You are Google's ProductGoogle Drive suffering from service outage
萎软让Tmo的Sidekick用户数据永久丢失outlook.com app发神经了
怪异的virginmobile被三爽坑惨了
我说咋连不上4G网络了Time warner cable - what can we do to serve you worse?
Sprint Customers 50% credit towards last 3 months or free A阿三将gmail搞咂了
相关话题的讨论汇总
话题: google话题: outages话题: amazon话题: cloud话题: services