Remove AWS Remove Exercise Remove Infrastructure Remove Storage
article thumbnail

Failure Modes and Continuous Resilience

Adrian Cockcroft

There are many possible failure modes, and each exercises a different aspect of resilience. This is why most AWS regions have three availability zones. The third team is the infrastructure platform team, who deal with datacenter and cloud based resources. If something fails, there should be another way for the system to succeed.

Latency 52
article thumbnail

Failure Modes and Continuous Resilience

Adrian Cockcroft

There are many possible failure modes, and each exercises a different aspect of resilience. This is why most AWS regions have three availability zones. The third team is the infrastructure platform team, who deal with datacenter and cloud based resources. If something fails, there should be another way for the system to succeed.

Latency 53
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Open-Sourcing Metaflow, a Human-Centric Framework for Data Science

The Netflix TechBlog

About two years ago, we, at our newly formed Machine Learning Infrastructure team started asking our data scientists a question: “What is the hardest thing for you as a data scientist at Netflix?” Our job as a Machine Learning Infrastructure team would therefore not be mainly about enabling new technical feats.

article thumbnail

Open-Sourcing Metaflow, a Human-Centric Framework for Data Science

The Netflix TechBlog

About two years ago, we, at our newly formed Machine Learning Infrastructure team started asking our data scientists a question: “What is the hardest thing for you as a data scientist at Netflix?” Our job as a Machine Learning Infrastructure team would therefore not be mainly about enabling new technical feats.

article thumbnail

A Decade of Dynamo: Powering the next wave of high-performance, internet-scale applications

All Things Distributed

Our straining database infrastructure on Oracle led us to evaluate if we could develop a purpose-built database that would support our business needs for the long term. As we began growing the AWS business, we realized that external customers might find our Dynamo database just as useful as we found it within Amazon.com.

Internet 128
article thumbnail

Amazon EC2 Cluster GPU Instances - All Things Distributed

All Things Distributed

The power of worlds most advanced GPUs is now available for everyone to use without any up-front investment, removing the risks and uncertainties that owning your own GPU infrastructure would involve. Configuring kernel execution is not a trivial exercise and requires GPU device specific knowledge. Countdown to What is Next in AWS.

AWS 136