Architecture, Big Data and Network - Technology Performance Pulse

What is Greenplum Database? Intro to the Big Data Database

Scalegrid

MAY 13, 2020

In this blog post, we explain what Greenplum is, and break down the Greenplum architecture, advantages, major use cases, and how to get started. It’s architecture was specially designed to manage large-scale data warehouses and business intelligence workloads by giving you the ability to spread your data out across a multitude of servers.

Big Data

Big Data Database Artificial Intelligence Open Source

How Netflix uses eBPF flow logs at scale for network insight

The Netflix TechBlog

JUNE 7, 2021

By Alok Tiagi , Hariharan Ananthakrishnan , Ivan Porto Carrero and Keerti Lakshminarayan Netflix has developed a network observability sidecar called Flow Exporter that uses eBPF tracepoints to capture TCP flows at near real time. Without having network visibility, it’s difficult to improve our reliability, security and capacity posture.

Network

Network Transportation AWS Cloud

In-Stream Big Data Processing

Highly Scalable

AUGUST 20, 2013

The shortcomings and drawbacks of batch-oriented data processing were widely recognized by the Big Data community quite a long time ago. In the previous section, we noted that many distributed query processing algorithms resemble message passing networks. Towards Unified Big Data Processing. Pipelining.

Big Data

Big Data Processing Lambda Database

What is IT operations analytics? Extract more data insights from more sources

Dynatrace

MAY 1, 2023

IT operations analytics is the process of unifying, storing, and contextually analyzing operational data to understand the health of applications, infrastructure, and environments and streamline everyday operations. ITOA collects operational data to identify patterns and anomalies for faster incident management and near-real-time insights.

Analytics

Analytics Artificial Intelligence Big Data Open Source

What is a Distributed Storage System

Scalegrid

FEBRUARY 8, 2024

Their design emphasizes increasing availability by spreading out files among different nodes or servers — this approach significantly reduces risks associated with losing or corrupting data due to node failure. This strategy reduces the volume needed during retrieval operations.

Storage

Storage Systems Big Data Azure

What is software automation? Optimize the software lifecycle with intelligent automation

Dynatrace

JUNE 26, 2023

Software analytics offers the ability to gain and share insights from data emitted by software systems and related operational processes to develop higher-quality software faster while operating it efficiently and securely. This involves big data analytics and applying advanced AI and machine learning techniques, such as causal AI.

Software

Software Software Analytics Big Data

Kubernetes for Big Data Workloads

Abhishek Tiwari

DECEMBER 27, 2017

Kubernetes has emerged as go to container orchestration platform for data engineering teams. In 2018, a widespread adaptation of Kubernetes for big data processing is anitcipated. Organisations are already using Kubernetes for a variety of workloads [1] [2] and data workloads are up next. Key challenges.

Big Data

Big Data Storage Benchmarking Hardware

Mastering Hybrid Cloud Strategy

Scalegrid

MARCH 14, 2024

Defining Hybrid Cloud Strategy The decision-making process about where to situate data and applications is vital to any hybrid cloud solution. Defining Hybrid Cloud Strategy The decision-making process about where to situate data and applications is vital to any hybrid cloud solution.

Strategy

Strategy Cloud Artificial Intelligence Infrastructure

What is cloud monitoring? How to improve your full-stack visibility

Dynatrace

JANUARY 11, 2023

As cloud and big data complexity scales beyond the ability of traditional monitoring tools to handle, next-generation cloud monitoring and observability are becoming necessities for IT teams. With agent monitoring, third-party software collects data and reports from the component that’s attached to the agent.

Cloud

Cloud Monitoring Best Practices Infrastructure

What is ITOps? Why IT operations is more crucial than ever in a multicloud world

Dynatrace

DECEMBER 15, 2022

Besides the traditional system hardware, storage, routers, and software, ITOps also includes virtual components of the network and cloud infrastructure. A network administrator sets up a network, manages virtual private networks (VPNs), creates and authorizes user profiles, allows secure access, and identifies and solves network issues.

Artificial Intelligence

Artificial Intelligence DevOps Hardware Virtualization

Any analysis, any time: Dynatrace Log Management and Analytics powered by Grail

Dynatrace

OCTOBER 4, 2022

Limited data availability constrains value creation. Modern IT environments — whether multicloud, on-premises, or hybrid-cloud architectures — generate exponentially increasing data volumes. The number and variety of applications, network devices, serverless functions, and ephemeral containers grows continuously.

Analytics

Analytics Artificial Intelligence Storage Serverless

Redis vs Memcached in 2024

Scalegrid

MARCH 28, 2024

Key Takeaways Redis offers complex data structures and additional features for versatile data handling, while Memcached excels in simplicity with a fast, multi-threaded architecture for basic caching needs. Memcached shines in scenarios where a simple, fast, and efficient caching solution is required without data persistence.

Cache

Cache Storage Scalability Architecture

Hybrid cloud infrastructure explained: Weighing the pros, cons, and complexities

Dynatrace

JUNE 29, 2022

To drive better outcomes using hybrid cloud architectures, it helps to understand their benefits—and how to orchestrate them seamlessly. What is hybrid cloud architecture? Hybrid cloud architecture is a computing environment that shares data and applications on a combination of public clouds and on-premises private clouds.

Infrastructure

Infrastructure Cloud Azure AWS

What is container orchestration?

Dynatrace

MARCH 24, 2023

But managing the deployment, modification, networking, and scaling of multiple containers can quickly outstrip the capabilities of development and operations teams. This orchestration includes provisioning, scheduling, networking, ensuring availability, and monitoring container lifecycles. How does container orchestration work?

Infrastructure

Infrastructure Open Source Operating System Cloud

Seer: leveraging big data to navigate the complexity of performance debugging in cloud microservices

The Morning Paper

MAY 14, 2019

Seer: leveraging big data to navigate the complexity of performance debugging in cloud microservices Gan et al., Using network queue depths alone is enough to signal a large fraction of QoS violations, although smaller than when the full instrumentation is available. ASPLOS’19. Distributed tracing and instrumentation.

Big Data

Big Data Cloud Performance Hardware

Tackling the Pipeline Problem in the Architecture Research Community

ACM Sigarch

APRIL 8, 2019

Computer architecture is an important and exciting field of computer science, which enables many other fields (eg. big-data processing, machine learning, quantum computing, and so on). For those of us who pursued computer architecture as a career, this is well understood. Why is that? Should we be alarmed as a community?

Architecture

Architecture Open Source Hardware Software Engineering

The Need for Real-Time Device Tracking

ScaleOut Software

JULY 19, 2021

Today’s streaming analytics architectures are not equipped to make sense of this rapidly changing information and react to it as it arrives. This data is also periodically uploaded to a data lake for offline batch analysis that calculates key statistics and looks for big trends that can help optimize operations.

IoT

IoT Analytics Big Data Architecture

No need to compromise visibility in public clouds with the new Azure services supported by Dynatrace

Dynatrace

JULY 6, 2020

Azure Virtual Network Gateways. Our customers have frequently requested support for this first new batch of services, which cover databases, big data, networks, and computing. See the health of your big data resources at a glance. Azure DB for PostgreSQL. Azure SQL Managed Instance. Azure HDInsight.

Azure

Azure Cloud Big Data Virtualization

Applying real-world AIOps use cases to your operations

Dynatrace

OCTOBER 17, 2022

Artificial intelligence for IT operations, or AIOps, combines big data and machine learning to provide actionable insight for IT teams to shape and automate their operational strategy. It may have third-party calls, such as content delivery networks, or more complex requests to a back end or microservice-based application.

DevOps

DevOps Artificial Intelligence Healthcare Innovation

Use Digital Twins for the Next Generation in Telematics

ScaleOut Software

NOVEMBER 24, 2020

However, telematics architectures face challenges in responding to telemetry in real time. Current Telematics Architecture. The volume of incoming telemetry challenges current telematics systems to keep up and quickly make sense of all the data. Challenges for Current Architectures. Solution: Real-Time Digital Twins.

Analytics

Analytics Architecture Scalability Software Architecture

Fast key-value stores: an idea whose time has come and gone

The Morning Paper

JUNE 23, 2019

The stateless + RInk (S+RInK) architecture attempts to provide the best of both worlds: to simultaneously offer both the implementation and operational simplicity of stateless application servers and the performance benefits of servers caching state in RAM. We’ve seen similar high marshalling overheads in big data systems too.)

Cache

Cache Latency Google Lambda

Hyper Scale VPC Flow Logs enrichment to provide Network Insight

The Netflix TechBlog

MAY 26, 2020

Without having network visibility, it’s not possible to improve our reliability, security and capacity posture. Network Availability: The expected continued growth of our ecosystem makes it difficult to understand our network bottlenecks and potential limits we may be reaching. 43416 5001 52.213.180.42

Network

Network Tuning AWS Big Data

Web Performance Bookshelf

Rigor

JANUARY 13, 2020

Take, for example, The Web Almanac , the golden collection of Big Data combined with the collective intelligence from most of the authors listed below, brilliantly spearheaded by Google’s @rick_viscomi. High Performance Browser Networking. Information Architecture. ” – Andy King, 2003. Time is Money.

Performance

Performance Social Media Website Website Performance

The AWS Pop-up Loft opens in New York City

All Things Distributed

MAY 27, 2015

The goal of opening the loft was to give developers an opportunity to get in-person support and education on AWS, to network, get some work done, or just hang out with peers. Bring your questions about AWS architecture, cost optimization, services and features, and anything else AWS related. And don’t be shy — walk-ins are welcome too.

AWS

AWS Education Big Data Games

Expanding the AWS Cloud: Introducing the AWS Canada (Central) Region

All Things Distributed

DECEMBER 8, 2016

It adopted Amazon Redshift, Amazon EMR and AWS Lambda to power its data warehouse, big data, and data science applications, supporting the development of product features at a fraction of the cost of competing solutions. Kik Interactive is a Canadian chat platform with hundreds of millions of users around the globe.

AWS

AWS Cloud Lambda Innovation

Expanding the Cloud: Introducing the AWS Asia Pacific (Mumbai) Region

All Things Distributed

JUNE 26, 2016

AdiMap uses Amazon Kinesis to process real-time streaming online ad data and job feeds, and processes them for storage in petabyte-scale Amazon Redshift. Advanced problem solving that connects big data with machine learning. warehouses to glean business insights for jobs, ad spend, or financials for mobile apps.

AWS

AWS Cloud Healthcare Blockchain

AWS Pop-up Loft 2.0: Returning to San Francisco on October 1st

All Things Distributed

SEPTEMBER 26, 2014

Be sure to bring your questions about AWS architecture, cost optimization, services and features, and anything else AWS-related. Topics include Introduction to AWS, Big Data, Compute & Networking, Architecture, Mobile & Gaming, Databases, Operations, Security, and more. AWS Technical Bootcamps.

AWS

AWS Games Education Innovation

Register for AWS re: Invent - All Things Distributed

All Things Distributed

JULY 16, 2012

There are sessions in many different categories: Architecture, Big Data, HPC, Computer & Networking, Storage, Databases, Security, Tools & Languages, Media Sharing & Content Delivery, Managing AWS Resources, Enterprise IT, Mobile, Start-up, and more.

AWS

AWS Big Data Media Storage

The Winds of Architecture Changes at the USENIX ATC 2019

ACM Sigarch

NOVEMBER 1, 2019

This blog post gives a glimpse of the computer systems research papers presented at the USENIX Annual Technical Conference (ATC) 2019, with an emphasis on systems that use new hardware architectures. As a consequence, the vast majority of the papers in the past has usually focused on conventional X86 or GPU-accelerated architectures.

Architecture

Architecture Hardware Cache Storage

Dutch Enterprises and The Cloud

All Things Distributed

SEPTEMBER 6, 2013

Shell leverages AWS for big data analytics to help achieve these goals. It makes use of the Eagle Genomics platform running on AWS, resulting in that Unilever’s digital data program now processes genetic sequences twenty times faster—without incurring higher compute costs.

Cloud

Cloud Energy AWS Healthcare

USENIX LISA 2018: CFP Now Open

Brendan Gregg

APRIL 30, 2018

Apart from networking, attending conferences like LISA in person is an effective way to upgrade your skills: you can block out work interruptions and absorb new knowledge that's been neatly summarized into sessions. We first met each other at LISA, in addition to making many other important industry connections over the years.

DevOps

DevOps Network Best Practices Programming

Rethinking the 'production' of data

All Things Distributed

DECEMBER 20, 2017

Marketers use big data and artificial intelligence to find out more about the future needs of their customers. The main goal in all this is to have the possibility to quickly iterate experiments through the widest range of architectures, combine services with each other, and compare approaches.

Artificial Intelligence

Artificial Intelligence Social Media Logistics AWS

USENIX LISA 2018: CFP Now Open

Brendan Gregg

APRIL 29, 2018

Apart from networking, attending conferences like LISA in person is an effective way to upgrade your skills: you can block out work interruptions and absorb new knowledge that's been neatly summarized into sessions. We first met each other at LISA, in addition to making many other important industry connections over the years.

DevOps

DevOps Network Best Practices Programming

Expanding the Cloud - Cluster Compute Instances for Amazon EC2.

All Things Distributed

JULY 13, 2010

Customers with complex computational workloads such as tightly coupled, parallel processes, or with applications that are very sensitive to network performance, can now achieve the same high compute and networking performance provided by custom-built infrastructure while benefiting from the elasticity, flexibility and cost advantages of Amazon EC2.

Cloud

Cloud AWS Automotive Latency

Expanding the Cloud with DNS - Introducing Amazon Route 53 - All.

All Things Distributed

DECEMBER 5, 2010

The naming system that we are all most familiar with in the internet is the Domain Name System (DNS) that manages the naming of the many different entities in our global network; its most common use is to map a name to an IP address, but it also provides facilities for aliases, finding mail servers, managing security keys, and much more.

Cloud

Cloud Internet Internet AWS

Optimizing data warehouse storage

The Netflix TechBlog

DECEMBER 21, 2020

We built AutoOptimize to efficiently and transparently optimize the data and metadata storage layout while maximizing their cost and performance benefits. This article will list some of the use cases of AutoOptimize, discuss the design principles that help enhance efficiency, and present the high-level architecture.

Storage

Storage Latency Efficiency Data Engineering

5 data integration trends that will define the future of ETL in 2018

Abhishek Tiwari

DECEMBER 27, 2017

A common theme across all these trends is to remove the complexity by simplifying data management as a whole. In 2018, we anticipate that ETL will either lose relevance or the ETL process will disintegrate and be consumed by new data architectures. Unified data management architecture.

Big Data

Big Data Artificial Intelligence Storage Hardware

The Amazon.com 2010 Shareholder Letter Focusses on Technology.

All Things Distributed

APRIL 27, 2011

Look inside a current textbook on software architecture, and youll find few patterns that we dont apply at Amazon. Our technologies are almost exclusively implemented as services : bits of logic that encapsulate the data they operate on and provide hardened interfaces as the only way to access their functionality.

Technology

Technology Technology AWS Storage

Amazon EC2 Cluster GPU Instances - All Things Distributed

All Things Distributed

NOVEMBER 15, 2010

Building general purpose architectures has always been hard; there are often so many conflicting requirements that you cannot derive an architecture that will serve all, so we have often ended up focusing on one side of the requirements that allow you to serve that area really well. From CPU to GPU.

AWS

AWS Latency Programming Architecture

What is Greenplum Database? Intro to the Big Data Database

How Netflix uses eBPF flow logs at scale for network insight

Trending Sources

In-Stream Big Data Processing

What is IT operations analytics? Extract more data insights from more sources

What is a Distributed Storage System

What is software automation? Optimize the software lifecycle with intelligent automation

Kubernetes for Big Data Workloads

Mastering Hybrid Cloud Strategy

What is cloud monitoring? How to improve your full-stack visibility

What is ITOps? Why IT operations is more crucial than ever in a multicloud world

Any analysis, any time: Dynatrace Log Management and Analytics powered by Grail

Redis vs Memcached in 2024

Hybrid cloud infrastructure explained: Weighing the pros, cons, and complexities

What is container orchestration?

Seer: leveraging big data to navigate the complexity of performance debugging in cloud microservices

Tackling the Pipeline Problem in the Architecture Research Community

The Need for Real-Time Device Tracking

No need to compromise visibility in public clouds with the new Azure services supported by Dynatrace

Applying real-world AIOps use cases to your operations

Use Digital Twins for the Next Generation in Telematics

Fast key-value stores: an idea whose time has come and gone

Hyper Scale VPC Flow Logs enrichment to provide Network Insight

Web Performance Bookshelf

The AWS Pop-up Loft opens in New York City

Expanding the AWS Cloud: Introducing the AWS Canada (Central) Region

Expanding the Cloud: Introducing the AWS Asia Pacific (Mumbai) Region

AWS Pop-up Loft 2.0: Returning to San Francisco on October 1st

Register for AWS re: Invent - All Things Distributed

The Winds of Architecture Changes at the USENIX ATC 2019

Dutch Enterprises and The Cloud

USENIX LISA 2018: CFP Now Open

Rethinking the 'production' of data

USENIX LISA 2018: CFP Now Open

Expanding the Cloud - Cluster Compute Instances for Amazon EC2.

Expanding the Cloud with DNS - Introducing Amazon Route 53 - All.

Optimizing data warehouse storage

5 data integration trends that will define the future of ETL in 2018

The Amazon.com 2010 Shareholder Letter Focusses on Technology.

Amazon EC2 Cluster GPU Instances - All Things Distributed

Stay Connected