2024

article thumbnail

The State of Observability 2024: Navigating Complexity With AI-Driven Insights

DZone

In today's fast-paced digital landscape, organizations are increasingly embracing multi-cloud environments and cloud-native architectures to drive innovation and deliver seamless customer experiences. However, the 2024 State of Observability report from Dynatrace reveals that the explosion of data generated by these complex ecosystems is pushing traditional monitoring and analytics approaches to their limits.

Analytics 328
article thumbnail

The future of work: How to zig, zag, and steer your career in the AI era

Dynatrace

Today’s macroeconomic environment is dynamic and uncertain, generating many questions about the future of work. New technologies are disrupting the landscape, while company mergers, acquisitions, and economic volatility abound. As artificial intelligence becomes more pervasive in organizations, the workforce senses that the future of work is undergoing massive shifts.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Introducing SafeTest: A Novel Approach to Front End Testing

The Netflix TechBlog

by Moshe Kolodny In this post, we’re excited to introduce SafeTest, a revolutionary library that offers a fresh perspective on End-To-End (E2E) tests for web-based User Interface (UI) applications. The Challenges of Traditional UI Testing Traditionally, UI tests have been conducted through either unit testing or integration testing (also referred to as End-To-End (E2E) testing).

Testing 237
article thumbnail

District heating: Using data centers to heat communities

All Things Distributed

An inside look at the Tallaght District Heating Scheme, where Heat Works is using recycled heat from an AWS data center to warm a community in Dublin, Ireland.

AWS 147
article thumbnail

eBPF Documentary

Brendan Gregg

eBPF is a crazy technology – like putting JavaScript into the Linux kernel – and getting it accepted had so far been an untold story of strategy and ingenuity. The eBPF documentary, published late last year, tells this story by interviewing key players from 2014 including myself, and touches on new developments including Windows. (If you are new to eBPF, it is the name of a kernel execution engine that runs a variety of new programs in a performant and safe sandbox in the kernel, lik

article thumbnail

The psychology of site speed and human happiness

Speed Curve

In the fourteen years that I've been working in the web performance industry, I've done a LOT of research, writing, and speaking about the psychology of page speed – in other words, why we crave fast, seamless online experiences. In fact, the entire first chapter of my book, Time Is Money (reprinted here courtesy of the good folks at O'Reilly), is dedicated to the subject.

Speed 136
article thumbnail

Uber Builds Scalable Chat Using Microservices with GraphQL Subscriptions and Kafka

InfoQ

Uber replaced a legacy architecture built using the WAMP protocol with a new solution that takes advantage of GraphQL subscriptions. The main drivers for creating a new architecture were challenges around reliability, scalability, observability/debugibility, as well as technical debt impeding the team’s ability to maintain the existing solution.

More Trending

article thumbnail

Beyond Problem and Solution Space: Better models for modern product development

Strategic Tech

I often encounter the phrases problem space and solution space. People use these words to try and articulate the types of work and activities they are referring to, or where they are in the process of building something new. Unfortunately, rather than aiding communication, I notice that these words are so highly ambiguous that more time is spent debating what they mean than is gained by using them to improve communication and collaboration.

article thumbnail

Setting Up Your Environment for Kubernetes Operators Using Docker, kubectl, and k3d

Percona Community

If you are just starting out in the world of Kubernetes operators, like me, preparing the environment for their installation should be something we do with not much difficulty. This blog will quickly guide you in setting the minimal environment. Kubernetes operators are invaluable for automating complex database operations, tasks that Kubernetes does not handle directly.

article thumbnail

Generative AI in the Real World: Chip Huyen on Finding Business Use Cases for Generative AI

O'Reilly

O’Reilly’s Generative AI in the Enterprise survey reported that people have trouble coming up with appropriate enterprise use cases for AI. Why is it hard to come up with appropriate use cases? Chip Huyen, co-founder of Claypot AI and author of Designing Machine Learning Systems , will talk about why many companies have trouble coming up with appropriate use cases for AI, how to evaluate possible use cases, and the skills your company will need to put these use cases into practice.

article thumbnail

Essential Techniques for Performance Tuning in Snowflake

DZone

Performance tuning in Snowflake is optimizing the configuration and SQL queries to improve the efficiency and speed of data operations. It involves adjusting various settings and writing queries to reduce execution time and resource consumption, ultimately leading to cost savings and enhanced user satisfaction.

Tuning 286
article thumbnail

Easily monitor IBM i with updated Dynatrace extension

Dynatrace

What is IBM i? IBM i, formerly known as iSeries, is an operating system developed by IBM for its line of IBM i Power Systems servers. It is based on the IBM AS/400 system and is known for its reliability, scalability, and security features. IBM i is designed to integrate seamlessly with legacy and modern applications, allowing businesses to run critical workloads and applications.

article thumbnail

Supporting Diverse ML Systems at Netflix

The Netflix TechBlog

David J. Berg , Romain Cledat , Kayla Seeley , Shashank Srikanth , Chaoying Wang , Darin Yu Netflix uses data science and machine learning across all facets of the company, powering a wide range of business applications from our internal infrastructure and content demand modeling to media understanding. The Machine Learning Platform (MLP) team at Netflix provides an entire ecosystem of tools around Metaflow , an open source machine learning infrastructure framework we started, to empower data sc

Systems 219
article thumbnail

What I've been reading since re:Invent

All Things Distributed

After a busy conference season, I've taken some time to catch up on reading and make a dent in the pile of books on my nightstand. Here's what I've started, finished, and picked up since re:Invent.

131
131
article thumbnail

What is System Testing? – Getting Started, Tips, and Tools

Testlodge

System testing involves analyzing the behavior and functionality of a fully integrated application. It is the third of the four levels of testing, performed after unit and integration testing but before user acceptance testing. A QA team member will usually do the assessing, or occasionally the task will fall to other team members such as product or project managers.

Systems 73
article thumbnail

Building the future of performance with SpeedCurve

Speed Curve

I’m beyond excited to announce that I’m joining the SpeedCurve team this year! I’ll still be doing some consulting work, but I’ll be taking on a few less clients this year so I can focus on helping to make an already amazing performance tool even better, working alongside some of my favorite people in the performance community.

article thumbnail

Uber Improves Resiliency of Microservices with Adaptive Load Shedding

InfoQ

Uber created a new load-shedding library for its microservice platform, serving over 130 million customers and handling aggregated peaks of millions of requests per second (RPSs). The company replaced the solution based on QALM with Cinnamon library, which, in addition to graceful degradation, can dynamically and continuously adjust the capacity of the service and the amount of load shedding.

Latency 105
article thumbnail

In Search of Transparency at FOSDEM

Percona

FOSDEM is for sure one of the most respected free and open source software-focused conferences in Europe, which is loved by many, myself included. It is from the point of love I am raising concerns about transparency, which seems to be lacking in regard to some processes at FOSDEM.

article thumbnail

C++ safety, in context

Sutter's Mill

Scope. To talk about C++’s current safety problems and solutions well, I need to include the context of the broad landscape of security and safety threats facing all software. I chair the ISO C++ standards committee and I work for Microsoft, but these are my personal opinions and I hope they will invite more dialog across programming language and security communities.

C++ 131
article thumbnail

Master MySQL Point in Time Recovery

Scalegrid

Data loss or corruption can be daunting. With MySQL point-in-time recovery , you can restore your database to the moment before the problem occurs. This article delivers a practical roadmap for using backups and binary logs to achieve accurate MySQL recovery, detailed steps for setting up your server, and tips for managing recovery and backups effectively without overwhelming you with complexity.

Database 162
article thumbnail

I Actually Chatted with ChatGPT

O'Reilly

ChatGPT was released just over a year ago (at the end of November 2022), and countless people have already written about their experiences using it in all sorts of settings. (I even contributed my own hot take last year with my O’Reilly Radar article Real-Real-World Programming with ChatGPT.) What more is left to say by now? Well, I bet very few of those people have actually chatted with ChatGPT.

article thumbnail

Achieving Kubernetes Monitoring Nirvana: Prometheus and Grafana Unleashed

DZone

In the ever-evolving landscape of container orchestration, Kubernetes has emerged as a frontrunner, offering unparalleled flexibility and scalability. However, with great power comes great responsibility — the responsibility to monitor and understand your Kubernetes clusters effectively. This is where Prometheus and Grafana step in, forming a dynamic duo that provides comprehensive insights into Kubernetes clusters.

article thumbnail

The benefits of unified observability and security for BizDevSecOps use cases

Dynatrace

BizDevSecOps might sound like a mouthful, but it marks a necessary evolution. As business goals and technology efforts continue to converge, organizations need to ensure teams are performing to their full potential. Business considerations are now part of the security, operations, and development framework. During a session at Dynatrace Perform 2024 , Dynatrace colleagues Kristof Renders, director of innovation services, and Brian Chandler, principal solutions architect, demonstrated four BizD

article thumbnail

Evolving from Rule-based Classifier: Machine Learning Powered Auto Remediation in Netflix Data…

The Netflix TechBlog

Evolving from Rule-based Classifier: Machine Learning Powered Auto Remediation in Netflix Data Platform by Binbing Hou , Stephanie Vezich Tamayo , Xiao Chen , Liang Tian , Troy Ristow , Haoyuan Wang , Snehal Chennuru , Pawan Dixit This is the first of the series of our work at Netflix on leveraging data insights and Machine Learning (ML) to improve the operational automation around the performance and cost efficiency of big data jobs.

Tuning 203
article thumbnail

The Return of the Frame Pointers

Brendan Gregg

Sometimes debuggers and profilers are obivously broken, sometimes it's subtle and hard to spot. From my flame graphs page: CPU flame graph (partly broken) (Click for original SVG.) This is pretty common and usually goes unnoticed as the flame graph looks ok at first glance. But there are 15% of samples on the left, above "[unknown]", that are in the wrong place and missing frames.

Java 145
article thumbnail

Home Screen Advantage

Alex Russell

After weeks of confusion and intentional chaos, Apple's plan to kneecap the web has crept into view, menacing a PWApocalypse as the March 6th compliance deadline approaches for the EU's Digital Markets Act (DMA). The view from Cupertino. The DMA requires Apple to open the iPhone to competing app stores, and and its lopsided proposal for "enabling" them is getting most of the press.

Storage 100
article thumbnail

Hello INP! Here's everything you need to know about the newest Core Web Vital

Speed Curve

After years of development and testing, Google has added Interaction to Next Paint (INP) to its trifecta of Core Web Vitals – the performance metrics that are a key ingredient in its search ranking algorithm. INP replaces First Input Delay (FID) as the Vitals responsiveness metric. Not sure what INP means or why it matters? No worries – that's what this post is for. :) What is INP?

Google 85
article thumbnail

How RevenueCat Manages Caching for Handling over 1.2 Billion Daily API Requests

InfoQ

RevenueCat extensively uses caching to improve the availability and performance of its product API while ensuring consistency. The company shared its techniques to deliver the platform, which can handle over 1.2 billion daily API requests. The team at RevenueCat created an open-source memcache client that provides several advanced features.

Cache 97
article thumbnail

Percona PG_TDE and Docker: Please Test Transparent Data Encryption for PostgreSQL

Percona

Securing your data can be a tough task. Unlike some other databases, PostgreSQL does not have a native solution to encrypt data on disk transparently. Percona is developing PG_TDE to encrypt that data using the AES-CTR encryption algorithm. This code is in development, and we need your help testing it.

Testing 111
article thumbnail

Reporting Core Web Vitals With The Performance API

Smashing Magazine

Reporting Core Web Vitals With The Performance API Reporting Core Web Vitals With The Performance API Geoff Graham 2024-02-27T12:00:00+00:00 2024-02-27T19:35:10+00:00 This article is sponsored by DebugBear There’s quite a buzz in the performance community with the Interaction to Next Paint (INP) metric becoming an official Core Web Vitals (CWV) metric in a few short weeks.

article thumbnail

Forming an Architecture Modernization Enabling Team (AMET)

Strategic Tech

This article was co-authored with Eduardo da Silva (also published on his blog ). Architecture modernization initiatives are strategic efforts involving many teams, usually for many months or years. They often compete with product/feature development work, resulting in them falling flat and failing to deliver the promised business benefits that triggered them.

article thumbnail

Can Language Models Replace Compilers?

O'Reilly

Kevlin Henney and I recently discussed whether automated code generation, using some future version of GitHub Copilot or the like, could ever replace higher-level languages. Specifically, could ChatGPT N (for large N) quit the game of generating code in a high-level language like Python, and produce executable machine code directly, like compilers do today?

article thumbnail

Automate Application Load Balancers With AWS Load Balancer Controller and Ingress

DZone

Automating AWS Load Balancers is essential for managing cloud infrastructure efficiently. This article delves into the importance of automation using the AWS Load Balancer controller and Ingress template. Whether you're new or experienced, grasping these configurations is vital to streamlining Load Balancer settings on Amazon Web Services, ensuring a smoother and more effective setup.

AWS 307
article thumbnail

Cloud observability now mandatory for organizations to thrive amid digital disruption

Dynatrace

As organizations face an increasingly competitive, dynamic, and disruptive macroeconomic environment, they have turned to cloud services and digitization for an edge. But as they embrace digital transformation in the cloud, organizations often confront significant challenges. Multicloud environments create massive complexity and a lack of visibility into the potential problems that cloud applications create, from application performance problems and user experience issues to cybersecurity threat

Cloud 226
article thumbnail

Bending pause times to your will with Generational ZGC

The Netflix TechBlog

The surprising and not so surprising benefits of generations in the Z Garbage Collector. By Danny Thomas, JVM Ecosystem Team The latest long term support release of the JDK delivers generational support for the Z Garbage Collector. More than half of our critical streaming video services are now running on JDK 21 with Generational ZGC, so it’s a good time to talk about our experience and the benefits we’ve seen.

Latency 198
article thumbnail

Bridging the Observability Gap for Modern Cloud Architectures

DZone

Cloud-native architectures have brought immense complexity along with increased business agility. But with this complexity comes fragility and lack of transparency into system performance and reliability. At Perform 2024, Dynatrace announced three major platform enhancements aimed squarely at bridging this observability gap for engineering teams. According to Steve Tack , SVP of Product Management at Dynatrace, a key goal is to "help organizations adopt new technologies with confidence.

article thumbnail

Unlocking Advanced AWS Cost Optimization With Trusted Advisor

DZone

Amazon Web Services ( AWS ) offers a range of tools to help users manage their resources effectively, ensuring they are secure, well-performing, and cost-optimized. One such tool is AWS Trusted Advisor, an application that inspects your AWS environment and provides real-time recommendations in various categories, including cost optimization. While many AWS customers are familiar with the essential cost-saving tips Trusted Advisor provides, a wealth of more profound insights and advanced strategi

AWS 300