What is Greenplum Database? Intro to the Big Data Database


Greenplum Database is a massively parallel processing (MPP) SQL database that is built and based on PostgreSQL. Greenplum Database is an open-source , hardware-agnostic MPP database for analytics, based on PostgreSQL and developed by Pivotal who was later acquired by VMware.

2019 Database Trends – SQL vs. NoSQL, Top Databases, Single vs. Multiple Database Use


Wondering which databases are trending in 2019? Get the latest insights on MySQL , MongoDB , PostgreSQL , Redis , and many others to see which database management systems are most favored this year. SQL Databases. NoSQL Databases. NoSQL Database Use: 39.52%.

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Database Design and Logical Asseveration for SQL Query Optimization

SQL Shack

Database design and Logical Asseveration play a vital role in database performance and SQL Query optimization. Both have different parameters to make your database and the query accurate. General database design Performance

Millions of tiny databases

The Morning Paper

Millions of tiny databases , Brooker et al., It takes you through the thinking processes and engineering practices behind the design of a key part of the control plane for AWS Elastic Block Storage (EBS): the Physalia database that stores configuration information.

Dynatrace achieves Amazon RDS Service Ready designation


We’re therefore excited to announce that Dynatrace has received the Amazon RDS Service Ready designation. Tasks such as hardware provisioning, database setup, patching, and backups are fully automated, making Amazon RDS cost efficient and scalable. Dynatrace news.

Design 174

2019 PostgreSQL Trends Report: Private vs. Public Cloud, Migrations, Database Combinations & Top Reasons Used


PostgreSQL is an open source relational database system that has soared in popularity over the past 30 years from its active, loyal, and growing community. 2019 PostgreSQL Trends Report: Private vs. Public Cloud, Migrations, Database Combos & Top Reasons Used Click To Tweet.

Towards multiverse databases

The Morning Paper

Towards multiverse databases Marzoev et al., The central idea behind multiverse databases is to push the data access and privacy rules into the database itself. Such a design rules out an entire class of application errors, protecting private data from accidentally leaking.

Design Of A Modern Cache—Part Deux

High Scalability

This structure works surprisingly well for many important workloads like database, search, and analytics. This is a guest post by Benjamin Manes , who did engineery things for Google and is now doing engineery things as CTO of Vector.

Cache 239

Updated Azure SQL Database Tier Options

SQL Performance

Azure SQL Database is Microsoft's database-as-a-service offering that provides a tremendous amount of flexibility. Microsoft is continually working on improving their products and Azure SQL Database is no different. HyperScale Database. Serverless Database.

Azure 56

Amazon Aurora ascendant: How we designed a cloud-native relational database

All Things Distributed

Relational databases have been around for a long time. The core technologies underpinning the major relational database management systems of today were developed in the 1980–1990s. Those fundamentals helped make relational databases immensely popular with users everywhere.

Design 109

Distributed Algorithms in NoSQL Databases

Highly Scalable

These developments gradually highlight a system of relevant database building blocks with proven practical efficiency. In this article I’m trying to provide more or less systematic description of techniques related to distributed operations in NoSQL databases.

2019 Open Source Database Report: Top Databases, Public Cloud vs. On-Premise, Polyglot Persistence


Ready to transition from a commercial database to open source, and want to know which databases are most popular in 2019? Wondering whether an on-premise vs. public cloud vs. hybrid cloud infrastructure is best for your database strategy? Or, considering adding a new database to your application and want to see which combinations are most popular? Polyglot Persistence Trends : Number of Databases Used & Top Combinations. 2019 Top Databases Used.

Weekend Reading: Amazon Aurora: Design Considerations for High Throughput Cloud-Native Relational Databases.

All Things Distributed

In many, high-throughput, OLTP style applications the database plays a crucial role to achieve scale, reliability, high-performance and cost efficiency. For a long time, these requirements were almost exclusively served by commercial, proprietary databases.

Microservices, events, and upside-down databases

O'Reilly Software

Data is all-important—vital for the continued success of our businesses—but has also been seen as a massive constraint in how we design and evolve our systems. All too often, the software wasn’t designed in a way that made it easy to ship.

Tuning PostgreSQL Database Parameters to Optimize Performance


It has default settings for all of the database parameters. It is primarily the responsibility of the database administrator or developer to tune PostgreSQL according to their system’s workload. In this blog, we will establish basic guidelines for setting PostgreSQL database parameters to improve database performance according to workload. It is important to pay attention to performance when writing database queries.

Organisational Fluidity in Digital Ecosystem Platform Design: Strategic Alliance Teams

Strategic Tech

If you’re among the thousands of organisations building a platform-powered digital ecosystem, you face serious organisation design challenges. Without an explicit organisation design strategy, you will over-commit resources to low priority initiatives and starve your big bets?—?the

How to design a RESTful API architecture from a human-language spec

O'Reilly Software

Despite being consumed directly by machines, APIs are made to satisfy the needs of human beings, so designing them should follow a user-centered process, but often it doesn’t. Continue reading How to design a RESTful API architecture from a human-language spec

Amazon DynamoDB ? a Fast and Scalable NoSQL Database.

All Things Distributed

a Fast and Scalable NoSQL Database Service Designed for Internet Scale Applications. Today is a very exciting day as we release Amazon DynamoDB , a fast, highly reliable and cost-effective NoSQL database service designed for internet scale applications. DynamoDB is the result of 15 years of learning in the areas of large scale non-relational databases and cloud services. s business meant that many of these services needed more scalable database solutions.

Using SQL CREATE INDEX to create clustered and non-clustered indexes

SQL Shack

An index in a database is very similar to an index in a book. General database design Indexes PerformanceThe SQL CREATE INDEX statement is used to create clustered as well as non-clustered indexes in SQL Server.

The Best Way to Host MongoDB on DigitalOcean


MongoDB is the #3 open source database and the #1 NoSQL database in the world. It’s a cross-platform document-oriented database that uses JSON-like documents with schema, and is leveraged broadly across startup apps up to enterprise-level businesses developing modern apps. While adequate for low-traffic applications, small databases, and dev/test environments, we recommend against leveraging shared clusters for your MongoDB production deployments.

View-centric performance optimization for database-backed web applications

The Morning Paper

View-centric performance optimization for database-backed web applications Yang et al., Moreover, users like the resulting experience at least as much if not more than the original page designs. you are doing the count in the database, right?).

IPA: invariant-preserving applications for weakly consistent replicated databases

The Morning Paper

IPA: invariant-preserving applications for weakly consistent replicated databases Balegas et al., For bonus points, revisit your application design so that more parts of it can be coordination free.

Which Is the Best PostgreSQL GUI? 2019 Comparison


PostgreSQL graphical user interface (GUI) tools help these open source database users to manage, manipulate, and visualize their data. PostgreSQL is the fourth most popular database management system in the world, and heavily used in all sizes of applications from small to large.

The top 5 reasons to run your own database benchmarks


This post addresses some of the opinions around database benchmarking and gives the top 5 reasons why industry standard benchmarking is important and should be an essential foundation of your database engineering strategy. So are standard database benchmarks meaningless?

Designing Schemaless, Uber Engineering’s Scalable Datastore Using MySQL

Uber Engineering

The making of Schemaless, Uber Engineering’s custom designed datastore using MySQL, which has allowed us to scale from 2014 to beyond. In Project Mezzanine we described … The post Designing Schemaless, Uber Engineering’s Scalable Datastore Using MySQL appeared first on Uber Engineering Blog. Architecture Data Database Infra Mezzanine MySQLBy Jakob Holdgaard Thomsen.

New Metadata-Only Column Changes in SQL Server 2016

SQL Performance

Depending on the type of change and the configuration of the database, an ALTER COLUMN command will need to perform one of the following actions: Change metadata in system tables only. The ALTER TABLE. ALTER COLUMN command is very powerful.

A one size fits all database doesn't fit anyone

All Things Distributed

A common question that I get is why do we offer so many database products? To do this, they need to be able to use multiple databases and data models within the same application. Seldom can one database fit the needs of multiple distinct use cases. The days of the one-size-fits-all monolithic database are behind us, and developers are now building highly distributed applications using a multitude of purpose-built databases. Purpose-built databases.

Clustered Index vs. Heap

SQL Shack

Development General database design Indexes PerformanceSummary There are few topics so widely misunderstood and that generates such frequent bad advice as that of the decision of how to index a table.

Upcoming Webinar Tuesday, 7/31: Using MySQL for Distributed Database Architectures


Please join Percona’s CEO, Peter Zaitsev as he presents Using MySQL for Distributed Database Architectures on Tuesday, July 31st, 2018 at 7:00 AM PDT (UTC-7) / 10:00 AM EDT (UTC-4).

Architecting App Logic for Scalability, Reuse, and Agility


App Logic for transactional systems enforces critical integrity for database web/mobile apps. rest api business rules app dev architecture & design business agility integration architectureArchitect your app logic!

Dynein – An Asynchronous Background Job Service from Airbnb


In a blog post Andy Fang from Airbnb describes the background and challenges designing and building the service. Dynamo DB Key-Value Store Distributed Systems NoSQL Asynchronous Architecture Database Architecture SQS Performance & Scalability Development Architecture & Design DevOps news

Embrace event-driven computing: Amazon expands DynamoDB with streams, cross-region replication, and database triggers

All Things Distributed

I am excited to share with you that today we are expanding DynamoDB with streams, cross-region replication, and database triggers. In traditional database architectures, database engines often run a small search engine or data warehouse engines on the same hardware as the database.

Lambda 126

Job Openings in AWS - Senior Leader in Database Services - All.

All Things Distributed

Job Openings in AWS - Senior Leader in Database Services. This week it is an opening for senior leaders with AWS Database Services. AWS Database Services is responsible for setting the database strategy and delivering distributed structured storage services to our AWS customers. The ideal candidate will be someone who has built and ran large scale distributed systems and/or databases. Job Openings in AWS - Senior Leader in Database Services.

Intro to Redis Cluster Sharding – Advantages, Limitations, Deploying & Client Connections


This is something to be careful about when designing your data structures. Only Supports One Database. Unlike standalone databases, Redis Clusters support only one database (database 0), and the SELECT command isn’t allowed.

DBLog: A Generic Change-Data-Capture Framework

The Netflix TechBlog

Andreas Andreakis , Ioannis Papapanagiotou Overview Change-Data-Capture (CDC) allows capturing committed changes from a database in real-time and propagating those changes to downstream consumers [1][2]. In databases like MySQL and PostgreSQL, transaction logs are the source of CDC events.

Consumer-driven Coupling: Patterns and Anti-patterns

Strategic Tech

Perhaps led by strong managers, they will fight vehemently for their work to be prioritised and software systems to be designed to their needs. On the frontend, the user researchers and design team were experimenting how to best lay the questions out for the simplest user experience.

Connecting MongoDB to Ruby with Self-Signed Certificates for SSL


Our yml file: development: # Configure available database clients. Alternatively, you can define the following: # # Define the name of the default database that Mongoid can connect to. # (required). database: test # Provide the hosts the default client can connect to. The user's database roles. is :plain) auth_mech: :scram # The database or source to authenticate the user against.

C++ 100

DBLog: A Generic Change-Data-Capture Framework

The Netflix TechBlog

Andreas Andreakis , Ioannis Papapanagiotou Overview Change-Data-Capture (CDC) allows capturing committed changes from a database in real-time and propagating those changes to downstream consumers [1][2]. In databases like MySQL and PostgreSQL, transaction logs are the source of CDC events.

PyMongo Tutorial: Testing MongoDB Failover in Your Python App


It comes as no surprise that Python developers commonly leverage MongoDB hosting , the most popular NoSQL database , for their deployments due to its flexible nature and lack of schema requirements. It is a fairly mature driver that supports most of the common operations with the database, and you can check out this tutorial for an introduction to the PyMongo driver. It is also recommended that SSL connections be enabled to encrypt the client-database traffic.

C++ 39

Testing the CP Subsystem with Jepsen


In addition to well-grounded designs and proven algorithms, reliability also requires a substantial amount of testing. testing test cases distributed databases jepsen cp subsystemAt Hazelcast we take reliability very seriously. With the new CP Subsystem module, Hazelcast has become the first and only IMDG that offers a linearizable distributed implementation of the Java concurrency primitives backed by the Raft consensus algorithm.

Java 100

MySQL High Availability Framework Explained – Part II: Semisynchronous Replication


MySQL semisynchronous replication, by design, can fall back to asynchronous mode based on the slave acknowledgement timeout settings or based on the number of semisynchronous-capable slaves available at any point in time. The default design of the ScaleGrid HA framework is to avoid falling back to asynchronous mode. This is a conscious design decision we have made as the default behavior which is, of course, configurable based on the application requirements.

A quick overview of In-Memory OLTP in SQL Server

SQL Shack

This is in continuation of the previous articles How to monitor internal data structures of SQL Server In-Memory database objects and SQL Server In-Memory database internal memory structure monitoring. This article describes the concepts of In-memory-optimization and design principles.

Scalable MicroService Architecture


But that fully self sufficient nature means that each of these services have their own end-to-end full stack of technology, including database. data-driven business logic stays close to the data, i.e. in the database. Application State Management and Data-Driven Decision Making.