General FAQ

YugabyteDB

What is YugabyteDB

YugabyteDB is a high-performant, highly available and scalable distributed SQL database designed for powering global, internet-scale applications. It is fully compatible with PostgreSQL and provides strong ACID guarantees for distributed transactions. It can be deployed in a single region, multi-region, and multi-cloud setups.

Explore YugabyteDB

What makes YugabyteDB unique

YugabyteDB stands out as a unique database solution due to its combination of features that bring together the strengths of both traditional SQL databases and modern NoSQL systems. It is horizontally scalable, supports global geo-distribution, supports SQL (YSQL) and NoSQL (YCQL) APIs, is highly performant and gurantees strong transactional consistency.

Design goals

Is YugabyteDB open source?

YugabyteDB is 100% open source. It is licensed under Apache 2.0.

Source code

How many major releases YugabyteDB has had so far?

YugabyteDB released its first beta, v0.9 in November 2017. Since then, several stable and preview versions have been released. The current stable version is v2024.1.2.0, and the current preview version is v2.23.0.0.

Releases

What is the difference between preview and stable versions?

Preview releases include features under active development and are recommended for development and testing only. Stable releases undergo rigorous testing for a longer period of time and are ready for production use.

Release versioning

What are the upcoming features?

The roadmap for upcoming releases and the list of recently released features can be found in the yugabyte-db repository on GitHub.

Current roadmap

Which companies are currently using YugabyteDB in production?

Global organizations of all sizes leverage YugabyteDB to fulfill their application requirements.

Success Stories

How do I report a security vulnerability?

Follow the steps in the vulnerability disclosure policy to report a vulnerability to our security team. The policy outlines our commitments to you when you disclose a potential vulnerability, the reporting process, and how we will respond.

What are YugabyteDB Anywhere and YugabyteDB Aeon?

YugabyteDB is the 100% open source core database. It is the best choice for startup organizations with strong technical operations expertise looking to deploy to production with traditional DevOps tools.

YugabyteDB Anywhere is commercial software for running a self-managed YugabyteDB-as-a-Service. It has built-in cloud native operations, enterprise-grade deployment options, and world-class support.

YugabyteDB Aeon is a fully-managed cloud service on Amazon Web Services (AWS), Microsoft Azure, and Google Cloud Platform (GCP). Sign up to get started.

Compare deployment options

When is YugabyteDB a good fit?

YugabyteDB is a good fit for fast-growing, cloud-native applications that need to serve business-critical data reliably, with zero data loss, high availability, and low latency. Common use cases include:

Distributed Online Transaction Processing (OLTP) applications needing multi-region scalability without compromising strong consistency and low latency. For example, user identity, Retail product catalog, Financial data service.
Hybrid Transactional/Analytical Processing (HTAP) (also known as Translytical) applications needing real-time analytics on transactional data. For example, user personalization, fraud detection, machine learning.
Streaming applications needing to efficiently ingest, analyze, and store ever-growing data. For example, IoT sensor analytics, time series metrics, real-time monitoring.

When is YugabyteDB not a good fit?

YugabyteDB is not a good fit for traditional Online Analytical Processing (OLAP) use cases that need complete ad-hoc analytics. Use an OLAP store such as Druid or a data warehouse such as Snowflake.

What are the trade-offs of using YugabyteDB?

Ensuring ACID transactions and full compatibility with the PostgreSQL API presents challenges in a distributed environment. The trade-offs can also vary depending on the database you're comparing it to. Here are a few key considerations when switching to YugabyteDB:

Consistency vs. Latency: YugabyteDB uses the Raft consensus algorithm for strong consistency in distributed systems. While this guarantees data integrity, it can result in higher write latency compared to eventually consistent databases like Cassandra.
Data Distribution vs Query Latency: Transactions and JOINs that span multiple nodes experience inter-node latency, making queries slower than in single-node databases like PostgreSQL.

Many projects are currently in progress to match the performance of a single-node database.
Multi-Region vs Latency: In multi-region or globally distributed setups, YugabyteDB replicates data across regions to ensure availability and resilience. However, this can lead to higher write latency due to cross-region coordination.
Availability vs Resource Requirements: Being a distributed database, YugabyteDB demands more hardware and networking resources to maintain high availability and fault tolerance compared to traditional monolithic databases that run on a single machine.
PostgreSQL Feature Support: Every new PostgreSQL feature must be optimized for distributed environments, which is not a simple task. Be sure to verify that the PostgreSQL features your application relies on are supported in the current version of YugabyteDB.
PostgreSQL compatibility

What is a YugabyteDB universe

A YugabyteDB universe comprises one primary cluster and zero or more read replica clusters that collectively function as a resilient and scalable distributed database. It is common to have just a primary cluster and hence the terms cluster and universe are sometimes used interchangeably but it is worthwhile to note that they are different.

More key concepts

Are there any performance benchmarks available?

YugabyteDB is regularly benchmarked using a variety of standard benchmarks like TPC-C, YCSB, and sysbench.

Benchmark

How is YugabyteDB tested for correctness?

Apart from the rigorous failure testing, YugabyteDB passes most of the scenarios in Jepsen testing. Jepsen is a methodology and toolset used to verify the correctness of distributed systems, particularly in the context of consistency models and fault tolerance and has become a standard for stress-testing distributed databases, data stores, and other distributed systems.

Jepsen testing

How does YugabyteDB compare to other databases?

We have published detailed comparison information against multiple SQL and NoSQL databases:

SQL - CockroachDB, TiDB, Vitess, Amazon Aurora, Google Spanner
NOSQL - MongoDB, FoundationDB, Cassandra, DynamoDB, CosmosDB

Compare YugabyteDB to other databases

PostgreSQL support

How compatible is YugabyteDB with PostgreSQL?

YugabyteDB is wire-protocol, syntax, feature, and runtime compatible with PostgreSQL. But that said, supporting all PostgreSQL features in a distributed system is not always feasible.

PostgreSQL compatibility

Can I use my existing PostgreSQL tools and drivers with YugabyteDB?

Yes. YugabyteDB is fully compatible with PostgreSQL and automatically works well with most of PostgreSQL tools.

Integrations

Are PostgreSQL extensions supported?

YugabyteDB pre-bundles many popular extensions and these should be readily available on your cluster. But given the distributed nature of YugabyteDB, not all extensions are supported by default.

PostgreSQL extensions

How can I migrate from PostgreSQL?

YugabyteDB is fully compatible with PostgreSQL and so most PostgreSQL applications should work as is. To address corner cases, we have published a comprehensive guide to help you migrate from PostgreSQL.

Migrate data

Architecture

How does YugabyteDB distribute data?

The table data is split into tablets and the table rows are mapped to the tablets via sharding. The tablets themselves are distributed across the various nodes in the cluster.

Data distribution

How does YugabyteDB scale?

YugabyteDB scales seamlessly when new nodes are added to the cluster without any service disruption. Table data is stored distributed in tablets. When new nodes are added, the rebalancer moves certain tablets to other nodes and keeps the number of tablets on each node more or less the same. As data grows, these tablets also split into two and are moved to other nodes.

Horizontal scalability

How does YugabyteDB provide high availability?

YugabyteDB replicates tablet data onto followers of the tablet via RAFT consensus. This ensures that a consistent copy of the data is available in case of failures. On failures, one of the tablet followers is promoted to be the leader.

Resiliency and high availability

How is data consistency maintained across multiple nodes?

Every write (insert, update, delete) to the data is replicated via RAFT consensus to tablet followers as per the replication factor (RF) of the cluster. Before acknowledging the write operation back to the client, YugabyteDB ensures that the data is replicated to a quorum (RF/2 + 1) of followers.

Synchronous replication

What is tablet splitting?

Data is stored in tablets. As the tablet grows, the tablet splits into two. This enables some data to be moved to other nodes in the cluster.

Tablet splitting

Are indexes colocated with tables?

Indexes are not typically colocated with the base table. The sharding of indexes is based on the primary key of the index and is independent of how the main table is sharded/distributed which is based on the primary key of the table.

Explore indexes
Colocating tables

How can YugabyteDB be both CP and ensure high availability at the same time?

In terms of the CAP theorem, YugabyteDB is a consistent and partition-tolerant (CP) database. It ensures high availability (HA) for most practical situations even while remaining strongly consistent. While this may seem to be a violation of the CAP theorem, that is not the case. CAP treats availability as a binary option whereas YugabyteDB treats availability as a percentage that can be tuned to achieve high write availability (reads are always available as long as a single node is available).

Partition Tolerance - CAP