Prepare to choose a data store in Azure

2025-09-23

When you prepare your landing zone environment for your cloud adoption, you need to determine the data requirements for hosting your workloads. Azure database products and services support various data storage scenarios and capabilities. How you configure your landing zone environment to support your data requirements depends on your workload governance, technical, and business requirements.

Identify data services requirements

As part of your landing zone evaluation and preparation, you need to identify the data stores that your landing zone needs to support. This process involves assessing each of the applications and services that make up your workloads to determine their data storage and access requirements. After you identify and document these requirements, you can create policies for your landing zone to control allowed resource types based on your workload needs.

For each application or service that you deploy to your landing zone environment, use the following information as a starting point to help you determine the appropriate data store services to use.

Functional requirements

Consider the nature of your data and how you plan to use it:

Data format: Structured (tables), semi-structured (JSON, XML, and key-value), or unstructured (images and documents)
Purpose: Online transactional processing (OLTP) for transactional data or online analytical processing (OLAP) for complex, ad-hoc data analysis
Search needs: Indexing capability or full-text search capability
Specialized: Vector stores for highly dimensional data or graph databases for highly interconnected data
Data relationships: Joins, graph traversal, or hierarchical structures
Consistency model: Strong, eventual, or configurable consistency
Schema flexibility: Schema-on-write (rigid) versus schema-on-read (flexible)
Concurrency needs: Optimistic versus pessimistic locking, and high-write scenarios
Data life cycle: Short-lived versus long-term archival, and hot versus cold data
Data movement: Extract, transform, and load (ETL) requirements; extract, load, and transform (ELT) requirements; and integration with pipelines

Nonfunctional requirements

Evaluate performance and scalability expectations:

Latency and throughput: Real-time versus batch processing
Scalability: Vertical versus horizontal scaling, and global distribution
Reliability and availability: Service-level agreement (SLA) requirements and failover strategies
Limits: Storage size, throughput caps, and partitioning constraints

Cost and management considerations

Factor in operational overhead and budget:

Managed versus self-hosted: Platform as a service (PaaS) versus infrastructure as a service (IaaS) trade-offs
Region availability: Data residency and compliance needs
Cost optimization: Tiered storage, partitioning, and caching
Licensing and portability: Vendor lock-in and open-source compatibility

Security and governance

Ensure alignment with organizational policies:

Encryption: At-rest and in-transit encryption
Authentication and authorization: Role-based access and identity integration
Auditing and monitoring: Activity logs, alerts, and diagnostics
Networking: Private endpoints, firewall rules, and virtual network integration

DevOps and team readiness

Assess your team's ability to support and evolve the solution:

Skill sets: Familiarity with query languages, SDKs, and tooling
Client support: Language bindings and driver availability
Tooling integration: Continuous integration and continuous delivery (CI/CD) pipelines and observability tools

Key questions

Answer the following questions about your workloads to make decisions based on the Azure database services decision tree:

What level of control do you need over the OS and database engine? Some scenarios require you to have a high degree of control or ownership of the software configuration and host servers for your database workloads. In these scenarios, you can deploy custom IaaS virtual machines (VMs) to fully control the deployment and configuration of data services. You might not need this level of control, but maybe you're not ready to move to a full PaaS solution. In that case, a managed instance can provide higher compatibility with your on-premises database engine while providing the benefits of a managed platform.
Will your workloads use a relational database technology? If so, choose from Azure SQL Database, Azure Database for MySQL, and Azure Database for PostgreSQL, which all provide managed PaaS database capabilities.
Will your workloads use SQL Server? In Azure, your workloads can run on IaaS-based SQL Server on Azure Virtual Machines or on the PaaS-based SQL Database hosted service. Your choice depends on whether you want to manage your database, apply patches, and take backups, or delegate these operations to Azure. Some scenarios require IaaS-hosted SQL Server because of capability requirements. For more information, see Choose the right SQL Server option in Azure.
Will your workloads use key-value database storage? Azure Managed Redis is a managed in-memory data store based on the latest Redis Enterprise version. It provides low latency and high throughput. Azure Cosmos DB also provides key-value storage capabilities.
Will your workloads use document or graph data? Azure Cosmos DB is a multimodel database service that supports various data types and APIs. It also provides document and graph database capabilities.
Will your workloads use column-family data? Azure Managed Instance for Apache Cassandra provides a managed Apache Cassandra cluster that can extend your existing datacenters into Azure or serve as a cloud-only cluster and datacenter.
Will your workloads require high-capacity data analytics capabilities? Microsoft Fabric is an enterprise-ready, end-to-end analytics platform. It unifies data movement, data processing, ingestion, transformation, real-time event routing, and report building.
Will your workloads require search engine capabilities? You can use Azure AI Search to build AI-enhanced cloud-based search indexes that can integrate into your applications.
Will your workloads use time-series data? Azure Data Explorer is a managed, high-performance, big data analytics platform that analyzes high volumes of data in near real time.

Note

For more information about how to assess database options for each of your applications or services, see Understand data store models.

Common database scenarios

The following table lists common use scenario requirements and the recommended database services to handle them.

Your goal	Recommended database service
Build apps that scale with a managed and intelligent SQL database in the cloud.	SQL Database
Modernize SQL Server applications by using a managed, up-to-date SQL instance in the cloud.	Azure SQL Managed Instance
Migrate your SQL workloads to Azure while maintaining complete SQL Server compatibility and OS-level access.	SQL Server on Virtual Machines
Build scalable, managed enterprise-ready apps on open-source PostgreSQL, scale out single-node PostgreSQL with high performance, or migrate PostgreSQL and Oracle workloads to the cloud.	Azure Database for PostgreSQL
Deliver high availability and elastic scaling to open-source mobile and web apps by using a managed community MySQL database service, or migrate MySQL workloads to the cloud.	Azure Database for MySQL
Build applications that have guaranteed low latency and high availability anywhere, at any scale, or migrate Cassandra, MongoDB, Gremlin, and other NoSQL workloads to the cloud.	Azure Cosmos DB
Modernize existing Cassandra data clusters and apps, and gain flexibility by using a managed instance service.	Azure Managed Instance for Apache Cassandra
Build a managed elastic data warehouse that has security at every level of scale at no extra cost.	Azure Synapse Analytics
Deliver fast, scalable applications by using an open-source-compatible in-memory data store.	Azure Cache for Redis

Database feature comparison

The following table lists features available in Azure database services.

Feature	SQL Database	SQL Managed Instance	Azure Database for PostgreSQL	Azure Database for MySQL	Azure Managed Instance for Apache Cassandra	Azure Cosmos DB	Azure Managed Redis	Azure Cosmos DB for MongoDB
Database type	Relational	Relational	Relational	Relational	NoSQL	NoSQL	In-memory	NoSQL
Data model	Relational	Relational	Relational	Relational	Wide-column	Multimodel: Document, wide-column, key-value, graph	Key-value	Document
Distributed multiprimary writes	No	No	No	No	Yes	Yes	Yes	Yes
Virtual network connectivity support	Virtual network service endpoint	Native virtual network implementation	Virtual network injection (Flexible Server only)	Virtual network injection (Flexible Server only)	Native virtual network implementation	Virtual network service endpoint	Virtual network service endpoint	Virtual network service endpoint

Note

Azure Private Link service simplifies networking design by enabling Azure services to communicate over private networking. All Azure database services support Azure Private Link service. For managed instance database services, these instances are deployed in virtual networks, so you don't need to deploy private endpoints for them.

Regional availability

Azure helps you deliver services at the scale needed to reach customers and partners anywhere. When you plan your cloud deployment, determine the Azure region to host your workload resources.

Most Azure regions support most database services. A few regions support only a subset of these products, but they mostly target governmental customers. Before you decide which regions to deploy your database resources to, see Products available by region to check the latest status of regional availability.

For more information about Azure global infrastructure, see Azure geographies.

Data residency and compliance requirements

Legal and contractual requirements related to data storage often apply to workloads. These requirements might vary based on the location of your organization, the jurisdiction of the physical assets that host your data stores, and your applicable business sector. Consider the following components of data obligations:

Data classification
Data location
Responsibilities for data protection under the shared responsibility model

For information about these requirements, see Achieve compliant data residency and security with Azure.

Part of your compliance efforts might include controlling where your database resources are physically located. Azure regions are organized into groups called geographies. An Azure geography honors data residency, sovereignty, compliance, and resiliency requirements within geographical and political boundaries. If your workloads are subject to data sovereignty or other compliance requirements, you must deploy your storage resources to regions in a compliant Azure geography.

Establish controls for database services

When you prepare your landing zone environment, you can establish controls that limit the data stores that users can deploy. Controls can help you manage costs and limit security risks. Developers and IT teams can still deploy and configure resources that support your workloads.

After you identify and document your landing zone's requirements, you can use Azure Policy to control the database resources that you allow users to create. Controls can allow or deny the creation of database resource types.

For example, you might restrict users to creating only SQL Database resources. Use policies to control the options that users can select when they create resources. For example, you can restrict SQL Database SKUs that users can provision by allowing only specific versions of SQL Server to be installed on an IaaS VM. For more information, see Azure Policy built-in policy definitions.

You can apply policies to resources, resource groups, subscriptions, and management groups. Include your policies in Azure Blueprints definitions, and apply them repeatedly throughout your cloud estate.

Next steps

Understand data models

Use the following articles to choose a specialized data store:

Feedback

Was this page helpful?