Microsoft Dynamics 365 connector FAQs

Preview

Find answers to frequently asked questions about the managed Microsoft Dynamics 365 connector in Lakeflow Connect.

For general questions about managed ingestion connectors, see Managed connector FAQs.

Connector-specific FAQs

How does the connector access D365 data?

The Dynamics 365 connector uses Azure Synapse Link for Dataverse as an intermediary:

Synapse Link continuously exports D365 data to ADLS Gen2 in CSV format.
Synapse Link maintains changelogs with VersionNumber timestamps for change tracking.
Databricks reads the exported files from ADLS Gen2 using Microsoft Entra ID authentication.
The connector processes changelogs to perform incremental ingestion.

With this architecture, you can ingest D365 data without making OData-based API calls to D365, which reduces load on your D365 environment.

Why do I need Azure Synapse Link?

Azure Synapse Link for Dataverse is required because:

Change tracking: Synapse Link provides changelogs with VersionNumber fields that enable incremental ingestion.
Performance: Reading exported files from ADLS Gen2 is more efficient than making OData-based API calls to D365.

How does incremental ingestion work?

The Dynamics 365 connector uses the VersionNumber field from Azure Synapse Link changelogs to track changes:

Synapse Link exports data to timestamped folders in ADLS Gen2.
Each export includes a changelog file with VersionNumber values indicating when records changed.
The connector processes folders in chronological order based on timestamps.
For each folder, the connector reads the changelog and applies changes (inserts, updates, deletes).
The connector stores the last processed VersionNumber as a cursor.
Subsequent pipeline runs only process new folders created after the last cursor.

This approach ensures the connector captures all changes without reprocessing unchanged data.

See Enable history tracking (SCD type 2) for information about how the connector handles updates and deletes.

Which Dynamics 365 applications are supported?

The Dynamics 365 connector supports the following:

Dataverse-native applications (direct access, no virtual entities or direct tables required). Examples include:

Dynamics 365 Sales
Dynamics 365 Customer Service
Dynamics 365 Marketing
Dynamics 365 Field Service

Non-Dataverse-native applications (requires either virtual entities or direct tables). Examples include:

Dynamics 365 Finance & Operations (F&O)

See Configure data source for Microsoft Dynamics 365 ingestion for setup details.

What's the difference between Dataverse-native and Non-Dataverse-native apps?

Dataverse-native apps store data directly in Dataverse tables. The connector can access these tables immediately after configuring Azure Synapse Link.

Non-Dataverse-native apps (such as F&O) store data in their own database, not in Dataverse. To ingest data from these applications, you can use either virtual entities or direct tables.

Definitions:

A virtual table (or entity) appears and behaves like a regular table within Dataverse, but it doesn't store any data itself. Instead, it retrieves data on demand, directly from an external data source (for example, F&O). This means the data is not materialized within the Dataverse database, allowing users to interact with the data without duplicating or storing it.
A direct table is a physical copy of application data that is exported and materialized outside Dataverse (for example, in Azure Data Lake Storage via Azure Synapse Link). The data is replicated from the source system (for example, F&O) and stored as raw transactional tables that closely match the source schema. Because the data is persisted, it supports scalable analytics and historical analysis without querying the source system.

How they are used (using F&O as an example):

If using virtual entities, data is exposed in Dataverse through the F&O Virtual Entity solution. These virtual entities appear as read-through tables (for example, mserp_) in Dataverse and can be exported to ADLS using Azure Synapse Link. From there, Lakeflow Connect pipelines ingest the exported folders into your target tables. This option typically provides business-friendly, pre-joined views with less downstream transformation effort, since F&O entities often aggregate data from multiple underlying tables into a denormalized view.
If using direct tables, Azure Synapse Link enables you to export raw F&O tables directly. These tables are available in a separate section during Synapse Link setup and are written to ADLS as raw transactional data.

Considerations for deciding when to use which:

To decide which option is best, you might consider:

Data granularity and control: Direct tables provide the most granular, raw transactional data for ultimate control over data modeling. Virtual tables offer a more flattened, aggregated view—potentially including computed fields.
Data readiness and transformation effort: Relatedly, virtual tables often offer pre-joined, business-ready data, which can help to minimize downstream transformations in Databricks. Direct tables often require additional data engineering effort for complex joins and transformations.
Performance: While both options are reasonably efficient, they have overhead at different parts of the flow. Virtual tables can introduce overhead on the source side (with the magnitude depending on the entity's complexity), while direct tables may require more downstream compute for business logic.

Can I ingest attachments from D365?

The Dynamics 365 connector ingests attachment metadata (file name, size, MIME type, record associations) but doesn't download attachment file contents. This is because Synapse Link exports table data, not binary file contents.

To access attachment files:

Ingest attachment metadata tables (for example, annotation, attachment).
Use the metadata to identify files you need.
Download files directly from D365 using the Dynamics 365 Web API or Power Automate.
Store files in your preferred storage location (for example, ADLS Gen2, volume).

Do I need separate connections for different D365 apps?

No, you can use a single Unity Catalog connection for all D365 applications in the same Dataverse environment. The connection authenticates to your ADLS Gen2 storage account, not to individual D365 applications.

However, you need separate pipelines for each Dataverse environment (source_schema). For example:

Single connection: Authenticates to your ADLS Gen2 container.
Multiple pipelines: One pipeline per Dataverse environment, each specifying a different source_schema value.

This approach simplifies authentication management while allowing you to ingest from multiple environments.

What permissions are required in D365?

To set up the Dynamics 365 connector, you need:

In Dynamics 365 and Dataverse:

System Administrator role or equivalent permissions to configure Azure Synapse Link.
Read permissions for all tables you want to ingest.
Permissions to configure virtual entities or direct tables (for applications like F&O).

In Azure:

Permissions to create and configure ADLS Gen2 storage accounts and containers.
Permissions to create and configure Microsoft Entra ID applications.
Permissions to assign the Storage Blob Data Contributor role to the Entra ID application.

In Databricks:

Workspace administrator or metastore administrator permissions to create Unity Catalog connections.
CREATE permissions on the target catalog and schema.

See Configure data source for Microsoft Dynamics 365 ingestion for detailed permission requirements.

Can I ingest data from multiple Dataverse environments?

Yes, you can ingest from multiple Dataverse environments using a single connection. Create separate pipelines for each environment:

Python
# Pipeline for production environment
prod_pipeline = w.pipelines.create(
    name="d365_prod_ingestion",
    ingestion_definition=IngestionPipelineDefinition(
        channel="PREVIEW",
        connection_name="d365_connection",  # Same connection
        source_schema="https://prod.crm.dynamics.com",  # Production
        source_table=["account", "contact"],
        destination_catalog="main",
        destination_schema="d365_prod",
        scd_type="SCD_TYPE_2"
    )
)

# Pipeline for test environment
test_pipeline = w.pipelines.create(
    name="d365_test_ingestion",
    ingestion_definition=IngestionPipelineDefinition(
        channel="PREVIEW",
        connection_name="d365_connection",  # Same connection
        source_schema="https://test.crm.dynamics.com",  # Test
        source_table=["account", "contact"],
        destination_catalog="main",
        destination_schema="d365_test",
        scd_type="SCD_TYPE_2"
    )
)

You must either have all environments export to the same ADLS Gen2 storage account and container, or create separate connections for each storage location.

How can I reduce ingestion costs?

To optimize costs:

Use column selection to ingest only required columns. See Select columns to ingest.
Only include tables you need in the pipeline.
Turn history tracking off (SCD type 1) if you don't need historical tracking to reduce storage.

See Microsoft Dynamics 365 ingestion connector limitations for more considerations.

Can I transform data during ingestion?

Lakeflow Connect ingests raw data from D365 without transformations. To transform data:

Ingest raw data into a landing schema (for example, d365_landing).
Create downstream Lakeflow Spark Declarative Pipelines pipelines for transformations.
Use SQL or Python to transform data into curated schemas.

This separation of concerns preserves raw data while allowing flexible transformations downstream.

How do I handle schema changes in D365?

At this time, all schema changes require a full refresh of the table. Monitor your D365 schema changes and plan full refreshes accordingly.

Does the Dataverse Synapse Link instance need to be in the same region as the Databricks workspace?

No.

Connector-specific FAQs​

How does the connector access D365 data?​

Why do I need Azure Synapse Link?​

How does incremental ingestion work?​

Which Dynamics 365 applications are supported?​

What's the difference between Dataverse-native and Non-Dataverse-native apps?​

Can I ingest attachments from D365?​

Do I need separate connections for different D365 apps?​

What permissions are required in D365?​

Can I ingest data from multiple Dataverse environments?​

How can I reduce ingestion costs?​

Can I transform data during ingestion?​

How do I handle schema changes in D365?​

Does the Dataverse Synapse Link instance need to be in the same region as the Databricks workspace?​