Troubleshoot Confluence ingestion
The Confluence connector is in Beta.
This page describes common issues with the Confluence connector in Databricks Lakeflow Connect and how to resolve them.
General pipeline troubleshooting
If a pipeline fails while executing, click the step that failed and confirm whether the error message gives sufficient information about the nature of the error.

You can also check and download the cluster logs from the pipeline details page by clicking Update details in the right-hand panel, then Logs. Scan the logs for errors or exceptions.

Authentication errors
Error:
The user does not have access to the specified site 'https://your-domain.atlassian.net' with the given token, or the URL is incorrect. Please verify the URL and the OAuth token's permissions.
Resolution:
- Check the OAuth scopes granted in the OAuth app.
- Check that your Confluence instance URL is correct and includes the full domain (for example,
https://your-domain.atlassian.net).
Rate limit errors
Error:
Confluence API rate limit exceeded.
Resolution:
- The Confluence connector automatically retries with exponential backoff when rate limits are encountered.
- If rate limit errors persist, consider scheduling pipeline runs during off-peak hours.
- For large Confluence instances, you might need to increase the time between pipeline runs.
- Contact Atlassian support if you need to increase your API rate limits.## Space not found errors
Missing pages or incomplete data
Issue: Some pages are not appearing in the ingested data, or the data appears incomplete.
Resolution:
- Verify that the pages exist in the source Confluence space and aren't archived. Archived pages aren't supported.
- Check that the pages aren't in a draft status. Only published pages are ingested.
- Confirm that the user account has read access to all of the pages in the space.
- Check the pipeline event logs for any warnings or errors related to specific pages.
Slow ingestion performance
Issue:
Pipeline runs are taking longer than expected.
Resolution:
- Initial pipeline runs (full snapshots) typically take longer than incremental runs.
- Large pages with extensive content or many attachments might slow down ingestion.
- Check if you're hitting Confluence API rate limits by reviewing the pipeline logs.
- If performance issues persist, create a support ticket.
Refresh token expiration
Issue:
The pipeline was working but suddenly started failing with authentication errors.
Resolution:
- Check the OAuth scopes granted in the OAuth app.
- Update the connection.
- Retry the pipeline.
Column selection using Databricks Asset Bundles
Issue:
You're unable to use the column selection feature when you create a managed ingestion pipeline using Databricks Asset Bundles.
Resolution:
Check your Databricks CLI version. If the version is below v0.251.0, reinstall the CLI.
Need more help?
If you encounter an issue that's not covered in this troubleshooting guide, create a support ticket.