Databricks

What It Is

The Databricks connection connects your Databricks SQL Warehouse to Duvo, enabling your assignments to run SQL queries and analyze data on your behalf. This is a user-configured connection—you'll need to provide your server hostname, HTTP path, and personal access token before your assignments can use it.


What It Can Do

Query Execution

  • Run SQL queries against your Databricks SQL Warehouse

  • Retrieve and process query results

  • Execute analytical queries across your data lakehouse

Data Exploration

  • Browse catalogs, schemas, and tables

  • Explore table structures and column definitions

  • Discover available datasets

Analysis

  • Aggregate and summarize data across tables

  • Pull business metrics and performance indicators

  • Extract data for reports and downstream workflows


Why It Matters

Databricks unifies data engineering, analytics, and AI on a single lakehouse platform. This connection allows assignments to:

  • Query your lakehouse — Access data directly from Databricks SQL Warehouse

  • Leverage unified data — Work with structured and semi-structured data in one place

  • Automate reporting — Pull metrics and insights without manual SQL sessions

  • Enrich workflows — Combine Databricks data with other systems for end-to-end automation


When to Use It

Use the Databricks connection when your assignment needs to:

  • Run analytical queries against your data lakehouse

  • Pull business metrics or performance data for reports

  • Validate data before making business decisions

  • Combine lakehouse data with other tools in a workflow

  • Monitor data conditions and trigger actions based on query results


How It Works

  1. Connect the connection — Provide your Databricks server hostname, HTTP path, and personal access token

  2. Configure access — Specify which catalog and schema to use

  3. Build your assignment — Add Databricks actions to your automation

  4. Query data — Your assignment can now execute SQL queries and retrieve results


Key Benefits

  • Lakehouse access — Query your unified data platform directly from workflows

  • SQL analytics — Use familiar SQL to retrieve exactly the data you need

  • Secure access — Uses personal access tokens for controlled authentication

  • Real-time data — Access current data rather than relying on exports

  • Data-driven automation — Make intelligent workflow decisions based on your lakehouse data

Last updated