Create Databricks-Connected Dashboards in Grafana



Connect to Databricks Data via CData Connect Cloud in Grafana and create custom business dashboards with real-time access to Databricks data.

Grafana is an open-source platform that allows users to analyze, monitor, and visualize telemetry from various data sources. When combined with CData Connect Cloud, you gain immediate access to Databricks data for business dashboards. This article outlines the process of connecting to Databricks using Connect Cloud and creating a basic dashboard from Databricks data within Grafana.

CData Connect Cloud offers a pure SQL Server, cloud-to-cloud interface for Databricks, enabling the creation of dashboards directly from live Databricks data within Grafana, all without the need for data replication to a native database. As you build visualizations, Grafana generates SQL queries to gather data. With its inherent optimized data processing capabilities, CData Connect Cloud efficiently channels all supported SQL operations, including filters and JOINs, directly to Databricks. This leverages server-side processing to swiftly deliver the requested Databricks data.

About Databricks Data Integration

Accessing and integrating live data from Databricks has never been easier with CData. Customers rely on CData connectivity to:

  • Access all versions of Databricks from Runtime Versions 9.1 - 13.X to both the Pro and Classic Databricks SQL versions.
  • Leave Databricks in their preferred environment thanks to compatibility with any hosting solution.
  • Secure authenticate in a variety of ways, including personal access token, Azure Service Principal, and Azure AD.
  • Upload data to Databricks using Databricks File System, Azure Blog Storage, and AWS S3 Storage.

While many customers are using CData's solutions to migrate data from different systems into their Databricks data lakehouse, several customers use our live connectivity solutions to federate connectivity between their databases and Databricks. These customers are using SQL Server Linked Servers or Polybase to get live access to Databricks from within their existing RDBMs.

Read more about common Databricks use-cases and how CData's solutions help solve data problems in our blog: What is Databricks Used For? 6 Use Cases.


Getting Started


Configure Databricks Connectivity for Grafana

Connectivity to Databricks from Grafana is made possible through CData Connect Cloud. To work with Databricks data from Grafana, we start by creating and configuring a Databricks connection.

  1. Log into Connect Cloud, click Connections and click Add Connection
  2. Select "Databricks" from the Add Connection panel
  3. Enter the necessary authentication properties to connect to Databricks.

    To connect to a Databricks cluster, set the properties as described below.

    Note: The needed values can be found in your Databricks instance by navigating to Clusters, and selecting the desired cluster, and selecting the JDBC/ODBC tab under Advanced Options.

    • Server: Set to the Server Hostname of your Databricks cluster.
    • HTTPPath: Set to the HTTP Path of your Databricks cluster.
    • Token: Set to your personal access token (this value can be obtained by navigating to the User Settings page of your Databricks instance and selecting the Access Tokens tab).
  4. Click Create & Test
  5. Navigate to the Permissions tab in the Add Databricks Connection page and update the User-based permissions.

Add a Personal Access Token

If you are connecting from a service, application, platform, or framework that does not support OAuth authentication, you can create a Personal Access Token (PAT) to use for authentication. Best practices would dictate that you create a separate PAT for each service, to maintain granularity of access.

  1. Click on your username at the top right of the Connect Cloud app and click User Profile.
  2. On the User Profile page, scroll down to the Personal Access Tokens section and click Create PAT.
  3. Give your PAT a name and click Create.
  4. The personal access token is only visible at creation, so be sure to copy it and store it securely for future use.

With the connection configured, you are ready to connect to Databricks data from Grafana.

Visualize Live Databricks Data in Grafana

To establish a connection from Grafana to the CData Connect Cloud Virtual SQL Server API, follow these steps.

  1. If you have not already done so, download and install Grafana for your operating system from the Grafana Website. Once installed, access Grafana at http://localhost:3000/.
  2. Log in to Grafana with your username and password for Grafana. If this is your first time logging in, the username is admin and the password is admin.
  3. On the navigation menu, click Connections > Add new connection. On this page, you can search for Microsoft SQL Server and select it as your data source.
  4. Select Microsoft SQL Server and click Add new data source.
  5. Enter a name for the new data source and then enter the following connection settings:
    • Host: tds.cdata.com:14333
    • Database: enter the Connection Name of the CData Connect Cloud data source you want to connect to (for example, Databricks1).
    • Username: enter your CData Connect Cloud username. This is displayed in the top-right corner of the CData Connect Cloud interface. For example, test@cdata.com.
    • Password: enter the PAT you previously generated.
  6. Click Save & Test. A Database Connect OK message appears when you have a successful connection established.

Create a Dashboard

Once you create the data source for Databricks, you can start building dashboards on Databricks data. Start in the navigation menu and click Dashboards

  1. On the Dashboards page, click + Create dashboard, then click + Add visualization.
  2. This opens the Select data source window where you can select your created connection.
  3. After selecting your connection, you can choose the tables and columns that you want to query for your visualization. Then press Run Query to run the generated query.
  4. After running your query, the resulting data is populated above the query editor. Here you can select the visualization that you want to use to present your data on the dashboard panel.
  5. When you have finished editing your panel, you can click Save dashboard to save the changes made to your Dashboard.

To get live data access to 100+ SaaS, Big Data, and NoSQL sources directly from your cloud applications, try CData Connect Cloud today!

Ready to get started?

Learn more about CData Connect Cloud or sign up for free trial access:

Free Trial