Databricks System Table Workspace Health SQL Toolkit

20 System Table queries to understand Databricks usage across the organization

Press enter or click to view image in full size

As data engineers, understanding the intricacies of our Databricks environment is important. You can’t optimize performance, budget or ensure efficient resource allocations without it. Thankfully, Databricks gives you a behind-the-scenes look at how your workspace is running in system tables. Everything from query performance to job execution and cluster activity is in those tables.

But raw system data can be tricky to navigate, and sometimes you just need a quick answer to that burning question. That’s why we’ve created a Databricks Health SQL Toolkit — a free set of queries you can copy, paste, and run to get instant insights into your environment. Whether you’re debugging slow queries, tracking costs, or just curious about what’s happening under the hood, these queries should help you get there faster.

If you find these queries to be useful, you can download our 1-click dashboard for Databricks to visualize all of them in a sharable dashboard. Wow the team with insights in your Jobs, SQL warehouses, APC clusters, and DLT usage.

General Usage

Let’s start with the basics. Here are a few queries to help you get a quick overview of how your organization is using Databricks and determine your next steps.

Who submits the most amount of queries?

Press enter or click to view image in full size

Knowing who your most active users by SQL query count helps shed valuable insights into workspace utilization and helps you:

Identify power users who might benefit from additional training or resources
Detect unusual activity patterns that could indicate security issues or inefficient practices
Better understand how your organization uses Databricks

This will be essential to understand how to optimize your Databricks environment to meet the needs and usage patterns of your team. It will also be useful for cost allocation and user management.

20 System Table queries to understand Databricks usage across the organization

General Usage

Who submits the most amount of queries?

Who owns the most expensive clusters?

How many clusters are currently active, and how were they created?

Which markets (Spot, On-Demand) do my clusters use?

How “sticky” is my workspace?

Cost Optimization

What are the current compute costs associated with my Databricks workspace?

What are the product costs associated with my Databricks workspace?

Which jobs are most likely over-provisioned?

Which jobs are most likely under-provisioned?

What are my most expensive jobs over the past 30 days?

What jobs are growing the most in costs week-over-week?

What are the most expensive SQL warehouses for the last 30 days?

What are the most expensive SQL queries over the past 30 days?

Why are these queries so expensive, and who owns them?

What are the most costly APC clusters over the past 30 days?

Which APC clusters are likely under-utilized?

What are the most expensive DLT clusters over the past 30 days?

Performance Management

Which notebooks are consuming the most DBUs over time?

What are my longest running queries?

Data and Workflow Management

What are my most commonly used datasets? Where are they stored?

How do I track lineage for data transformations in the workspace?

Get the queries

Get the comprehensive dashboard

Conclusion