Databricks Performance Checklist | Data Science with Raghav

This is a starting checklist you can customize as you write more detailed technical posts.

1. Understand the symptom

Is the job slow, expensive, failing, skewed, or producing too many small files?

Start with stages, tasks, shuffle read/write, spill, executor time, and data skew.

Review partitioning, file sizes, table statistics, clustering, and whether filters can skip data.

Avoid random tuning. Change one thing, measure, and keep notes.