Beyond Pandas: Scaling Data Processing with Polars
| Speaker | Caleb Jephuneh |
|---|---|
| Track | Data Science and Engineering |
| Type | Regular talk (45 minutes) |
Abstract
We all love Pandas, but when your dataset grows to gigabytes, the performance bottlenecks can become a nightmare. This talk is your introduction to Polars, a blazing-fast DataFrame library that is changing the game for data scientists. We will perform a direct comparison between Pandas and Polars on a large dataset, showcasing the incredible speed of Polars' lazy evaluation and multi-core processing. You'll learn the key differences in syntax and API, understand when to use Polars for maximum performance, and discover how to transition your data pipelines to this new powerhouse. Prepare to unlock a new level of performance for your data analysis workflows.
