External Table
Luca's blog on data engineering, data frameworks, and performance.
Links
(Move to ...)
Luca's Home Page
Luca's Twitter
Luca's GitHub
Blog of the database services at CERN
▼
Friday, April 26, 2024
Building an Apache Spark Performance Lab: Tools and Techniques for Spark Optimization
›
Apache Spark is renowned for its speed and efficiency in handling large-scale data processing. However, optimizing Spark to achieve maximum ...
Tuesday, January 30, 2024
Enhancing Apache Spark and Parquet Efficiency: A Deep Dive into Column Indexes and Bloom Filters
›
In the ever-evolving landscape of big data, Apache Spark and Apache Parquet continue to introduce game-changing features. Their latest updat...
›
Home
View web version