External Table
Luca's blog on data engineering, data platforms, and performance.
Links
(Move to ...)
Luca's Home Page
Luca's Twitter
Luca's GitHub
Blog of the database services at CERN
▼
Thursday, February 23, 2023
Introduction to Spark APIs for Data Processing
›
Introduction to Apache Spark APIs for Data Processing This is a self-paced and open introduction course to Apache Spark. Theory and demos co...
Monday, May 23, 2022
Making histograms with Apache Spark and other SQL engines
›
Topic: This post will show you how to generate histograms using Apache Spark. You will find examples using the Spark DataFrame API and with...
Thursday, March 10, 2022
Can High Energy Physics Analysis Profit from Apache Spark APIs?
›
We are in a golden age for distributed data processing, with an abundance of tools and solutions emerging from industry and open source. Hig...
Friday, August 28, 2020
Apache Spark 3.0 Memory Monitoring Improvements
›
TLDR; Apache Spark 3.0 comes with many improvements, including new features for memory monitoring. This can help you troubleshooting memo...
Thursday, March 26, 2020
Distributed Deep Learning for Physics with TensorFlow and Kubernetes
›
Summary: This post details a solution for distributed deep learning training for a High Energy Physics use case, deployed using cloud resou...
‹
›
Home
View web version