External Table

Luca's blog on data engineering, data platforms, and performance.

Links

▼
Thursday, February 23, 2023

Introduction to Spark APIs for Data Processing

›
Introduction to Apache Spark APIs for Data Processing This is a self-paced and open introduction course to Apache Spark. Theory and demos co...
Monday, May 23, 2022

Making histograms with Apache Spark and other SQL engines

›
Topic: This post will show you how to generate histograms using Apache Spark. You will find examples using the Spark DataFrame API and with...
Thursday, March 10, 2022

Can High Energy Physics Analysis Profit from Apache Spark APIs?

›
We are in a golden age for distributed data processing, with an abundance of tools and solutions emerging from industry and open source. Hig...
Friday, August 28, 2020

Apache Spark 3.0 Memory Monitoring Improvements

›
TLDR;   Apache Spark 3.0 comes with many improvements, including new features for memory monitoring. This can help you troubleshooting memo...
Thursday, March 26, 2020

Distributed Deep Learning for Physics with TensorFlow and Kubernetes

›
Summary: This post details a solution for distributed deep learning training for a High Energy Physics use case, deployed using cloud resou...
‹
›
Home
View web version

About Me

My photo
Luca Canali
Geneva, Switzerland
@LucaCanaliDB

View my complete profile
Powered by Blogger.