External Table
Luca's blog on data engineering, data platforms, and performance.
Links
(Move to ...)
Luca's Home Page
Luca's Twitter
Luca's GitHub
Blog of the database services at CERN
▼
Friday, August 28, 2020
Apache Spark 3.0 Memory Monitoring Improvements
›
TLDR; Apache Spark 3.0 comes with many improvements, including new features for memory monitoring. This can help you troubleshooting memo...
Thursday, March 26, 2020
Distributed Deep Learning for Physics with TensorFlow and Kubernetes
›
Summary: This post details a solution for distributed deep learning training for a High Energy Physics use case, deployed using cloud resou...
Thursday, April 25, 2019
Machine Learning Pipelines for High Energy Physics Using Apache Spark with BigDL and Analytics Zoo
›
Topic: This post describes a data pipeline for a machine learning task of interest in high energy physics: building a particle classifier t...
Tuesday, February 19, 2019
A Performance Dashboard for Apache Spark
›
Topic: This post dives into the steps for deploying a performance dashboard for Apache Spark, using Spark metrics system instrumentation, I...
‹
›
Home
View web version