External Table

Luca's blog on data engineering, data platforms, and performance.

Links

▼
Thursday, April 25, 2019

Machine Learning Pipelines for High Energy Physics Using Apache Spark with BigDL and Analytics Zoo

›
Topic: This post describes a data pipeline for a machine learning task of interest in high energy physics: building a particle classifier t...
Tuesday, February 19, 2019

A Performance Dashboard for Apache Spark

›
Topic: This post dives into the steps for deploying a performance dashboard for Apache Spark, using Spark metrics system instrumentation, I...
Friday, August 24, 2018

SparkMeasure, a tool for performance troubleshooting of Apache Spark workloads

›
SparkMeasure    SparkMeasure simplifies the collection and analysis of Spark task metrics data . It is also intended as a working example...
2 comments:
Friday, September 29, 2017

Performance Analysis of a CPU-Intensive Workload in Apache Spark

›
Topic: This post is about techniques and tools for measuring and understanding CPU-bound and memory-bound  workloads in Apache Spark. You ...
2 comments:
‹
›
Home
View web version

About Me

My photo
Luca Canali
Geneva, Switzerland
@LucaCanaliDB

View my complete profile
Powered by Blogger.