External Table
Luca's blog on data engineering, data platforms, and performance.
Links
(Move to ...)
Luca's Home Page
Luca's Twitter
Luca's GitHub
Blog of the database services at CERN
▼
Showing posts with label
tools
.
Show all posts
Showing posts with label
tools
.
Show all posts
Monday, September 8, 2025
Troubleshoot I/O & Wait Latency with OraLatencyMap and PyLatencyMap
›
I recently chased an Oracle performance issue where most reads were sub-millisecond (cache) , but a thin band around ~10 ms (spindles) domi...
Friday, April 26, 2024
Building an Apache Spark Performance Lab: Tools and Techniques for Spark Optimization
›
Apache Spark is renowned for its speed and efficiency in handling large-scale data processing. However, optimizing Spark to achieve maximum ...
Friday, August 28, 2020
Apache Spark 3.0 Memory Monitoring Improvements
›
TLDR; Apache Spark 3.0 comes with many improvements, including new features for memory monitoring. This can help you troubleshooting memo...
Thursday, March 26, 2020
Distributed Deep Learning for Physics with TensorFlow and Kubernetes
›
Summary: This post details a solution for distributed deep learning training for a High Energy Physics use case, deployed using cloud resou...
Friday, August 24, 2018
SparkMeasure, a tool for performance troubleshooting of Apache Spark workloads
›
SparkMeasure SparkMeasure simplifies the collection and analysis of Spark task metrics data . It is also intended as a working example...
2 comments:
Friday, September 29, 2017
Performance Analysis of a CPU-Intensive Workload in Apache Spark
›
Topic: This post is about techniques and tools for measuring and understanding CPU-bound and memory-bound workloads in Apache Spark. You ...
2 comments:
›
Home
View web version