Edson Hiroshi AokiinTowards Data ScienceOvercoming Apache Spark’s biggest pain pointsAn advanced guide to the most challenging aspects of Spark and how data scientists and engineers can overcome them11 min read·Oct 10, 2020--5--5
Edson Hiroshi AokiinTowards Data ScienceTraining multiple ML models and running data tasks in parallel via YARN+Spark+multithreadingHarness large scale computational resources to allow a single data scientist to perform hundreds of Big data tasks in parallel24 min read·Nov 11, 2019--2--2