Saving Apache Spark Big Data Processing Costs on Google Cloud Dataproc

Originally published at: https://developer.nvidia.com/blog/saving-apache-spark-big-data-processing-costs-on-google-cloud-dataproc/

According to IDC, the volume of data generated each year is growing exponentially.  IDC’s Global DataSphere projects that the world will generate 221 ZB of data by 2026. This data holds fantastic information. But as the volume of data grows, so does the processing cost. As a data scientist or engineer, you’ve certainly felt the…