Improving Apache Spark Performance and Reducing Costs with Amazon EMR and NVIDIA

Originally published at: https://developer.nvidia.com/blog/improving-apache-spark-performance-and-reducing-costs-with-amazon-emr-and-nvidia/

Apache Spark has emerged as the standard framework for large-scale, distributed, data analytics processing. NVIDIA worked with the Apache Spark community to accelerate the world’s most popular data analytics framework and to offer revolutionary GPU acceleration on several leading platforms, including Google Cloud, Databricks, and Cloudera. Now, Amazon EMR joins the list of leading platforms, making it easy and…