Posts by Matt Ahrens
Data Science
Apr 03, 2025
Accelerating Apache Parquet Scans on Apache Spark with GPUs
As data sizes have grown in enterprises across industries, Apache Parquet has become a prominent format for storing data. Apache Parquet is a columnar storage...
8 MIN READ
Data Science
Jan 29, 2025
Accelerating JSON Processing on Apache Spark with GPUs
JSON is a popular format for text-based data that allows for interoperability between systems in web applications as well as data management. The format has...
9 MIN READ