Posts by Yunsong Wang
Data Science
Nov 28, 2024
Supercharging Deduplication in pandas Using RAPIDS cuDF
A common operation in data analytics is to drop duplicate rows. Deduplication is critical in Extract, Transform, Load (ETL) workflows, where you might want to...
12 MIN READ
Data Science
Mar 06, 2023
Maximizing Performance with Massively Parallel Hash Maps on GPUs
Decades of computer science history have been devoted to devising solutions for efficient storage and retrieval of information. Hash maps (or hash tables) are a...
19 MIN READ