Bradley Dice – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2024-12-12T19:38:34Z http://www.open-lab.net/blog/feed/ Bradley Dice <![CDATA[Supercharging Deduplication in pandas Using RAPIDS cuDF]]> http://www.open-lab.net/blog/?p=92703 2024-12-12T19:38:34Z 2024-11-28T14:00:00Z A common operation in data analytics is to drop duplicate rows. Deduplication is critical in Extract, Transform, Load (ETL) workflows, where you might want to...]]>

Source

]]>
Bradley Dice <![CDATA[Streamline ETL Workflows with Nested Data Types in RAPIDS libcudf]]> http://www.open-lab.net/blog/?p=75553 2024-01-22T21:35:40Z 2023-12-15T21:16:55Z Nested data types are a convenient way to represent hierarchical relationships within columnar data. They are frequently used as part of extract, transform,...]]>

Nested data types are a convenient way to represent hierarchical relationships within columnar data. They are frequently used as part of extract, transform, load (ETL) workloads in business intelligence, recommender systems, cybersecurity, geospatial, and other applications. List types can be used to easily attach multiple transactions to a user without creating a new lookup table…

Source

]]>
2
���˳���97caoporen����