Karthikeyan Natarajan – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-02-25T20:19:32Z http://www.open-lab.net/blog/feed/ Karthikeyan Natarajan <![CDATA[JSON Lines Reading with pandas 100x Faster Using NVIDIA cuDF]]> http://www.open-lab.net/blog/?p=95970 2025-02-25T20:19:32Z 2025-02-20T17:00:00Z JSON is a widely adopted format for text-based information working interoperably between systems, most commonly in web applications and large language models...]]>

JSON is a widely adopted format for text-based information working interoperably between systems, most commonly in web applications and large language models (LLMs). While the JSON format is human-readable, it is complex to process with data science and data engineering tools. JSON data often takes the form of newline-delimited JSON Lines (also known as NDJSON) to represent multiple records…

Source

]]>
Karthikeyan Natarajan <![CDATA[Streamline ETL Workflows with Nested Data Types in RAPIDS libcudf]]> http://www.open-lab.net/blog/?p=75553 2024-01-22T21:35:40Z 2023-12-15T21:16:55Z Nested data types are a convenient way to represent hierarchical relationships within columnar data. They are frequently used as part of extract, transform,...]]>

Nested data types are a convenient way to represent hierarchical relationships within columnar data. They are frequently used as part of extract, transform, load (ETL) workloads in business intelligence, recommender systems, cybersecurity, geospatial, and other applications. List types can be used to easily attach multiple transactions to a user without creating a new lookup table…

Source

]]>
2
Karthikeyan Natarajan <![CDATA[GPU-Accelerated JSON Data Processing with RAPIDS]]> http://www.open-lab.net/blog/?p=60657 2023-11-20T23:12:50Z 2023-02-09T17:30:00Z JSON is a widely adopted format for text-based information working interoperably between systems, most commonly in web applications. While the JSON format is...]]>

JSON is a widely adopted format for text-based information working interoperably between systems, most commonly in web applications. While the JSON format is human-readable, it is complex to process with data science and data engineering tools. To bridge that gap, RAPIDS cuDF provides a GPU-accelerated JSON reader (cudf.read_json) that is efficient and robust for many JSON data structures.

Source

]]>
0
���˳���97caoporen����