Data Science – NVIDIA Technical Blog

Data Science – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-04-29T22:43:09Z http://www.open-lab.net/blog/feed/ Joseph Lucas <![CDATA[Structuring Applications to Secure the KV Cache]]> http://www.open-lab.net/blog/?p=99425 2025-04-29T22:43:09Z 2025-04-29T22:43:01Z

When interacting with transformer-based models like large language models (LLMs) and vision-language models (VLMs), the structure of the input shapes the...

]]>

Jenn Yonemitsu <![CDATA[Kaggle Grandmasters Unveil Winning Strategies for Data Science Superpowers]]> http://www.open-lab.net/blog/?p=99350 2025-04-29T17:23:06Z 2025-04-29T17:22:59Z

Kaggle Grandmasters David Austin and Chris Deotte from NVIDIA and Ruchi Bhatia from HP joined Brenda Flynn from Kaggle at this year��s Google Cloud Next...

]]>

Bo Dong <![CDATA[NVIDIA cuPyNumeric 25.03 Now Fully Open Source with PIP and HDF5 Support]]> http://www.open-lab.net/blog/?p=99089 2025-04-23T19:26:15Z 2025-04-23T19:26:07Z

NVIDIA cuPyNumeric is a library that aims to provide a distributed and accelerated drop-in replacement for NumPy built on top of the Legate framework. It brings...

]]>

Chris Deotte https://www.kaggle.com/cdeotte <![CDATA[Grandmaster Pro Tip: Winning First Place in Kaggle Competition with Feature Engineering using NVIDIA cuDF-pandas]]> http://www.open-lab.net/blog/?p=98938 2025-04-22T23:45:10Z 2025-04-17T23:03:20Z

Feature engineering remains one of the most effective ways to improve model accuracy when working with tabular data. Unlike domains such as NLP and computer...

]]>

Ziyue Xu <![CDATA[Efficient Federated Learning in the Era of LLMs with Message Quantization and Streaming]]> http://www.open-lab.net/blog/?p=98553 2025-04-17T19:35:24Z 2025-04-16T16:00:00Z

Federated learning (FL) has emerged as a promising approach for training machine learning models across distributed data sources while preserving data privacy....

]]>

1 Nirmal Kumar Juluru <![CDATA[NVIDIA Llama Nemotron Ultra Open Model Delivers Groundbreaking Reasoning Accuracy]]> http://www.open-lab.net/blog/?p=98855 2025-04-22T23:49:11Z 2025-04-15T18:00:00Z

AI is no longer just about generating text or images��it��s about deep reasoning, detailed problem-solving, and powerful adaptability for real-world...

]]>

Ziyue Xu <![CDATA[Effortless Federated Learning on Mobile with NVIDIA FLARE and Meta ExecuTorch]]> http://www.open-lab.net/blog/?p=98560 2025-04-17T19:35:27Z 2025-04-11T18:37:54Z

NVIDIA and the PyTorch team at Meta announced a groundbreaking collaboration that brings federated learning (FL) capabilities to mobile devices through the...

]]>

1 Shai Shen-Orr <![CDATA[Curating Biological Findings from Scientific Literature with NVIDIA NIM]]> http://www.open-lab.net/blog/?p=98526 2025-04-28T23:18:36Z 2025-04-10T18:30:00Z

Scientific papers are highly heterogeneous, often employing diverse terminologies for the same entities, using varied methodologies to study biological...

]]>

Prem Sagar Gali <![CDATA[Efficiently Scaling Polars GPU Parquet Reader]]> http://www.open-lab.net/blog/?p=98435 2025-04-22T23:52:25Z 2025-04-10T16:30:00Z

When working with large datasets, the performance of your data processing tools becomes critical. Polars, an open-source library for data manipulation known for...

]]>

Vinay Raman <![CDATA[Evaluating and Enhancing RAG Pipeline Performance Using Synthetic Data?]]> http://www.open-lab.net/blog/?p=97927 2025-04-17T19:35:37Z 2025-04-07T18:39:06Z

As large language models (LLM) gain popularity in various question-answering systems, retrieval-augmented generation (RAG) pipelines have also become a focal...

]]>

Sama Bali <![CDATA[Event: HP & NVIDIA Developer Challenge]]> http://www.open-lab.net/blog/?p=98487 2025-04-17T19:35:39Z 2025-04-07T17:54:00Z

Join the hackathon to build open-source AI solutions, optimize models, enhance workflows, connect with peers, and win prizes.

]]>

Matt Ahrens <![CDATA[Accelerating Apache Parquet Scans on Apache Spark with GPUs]]> http://www.open-lab.net/blog/?p=98350 2025-04-22T23:57:50Z 2025-04-03T16:18:03Z

As data sizes have grown in enterprises across industries, Apache Parquet has become a prominent format for storing data. Apache Parquet is a columnar storage...

]]>

1 Ronen Dar <![CDATA[NVIDIA Open Sources Run:ai Scheduler to Foster Community Collaboration]]> http://www.open-lab.net/blog/?p=98094 2025-04-22T23:59:16Z 2025-04-01T09:00:00Z

Today, NVIDIA announced the open-source release of the KAI Scheduler, a Kubernetes-native GPU scheduling solution, now available under the Apache 2.0 license....

]]>

Brian Shi <![CDATA[Boosting Q&A Accuracy with GraphRAG Using PyG and Graph Databases]]> http://www.open-lab.net/blog/?p=97900 2025-04-03T18:46:06Z 2025-03-26T21:41:08Z

Large language models (LLMs) often struggle with accuracy when handling domain-specific questions, especially those requiring multi-hop reasoning or access to...

]]>

Cole Swain <![CDATA[Spotlight: Tomorrow.io?Transforms Global Weather Resilience with NVIDIA AI]]> http://www.open-lab.net/blog/?p=98023 2025-04-03T18:46:17Z 2025-03-26T21:19:34Z

From hyperlocal forecasts that guide daily operations to planet-scale models illuminating new climate insights, the world is entering a new frontier in weather...

]]>

John Ashcroft <![CDATA[Powering Flood Risk Assessment with NVIDIA Earth-2]]> http://www.open-lab.net/blog/?p=97974 2025-04-23T00:01:57Z 2025-03-25T20:59:12Z

Inland flooding causes significant economic and societal impacts annually. Of the eight natural disasters costing the insurance industry over $1 billion in...

]]>

Xavier Renard <![CDATA[Spotlight: AXA Explores AI-Driven Hurricane Risk Assessment]]> http://www.open-lab.net/blog/?p=98096 2025-04-23T00:05:46Z 2025-03-25T17:47:06Z

Large ensembles are essential for predicting rare, high-impact events that cannot be fully understood through historical data alone. By simulating thousands of...

]]>

Holger Roth <![CDATA[Supercharging the Federated Learning Ecosystem by Integrating Flower and NVIDIA FLARE]]> http://www.open-lab.net/blog/?p=94045 2025-04-23T00:05:05Z 2025-03-24T16:00:00Z

In recent years, open-source systems like Flower and NVIDIA FLARE have emerged as pivotal tools in the federated learning (FL) landscape, each with its unique...

]]>

1 Kyle Tretina <![CDATA[Guiding Generative Molecular Design with Experimental Feedback Using Oracles]]> http://www.open-lab.net/blog/?p=96966 2025-03-25T17:23:57Z 2025-03-19T15:00:00Z

Generative chemistry with AI has the potential to revolutionize how scientists approach drug discovery and development, health, and materials science and...

]]>

TJ Chen <![CDATA[Shrink Genomics and Single-Cell Analysis Time to Minutes with NVIDIA Parabricks and NVIDIA AI Blueprints]]> http://www.open-lab.net/blog/?p=96979 2025-03-20T18:33:12Z 2025-03-19T15:00:00Z

NVIDIA Parabricks is a scalable genomics analysis software suite that solves omics challenges with accelerated computing and deep learning to unlock new...

]]>

Siddharth Sharma <![CDATA[NVIDIA cuML Brings Zero Code Change Acceleration to scikit-learn]]> http://www.open-lab.net/blog/?p=97091 2025-04-23T00:22:52Z 2025-03-18T17:42:25Z

Scikit-learn, the most widely used ML library, is popular for processing tabular data because of its simple API, diversity of algorithms, and compatibility with...

]]>

Erik Ordentlich <![CDATA[Accelerate Apache Spark ML on NVIDIA GPUs with Zero Code Change]]> http://www.open-lab.net/blog/?p=96768 2025-04-23T00:36:38Z 2025-03-06T19:49:16Z

The NVIDIA RAPIDS Accelerator for Apache Spark software plug-in pioneered a zero code change user experience (UX) for GPU-accelerated data processing. It...

]]>

Mark J. Bennett <![CDATA[GPU-Accelerate Algorithmic Trading Simulations by over 100x with Numba]]> http://www.open-lab.net/blog/?p=96652 2025-03-10T23:13:45Z 2025-03-04T21:44:01Z

Quantitative developers need to run back-testing simulations to see how financial algorithms perform from a profit and loss (P&L) standpoint. Statistical...

]]>

Douglas Moore <![CDATA[Accelerate Medical Imaging AI Operations with Databricks Pixels 2.0 and MONAI]]> http://www.open-lab.net/blog/?p=96530 2025-04-23T02:39:52Z 2025-02-28T18:11:50Z

According to the World Health Organization (WHO), 3.6 billion medical imaging tests are performed every year globally to diagnose, monitor, and treat various...

]]>

Tom Augspurger <![CDATA[High-Performance Remote IO With NVIDIA KvikIO]]> http://www.open-lab.net/blog/?p=96582 2025-03-06T19:26:42Z 2025-02-27T17:55:52Z

Workloads processing large amounts of data, especially those running on the cloud, will often use an object storage service (S3, Google Cloud Storage, Azure...

]]>

1 Karthikeyan Natarajan <![CDATA[JSON Lines Reading with pandas 100x Faster Using NVIDIA cuDF]]> http://www.open-lab.net/blog/?p=95970 2025-04-23T02:44:00Z 2025-02-20T17:00:00Z

JSON is a widely adopted format for text-based information working interoperably between systems, most commonly in web applications and large language models...

]]>

Kyle Tretina <![CDATA[Understanding the Language of Life��s Biomolecules Across Evolution at a New Scale with Evo 2]]> http://www.open-lab.net/blog/?p=95589 2025-04-23T02:44:28Z 2025-02-19T17:14:51Z

AI has evolved from an experimental curiosity to a driving force within biological research. The convergence of deep learning algorithms, massive omics...

]]>

Brad Nemire <![CDATA[Featured Sessions for Students at NVIDIA GTC 2025]]> http://www.open-lab.net/blog/?p=96181 2025-02-20T15:52:32Z 2025-02-15T02:00:58Z

Learn from researchers, scientists, and industry leaders across a variety of topics including AI, robotics, and Data Science.

]]>

Rick Ratzel <![CDATA[Using NetworkX, Jaccard Similarity, and cuGraph to Predict Your Next Favorite Movie]]> http://www.open-lab.net/blog/?p=95820 2025-04-23T02:45:37Z 2025-02-13T17:00:00Z

As the amount of data available to everyone in the world increases, the ability for a consumer to make informed decisions becomes increasingly difficult....

]]>

Jesus Alvarez <![CDATA[NVIDIA Open GPU Datacenter Drivers for RHEL9 Signed by Red Hat]]> http://www.open-lab.net/blog/?p=95069 2025-04-23T02:52:36Z 2025-02-10T17:48:26Z

NVIDIA and Red Hat have partnered to bring continued improvements to the precompiled NVIDIA Driver introduced in 2020. Last month, NVIDIA announced that the...

]]>

3 Allison Ding <![CDATA[Get Started with GPU Acceleration for Data Science]]> http://www.open-lab.net/blog/?p=95894 2025-04-23T02:52:30Z 2025-02-06T23:07:48Z

In data science, operational efficiency is key to handling increasingly complex and large datasets. GPU acceleration has become essential for modern workflows,...

]]>

Brad Nemire <![CDATA[Featured Researcher and Educator Sessions at NVIDIA GTC 2025]]> http://www.open-lab.net/blog/?p=95817 2025-02-06T19:33:45Z 2025-02-05T23:03:06Z

Explore the latest advancements in academia, including advanced research, innovative teaching methods, and the future of learning and technology.

]]>

Michelle Horton <![CDATA[AI Foundation Model Enhances Cancer Diagnosis and Tailors Treatment]]> http://www.open-lab.net/blog/?p=95722 2025-04-23T02:48:13Z 2025-02-04T17:16:54Z

A new study and AI model from researchers at Stanford University is streamlining cancer diagnostics, treatment planning, and prognosis prediction. Named MUSK...

]]>

1 Jonathan Bentz <![CDATA[CUDA Toolkit Now Available for NVIDIA Blackwell?]]> http://www.open-lab.net/blog/?p=95358 2025-04-23T14:58:16Z 2025-01-31T19:17:12Z

The latest release of the CUDA Toolkit, version 12.8, continues to push accelerated computing performance in data sciences, AI, scientific computing, and...

]]>

Prem Sagar Gali <![CDATA[Mastering the cudf.pandas Profiler for GPU Acceleration]]> http://www.open-lab.net/blog/?p=95351 2025-04-23T15:00:07Z 2025-01-30T17:00:00Z

In the world of Python data science, pandas has long reigned as the go-to library for intuitive data manipulation and analysis. However, as data volumes grow,...

]]>

Matt Ahrens <![CDATA[Accelerating JSON Processing on Apache Spark with GPUs]]> http://www.open-lab.net/blog/?p=95298 2025-04-23T15:01:08Z 2025-01-29T22:10:22Z

JSON is a popular format for text-based data that allows for interoperability between systems in web applications as well as data management. The format has...

]]>

Amit Bleiweiss <![CDATA[Mastering LLM Techniques: Evaluation]]> http://www.open-lab.net/blog/?p=95447 2025-04-23T15:01:33Z 2025-01-29T20:44:06Z

Evaluating large language models (LLMs) and retrieval-augmented generation (RAG) systems is a complex and nuanced process, reflecting the sophisticated and...

]]>

Juana Nakfour <![CDATA[Horizontal Autoscaling of NVIDIA NIM Microservices on Kubernetes]]> http://www.open-lab.net/blog/?p=94972 2025-04-23T15:02:12Z 2025-01-22T17:34:51Z

NVIDIA NIM microservices are model inference containers that can be deployed on Kubernetes. In a production environment, it��s important to understand the...

]]>

2 Elias Wolfberg <![CDATA[AI Uncovers Potentially Hazardous, Forgotten Oil and Gas Wells]]> http://www.open-lab.net/blog/?p=95106 2025-04-23T15:03:07Z 2025-01-16T19:09:15Z

With as many as 800,000 forgotten oil and gas wells scattered across the US, researchers from Lawrence Berkeley National Laboratory (LBNL), have developed an AI...

]]>

Brian Tepera <![CDATA[Accelerating Time Series Forecasting with RAPIDS cuML]]> http://www.open-lab.net/blog/?p=95127 2025-01-23T19:54:21Z 2025-01-16T17:20:10Z

Time series forecasting is a powerful data science technique used to predict future values based on data points from the past Open source Python libraries like...

]]>

Brad Nemire <![CDATA[Upcoming Webinar: Inside the RAPIDS-Accelerated Polars GPU Engine]]> http://www.open-lab.net/blog/?p=94968 2025-01-23T19:54:27Z 2025-01-13T17:17:47Z

In the webinar on January 28th, you'll get an inside look of the new GPU engine to learn how Polars' declarative API and query optimizer enable seamless GPU...

]]>

Nirmal Kumar Juluru <![CDATA[Enhancing Generative AI Model Accuracy with NVIDIA NeMo Curator]]> http://www.open-lab.net/blog/?p=94263 2025-01-23T19:54:27Z 2025-01-13T17:00:00Z

In the rapidly evolving landscape of artificial intelligence, the quality of the data used for training models is paramount. High-quality data ensures that...

]]>

Kyle Tretina <![CDATA[Evaluating GenMol as a Generalist Foundation Model for Molecular Generation]]> http://www.open-lab.net/blog/?p=94836 2025-01-23T19:54:29Z 2025-01-13T14:00:00Z

Traditional computational drug discovery relies almost exclusively on highly task-specific computational models for hit identification and lead optimization....

]]>

Kyle Tretina <![CDATA[Accelerate Protein Engineering with the NVIDIA BioNeMo Blueprint for Generative Protein Binder Design]]> http://www.open-lab.net/blog/?p=94851 2025-01-23T19:54:28Z 2025-01-13T14:00:00Z

Designing a therapeutic protein that specifically binds its target in drug discovery is a staggering challenge. Traditional workflows are often a painstaking...

]]>

Peter Entschev <![CDATA[Accelerating GPU Analytics Using RAPIDS and Ray]]> http://www.open-lab.net/blog/?p=94495 2024-12-20T21:13:45Z 2024-12-20T21:13:42Z

RAPIDS is a suite of open-source GPU-accelerated data science and AI libraries that are well supported for scale-out with distributed engines like Spark and...

]]>

Jenn Yonemitsu <![CDATA[NVIDIA Hackathon Winners Share Strategies for RAPIDS-Accelerated ML Workflows]]> http://www.open-lab.net/blog/?p=94393 2025-01-22T18:31:27Z 2024-12-20T18:00:00Z

Approximately 220 teams gathered at the Open Data Science Conference (ODSC) West this year to compete in the NVIDIA hackathon, a 24-hour machine learning (ML)...

]]>

Tom Balough <![CDATA[Enhance Your Training Data with New NVIDIA NeMo Curator Classifier Models]]> http://www.open-lab.net/blog/?p=94447 2024-12-19T23:08:12Z 2024-12-19T23:08:08Z

Classifier models are specialized in categorizing data into predefined groups or classes, playing a crucial role in optimizing data processing pipelines for...

]]>

Nick Becker <![CDATA[RAPIDS 24.12 Introduces cuDF on PyPI, CUDA Unified Memory for Polars, and Faster GNNs]]> http://www.open-lab.net/blog/?p=94415 2024-12-19T21:46:07Z 2024-12-19T21:21:42Z

RAPIDS 24.12 introduces cuDF packages to PyPI, speeds up groupby aggregations and reading files from AWS S3, enables larger-than-GPU memory queries in the...

]]>

Ziyue Xu <![CDATA[Security for Data Privacy in Federated Learning with CUDA-Accelerated Homomorphic Encryption in XGBoost]]> http://www.open-lab.net/blog/?p=93870 2024-12-17T19:33:44Z 2024-12-18T21:30:00Z

XGBoost is a machine learning algorithm widely used for tabular data modeling. To expand the XGBoost model from single-site learning to multisite collaborative...

]]>

Michelle Horton <![CDATA[Top Posts of 2024 Highlight NVIDIA NIM, LLM Breakthroughs, and Data Science Optimization]]> http://www.open-lab.net/blog/?p=93566 2024-12-16T18:34:16Z 2024-12-16T18:34:14Z

2024 was another landmark year for developers, researchers, and innovators working with NVIDIA technologies. From groundbreaking developments in AI inference to...

]]>

0 Joe Bungo <![CDATA[NVIDIA Deep Learning Institute Releases New Data Science Teaching Kit for Educators]]> https://news.www.open-lab.net/?p=19371 2024-12-12T19:35:13Z 2024-12-12T17:11:19Z

As data grows in volume, velocity, and complexity, the data science field is booming. There��s an ever-increasing demand for talent and skill sets to...

]]>

Nick Becker <![CDATA[Harnessing GPU Acceleration for Multi-Label Classification with RAPIDS cuML]]> http://www.open-lab.net/blog/?p=93575 2024-12-12T19:17:22Z 2024-12-12T16:55:40Z

Modern classification workflows often require classifying individual records and data points into multiple categories instead of just assigning a single label....

]]>

Prem Sagar Gali <![CDATA[Unified Virtual Memory Supercharges pandas with RAPIDS cuDF]]> http://www.open-lab.net/blog/?p=93438 2024-12-12T19:35:20Z 2024-12-05T19:07:07Z

cuDF-pandas, introduced in a previous post, is a GPU-accelerated library that accelerates pandas to deliver significant performance improvements��up to 50x...

]]>

Vega Shah <![CDATA[In-Silico Antibody Development with AlphaBind Using NVIDIA BioNeMo and AWS HealthOmics]]> http://www.open-lab.net/blog/?p=92757 2024-12-12T19:38:30Z 2024-12-03T18:00:00Z

Antibodies have become the most prevalent class of therapeutics, primarily due to their ability to target specific antigens, enabling them to treat a wide range...

]]>

Bradley Dice <![CDATA[Supercharging Deduplication in pandas Using RAPIDS cuDF]]> http://www.open-lab.net/blog/?p=92703 2024-12-12T19:38:34Z 2024-11-28T14:00:00Z

A common operation in data analytics is to drop duplicate rows. Deduplication is critical in Extract, Transform, Load (ETL) workflows, where you might want to...

]]>

Ben Zaitlen https://www.linkedin.com/in/benjamin-zaitlen-62ab7b4/ <![CDATA[Best Practices for Multi-GPU Data Analysis Using RAPIDS with Dask]]> http://www.open-lab.net/blog/?p=92480 2024-12-12T19:38:40Z 2024-11-21T19:02:03Z

As we move towards a more dense computing infrastructure, with more compute, more GPUs, accelerated networking, and so forth��multi-gpu training and analysis...

]]>

Mario Geiger <![CDATA[Accelerate Drug and Material Discovery with New Math Library NVIDIA cuEquivariance]]> http://www.open-lab.net/blog/?p=91896 2024-11-18T22:58:58Z 2024-11-18T18:30:00Z

AI models for science are often trained to make predictions about the workings of nature, such as predicting the structure of a biomolecule or the properties of...

]]>

1 Wen Jie Ong <![CDATA[Revolutionizing AI-Driven Material Discovery Using NVIDIA ALCHEMI]]> http://www.open-lab.net/blog/?p=91999 2024-11-18T22:57:30Z 2024-11-18T18:30:00Z

AI has proven to be a force multiplier, helping to create a future where scientists can design entirely new materials, while engineers seamlessly transform...

]]>

Wonchan Lee <![CDATA[Effortlessly Scale NumPy from Laptops to Supercomputers with NVIDIA cuPyNumeric]]> http://www.open-lab.net/blog/?p=91682 2025-04-10T23:02:00Z 2024-11-18T17:00:00Z

Python is the most common programming language for data science, machine learning, and numerical computing. It continues to grow in popularity among scientists...

]]>

1 Nick Becker <![CDATA[Faster Causal Inference on Large Datasets with NVIDIA RAPIDS]]> http://www.open-lab.net/blog/?p=91854 2024-11-18T20:15:01Z 2024-11-14T16:00:00Z

As consumer applications generate more data than ever before, enterprises are turning to causal inference methods for observational data to help shed light on...

]]>

Nick Becker <![CDATA[NVIDIA RAPIDS 24.10 Introduces Accelerated NetworkX with Zero Code Change, Updates for UMAP and cuDF-Pandas]]> http://www.open-lab.net/blog/?p=91788 2024-11-14T17:10:34Z 2024-11-13T22:37:14Z

The RAPIDS v24.10 release takes another step forward in bringing accelerated computing to data scientists and developers with a seamless user experience. This...

]]>

Amit Bleiweiss <![CDATA[Mastering LLM Techniques: Text Data Processing]]> http://www.open-lab.net/blog/?p=91738 2025-04-01T19:02:02Z 2024-11-13T18:05:06Z

Training and customizing LLMs for high accuracy is fraught with challenges, primarily due to their dependency on high-quality data. Poor data quality and...

]]>

Kyle Tretina <![CDATA[Boost Alphafold2 Protein Structure Prediction with GPU-Accelerated MMseqs2]]> http://www.open-lab.net/blog/?p=91623 2024-11-14T17:10:35Z 2024-11-13T17:00:00Z

The ability to compare the sequences of multiple related proteins is a foundational task for many life science researchers. This is often done in the form of a...

]]>

Michelle Horton <![CDATA[AI That ��Hears�� Heart Disease May Help Vets Diagnose Dogs]]> http://www.open-lab.net/blog/?p=91619 2024-11-14T17:10:40Z 2024-11-12T15:49:17Z

A new machine-learning algorithm that listens to digital heartbeat data could help veterinarians diagnose murmurs and early-stage heart disease in dogs....

]]>

Amr Elmeleegy <![CDATA[5x Faster Time to First Token with NVIDIA TensorRT-LLM KV Cache Early Reuse]]> http://www.open-lab.net/blog/?p=91625 2024-11-14T17:10:41Z 2024-11-08T23:55:43Z

In our previous blog post, we demonstrated how reusing the key-value (KV) cache by offloading it to CPU memory can accelerate time to first token (TTFT) by up...

]]>

Chelsea Gomatam <![CDATA[Discover New Biological Insights with Accelerated Pangenome Alignment in NVIDIA Parabricks]]> http://www.open-lab.net/blog/?p=91220 2024-11-14T17:10:48Z 2024-11-04T17:39:18Z

NVIDIA Parabricks is a scalable genomics analysis software suite that solves omics challenges with accelerated computing and deep learning to unlock new...

]]>

1 Tyler Whitehouse <![CDATA[Frictionless Collaboration and Rapid Prototyping in Hybrid Environments with NVIDIA AI Workbench]]> http://www.open-lab.net/blog/?p=91234 2024-11-14T17:10:49Z 2024-11-04T17:30:00Z

NVIDIA AI Workbench is a free development environment manager that streamlines data science, AI, and machine learning (ML) projects on systems of choice. The...

]]>

Jinsol Park <![CDATA[Even Faster and More Scalable UMAP on the GPU with RAPIDS cuML]]> http://www.open-lab.net/blog/?p=91198 2024-11-14T17:10:53Z 2024-10-31T20:24:07Z

UMAP is a popular dimension reduction algorithm used in fields like bioinformatics, NLP topic modeling, and ML preprocessing. It works by creating a k-nearest...

]]>

2 Summer Liu <![CDATA[Supercharging Fraud Detection in Financial Services with Graph Neural Networks]]> http://www.open-lab.net/blog/?p=90877 2024-10-31T18:36:06Z 2024-10-28T15:30:00Z

Fraud in financial services is a massive problem. According to NASDAQ, in 2023, banks faced $442 billion in projected losses from payments, checks, and credit...

]]>

Michael Yh Wang <![CDATA[Bridging the CUDA C++ Ecosystem and Python Developers with Numbast]]> http://www.open-lab.net/blog/?p=90086 2024-10-31T16:26:15Z 2024-10-24T16:30:00Z

By enabling CUDA kernels to be written in Python similar to how they can be implemented within C++, Numba bridges the gap between the Python ecosystem and the...

]]>

Michelle Horton <![CDATA[Optimizing Drug Discovery with CUDA Graphs, Coroutines, and GPU Workflows]]> http://www.open-lab.net/blog/?p=90780 2024-10-31T16:21:20Z 2024-10-23T17:28:49Z

Pharmaceutical research demands fast, efficient simulations to predict how molecules interact, speeding up drug discovery. Jiqun Tu, a senior developer...

]]>

Rick Ratzel <![CDATA[NetworkX Introduces Zero Code Change Acceleration Using NVIDIA cuGraph]]> http://www.open-lab.net/blog/?p=90753 2024-10-31T16:21:22Z 2024-10-22T18:00:00Z

NetworkX accelerated by NVIDIA cuGraph is a newly released backend co-developed with the NetworkX team. NVIDIA cuGraph provides GPU acceleration for popular...

]]>

Michelle Horton <![CDATA[AI Accurately Forecasts Extreme Weather Up to 23 Days Ahead]]> http://www.open-lab.net/blog/?p=90546 2024-10-31T16:21:26Z 2024-10-21T16:00:00Z

New research from the University of Washington is refining AI weather models using deep learning for more accurate predictions and longer-term forecasts. The...

]]>

Charlie Huang <![CDATA[Scale High-Performance AI Inference with Google Kubernetes Engine and NVIDIA NIM]]> http://www.open-lab.net/blog/?p=90198 2024-10-30T18:57:03Z 2024-10-16T16:30:00Z

The rapid evolution of AI models has driven the need for more efficient and scalable inferencing solutions. As organizations strive to harness the power of AI,...

]]>

Nirmal Kumar Juluru <![CDATA[Train Highly Accurate LLMs with the Zyda-2 Open 5T-Token Dataset Processed with NVIDIA NeMo Curator]]> http://www.open-lab.net/blog/?p=89677 2024-10-18T20:10:29Z 2024-10-15T18:00:00Z

Open-source datasets have significantly democratized access to high-quality data, lowering the barriers of entry for developers and researchers to train...

]]>

Michelle Horton <![CDATA[AI Research Revs Up EV Charging for Large-Scale Optimization, Speed, and Savings]]> http://www.open-lab.net/blog/?p=90119 2024-10-21T16:29:21Z 2024-10-14T15:54:39Z

Electric vehicle (EV) charging is getting a jolt with an innovative new AI algorithm that boosts efficiency, reduces cost, and keeps the grid from...

]]>

Nicolas Blin <![CDATA[Accelerate Large Linear Programming Problems with NVIDIA cuOpt]]> http://www.open-lab.net/blog/?p=89885 2024-10-17T18:19:09Z 2024-10-08T15:00:00Z

The evolution of linear programming (LP) solvers has been marked by significant milestones over the past century, from Simplex to the interior point method...

]]>

1 Nick Becker <![CDATA[NVIDIA CUDA-X Now Accelerates the Polars Data Processing Library]]> http://www.open-lab.net/blog/?p=89963 2024-10-17T18:19:09Z 2024-10-08T15:00:00Z

Polars, one of the fastest-growing data analytics tools, has just crossed 9M monthly downloads. As a modern DataFrame library, it is designed for efficiently...

]]>

Nirmal Kumar Juluru <![CDATA[Just Released: NVIDIA NeMo Curator Improvements for Accelerating Data Curation]]> http://www.open-lab.net/blog/?p=89756 2024-10-18T20:10:53Z 2024-10-04T16:00:00Z

NeMo Curator now supports images, enabling you to process data for training accurate generative AI models.

]]>

Corey Nolet <![CDATA[Event: Community Over Code]]> http://www.open-lab.net/blog/?p=89692 2024-10-17T19:06:59Z 2024-10-03T20:00:00Z

Learn about accelerating vector search with NVIDIA cuVS and Apache Solr on October 10 at Community Over Code.

]]>

Melody Tu <![CDATA[AI Investigates Antarctica��s Disappearing Moss to Uncover Climate Change Clues]]> http://www.open-lab.net/blog/?p=89792 2024-10-23T23:36:01Z 2024-10-03T16:24:50Z

Antarctica plays a crucial role in regulating ?Earth��s climate. Most climate research into the world��s coldest, most windswept continent focuses on the...

]]>

Moon Chung <![CDATA[Event: NVIDIA cuOpt at INFORMS 2024]]> http://www.open-lab.net/blog/?p=89753 2024-10-17T19:07:01Z 2024-10-03T16:00:00Z

Join NVIDIA cuOpt engineers at INFORMS 2024 on October 22-23 to learn how to revolutionize accelerated computing.

]]>

Tanya Lenz <![CDATA[Webinar: Accelerating Python with GPUs]]> http://www.open-lab.net/blog/?p=89659 2024-10-17T19:07:02Z 2024-10-02T18:00:00Z

Join us on October 9 to learn how your applications can benefit from NVIDIA CUDA Python software initiatives.

]]>

Ville Tuulos <![CDATA[Building LLM-Powered Production Systems with NVIDIA NIM and Outerbounds]]> http://www.open-lab.net/blog/?p=89552 2024-10-17T19:07:03Z 2024-10-02T17:00:00Z

With the rapid expansion of language models over the past 18 months, hundreds of variants are now available. These include large language models (LLMs), small...

]]>

Michelle Horton <![CDATA[AI Uses Zero-Shot Learning to Find Existing Drugs for Treating Rare Diseases]]> http://www.open-lab.net/blog/?p=89672 2024-10-17T19:07:03Z 2024-10-02T16:25:36Z

A groundbreaking drug-repurposing AI model could bring new hope to doctors and patients trying to treat diseases with limited or no existing treatment options....

]]>

Elias Wolfberg <![CDATA[AI Chatbot Delivers Multilingual Support to African Farmers]]> http://www.open-lab.net/blog/?p=89513 2024-10-17T19:07:10Z 2024-09-27T18:10:11Z

Some of Africa��s most resource-constrained farmers are gaining access to on-demand, AI-powered advice through a multimodal chatbot?that gives detailed...

]]>

Summer Liu <![CDATA[Harnessing Data with AI to Boost Zero Trust Cyber Defense]]> http://www.open-lab.net/blog/?p=89214 2024-10-28T21:54:29Z 2024-09-26T16:35:55Z

Modern cyber threats have grown increasingly sophisticated, posing significant risks to federal agencies and critical infrastructure. According to Deloitte,...

]]>

Jochen Papenbrock <![CDATA[Event: Developer Day for Financial Services]]> http://www.open-lab.net/blog/?p=89179 2024-09-19T19:28:59Z 2024-09-18T18:06:44Z

Join this virtual developer day to learn how AI and Machine Learning can revolutionize fraud detection and financial crime prevention.

]]>

Jamil Semaan <![CDATA[Polars GPU Engine Powered by RAPIDS cuDF Now Available in Open Beta]]> http://www.open-lab.net/blog/?p=89052 2024-12-12T22:32:12Z 2024-09-17T14:00:00Z

Today, Polars released a new GPU engine powered by RAPIDS cuDF that accelerates Polars workflows up to 13x on NVIDIA GPUs, allowing data scientists to process...

]]>

1 Micha? Szo?ucha <![CDATA[Improved Data Loading with Threads]]> http://www.open-lab.net/blog/?p=88657 2024-09-19T19:30:59Z 2024-09-13T16:00:00Z

Data loading is a critical aspect of deep learning workflows, whether you're focused on training or inference. However, it often presents a paradox: the need...

]]>

Gregory Kimball <![CDATA[Scaling Up to One Billion Rows of Data in pandas using RAPIDS cuDF]]> http://www.open-lab.net/blog/?p=88761 2024-09-25T17:26:00Z 2024-09-11T16:54:53Z

The One Billion Row Challenge is a fun benchmark to showcase basic data processing operations. It was originally launched as a pure-Java competition, and has...

]]>

Michelle Horton <![CDATA[Advanced Strategies for High-Performance GPU Programming with NVIDIA CUDA]]> http://www.open-lab.net/blog/?p=88069 2024-09-19T19:31:59Z 2024-09-11T16:25:00Z

Stephen Jones, a leading expert and distinguished NVIDIA CUDA architect, offers his guidance and insights with a deep dive into the complexities of mapping...

]]>

1 Mehran Maghoumi <![CDATA[Streamlining Data Processing for Domain Adaptive Pretraining with NVIDIA NeMo Curator]]> http://www.open-lab.net/blog/?p=87876 2024-10-18T20:11:21Z 2024-09-10T16:30:00Z

Domain-adaptive pretraining (DAPT) of large language models (LLMs) is an important step towards building domain-specific models. These models demonstrate...

]]>

Anthony Mahanna <![CDATA[Accelerated, Production-Ready Graph Analytics for NetworkX Users]]> http://www.open-lab.net/blog/?p=88512 2024-09-09T21:06:55Z 2024-09-04T19:40:27Z

NetworkX is a popular, easy-to-use Python library for graph analytics. However, its performance and scalability may be unsatisfactory for medium-to-large-sized...

]]>

4 Tianna Nguy <![CDATA[Hands-On Training at NVIDIA AI Summit in Washington, DC]]> http://www.open-lab.net/blog/?p=88598 2024-09-05T17:57:08Z 2024-09-04T17:47:42Z

Immerse yourself in NVIDIA technology with our full-day, hands-on technical workshops at our AI Summit in Washington D.C. on October 7, 2024.

]]>

Amarnath Mohan <![CDATA[Accelerating Predictive Maintenance in Manufacturing with RAPIDS AI]]> http://www.open-lab.net/blog/?p=87334 2024-09-05T17:57:10Z 2024-08-30T15:58:23Z

The International Society of Automation (ISA) reports that 5% of plant production is lost annually due to downtime. Putting that into a different context,...

]]>

Oscar Javier Aldana <![CDATA[Spotlight: clicOH Accelerates Last-Mile Delivery 20x with NVIDIA cuOpt]]> http://www.open-lab.net/blog/?p=88363 2024-09-05T17:57:11Z 2024-08-29T22:18:14Z

Driven by shifts in consumer behavior and the pandemic, e-commerce continues its explosive growth and transformation. As a result, logistics and transportation...

]]>

Michelle Horton <![CDATA[Boosting CUDA Efficiency with Essential Techniques for New Developers]]> http://www.open-lab.net/blog/?p=87823 2024-09-05T17:57:12Z 2024-08-29T17:00:00Z

To fully harness the capabilities of NVIDIA GPUs, optimizing NVIDIA CUDA performance is essential, particularly for developers new to GPU programming. This talk...

]]>

1 Prachi Goel <![CDATA[Just Released: RAPIDS 24.08]]> http://www.open-lab.net/blog/?p=88370 2024-09-05T17:57:13Z 2024-08-29T16:00:58Z

RAPIDS 24.08 is now available with significant updates geared towards processing larger workloads and seamless CPU/GPU interoperability.

]]>

Amr Elmeleegy <![CDATA[NVIDIA Triton Inference Server Achieves Outstanding Performance in MLPerf Inference 4.1 Benchmarks]]> http://www.open-lab.net/blog/?p=87970 2024-09-05T18:37:49Z 2024-08-28T16:00:00Z

Six years ago, we embarked on a journey to develop an AI inference serving solution specifically designed for high-throughput and time-sensitive production use...

]]>

��˳��97caoporen��