Data Science – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-04-29T22:43:09Z http://www.open-lab.net/blog/feed/ Joseph Lucas <![CDATA[Structuring Applications to Secure the KV Cache]]> http://www.open-lab.net/blog/?p=99425 2025-04-29T22:43:09Z 2025-04-29T22:43:01Z When interacting with transformer-based models like large language models (LLMs) and vision-language models (VLMs), the structure of the input shapes the...

Source

]]>
Jenn Yonemitsu <![CDATA[Kaggle Grandmasters Unveil Winning Strategies for Data Science Superpowers]]> http://www.open-lab.net/blog/?p=99350 2025-04-29T17:23:06Z 2025-04-29T17:22:59Z Kaggle Grandmasters David Austin and Chris Deotte from NVIDIA and Ruchi Bhatia from HP joined Brenda Flynn from Kaggle at this year��s Google Cloud Next...

Source

]]>
Bo Dong <![CDATA[NVIDIA cuPyNumeric 25.03 Now Fully Open Source with PIP and HDF5 Support]]> http://www.open-lab.net/blog/?p=99089 2025-04-23T19:26:15Z 2025-04-23T19:26:07Z NVIDIA cuPyNumeric is a library that aims to provide a distributed and accelerated drop-in replacement for NumPy built on top of the Legate framework. It brings...

Source

]]>
Chris Deotte https://www.kaggle.com/cdeotte <![CDATA[Grandmaster Pro Tip: Winning First Place in Kaggle Competition with Feature Engineering using NVIDIA cuDF-pandas]]> http://www.open-lab.net/blog/?p=98938 2025-04-22T23:45:10Z 2025-04-17T23:03:20Z Feature engineering remains one of the most effective ways to improve model accuracy when working with tabular data. Unlike domains such as NLP and computer...

Source

]]>
Ziyue Xu <![CDATA[Efficient Federated Learning in the Era of LLMs with Message Quantization and Streaming]]> http://www.open-lab.net/blog/?p=98553 2025-04-17T19:35:24Z 2025-04-16T16:00:00Z Federated learning (FL) has emerged as a promising approach for training machine learning models across distributed data sources while preserving data privacy....

Source

]]>
1
Nirmal Kumar Juluru <![CDATA[NVIDIA Llama Nemotron Ultra Open Model Delivers Groundbreaking Reasoning Accuracy]]> http://www.open-lab.net/blog/?p=98855 2025-04-22T23:49:11Z 2025-04-15T18:00:00Z AI is no longer just about generating text or images��it��s about deep reasoning, detailed problem-solving, and powerful adaptability for real-world...

Source

]]>
Ziyue Xu <![CDATA[Effortless Federated Learning on Mobile with NVIDIA FLARE and Meta ExecuTorch]]> http://www.open-lab.net/blog/?p=98560 2025-04-17T19:35:27Z 2025-04-11T18:37:54Z NVIDIA and the PyTorch team at Meta announced a groundbreaking collaboration that brings federated learning (FL) capabilities to mobile devices through the...

Source

]]>
1
Shai Shen-Orr <![CDATA[Curating Biological Findings from Scientific Literature with NVIDIA NIM]]> http://www.open-lab.net/blog/?p=98526 2025-04-28T23:18:36Z 2025-04-10T18:30:00Z Scientific papers are highly heterogeneous, often employing diverse terminologies for the same entities, using varied methodologies to study biological...

Source

]]>
Prem Sagar Gali <![CDATA[Efficiently Scaling Polars GPU Parquet Reader]]> http://www.open-lab.net/blog/?p=98435 2025-04-22T23:52:25Z 2025-04-10T16:30:00Z When working with large datasets, the performance of your data processing tools becomes critical. Polars, an open-source library for data manipulation known for...

Source

]]>
Vinay Raman <![CDATA[Evaluating and Enhancing RAG Pipeline Performance Using Synthetic Data?]]> http://www.open-lab.net/blog/?p=97927 2025-04-17T19:35:37Z 2025-04-07T18:39:06Z As large language models (LLM) gain popularity in various question-answering systems, retrieval-augmented generation (RAG) pipelines have also become a focal...

Source

]]>
Sama Bali <![CDATA[Event: HP & NVIDIA Developer Challenge]]> http://www.open-lab.net/blog/?p=98487 2025-04-17T19:35:39Z 2025-04-07T17:54:00Z Join the hackathon to build open-source AI solutions, optimize models, enhance workflows, connect with peers, and win prizes.

Source

]]>
Matt Ahrens <![CDATA[Accelerating Apache Parquet Scans on Apache Spark with GPUs]]> http://www.open-lab.net/blog/?p=98350 2025-04-22T23:57:50Z 2025-04-03T16:18:03Z As data sizes have grown in enterprises across industries, Apache Parquet has become a prominent format for storing data. Apache Parquet is a columnar storage...

Source

]]>
1
Ronen Dar <![CDATA[NVIDIA Open Sources Run:ai Scheduler to Foster Community Collaboration]]> http://www.open-lab.net/blog/?p=98094 2025-04-22T23:59:16Z 2025-04-01T09:00:00Z Today, NVIDIA announced the open-source release of the KAI Scheduler, a Kubernetes-native GPU scheduling solution, now available under the Apache 2.0 license....

Source

]]>
Brian Shi <![CDATA[Boosting Q&A Accuracy with GraphRAG Using PyG and Graph Databases]]> http://www.open-lab.net/blog/?p=97900 2025-04-03T18:46:06Z 2025-03-26T21:41:08Z Large language models (LLMs) often struggle with accuracy when handling domain-specific questions, especially those requiring multi-hop reasoning or access to...

Source

]]>
Cole Swain <![CDATA[Spotlight: Tomorrow.io?Transforms Global Weather Resilience with NVIDIA AI]]> http://www.open-lab.net/blog/?p=98023 2025-04-03T18:46:17Z 2025-03-26T21:19:34Z From hyperlocal forecasts that guide daily operations to planet-scale models illuminating new climate insights, the world is entering a new frontier in weather...

Source

]]>
John Ashcroft <![CDATA[Powering Flood Risk Assessment with NVIDIA Earth-2]]> http://www.open-lab.net/blog/?p=97974 2025-04-23T00:01:57Z 2025-03-25T20:59:12Z Inland flooding causes significant economic and societal impacts annually. Of the eight natural disasters costing the insurance industry over $1 billion in...

Source

]]>
Xavier Renard <![CDATA[Spotlight: AXA Explores AI-Driven Hurricane Risk Assessment]]> http://www.open-lab.net/blog/?p=98096 2025-04-23T00:05:46Z 2025-03-25T17:47:06Z Large ensembles are essential for predicting rare, high-impact events that cannot be fully understood through historical data alone. By simulating thousands of...

Source

]]>
Holger Roth <![CDATA[Supercharging the Federated Learning Ecosystem by Integrating Flower and NVIDIA FLARE]]> http://www.open-lab.net/blog/?p=94045 2025-04-23T00:05:05Z 2025-03-24T16:00:00Z In recent years, open-source systems like Flower and NVIDIA FLARE have emerged as pivotal tools in the federated learning (FL) landscape, each with its unique...

Source

]]>
1
Kyle Tretina <![CDATA[Guiding Generative Molecular Design with Experimental Feedback Using Oracles]]> http://www.open-lab.net/blog/?p=96966 2025-03-25T17:23:57Z 2025-03-19T15:00:00Z Generative chemistry with AI has the potential to revolutionize how scientists approach drug discovery and development, health, and materials science and...

Source

]]>
TJ Chen <![CDATA[Shrink Genomics and Single-Cell Analysis Time to Minutes with NVIDIA Parabricks and NVIDIA AI Blueprints]]> http://www.open-lab.net/blog/?p=96979 2025-03-20T18:33:12Z 2025-03-19T15:00:00Z NVIDIA Parabricks is a scalable genomics analysis software suite that solves omics challenges with accelerated computing and deep learning to unlock new...

Source

]]>
Siddharth Sharma <![CDATA[NVIDIA cuML Brings Zero Code Change Acceleration to scikit-learn]]> http://www.open-lab.net/blog/?p=97091 2025-04-23T00:22:52Z 2025-03-18T17:42:25Z Scikit-learn, the most widely used ML library, is popular for processing tabular data because of its simple API, diversity of algorithms, and compatibility with...

Source

]]>
Erik Ordentlich <![CDATA[Accelerate Apache Spark ML on NVIDIA GPUs with Zero Code Change]]> http://www.open-lab.net/blog/?p=96768 2025-04-23T00:36:38Z 2025-03-06T19:49:16Z The NVIDIA RAPIDS Accelerator for Apache Spark software plug-in pioneered a zero code change user experience (UX) for GPU-accelerated data processing. It...

Source

]]>
Mark J. Bennett <![CDATA[GPU-Accelerate Algorithmic Trading Simulations by over 100x with Numba]]> http://www.open-lab.net/blog/?p=96652 2025-03-10T23:13:45Z 2025-03-04T21:44:01Z Quantitative developers need to run back-testing simulations to see how financial algorithms perform from a profit and loss (P&L) standpoint. Statistical...

Source

]]>
Douglas Moore <![CDATA[Accelerate Medical Imaging AI Operations with Databricks Pixels 2.0 and MONAI]]> http://www.open-lab.net/blog/?p=96530 2025-04-23T02:39:52Z 2025-02-28T18:11:50Z According to the World Health Organization (WHO), 3.6 billion medical imaging tests are performed every year globally to diagnose, monitor, and treat various...

Source

]]>
Tom Augspurger <![CDATA[High-Performance Remote IO With NVIDIA KvikIO]]> http://www.open-lab.net/blog/?p=96582 2025-03-06T19:26:42Z 2025-02-27T17:55:52Z Workloads processing large amounts of data, especially those running on the cloud, will often use an object storage service (S3, Google Cloud Storage, Azure...

Source

]]>
1
Karthikeyan Natarajan <![CDATA[JSON Lines Reading with pandas 100x Faster Using NVIDIA cuDF]]> http://www.open-lab.net/blog/?p=95970 2025-04-23T02:44:00Z 2025-02-20T17:00:00Z JSON is a widely adopted format for text-based information working interoperably between systems, most commonly in web applications and large language models...

Source

]]>
Kyle Tretina <![CDATA[Understanding the Language of Life��s Biomolecules Across Evolution at a New Scale with Evo 2]]> http://www.open-lab.net/blog/?p=95589 2025-04-23T02:44:28Z 2025-02-19T17:14:51Z AI has evolved from an experimental curiosity to a driving force within biological research. The convergence of deep learning algorithms, massive omics...

Source

]]>
Brad Nemire <![CDATA[Featured Sessions for Students at NVIDIA GTC 2025]]> http://www.open-lab.net/blog/?p=96181 2025-02-20T15:52:32Z 2025-02-15T02:00:58Z Learn from researchers, scientists, and industry leaders across a variety of topics including AI, robotics, and Data Science.

Source

]]>
Rick Ratzel <![CDATA[Using NetworkX, Jaccard Similarity, and cuGraph to Predict Your Next Favorite Movie]]> http://www.open-lab.net/blog/?p=95820 2025-04-23T02:45:37Z 2025-02-13T17:00:00Z As the amount of data available to everyone in the world increases, the ability for a consumer to make informed decisions becomes increasingly difficult....

Source

]]>
Jesus Alvarez <![CDATA[NVIDIA Open GPU Datacenter Drivers for RHEL9 Signed by Red Hat]]> http://www.open-lab.net/blog/?p=95069 2025-04-23T02:52:36Z 2025-02-10T17:48:26Z NVIDIA and Red Hat have partnered to bring continued improvements to the precompiled NVIDIA Driver introduced in 2020. Last month, NVIDIA announced that the...

Source

]]>
3
Allison Ding <![CDATA[Get Started with GPU Acceleration for Data Science]]> http://www.open-lab.net/blog/?p=95894 2025-04-23T02:52:30Z 2025-02-06T23:07:48Z In data science, operational efficiency is key to handling increasingly complex and large datasets. GPU acceleration has become essential for modern workflows,...

Source

]]>
Brad Nemire <![CDATA[Featured Researcher and Educator Sessions at NVIDIA GTC 2025]]> http://www.open-lab.net/blog/?p=95817 2025-02-06T19:33:45Z 2025-02-05T23:03:06Z Explore the latest advancements in academia, including advanced research, innovative teaching methods, and the future of learning and technology.

Source

]]>
Michelle Horton <![CDATA[AI Foundation Model Enhances Cancer Diagnosis and Tailors Treatment]]> http://www.open-lab.net/blog/?p=95722 2025-04-23T02:48:13Z 2025-02-04T17:16:54Z A new study and AI model from researchers at Stanford University is streamlining cancer diagnostics, treatment planning, and prognosis prediction. Named MUSK...

Source

]]>
1
Jonathan Bentz <![CDATA[CUDA Toolkit Now Available for NVIDIA Blackwell?]]> http://www.open-lab.net/blog/?p=95358 2025-04-23T14:58:16Z 2025-01-31T19:17:12Z The latest release of the CUDA Toolkit, version 12.8, continues to push accelerated computing performance in data sciences, AI, scientific computing, and...

Source

]]>
Prem Sagar Gali <![CDATA[Mastering the cudf.pandas Profiler for GPU Acceleration]]> http://www.open-lab.net/blog/?p=95351 2025-04-23T15:00:07Z 2025-01-30T17:00:00Z In the world of Python data science, pandas has long reigned as the go-to library for intuitive data manipulation and analysis. However, as data volumes grow,...

Source

]]>
Matt Ahrens <![CDATA[Accelerating JSON Processing on Apache Spark with GPUs]]> http://www.open-lab.net/blog/?p=95298 2025-04-23T15:01:08Z 2025-01-29T22:10:22Z JSON is a popular format for text-based data that allows for interoperability between systems in web applications as well as data management. The format has...

Source

]]>
Amit Bleiweiss <![CDATA[Mastering LLM Techniques: Evaluation]]> http://www.open-lab.net/blog/?p=95447 2025-04-23T15:01:33Z 2025-01-29T20:44:06Z Evaluating large language models (LLMs) and retrieval-augmented generation (RAG) systems is a complex and nuanced process, reflecting the sophisticated and...

Source

]]>
Juana Nakfour <![CDATA[Horizontal Autoscaling of NVIDIA NIM Microservices on Kubernetes]]> http://www.open-lab.net/blog/?p=94972 2025-04-23T15:02:12Z 2025-01-22T17:34:51Z NVIDIA NIM microservices are model inference containers that can be deployed on Kubernetes. In a production environment, it��s important to understand the...

Source

]]>
2
Elias Wolfberg <![CDATA[AI Uncovers Potentially Hazardous, Forgotten Oil and Gas Wells]]> http://www.open-lab.net/blog/?p=95106 2025-04-23T15:03:07Z 2025-01-16T19:09:15Z With as many as 800,000 forgotten oil and gas wells scattered across the US, researchers from Lawrence Berkeley National Laboratory (LBNL), have developed an AI...

Source

]]>
Brian Tepera <![CDATA[Accelerating Time Series Forecasting with RAPIDS cuML]]> http://www.open-lab.net/blog/?p=95127 2025-01-23T19:54:21Z 2025-01-16T17:20:10Z Time series forecasting is a powerful data science technique used to predict future values based on data points from the past Open source Python libraries like...

Source

]]>
Brad Nemire <![CDATA[Upcoming Webinar: Inside the RAPIDS-Accelerated Polars GPU Engine]]> http://www.open-lab.net/blog/?p=94968 2025-01-23T19:54:27Z 2025-01-13T17:17:47Z In the webinar on January 28th, you'll get an inside look of the new GPU engine to learn how Polars' declarative API and query optimizer enable seamless GPU...

Source

]]>
Nirmal Kumar Juluru <![CDATA[Enhancing Generative AI Model Accuracy with NVIDIA NeMo Curator]]> http://www.open-lab.net/blog/?p=94263 2025-01-23T19:54:27Z 2025-01-13T17:00:00Z In the rapidly evolving landscape of artificial intelligence, the quality of the data used for training models is paramount. High-quality data ensures that...

Source

]]>
Kyle Tretina <![CDATA[Evaluating GenMol as a Generalist Foundation Model for Molecular Generation]]> http://www.open-lab.net/blog/?p=94836 2025-01-23T19:54:29Z 2025-01-13T14:00:00Z Traditional computational drug discovery relies almost exclusively on highly task-specific computational models for hit identification and lead optimization....

Source

]]>
Kyle Tretina <![CDATA[Accelerate Protein Engineering with the NVIDIA BioNeMo Blueprint for Generative Protein Binder Design]]> http://www.open-lab.net/blog/?p=94851 2025-01-23T19:54:28Z 2025-01-13T14:00:00Z Designing a therapeutic protein that specifically binds its target in drug discovery is a staggering challenge. Traditional workflows are often a painstaking...

Source

]]>
Peter Entschev <![CDATA[Accelerating GPU Analytics Using RAPIDS and Ray]]> http://www.open-lab.net/blog/?p=94495 2024-12-20T21:13:45Z 2024-12-20T21:13:42Z RAPIDS is a suite of open-source GPU-accelerated data science and AI libraries that are well supported for scale-out with distributed engines like Spark and...

Source

]]>
Jenn Yonemitsu <![CDATA[NVIDIA Hackathon Winners Share Strategies for RAPIDS-Accelerated ML Workflows]]> http://www.open-lab.net/blog/?p=94393 2025-01-22T18:31:27Z 2024-12-20T18:00:00Z Approximately 220 teams gathered at the Open Data Science Conference (ODSC) West this year to compete in the NVIDIA hackathon, a 24-hour machine learning (ML)...

Source

]]>
Tom Balough <![CDATA[Enhance Your Training Data with New NVIDIA NeMo Curator Classifier Models]]> http://www.open-lab.net/blog/?p=94447 2024-12-19T23:08:12Z 2024-12-19T23:08:08Z Classifier models are specialized in categorizing data into predefined groups or classes, playing a crucial role in optimizing data processing pipelines for...

Source

]]>
Nick Becker <![CDATA[RAPIDS 24.12 Introduces cuDF on PyPI, CUDA Unified Memory for Polars, and Faster GNNs]]> http://www.open-lab.net/blog/?p=94415 2024-12-19T21:46:07Z 2024-12-19T21:21:42Z RAPIDS 24.12 introduces cuDF packages to PyPI, speeds up groupby aggregations and reading files from AWS S3, enables larger-than-GPU memory queries in the...

Source

]]>
Ziyue Xu <![CDATA[Security for Data Privacy in Federated Learning with CUDA-Accelerated Homomorphic Encryption in XGBoost]]> http://www.open-lab.net/blog/?p=93870 2024-12-17T19:33:44Z 2024-12-18T21:30:00Z XGBoost is a machine learning algorithm widely used for tabular data modeling. To expand the XGBoost model from single-site learning to multisite collaborative...

Source

]]>
Michelle Horton <![CDATA[Top Posts of 2024 Highlight NVIDIA NIM, LLM Breakthroughs, and Data Science Optimization]]> http://www.open-lab.net/blog/?p=93566 2024-12-16T18:34:16Z 2024-12-16T18:34:14Z 2024 was another landmark year for developers, researchers, and innovators working with NVIDIA technologies. From groundbreaking developments in AI inference to...

Source

]]>
0
Joe Bungo <![CDATA[NVIDIA Deep Learning Institute Releases New Data Science Teaching Kit for Educators]]> https://news.www.open-lab.net/?p=19371 2024-12-12T19:35:13Z 2024-12-12T17:11:19Z As data grows in volume, velocity, and complexity, the data science field is booming.  There��s an ever-increasing demand for talent and skill sets to...

Source

]]>
Nick Becker <![CDATA[Harnessing GPU Acceleration for Multi-Label Classification with RAPIDS cuML]]> http://www.open-lab.net/blog/?p=93575 2024-12-12T19:17:22Z 2024-12-12T16:55:40Z Modern classification workflows often require classifying individual records and data points into multiple categories instead of just assigning a single label....

Source

]]>
Prem Sagar Gali <![CDATA[Unified Virtual Memory Supercharges pandas with RAPIDS cuDF]]> http://www.open-lab.net/blog/?p=93438 2024-12-12T19:35:20Z 2024-12-05T19:07:07Z cuDF-pandas, introduced in a previous post, is a GPU-accelerated library that accelerates pandas to deliver significant performance improvements��up to 50x...

Source

]]>
Vega Shah <![CDATA[In-Silico Antibody Development with AlphaBind Using NVIDIA BioNeMo and AWS HealthOmics]]> http://www.open-lab.net/blog/?p=92757 2024-12-12T19:38:30Z 2024-12-03T18:00:00Z Antibodies have become the most prevalent class of therapeutics, primarily due to their ability to target specific antigens, enabling them to treat a wide range...

Source

]]>
Bradley Dice <![CDATA[Supercharging Deduplication in pandas Using RAPIDS cuDF]]> http://www.open-lab.net/blog/?p=92703 2024-12-12T19:38:34Z 2024-11-28T14:00:00Z A common operation in data analytics is to drop duplicate rows. Deduplication is critical in Extract, Transform, Load (ETL) workflows, where you might want to...

Source

]]>
Ben Zaitlen https://www.linkedin.com/in/benjamin-zaitlen-62ab7b4/ <![CDATA[Best Practices for Multi-GPU Data Analysis Using RAPIDS with Dask]]> http://www.open-lab.net/blog/?p=92480 2024-12-12T19:38:40Z 2024-11-21T19:02:03Z As we move towards a more dense computing infrastructure, with more compute, more GPUs, accelerated networking, and so forth��multi-gpu training and analysis...

Source

]]>
Mario Geiger <![CDATA[Accelerate Drug and Material Discovery with New Math Library NVIDIA cuEquivariance]]> http://www.open-lab.net/blog/?p=91896 2024-11-18T22:58:58Z 2024-11-18T18:30:00Z AI models for science are often trained to make predictions about the workings of nature, such as predicting the structure of a biomolecule or the properties of...

Source

]]>
1
Wen Jie Ong <![CDATA[Revolutionizing AI-Driven Material Discovery Using NVIDIA ALCHEMI]]> http://www.open-lab.net/blog/?p=91999 2024-11-18T22:57:30Z 2024-11-18T18:30:00Z AI has proven to be a force multiplier, helping to create a future where scientists can design entirely new materials, while engineers seamlessly transform...

Source

]]>
Wonchan Lee <![CDATA[Effortlessly Scale NumPy from Laptops to Supercomputers with NVIDIA cuPyNumeric]]> http://www.open-lab.net/blog/?p=91682 2025-04-10T23:02:00Z 2024-11-18T17:00:00Z Python is the most common programming language for data science, machine learning, and numerical computing. It continues to grow in popularity among scientists...

Source

]]>
1
Nick Becker <![CDATA[Faster Causal Inference on Large Datasets with NVIDIA RAPIDS]]> http://www.open-lab.net/blog/?p=91854 2024-11-18T20:15:01Z 2024-11-14T16:00:00Z As consumer applications generate more data than ever before, enterprises are turning to causal inference methods for observational data to help shed light on...

Source

]]>
Nick Becker <![CDATA[NVIDIA RAPIDS 24.10 Introduces Accelerated NetworkX with Zero Code Change, Updates for UMAP and cuDF-Pandas]]> http://www.open-lab.net/blog/?p=91788 2024-11-14T17:10:34Z 2024-11-13T22:37:14Z The RAPIDS v24.10 release takes another step forward in bringing accelerated computing to data scientists and developers with a seamless user experience. This...

Source

]]>
Amit Bleiweiss <![CDATA[Mastering LLM Techniques: Text Data Processing]]> http://www.open-lab.net/blog/?p=91738 2025-04-01T19:02:02Z 2024-11-13T18:05:06Z Training and customizing LLMs for high accuracy is fraught with challenges, primarily due to their dependency on high-quality data. Poor data quality and...

Source

]]>
Kyle Tretina <![CDATA[Boost Alphafold2 Protein Structure Prediction with GPU-Accelerated MMseqs2]]> http://www.open-lab.net/blog/?p=91623 2024-11-14T17:10:35Z 2024-11-13T17:00:00Z The ability to compare the sequences of multiple related proteins is a foundational task for many life science researchers. This is often done in the form of a...

Source

]]>
Michelle Horton <![CDATA[AI That ��Hears�� Heart Disease May Help Vets Diagnose Dogs]]> http://www.open-lab.net/blog/?p=91619 2024-11-14T17:10:40Z 2024-11-12T15:49:17Z A new machine-learning algorithm that listens to digital heartbeat data could help veterinarians diagnose murmurs and early-stage heart disease in dogs....

Source

]]>
Amr Elmeleegy <![CDATA[5x Faster Time to First Token with NVIDIA TensorRT-LLM KV Cache Early Reuse]]> http://www.open-lab.net/blog/?p=91625 2024-11-14T17:10:41Z 2024-11-08T23:55:43Z In our previous blog post, we demonstrated how reusing the key-value (KV) cache by offloading it to CPU memory can accelerate time to first token (TTFT) by up...

Source

]]>
Chelsea Gomatam <![CDATA[Discover New Biological Insights with Accelerated Pangenome Alignment in NVIDIA Parabricks]]> http://www.open-lab.net/blog/?p=91220 2024-11-14T17:10:48Z 2024-11-04T17:39:18Z NVIDIA Parabricks is a scalable genomics analysis software suite that solves omics challenges with accelerated computing and deep learning to unlock new...

Source

]]>
1
Tyler Whitehouse <![CDATA[Frictionless Collaboration and Rapid Prototyping in Hybrid Environments with NVIDIA AI Workbench]]> http://www.open-lab.net/blog/?p=91234 2024-11-14T17:10:49Z 2024-11-04T17:30:00Z NVIDIA AI Workbench is a free development environment manager that streamlines data science, AI, and machine learning (ML) projects on systems of choice. The...

Source

]]>
Jinsol Park <![CDATA[Even Faster and More Scalable UMAP on the GPU with RAPIDS cuML]]> http://www.open-lab.net/blog/?p=91198 2024-11-14T17:10:53Z 2024-10-31T20:24:07Z UMAP is a popular dimension reduction algorithm used in fields like bioinformatics, NLP topic modeling, and ML preprocessing. It works by creating a k-nearest...

Source

]]>
2
Summer Liu <![CDATA[Supercharging Fraud Detection in Financial Services with Graph Neural Networks]]> http://www.open-lab.net/blog/?p=90877 2024-10-31T18:36:06Z 2024-10-28T15:30:00Z Fraud in financial services is a massive problem. According to NASDAQ, in 2023, banks faced $442 billion in projected losses from payments, checks, and credit...

Source

]]>
Michael Yh Wang <![CDATA[Bridging the CUDA C++ Ecosystem and Python Developers with Numbast]]> http://www.open-lab.net/blog/?p=90086 2024-10-31T16:26:15Z 2024-10-24T16:30:00Z By enabling CUDA kernels to be written in Python similar to how they can be implemented within C++, Numba bridges the gap between the Python ecosystem and the...

Source

]]>
Michelle Horton <![CDATA[Optimizing Drug Discovery with CUDA Graphs, Coroutines, and GPU Workflows]]> http://www.open-lab.net/blog/?p=90780 2024-10-31T16:21:20Z 2024-10-23T17:28:49Z Pharmaceutical research demands fast, efficient simulations to predict how molecules interact, speeding up drug discovery. Jiqun Tu, a senior developer...

Source

]]>
Rick Ratzel <![CDATA[NetworkX Introduces Zero Code Change Acceleration Using NVIDIA cuGraph]]> http://www.open-lab.net/blog/?p=90753 2024-10-31T16:21:22Z 2024-10-22T18:00:00Z NetworkX accelerated by NVIDIA cuGraph is a newly released backend co-developed with the NetworkX team. NVIDIA cuGraph provides GPU acceleration for popular...

Source

]]>
Michelle Horton <![CDATA[AI Accurately Forecasts Extreme Weather Up to 23 Days Ahead]]> http://www.open-lab.net/blog/?p=90546 2024-10-31T16:21:26Z 2024-10-21T16:00:00Z New research from the University of Washington is refining AI weather models using deep learning for more accurate predictions and longer-term forecasts. The...

Source

]]>
Charlie Huang <![CDATA[Scale High-Performance AI Inference with Google Kubernetes Engine and NVIDIA NIM]]> http://www.open-lab.net/blog/?p=90198 2024-10-30T18:57:03Z 2024-10-16T16:30:00Z The rapid evolution of AI models has driven the need for more efficient and scalable inferencing solutions. As organizations strive to harness the power of AI,...

Source

]]>
Nirmal Kumar Juluru <![CDATA[Train Highly Accurate LLMs with the Zyda-2 Open 5T-Token Dataset Processed with NVIDIA NeMo Curator]]> http://www.open-lab.net/blog/?p=89677 2024-10-18T20:10:29Z 2024-10-15T18:00:00Z Open-source datasets have significantly democratized access to high-quality data, lowering the barriers of entry for developers and researchers to train...

Source

]]>
Michelle Horton <![CDATA[AI Research Revs Up EV Charging for Large-Scale Optimization, Speed, and Savings]]> http://www.open-lab.net/blog/?p=90119 2024-10-21T16:29:21Z 2024-10-14T15:54:39Z Electric vehicle (EV) charging is getting a jolt with an innovative new AI algorithm that boosts efficiency, reduces cost, and keeps the grid from...

Source

]]>
Nicolas Blin <![CDATA[Accelerate Large Linear Programming Problems with NVIDIA cuOpt]]> http://www.open-lab.net/blog/?p=89885 2024-10-17T18:19:09Z 2024-10-08T15:00:00Z The evolution of linear programming (LP) solvers has been marked by significant milestones over the past century, from Simplex to the interior point method...

Source

]]>
1
Nick Becker <![CDATA[NVIDIA CUDA-X Now Accelerates the Polars Data Processing Library]]> http://www.open-lab.net/blog/?p=89963 2024-10-17T18:19:09Z 2024-10-08T15:00:00Z Polars, one of the fastest-growing data analytics tools, has just crossed 9M monthly downloads. As a modern DataFrame library, it is designed for efficiently...

Source

]]>
Nirmal Kumar Juluru <![CDATA[Just Released: NVIDIA NeMo Curator Improvements for Accelerating Data Curation]]> http://www.open-lab.net/blog/?p=89756 2024-10-18T20:10:53Z 2024-10-04T16:00:00Z NeMo Curator now supports images, enabling you to process data for training accurate generative AI models.

Source

]]>
Corey Nolet <![CDATA[Event: Community Over Code]]> http://www.open-lab.net/blog/?p=89692 2024-10-17T19:06:59Z 2024-10-03T20:00:00Z Learn about accelerating vector search with NVIDIA cuVS and Apache Solr on October 10 at Community Over Code.

Source

]]>
Melody Tu <![CDATA[AI Investigates Antarctica��s Disappearing Moss to Uncover Climate Change Clues]]> http://www.open-lab.net/blog/?p=89792 2024-10-23T23:36:01Z 2024-10-03T16:24:50Z Antarctica plays a crucial role in regulating ?Earth��s climate. Most climate research into the world��s coldest, most windswept continent focuses on the...

Source

]]>
Moon Chung <![CDATA[Event: NVIDIA cuOpt at INFORMS 2024]]> http://www.open-lab.net/blog/?p=89753 2024-10-17T19:07:01Z 2024-10-03T16:00:00Z Join NVIDIA cuOpt engineers at INFORMS 2024 on October 22-23 to learn how to revolutionize accelerated computing.

Source

]]>
Tanya Lenz <![CDATA[Webinar: Accelerating Python with GPUs]]> http://www.open-lab.net/blog/?p=89659 2024-10-17T19:07:02Z 2024-10-02T18:00:00Z Join us on October 9 to learn how your applications can benefit from NVIDIA CUDA Python software initiatives.

Source

]]>
Ville Tuulos <![CDATA[Building LLM-Powered Production Systems with NVIDIA NIM and Outerbounds]]> http://www.open-lab.net/blog/?p=89552 2024-10-17T19:07:03Z 2024-10-02T17:00:00Z With the rapid expansion of language models over the past 18 months, hundreds of variants are now available. These include large language models (LLMs), small...

Source

]]>
Michelle Horton <![CDATA[AI Uses Zero-Shot Learning to Find Existing Drugs for Treating Rare Diseases]]> http://www.open-lab.net/blog/?p=89672 2024-10-17T19:07:03Z 2024-10-02T16:25:36Z A groundbreaking drug-repurposing AI model could bring new hope to doctors and patients trying to treat diseases with limited or no existing treatment options....

Source

]]>
Elias Wolfberg <![CDATA[AI Chatbot Delivers Multilingual Support to African Farmers]]> http://www.open-lab.net/blog/?p=89513 2024-10-17T19:07:10Z 2024-09-27T18:10:11Z Some of Africa��s most resource-constrained farmers are gaining access to on-demand, AI-powered advice through a multimodal chatbot?that gives detailed...

Source

]]>
Summer Liu <![CDATA[Harnessing Data with AI to Boost Zero Trust Cyber Defense]]> http://www.open-lab.net/blog/?p=89214 2024-10-28T21:54:29Z 2024-09-26T16:35:55Z Modern cyber threats have grown increasingly sophisticated, posing significant risks to federal agencies and critical infrastructure. According to Deloitte,...

Source

]]>
Jochen Papenbrock <![CDATA[Event: Developer Day for Financial Services]]> http://www.open-lab.net/blog/?p=89179 2024-09-19T19:28:59Z 2024-09-18T18:06:44Z Join this virtual developer day to learn how AI and Machine Learning can revolutionize fraud detection and financial crime prevention.

Source

]]>
Jamil Semaan <![CDATA[Polars GPU Engine Powered by RAPIDS cuDF Now Available in Open Beta]]> http://www.open-lab.net/blog/?p=89052 2024-12-12T22:32:12Z 2024-09-17T14:00:00Z Today, Polars released a new GPU engine powered by RAPIDS cuDF that accelerates Polars workflows up to 13x on NVIDIA GPUs, allowing data scientists to process...

Source

]]>
1
Micha? Szo?ucha <![CDATA[Improved Data Loading with Threads]]> http://www.open-lab.net/blog/?p=88657 2024-09-19T19:30:59Z 2024-09-13T16:00:00Z Data loading is a critical aspect of deep learning workflows, whether you're focused on training or inference. However, it often presents a paradox: the need...

Source

]]>
Gregory Kimball <![CDATA[Scaling Up to One Billion Rows of Data in pandas using RAPIDS cuDF]]> http://www.open-lab.net/blog/?p=88761 2024-09-25T17:26:00Z 2024-09-11T16:54:53Z The One Billion Row Challenge is a fun benchmark to showcase basic data processing operations. It was originally launched as a pure-Java competition, and has...

Source

]]>
Michelle Horton <![CDATA[Advanced Strategies for High-Performance GPU Programming with NVIDIA CUDA]]> http://www.open-lab.net/blog/?p=88069 2024-09-19T19:31:59Z 2024-09-11T16:25:00Z Stephen Jones, a leading expert and distinguished NVIDIA CUDA architect, offers his guidance and insights with a deep dive into the complexities of mapping...

Source

]]>
1
Mehran Maghoumi <![CDATA[Streamlining Data Processing for Domain Adaptive Pretraining with NVIDIA NeMo Curator]]> http://www.open-lab.net/blog/?p=87876 2024-10-18T20:11:21Z 2024-09-10T16:30:00Z Domain-adaptive pretraining (DAPT) of large language models (LLMs) is an important step towards building domain-specific models. These models demonstrate...

Source

]]>
Anthony Mahanna <![CDATA[Accelerated, Production-Ready Graph Analytics for NetworkX Users]]> http://www.open-lab.net/blog/?p=88512 2024-09-09T21:06:55Z 2024-09-04T19:40:27Z NetworkX is a popular, easy-to-use Python library for graph analytics. However, its performance and scalability may be unsatisfactory for medium-to-large-sized...

Source

]]>
4
Tianna Nguy <![CDATA[Hands-On Training at NVIDIA AI Summit in Washington, DC]]> http://www.open-lab.net/blog/?p=88598 2024-09-05T17:57:08Z 2024-09-04T17:47:42Z Immerse yourself in NVIDIA technology with our full-day, hands-on technical workshops at our AI Summit in Washington D.C. on October 7, 2024.

Source

]]>
Amarnath Mohan <![CDATA[Accelerating Predictive Maintenance in Manufacturing with RAPIDS AI]]> http://www.open-lab.net/blog/?p=87334 2024-09-05T17:57:10Z 2024-08-30T15:58:23Z The International Society of Automation (ISA) reports that 5% of plant production is lost annually due to downtime. Putting that into a different context,...

Source

]]>
Oscar Javier Aldana <![CDATA[Spotlight: clicOH Accelerates Last-Mile Delivery 20x with NVIDIA cuOpt]]> http://www.open-lab.net/blog/?p=88363 2024-09-05T17:57:11Z 2024-08-29T22:18:14Z Driven by shifts in consumer behavior and the pandemic, e-commerce continues its explosive growth and transformation. As a result, logistics and transportation...

Source

]]>
Michelle Horton <![CDATA[Boosting CUDA Efficiency with Essential Techniques for New Developers]]> http://www.open-lab.net/blog/?p=87823 2024-09-05T17:57:12Z 2024-08-29T17:00:00Z To fully harness the capabilities of NVIDIA GPUs, optimizing NVIDIA CUDA performance is essential, particularly for developers new to GPU programming. This talk...

Source

]]>
1
Prachi Goel <![CDATA[Just Released: RAPIDS 24.08]]> http://www.open-lab.net/blog/?p=88370 2024-09-05T17:57:13Z 2024-08-29T16:00:58Z RAPIDS 24.08 is now available with significant updates geared towards processing larger workloads and seamless CPU/GPU interoperability.

Source

]]>
Amr Elmeleegy <![CDATA[NVIDIA Triton Inference Server Achieves Outstanding Performance in MLPerf Inference 4.1 Benchmarks]]> http://www.open-lab.net/blog/?p=87970 2024-09-05T18:37:49Z 2024-08-28T16:00:00Z Six years ago, we embarked on a journey to develop an AI inference serving solution specifically designed for high-throughput and time-sensitive production use...

Source

]]>
���˳���97caoporen����