Data Science – NVIDIA Technical BlogNews and tutorials for developers, data scientists, and IT admins2025-04-29T22:43:09Zhttp://www.open-lab.net/blog/feed/Joseph Lucas<![CDATA[Structuring Applications to Secure the KV Cache]]>http://www.open-lab.net/blog/?p=994252025-04-29T22:43:09Z2025-04-29T22:43:01ZWhen interacting with transformer-based models like large language models (LLMs) and vision-language models (VLMs), the structure of the input shapes the...
]]>Jenn Yonemitsu<![CDATA[Kaggle Grandmasters Unveil Winning Strategies for Data Science Superpowers]]>http://www.open-lab.net/blog/?p=993502025-04-29T17:23:06Z2025-04-29T17:22:59ZKaggle Grandmasters David Austin and Chris Deotte from NVIDIA and Ruchi Bhatia from HP joined Brenda Flynn from Kaggle at this year��s Google Cloud Next...
]]>Bo Dong<![CDATA[NVIDIA cuPyNumeric 25.03 Now Fully Open Source with PIP and HDF5 Support]]>http://www.open-lab.net/blog/?p=990892025-04-23T19:26:15Z2025-04-23T19:26:07ZNVIDIA cuPyNumeric is a library that aims to provide a distributed and accelerated drop-in replacement for NumPy built on top of the Legate framework. It brings...
]]>Chris Deottehttps://www.kaggle.com/cdeotte<![CDATA[Grandmaster Pro Tip: Winning First Place in Kaggle Competition with Feature Engineering using NVIDIA cuDF-pandas]]>http://www.open-lab.net/blog/?p=989382025-04-22T23:45:10Z2025-04-17T23:03:20ZFeature engineering remains one of the most effective ways to improve model accuracy when working with tabular data. Unlike domains such as NLP and computer...
]]>Ziyue Xu<![CDATA[Efficient Federated Learning in the Era of LLMs with Message Quantization and Streaming]]>http://www.open-lab.net/blog/?p=985532025-04-17T19:35:24Z2025-04-16T16:00:00ZFederated learning (FL) has emerged as a promising approach for training machine learning models across distributed data sources while preserving data privacy....
]]>1Nirmal Kumar Juluru<![CDATA[NVIDIA Llama Nemotron Ultra Open Model Delivers Groundbreaking Reasoning Accuracy]]>http://www.open-lab.net/blog/?p=988552025-04-22T23:49:11Z2025-04-15T18:00:00ZAI is no longer just about generating text or images��it��s about deep reasoning, detailed problem-solving, and powerful adaptability for real-world...
]]>Ziyue Xu<![CDATA[Effortless Federated Learning on Mobile with NVIDIA FLARE and Meta ExecuTorch]]>http://www.open-lab.net/blog/?p=985602025-04-17T19:35:27Z2025-04-11T18:37:54ZNVIDIA and the PyTorch team at Meta announced a groundbreaking collaboration that brings federated learning (FL) capabilities to mobile devices through the...
]]>1Shai Shen-Orr<![CDATA[Curating Biological Findings from Scientific Literature with NVIDIA NIM]]>http://www.open-lab.net/blog/?p=985262025-04-28T23:18:36Z2025-04-10T18:30:00ZScientific papers are highly heterogeneous, often employing diverse terminologies for the same entities, using varied methodologies to study biological...
]]>Prem Sagar Gali<![CDATA[Efficiently Scaling Polars GPU Parquet Reader]]>http://www.open-lab.net/blog/?p=984352025-04-22T23:52:25Z2025-04-10T16:30:00ZWhen working with large datasets, the performance of your data processing tools becomes critical. Polars, an open-source library for data manipulation known for...
]]>Vinay Raman<![CDATA[Evaluating and Enhancing RAG Pipeline Performance Using Synthetic Data?]]>http://www.open-lab.net/blog/?p=979272025-04-17T19:35:37Z2025-04-07T18:39:06ZAs large language models (LLM) gain popularity in various question-answering systems, retrieval-augmented generation (RAG) pipelines have also become a focal...
]]>Sama Bali<![CDATA[Event: HP & NVIDIA Developer Challenge]]>http://www.open-lab.net/blog/?p=984872025-04-17T19:35:39Z2025-04-07T17:54:00ZJoin the hackathon to build open-source AI solutions, optimize models, enhance workflows, connect with peers, and win prizes.
]]>Matt Ahrens<![CDATA[Accelerating Apache Parquet Scans on Apache Spark with GPUs]]>http://www.open-lab.net/blog/?p=983502025-04-22T23:57:50Z2025-04-03T16:18:03ZAs data sizes have grown in enterprises across industries, Apache Parquet has become a prominent format for storing data. Apache Parquet is a columnar storage...
]]>1Ronen Dar<![CDATA[NVIDIA Open Sources Run:ai Scheduler to Foster Community Collaboration]]>http://www.open-lab.net/blog/?p=980942025-04-22T23:59:16Z2025-04-01T09:00:00ZToday, NVIDIA announced the open-source release of the KAI Scheduler, a Kubernetes-native GPU scheduling solution, now available under the Apache 2.0 license....
]]>Brian Shi<![CDATA[Boosting Q&A Accuracy with GraphRAG Using PyG and Graph Databases]]>http://www.open-lab.net/blog/?p=979002025-04-03T18:46:06Z2025-03-26T21:41:08ZLarge language models (LLMs) often struggle with accuracy when handling domain-specific questions, especially those requiring multi-hop reasoning or access to...
]]>Cole Swain<![CDATA[Spotlight: Tomorrow.io?Transforms Global Weather Resilience with NVIDIA AI]]>http://www.open-lab.net/blog/?p=980232025-04-03T18:46:17Z2025-03-26T21:19:34ZFrom hyperlocal forecasts that guide daily operations to planet-scale models illuminating new climate insights, the world is entering a new frontier in weather...
]]>John Ashcroft<![CDATA[Powering Flood Risk Assessment with NVIDIA Earth-2]]>http://www.open-lab.net/blog/?p=979742025-04-23T00:01:57Z2025-03-25T20:59:12ZInland flooding causes significant economic and societal impacts annually. Of the eight natural disasters costing the insurance industry over $1 billion in...
]]>Xavier Renard<![CDATA[Spotlight: AXA Explores AI-Driven Hurricane Risk Assessment]]>http://www.open-lab.net/blog/?p=980962025-04-23T00:05:46Z2025-03-25T17:47:06ZLarge ensembles are essential for predicting rare, high-impact events that cannot be fully understood through historical data alone. By simulating thousands of...
]]>Holger Roth<![CDATA[Supercharging the Federated Learning Ecosystem by Integrating Flower and NVIDIA FLARE]]>http://www.open-lab.net/blog/?p=940452025-04-23T00:05:05Z2025-03-24T16:00:00ZIn recent years, open-source systems like Flower and NVIDIA FLARE have emerged as pivotal tools in the federated learning (FL) landscape, each with its unique...
]]>1Kyle Tretina<![CDATA[Guiding Generative Molecular Design with Experimental Feedback Using Oracles]]>http://www.open-lab.net/blog/?p=969662025-03-25T17:23:57Z2025-03-19T15:00:00ZGenerative chemistry with AI has the potential to revolutionize how scientists approach drug discovery and development, health, and materials science and...
]]>TJ Chen<![CDATA[Shrink Genomics and Single-Cell Analysis Time to Minutes with NVIDIA Parabricks and NVIDIA AI Blueprints]]>http://www.open-lab.net/blog/?p=969792025-03-20T18:33:12Z2025-03-19T15:00:00ZNVIDIA Parabricks is a scalable genomics analysis software suite that solves omics challenges with accelerated computing and deep learning to unlock new...
]]>Siddharth Sharma<![CDATA[NVIDIA cuML Brings Zero Code Change Acceleration to scikit-learn]]>http://www.open-lab.net/blog/?p=970912025-04-23T00:22:52Z2025-03-18T17:42:25ZScikit-learn, the most widely used ML library, is popular for processing tabular data because of its simple API, diversity of algorithms, and compatibility with...
]]>Erik Ordentlich<![CDATA[Accelerate Apache Spark ML on NVIDIA GPUs with Zero Code Change]]>http://www.open-lab.net/blog/?p=967682025-04-23T00:36:38Z2025-03-06T19:49:16ZThe NVIDIA RAPIDS Accelerator for Apache Spark software plug-in pioneered a zero code change user experience (UX) for GPU-accelerated data processing. It...
]]>Mark J. Bennett<![CDATA[GPU-Accelerate Algorithmic Trading Simulations by over 100x with Numba]]>http://www.open-lab.net/blog/?p=966522025-03-10T23:13:45Z2025-03-04T21:44:01ZQuantitative developers need to run back-testing simulations to see how financial algorithms perform from a profit and loss (P&L) standpoint. Statistical...
]]>Douglas Moore<![CDATA[Accelerate Medical Imaging AI Operations with Databricks Pixels 2.0 and MONAI]]>http://www.open-lab.net/blog/?p=965302025-04-23T02:39:52Z2025-02-28T18:11:50ZAccording to the World Health Organization (WHO), 3.6 billion medical imaging tests are performed every year globally to diagnose, monitor, and treat various...
]]>Tom Augspurger<![CDATA[High-Performance Remote IO With NVIDIA KvikIO]]>http://www.open-lab.net/blog/?p=965822025-03-06T19:26:42Z2025-02-27T17:55:52ZWorkloads processing large amounts of data, especially those running on the cloud, will often use an object storage service (S3, Google Cloud Storage, Azure...
]]>1Karthikeyan Natarajan<![CDATA[JSON Lines Reading with pandas 100x Faster Using NVIDIA cuDF]]>http://www.open-lab.net/blog/?p=959702025-04-23T02:44:00Z2025-02-20T17:00:00ZJSON is a widely adopted format for text-based information working interoperably between systems, most commonly in web applications and large language models...
]]>Kyle Tretina<![CDATA[Understanding the Language of Life��s Biomolecules Across Evolution at a New Scale with Evo 2]]>http://www.open-lab.net/blog/?p=955892025-04-23T02:44:28Z2025-02-19T17:14:51ZAI has evolved from an experimental curiosity to a driving force within biological research. The convergence of deep learning algorithms, massive omics...
]]>Brad Nemire<![CDATA[Featured Sessions for Students at NVIDIA GTC 2025]]>http://www.open-lab.net/blog/?p=961812025-02-20T15:52:32Z2025-02-15T02:00:58ZLearn from researchers, scientists, and industry leaders across a variety of topics including AI, robotics, and Data Science.
]]>Rick Ratzel<![CDATA[Using NetworkX, Jaccard Similarity, and cuGraph to Predict Your Next Favorite Movie]]>http://www.open-lab.net/blog/?p=958202025-04-23T02:45:37Z2025-02-13T17:00:00ZAs the amount of data available to everyone in the world increases, the ability for a consumer to make informed decisions becomes increasingly difficult....
]]>Jesus Alvarez<![CDATA[NVIDIA Open GPU Datacenter Drivers for RHEL9 Signed by Red Hat]]>http://www.open-lab.net/blog/?p=950692025-04-23T02:52:36Z2025-02-10T17:48:26ZNVIDIA and Red Hat have partnered to bring continued improvements to the precompiled NVIDIA Driver introduced in 2020. Last month, NVIDIA announced that the...
]]>3Allison Ding<![CDATA[Get Started with GPU Acceleration for Data Science]]>http://www.open-lab.net/blog/?p=958942025-04-23T02:52:30Z2025-02-06T23:07:48ZIn data science, operational efficiency is key to handling increasingly complex and large datasets. GPU acceleration has become essential for modern workflows,...
]]>Brad Nemire<![CDATA[Featured Researcher and Educator Sessions at NVIDIA GTC 2025]]>http://www.open-lab.net/blog/?p=958172025-02-06T19:33:45Z2025-02-05T23:03:06ZExplore the latest advancements in academia, including advanced research, innovative teaching methods, and the future of learning and technology.
]]>Michelle Horton<![CDATA[AI Foundation Model Enhances Cancer Diagnosis and Tailors Treatment]]>http://www.open-lab.net/blog/?p=957222025-04-23T02:48:13Z2025-02-04T17:16:54ZA new study and AI model from researchers at Stanford University is streamlining cancer diagnostics, treatment planning, and prognosis prediction. Named MUSK...
]]>1Jonathan Bentz<![CDATA[CUDA Toolkit Now Available for NVIDIA Blackwell?]]>http://www.open-lab.net/blog/?p=953582025-04-23T14:58:16Z2025-01-31T19:17:12ZThe latest release of the CUDA Toolkit, version 12.8, continues to push accelerated computing performance in data sciences, AI, scientific computing, and...
]]>Prem Sagar Gali<![CDATA[Mastering the cudf.pandas Profiler for GPU Acceleration]]>http://www.open-lab.net/blog/?p=953512025-04-23T15:00:07Z2025-01-30T17:00:00ZIn the world of Python data science, pandas has long reigned as the go-to library for intuitive data manipulation and analysis. However, as data volumes grow,...
]]>Matt Ahrens<![CDATA[Accelerating JSON Processing on Apache Spark with GPUs]]>http://www.open-lab.net/blog/?p=952982025-04-23T15:01:08Z2025-01-29T22:10:22ZJSON is a popular format for text-based data that allows for interoperability between systems in web applications as well as data management. The format has...
]]>Amit Bleiweiss<![CDATA[Mastering LLM Techniques: Evaluation]]>http://www.open-lab.net/blog/?p=954472025-04-23T15:01:33Z2025-01-29T20:44:06ZEvaluating large language models (LLMs) and retrieval-augmented generation (RAG) systems is a complex and nuanced process, reflecting the sophisticated and...
]]>Juana Nakfour<![CDATA[Horizontal Autoscaling of NVIDIA NIM Microservices on Kubernetes]]>http://www.open-lab.net/blog/?p=949722025-04-23T15:02:12Z2025-01-22T17:34:51ZNVIDIA NIM microservices are model inference containers that can be deployed on Kubernetes. In a production environment, it��s important to understand the...
]]>2Elias Wolfberg<![CDATA[AI Uncovers Potentially Hazardous, Forgotten Oil and Gas Wells]]>http://www.open-lab.net/blog/?p=951062025-04-23T15:03:07Z2025-01-16T19:09:15ZWith as many as 800,000 forgotten oil and gas wells scattered across the US, researchers from Lawrence Berkeley National Laboratory (LBNL), have developed an AI...
]]>Brian Tepera<![CDATA[Accelerating Time Series Forecasting with RAPIDS cuML]]>http://www.open-lab.net/blog/?p=951272025-01-23T19:54:21Z2025-01-16T17:20:10ZTime series forecasting is a powerful data science technique used to predict future values based on data points from the past Open source Python libraries like...
]]>Brad Nemire<![CDATA[Upcoming Webinar: Inside the RAPIDS-Accelerated Polars GPU Engine]]>http://www.open-lab.net/blog/?p=949682025-01-23T19:54:27Z2025-01-13T17:17:47ZIn the webinar on January 28th, you'll get an inside look of the new GPU engine to learn how Polars' declarative API and query optimizer enable seamless GPU...
]]>Nirmal Kumar Juluru<![CDATA[Enhancing Generative AI Model Accuracy with NVIDIA NeMo Curator]]>http://www.open-lab.net/blog/?p=942632025-01-23T19:54:27Z2025-01-13T17:00:00ZIn the rapidly evolving landscape of artificial intelligence, the quality of the data used for training models is paramount. High-quality data ensures that...
]]>Kyle Tretina<![CDATA[Evaluating GenMol as a Generalist Foundation Model for Molecular Generation]]>http://www.open-lab.net/blog/?p=948362025-01-23T19:54:29Z2025-01-13T14:00:00ZTraditional computational drug discovery relies almost exclusively on highly task-specific computational models for hit identification and lead optimization....
]]>Kyle Tretina<![CDATA[Accelerate Protein Engineering with the NVIDIA BioNeMo Blueprint for Generative Protein Binder Design]]>http://www.open-lab.net/blog/?p=948512025-01-23T19:54:28Z2025-01-13T14:00:00ZDesigning a therapeutic protein that specifically binds its target in drug discovery is a staggering challenge. Traditional workflows are often a painstaking...
]]>Peter Entschev<![CDATA[Accelerating GPU Analytics Using RAPIDS and Ray]]>http://www.open-lab.net/blog/?p=944952024-12-20T21:13:45Z2024-12-20T21:13:42ZRAPIDS is a suite of open-source GPU-accelerated data science and AI libraries that are well supported for scale-out with distributed engines like Spark and...
]]>Jenn Yonemitsu<![CDATA[NVIDIA Hackathon Winners Share Strategies for RAPIDS-Accelerated ML Workflows]]>http://www.open-lab.net/blog/?p=943932025-01-22T18:31:27Z2024-12-20T18:00:00ZApproximately 220 teams gathered at the Open Data Science Conference (ODSC) West this year to compete in the NVIDIA hackathon, a 24-hour machine learning (ML)...
]]>Tom Balough<![CDATA[Enhance Your Training Data with New NVIDIA NeMo Curator Classifier Models]]>http://www.open-lab.net/blog/?p=944472024-12-19T23:08:12Z2024-12-19T23:08:08ZClassifier models are specialized in categorizing data into predefined groups or classes, playing a crucial role in optimizing data processing pipelines for...
]]>Nick Becker<![CDATA[RAPIDS 24.12 Introduces cuDF on PyPI, CUDA Unified Memory for Polars, and Faster GNNs]]>http://www.open-lab.net/blog/?p=944152024-12-19T21:46:07Z2024-12-19T21:21:42ZRAPIDS 24.12 introduces cuDF packages to PyPI, speeds up groupby aggregations and reading files from AWS S3, enables larger-than-GPU memory queries in the...
]]>Ziyue Xu<![CDATA[Security for Data Privacy in Federated Learning with CUDA-Accelerated Homomorphic Encryption in XGBoost]]>http://www.open-lab.net/blog/?p=938702024-12-17T19:33:44Z2024-12-18T21:30:00ZXGBoost is a machine learning algorithm widely used for tabular data modeling. To expand the XGBoost model from single-site learning to multisite collaborative...
]]>Michelle Horton<![CDATA[Top Posts of 2024 Highlight NVIDIA NIM, LLM Breakthroughs, and Data Science Optimization]]>http://www.open-lab.net/blog/?p=935662024-12-16T18:34:16Z2024-12-16T18:34:14Z2024 was another landmark year for developers, researchers, and innovators working with NVIDIA technologies. From groundbreaking developments in AI inference to...
]]>0Joe Bungo<![CDATA[NVIDIA Deep Learning Institute Releases New Data Science Teaching Kit for Educators]]>https://news.www.open-lab.net/?p=193712024-12-12T19:35:13Z2024-12-12T17:11:19ZAs data grows in volume, velocity, and complexity, the data science field is booming. There��s an ever-increasing demand for talent and skill sets to...
]]>Nick Becker<![CDATA[Harnessing GPU Acceleration for Multi-Label Classification with RAPIDS cuML]]>http://www.open-lab.net/blog/?p=935752024-12-12T19:17:22Z2024-12-12T16:55:40ZModern classification workflows often require classifying individual records and data points into multiple categories instead of just assigning a single label....
]]>Prem Sagar Gali<![CDATA[Unified Virtual Memory Supercharges pandas with RAPIDS cuDF]]>http://www.open-lab.net/blog/?p=934382024-12-12T19:35:20Z2024-12-05T19:07:07ZcuDF-pandas, introduced in a previous post, is a GPU-accelerated library that accelerates pandas to deliver significant performance improvements��up to 50x...
]]>Vega Shah<![CDATA[In-Silico Antibody Development with AlphaBind Using NVIDIA BioNeMo and AWS HealthOmics]]>http://www.open-lab.net/blog/?p=927572024-12-12T19:38:30Z2024-12-03T18:00:00ZAntibodies have become the most prevalent class of therapeutics, primarily due to their ability to target specific antigens, enabling them to treat a wide range...
]]>Bradley Dice<![CDATA[Supercharging Deduplication in pandas Using RAPIDS cuDF]]>http://www.open-lab.net/blog/?p=927032024-12-12T19:38:34Z2024-11-28T14:00:00ZA common operation in data analytics is to drop duplicate rows. Deduplication is critical in Extract, Transform, Load (ETL) workflows, where you might want to...
]]>Ben Zaitlenhttps://www.linkedin.com/in/benjamin-zaitlen-62ab7b4/<![CDATA[Best Practices for Multi-GPU Data Analysis Using RAPIDS with Dask]]>http://www.open-lab.net/blog/?p=924802024-12-12T19:38:40Z2024-11-21T19:02:03ZAs we move towards a more dense computing infrastructure, with more compute, more GPUs, accelerated networking, and so forth��multi-gpu training and analysis...
]]>Mario Geiger<![CDATA[Accelerate Drug and Material Discovery with New Math Library NVIDIA cuEquivariance]]>http://www.open-lab.net/blog/?p=918962024-11-18T22:58:58Z2024-11-18T18:30:00ZAI models for science are often trained to make predictions about the workings of nature, such as predicting the structure of a biomolecule or the properties of...
]]>1Wen Jie Ong<![CDATA[Revolutionizing AI-Driven Material Discovery Using NVIDIA ALCHEMI]]>http://www.open-lab.net/blog/?p=919992024-11-18T22:57:30Z2024-11-18T18:30:00ZAI has proven to be a force multiplier, helping to create a future where scientists can design entirely new materials, while engineers seamlessly transform...
]]>Wonchan Lee<![CDATA[Effortlessly Scale NumPy from Laptops to Supercomputers with NVIDIA cuPyNumeric]]>http://www.open-lab.net/blog/?p=916822025-04-10T23:02:00Z2024-11-18T17:00:00ZPython is the most common programming language for data science, machine learning, and numerical computing. It continues to grow in popularity among scientists...
]]>1Nick Becker<![CDATA[Faster Causal Inference on Large Datasets with NVIDIA RAPIDS]]>http://www.open-lab.net/blog/?p=918542024-11-18T20:15:01Z2024-11-14T16:00:00ZAs consumer applications generate more data than ever before, enterprises are turning to causal inference methods for observational data to help shed light on...
]]>Nick Becker<![CDATA[NVIDIA RAPIDS 24.10 Introduces Accelerated NetworkX with Zero Code Change, Updates for UMAP and cuDF-Pandas]]>http://www.open-lab.net/blog/?p=917882024-11-14T17:10:34Z2024-11-13T22:37:14ZThe RAPIDS v24.10 release takes another step forward in bringing accelerated computing to data scientists and developers with a seamless user experience. This...
]]>Amit Bleiweiss<![CDATA[Mastering LLM Techniques: Text Data Processing]]>http://www.open-lab.net/blog/?p=917382025-04-01T19:02:02Z2024-11-13T18:05:06ZTraining and customizing LLMs for high accuracy is fraught with challenges, primarily due to their dependency on high-quality data. Poor data quality and...
]]>Kyle Tretina<![CDATA[Boost Alphafold2 Protein Structure Prediction with GPU-Accelerated MMseqs2]]>http://www.open-lab.net/blog/?p=916232024-11-14T17:10:35Z2024-11-13T17:00:00ZThe ability to compare the sequences of multiple related proteins is a foundational task for many life science researchers. This is often done in the form of a...
]]>Michelle Horton<![CDATA[AI That ��Hears�� Heart Disease May Help Vets Diagnose Dogs]]>http://www.open-lab.net/blog/?p=916192024-11-14T17:10:40Z2024-11-12T15:49:17ZA new machine-learning algorithm that listens to digital heartbeat data could help veterinarians diagnose murmurs and early-stage heart disease in dogs....
]]>Amr Elmeleegy<![CDATA[5x Faster Time to First Token with NVIDIA TensorRT-LLM KV Cache Early Reuse]]>http://www.open-lab.net/blog/?p=916252024-11-14T17:10:41Z2024-11-08T23:55:43ZIn our previous blog post, we demonstrated how reusing the key-value (KV) cache by offloading it to CPU memory can accelerate time to first token (TTFT) by up...
]]>Chelsea Gomatam<![CDATA[Discover New Biological Insights with Accelerated Pangenome Alignment in NVIDIA Parabricks]]>http://www.open-lab.net/blog/?p=912202024-11-14T17:10:48Z2024-11-04T17:39:18ZNVIDIA Parabricks is a scalable genomics analysis software suite that solves omics challenges with accelerated computing and deep learning to unlock new...
]]>1Tyler Whitehouse<![CDATA[Frictionless Collaboration and Rapid Prototyping in Hybrid Environments with NVIDIA AI Workbench]]>http://www.open-lab.net/blog/?p=912342024-11-14T17:10:49Z2024-11-04T17:30:00ZNVIDIA AI Workbench is a free development environment manager that streamlines data science, AI, and machine learning (ML) projects on systems of choice. The...
]]>Jinsol Park<![CDATA[Even Faster and More Scalable UMAP on the GPU with RAPIDS cuML]]>http://www.open-lab.net/blog/?p=911982024-11-14T17:10:53Z2024-10-31T20:24:07ZUMAP is a popular dimension reduction algorithm used in fields like bioinformatics, NLP topic modeling, and ML preprocessing. It works by creating a k-nearest...
]]>2Summer Liu<![CDATA[Supercharging Fraud Detection in Financial Services with Graph Neural Networks]]>http://www.open-lab.net/blog/?p=908772024-10-31T18:36:06Z2024-10-28T15:30:00ZFraud in financial services is a massive problem. According to NASDAQ, in 2023, banks faced $442 billion in projected losses from payments, checks, and credit...
]]>Michael Yh Wang<![CDATA[Bridging the CUDA C++ Ecosystem and Python Developers with Numbast]]>http://www.open-lab.net/blog/?p=900862024-10-31T16:26:15Z2024-10-24T16:30:00ZBy enabling CUDA kernels to be written in Python similar to how they can be implemented within C++, Numba bridges the gap between the Python ecosystem and the...
]]>Michelle Horton<![CDATA[Optimizing Drug Discovery with CUDA Graphs, Coroutines, and GPU Workflows]]>http://www.open-lab.net/blog/?p=907802024-10-31T16:21:20Z2024-10-23T17:28:49ZPharmaceutical research demands fast, efficient simulations to predict how molecules interact, speeding up drug discovery. Jiqun Tu, a senior developer...
]]>Rick Ratzel<![CDATA[NetworkX Introduces Zero Code Change Acceleration Using NVIDIA cuGraph]]>http://www.open-lab.net/blog/?p=907532024-10-31T16:21:22Z2024-10-22T18:00:00ZNetworkX accelerated by NVIDIA cuGraph is a newly released backend co-developed with the NetworkX team. NVIDIA cuGraph provides GPU acceleration for popular...
]]>Michelle Horton<![CDATA[AI Accurately Forecasts Extreme Weather Up to 23 Days Ahead]]>http://www.open-lab.net/blog/?p=905462024-10-31T16:21:26Z2024-10-21T16:00:00ZNew research from the University of Washington is refining AI weather models using deep learning for more accurate predictions and longer-term forecasts. The...
]]>Charlie Huang<![CDATA[Scale High-Performance AI Inference with Google Kubernetes Engine and NVIDIA NIM]]>http://www.open-lab.net/blog/?p=901982024-10-30T18:57:03Z2024-10-16T16:30:00ZThe rapid evolution of AI models has driven the need for more efficient and scalable inferencing solutions. As organizations strive to harness the power of AI,...
]]>Nirmal Kumar Juluru<![CDATA[Train Highly Accurate LLMs with the Zyda-2 Open 5T-Token Dataset Processed with NVIDIA NeMo Curator]]>http://www.open-lab.net/blog/?p=896772024-10-18T20:10:29Z2024-10-15T18:00:00ZOpen-source datasets have significantly democratized access to high-quality data, lowering the barriers of entry for developers and researchers to train...
]]>Michelle Horton<![CDATA[AI Research Revs Up EV Charging for Large-Scale Optimization, Speed, and Savings]]>http://www.open-lab.net/blog/?p=901192024-10-21T16:29:21Z2024-10-14T15:54:39ZElectric vehicle (EV) charging is getting a jolt with an innovative new AI algorithm that boosts efficiency, reduces cost, and keeps the grid from...
]]>Nicolas Blin<![CDATA[Accelerate Large Linear Programming Problems with NVIDIA cuOpt]]>http://www.open-lab.net/blog/?p=898852024-10-17T18:19:09Z2024-10-08T15:00:00ZThe evolution of linear programming (LP) solvers has been marked by significant milestones over the past century, from Simplex to the interior point method...
]]>1Nick Becker<![CDATA[NVIDIA CUDA-X Now Accelerates the Polars Data Processing Library]]>http://www.open-lab.net/blog/?p=899632024-10-17T18:19:09Z2024-10-08T15:00:00ZPolars, one of the fastest-growing data analytics tools, has just crossed 9M monthly downloads. As a modern DataFrame library, it is designed for efficiently...
]]>Nirmal Kumar Juluru<![CDATA[Just Released: NVIDIA NeMo Curator Improvements for Accelerating Data Curation]]>http://www.open-lab.net/blog/?p=897562024-10-18T20:10:53Z2024-10-04T16:00:00ZNeMo Curator now supports images, enabling you to process data for training accurate generative AI models.
]]>Corey Nolet<![CDATA[Event: Community Over Code]]>http://www.open-lab.net/blog/?p=896922024-10-17T19:06:59Z2024-10-03T20:00:00ZLearn about accelerating vector search with NVIDIA cuVS and Apache Solr on October 10 at Community Over Code.
]]>Melody Tu<![CDATA[AI Investigates Antarctica��s Disappearing Moss to Uncover Climate Change Clues]]>http://www.open-lab.net/blog/?p=897922024-10-23T23:36:01Z2024-10-03T16:24:50ZAntarctica plays a crucial role in regulating ?Earth��s climate. Most climate research into the world��s coldest, most windswept continent focuses on the...
]]>Moon Chung<![CDATA[Event: NVIDIA cuOpt at INFORMS 2024]]>http://www.open-lab.net/blog/?p=897532024-10-17T19:07:01Z2024-10-03T16:00:00ZJoin NVIDIA cuOpt engineers at INFORMS 2024 on October 22-23 to learn how to revolutionize accelerated computing.
]]>Tanya Lenz<![CDATA[Webinar: Accelerating Python with GPUs]]>http://www.open-lab.net/blog/?p=896592024-10-17T19:07:02Z2024-10-02T18:00:00ZJoin us on October 9 to learn how your applications can benefit from NVIDIA CUDA Python software initiatives.
]]>Ville Tuulos<![CDATA[Building LLM-Powered Production Systems with NVIDIA NIM and Outerbounds]]>http://www.open-lab.net/blog/?p=895522024-10-17T19:07:03Z2024-10-02T17:00:00ZWith the rapid expansion of language models over the past 18 months, hundreds of variants are now available. These include large language models (LLMs), small...
]]>Michelle Horton<![CDATA[AI Uses Zero-Shot Learning to Find Existing Drugs for Treating Rare Diseases]]>http://www.open-lab.net/blog/?p=896722024-10-17T19:07:03Z2024-10-02T16:25:36ZA groundbreaking drug-repurposing AI model could bring new hope to doctors and patients trying to treat diseases with limited or no existing treatment options....
]]>Elias Wolfberg<![CDATA[AI Chatbot Delivers Multilingual Support to African Farmers]]>http://www.open-lab.net/blog/?p=895132024-10-17T19:07:10Z2024-09-27T18:10:11ZSome of Africa��s most resource-constrained farmers are gaining access to on-demand, AI-powered advice through a multimodal chatbot?that gives detailed...
]]>Summer Liu<![CDATA[Harnessing Data with AI to Boost Zero Trust Cyber Defense]]>http://www.open-lab.net/blog/?p=892142024-10-28T21:54:29Z2024-09-26T16:35:55ZModern cyber threats have grown increasingly sophisticated, posing significant risks to federal agencies and critical infrastructure. According to Deloitte,...
]]>Jochen Papenbrock<![CDATA[Event: Developer Day for Financial Services]]>http://www.open-lab.net/blog/?p=891792024-09-19T19:28:59Z2024-09-18T18:06:44ZJoin this virtual developer day to learn how AI and Machine Learning can revolutionize fraud detection and financial crime prevention.
]]>Jamil Semaan<![CDATA[Polars GPU Engine Powered by RAPIDS cuDF Now Available in Open Beta]]>http://www.open-lab.net/blog/?p=890522024-12-12T22:32:12Z2024-09-17T14:00:00ZToday, Polars released a new GPU engine powered by RAPIDS cuDF that accelerates Polars workflows up to 13x on NVIDIA GPUs, allowing data scientists to process...
]]>1Micha? Szo?ucha<![CDATA[Improved Data Loading with Threads]]>http://www.open-lab.net/blog/?p=886572024-09-19T19:30:59Z2024-09-13T16:00:00ZData loading is a critical aspect of deep learning workflows, whether you're focused on training or inference. However, it often presents a paradox: the need...
]]>Gregory Kimball<![CDATA[Scaling Up to One Billion Rows of Data in pandas using RAPIDS cuDF]]>http://www.open-lab.net/blog/?p=887612024-09-25T17:26:00Z2024-09-11T16:54:53ZThe One Billion Row Challenge is a fun benchmark to showcase basic data processing operations. It was originally launched as a pure-Java competition, and has...
]]>Michelle Horton<![CDATA[Advanced Strategies for High-Performance GPU Programming with NVIDIA CUDA]]>http://www.open-lab.net/blog/?p=880692024-09-19T19:31:59Z2024-09-11T16:25:00ZStephen Jones, a leading expert and distinguished NVIDIA CUDA architect, offers his guidance and insights with a deep dive into the complexities of mapping...
]]>1Mehran Maghoumi<![CDATA[Streamlining Data Processing for Domain Adaptive Pretraining with NVIDIA NeMo Curator]]>http://www.open-lab.net/blog/?p=878762024-10-18T20:11:21Z2024-09-10T16:30:00ZDomain-adaptive pretraining (DAPT) of large language models (LLMs) is an important step towards building domain-specific models. These models demonstrate...
]]>Anthony Mahanna<![CDATA[Accelerated, Production-Ready Graph Analytics for NetworkX Users]]>http://www.open-lab.net/blog/?p=885122024-09-09T21:06:55Z2024-09-04T19:40:27ZNetworkX is a popular, easy-to-use Python library for graph analytics. However, its performance and scalability may be unsatisfactory for medium-to-large-sized...
]]>4Tianna Nguy<![CDATA[Hands-On Training at NVIDIA AI Summit in Washington, DC]]>http://www.open-lab.net/blog/?p=885982024-09-05T17:57:08Z2024-09-04T17:47:42ZImmerse yourself in NVIDIA technology with our full-day, hands-on technical workshops at our AI Summit in Washington D.C. on October 7, 2024.
]]>Amarnath Mohan<![CDATA[Accelerating Predictive Maintenance in Manufacturing with RAPIDS AI]]>http://www.open-lab.net/blog/?p=873342024-09-05T17:57:10Z2024-08-30T15:58:23ZThe International Society of Automation (ISA) reports that 5% of plant production is lost annually due to downtime. Putting that into a different context,...
]]>Oscar Javier Aldana<![CDATA[Spotlight: clicOH Accelerates Last-Mile Delivery 20x with NVIDIA cuOpt]]>http://www.open-lab.net/blog/?p=883632024-09-05T17:57:11Z2024-08-29T22:18:14ZDriven by shifts in consumer behavior and the pandemic, e-commerce continues its explosive growth and transformation. As a result, logistics and transportation...
]]>Michelle Horton<![CDATA[Boosting CUDA Efficiency with Essential Techniques for New Developers]]>http://www.open-lab.net/blog/?p=878232024-09-05T17:57:12Z2024-08-29T17:00:00ZTo fully harness the capabilities of NVIDIA GPUs, optimizing NVIDIA CUDA performance is essential, particularly for developers new to GPU programming. This talk...
]]>1Prachi Goel<![CDATA[Just Released: RAPIDS 24.08]]>http://www.open-lab.net/blog/?p=883702024-09-05T17:57:13Z2024-08-29T16:00:58ZRAPIDS 24.08 is now available with significant updates geared towards processing larger workloads and seamless CPU/GPU interoperability.
]]>Amr Elmeleegy<![CDATA[NVIDIA Triton Inference Server Achieves Outstanding Performance in MLPerf Inference 4.1 Benchmarks]]>http://www.open-lab.net/blog/?p=879702024-09-05T18:37:49Z2024-08-28T16:00:00ZSix years ago, we embarked on a journey to develop an AI inference serving solution specifically designed for high-throughput and time-sensitive production use...