Carol McDonald – NVIDIA Technical Blog

Carol McDonald – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2024-11-22T20:10:03Z http://www.open-lab.net/blog/feed/ Carol McDonald <![CDATA[Advancing the State of the Art in AutoML, Now 10x Faster with NVIDIA GPUs and RAPIDS]]> http://www.open-lab.net/blog/?p=32442 2022-08-21T23:51:49Z 2021-06-09T15:00:00Z

To achieve state-of-the-art machine learning (ML) solutions, data scientists often build complex ML models. However, these techniques are computationally...]]>

To achieve state-of-the-art machine learning (ML) solutions, data scientists often build complex ML models. However, these techniques are computationally expensive, and until recently required extensive background knowledge, experience, and human effort. Recently, at GTC 21, AWS Senior Data Scientist Nick Erickson gave a session sharing how the combination of AutoGluon, RAPIDS…

]]> 1 Carol McDonald <![CDATA[How to Build a Winning Deep Learning Powered Recommender System-Part 3]]> http://www.open-lab.net/blog/?p=31268 2024-10-28T19:15:46Z 2021-05-06T18:00:00Z

Recommender systems (RecSys) have become a key component in many online services, such as e-commerce, social media, news service, or online video streaming....]]>

Recommender systems (RecSys) have become a key component in many online services, such as e-commerce, social media, news service, or online video streaming. However with the growth in importance, the growth in scale of industry datasets, and more sophisticated models, the bar has been raised for computational resources required for recommendation systems. To meet the computational demands…

]]> 0 Carol McDonald <![CDATA[How to Build a Deep Learning Powered Recommender System, Part 2]]> http://www.open-lab.net/blog/?p=30940 2024-10-28T19:11:33Z 2021-05-02T21:30:00Z

Recommender systems (RecSys) have become a key component in many online services, such as e-commerce, social media, news service, or online video...]]>

Recommender systems (RecSys) have become a key component in many online services, such as e-commerce, social media, news service, or online video streaming. However with the growth in importance, the growth in scale of industry datasets, and more sophisticated models, the bar has been raised for computational resources required for recommendation systems. To meet the computational demands…

]]> 0 Carol McDonald <![CDATA[How to Build a Winning Recommendation System, Part 1]]> http://www.open-lab.net/blog/?p=30759 2024-11-22T20:10:03Z 2021-04-26T21:46:31Z

Recommender systems (RecSys) have become a key component in many online services, such as e-commerce, social media, news service, or online video...]]>

Recommender systems (RecSys) have become a key component in many online services, such as e-commerce, social media, news service, or online video streaming. However with their growth in importance, the growth in scale of industry datasets, and more sophisticated models, the bar has been raised for computational resources required for recommendation systems. After NVIDIA introduced Merlin…

]]> 0 Carol McDonald <![CDATA[Accelerating Deep Learning with Apache Spark and NVIDIA GPUs on AWS]]> http://www.open-lab.net/blog/?p=24036 2022-08-21T23:41:03Z 2021-02-23T21:58:00Z

With the growing interest in deep learning (DL), more and more users are using DL in production environments. Because DL requires intensive computational power,...]]>

With the growing interest in deep learning (DL), more and more users are using DL in production environments. Because DL requires intensive computational power, developers are leveraging GPUs to do their training and inference jobs. Recently, as part of a major Apache Spark initiative to better unify DL and data processing on Spark, GPUs became a schedulable resource in Apache Spark 3.

]]> 0 Carol McDonald <![CDATA[Leveraging Machine Learning to Detect Fraud: Tips to Developing a Winning Kaggle Solution]]> http://www.open-lab.net/blog/?p=23421 2022-08-21T23:41:00Z 2021-01-26T22:33:11Z

Kaggle is an online community that allows data scientists and machine learning engineers to find and publish data sets, learn, explore, build models, and...]]>

Kaggle is an online community that allows data scientists and machine learning engineers to find and publish data sets, learn, explore, build models, and collaborate with their peers. Members also enter competitions to solve data science challenges. Kaggle members earn the following medals for their achievements: Novice, Contributor, Expert, Master, and Grandmaster. The quality and quantity of…

]]> 1 Carol McDonald <![CDATA[Improving Apache Spark Performance and Reducing Costs with Amazon EMR and NVIDIA]]> http://www.open-lab.net/blog/?p=23620 2022-08-21T23:41:00Z 2021-01-26T22:28:29Z

Apache Spark has emerged as the standard framework for large-scale, distributed, data analytics processing. NVIDIA worked with the Apache Spark community to...]]>

Apache Spark has emerged as the standard framework for large-scale, distributed, data analytics processing. NVIDIA worked with the Apache Spark community to accelerate the world’s most popular data analytics framework and to offer revolutionary GPU acceleration on several leading platforms, including Google Cloud, Databricks, and Cloudera. Now, Amazon EMR joins the list of leading platforms…

]]> 0 Carol McDonald <![CDATA[Accelerating Spark 3.0 and XGBoost End-to-End Training and Hyperparameter Tuning]]> http://www.open-lab.net/blog/?p=21124 2023-03-22T01:09:02Z 2020-10-05T18:30:00Z

At GTC Spring 2020, Adobe, Verizon Media, and Uber each discussed how they used Spark 3.0 with GPUs to accelerate and scale ML big data pre-processing,...]]>

At GTC Spring 2020, Adobe, Verizon Media, and Uber each discussed how they used Spark 3.0 with GPUs to accelerate and scale ML big data pre-processing, training, and tuning pipelines. There are multiple challenges when it comes to the performance of large-scale machine learning (ML) solutions: huge datasets, complex data preprocessing and feature engineering pipelines…

]]> 1 Carol McDonald <![CDATA[Optimizing and Improving Spark 3.0 Performance with GPUs]]> http://www.open-lab.net/blog/?p=20560 2022-08-21T23:40:37Z 2020-09-01T20:15:10Z

Apache Spark continued the effort to analyze big data that Apache Hadoop started over 15 years ago and has become the leading framework for large-scale...]]>

Apache Spark continued the effort to analyze big data that Apache Hadoop started over 15 years ago and has become the leading framework for large-scale distributed data processing. Today, hundreds of thousands of data engineers and scientists are working with Spark across 16,000+ enterprises and organizations. One reason why Spark has taken the torch from Hadoop is because it can process data…

]]> 3 Carol McDonald <![CDATA[Accelerating Apache Spark 3.0 with GPUs and RAPIDS]]> http://www.open-lab.net/blog/?p=18772 2022-08-21T23:40:21Z 2020-07-02T22:47:40Z

Given the parallel nature of many data processing tasks, it��s only natural that the massively parallel architecture of a GPU should be able to parallelize and...]]>

Given the parallel nature of many data processing tasks, it’s only natural that the massively parallel architecture of a GPU should be able to parallelize and accelerate Apache Spark data processing queries, in the same way that a GPU accelerates deep learning (DL) in artificial intelligence (AI). NVIDIA has worked with the Apache Spark community to implement GPU acceleration through the…

]]> 0 ��˳��97caoporen��