Meghana Ravikumar – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2022-08-21T23:40:31Z http://www.open-lab.net/blog/feed/ Meghana Ravikumar <![CDATA[Efficient BERT: Finding Your Optimal Model with Multimetric Bayesian Optimization, Part 3]]> http://www.open-lab.net/blog/?p=19520 2022-08-21T23:40:31Z 2020-08-18T17:35:00Z This is the third post in this series about distilling BERT with multimetric Bayesian optimization. Part 1 discusses the background for the experiment and Part...]]>

This is the third post in this series about distilling BERT with multimetric Bayesian optimization. Part 1 discusses the background for the experiment and Part 2 discusses the setup for the Bayesian optimization. In my previous posts, I discussed the importance of BERT for transfer learning in NLP, and established the foundations of this experiment’s design. In this post, I discuss the model…

Source

]]>
0
Meghana Ravikumar <![CDATA[Efficient BERT: Finding Your Optimal Model with Multimetric Bayesian Optimization, Part 2]]> http://www.open-lab.net/blog/?p=19510 2022-08-21T23:40:31Z 2020-08-18T17:30:00Z This is the second post in this series about distilling BERT with multimetric Bayesian optimization. Part 1 discusses the background for the experiment and Part...]]>

This is the second post in this series about distilling BERT with multimetric Bayesian optimization. Part 1 discusses the background for the experiment and Part 3 discusses the results. In my previous post, I discussed the importance of the BERT architecture in making transfer learning accessible in NLP. BERT allows a variety of problems to share off-the-shelf, pretrained models and moves NLP…

Source

]]>
0
Meghana Ravikumar <![CDATA[Efficient BERT: Finding Your Optimal Model with Multimetric Bayesian Optimization, Part 1]]> http://www.open-lab.net/blog/?p=19499 2022-08-21T23:40:28Z 2020-08-18T17:25:00Z This is the first post in a series about distilling BERT with multimetric Bayesian optimization. Part 2 discusses the set up for the Bayesian experiment, and...]]>

This is the first post in a series about distilling BERT with multimetric Bayesian optimization. Part 2 discusses the set up for the Bayesian experiment, and Part 3 discusses the results. You’ve all heard of BERT: Ernie’s partner in crime. Just kidding! I mean the natural language processing (NLP) architecture developed by Google in 2018. That’s much less exciting, I know. However…

Source

]]>
0
Meghana Ravikumar <![CDATA[Optimizing End-to-End Memory Networks Using SigOpt and GPUs]]> http://www.open-lab.net/blog/?p=13572 2022-08-21T23:39:19Z 2019-02-21T14:00:45Z Natural language systems have become the go-between for humans and AI-assisted digital services. Digital assistants, chatbots, and automated HR systems all rely...]]>

Natural language systems have become the go-between for humans and AI-assisted digital services. Digital assistants, chatbots, and automated HR systems all rely on understanding language, working in the space of question answering. So what are question answering (QA) systems and why do they matter? In general, QA systems take some sort of context in the form of natural language and retrieve…

Source

]]>
0
���˳���97caoporen����