Eryk Lewinson – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2023-08-10T17:11:24Z http://www.open-lab.net/blog/feed/ Eryk Lewinson <![CDATA[A Comprehensive Guide on Interaction Terms in Time Series Forecasting]]> http://www.open-lab.net/blog/?p=68209 2023-08-10T17:11:24Z 2023-07-20T17:00:00Z Modeling time series data can be challenging (and fascinating) due to its inherent complexity and unpredictability. Long-term trends in time series can change...]]>

Modeling time series data can be challenging (and fascinating) due to its inherent complexity and unpredictability. Long-term trends in time series can change drastically due to certain events, for example. Recall the beginning of the global pandemic, when businesses such as airlines or brick-and-mortar shops saw a quick decline in the number of customers and sales. In contrast…

Source

]]>
0
Eryk Lewinson <![CDATA[A Comprehensive Guide to Interaction Terms in Linear Regression]]> http://www.open-lab.net/blog/?p=63780 2023-06-09T22:27:33Z 2023-04-26T17:00:00Z Linear regression is a powerful statistical tool used to model the relationship between a dependent variable and one or more independent variables (features)....]]>

Linear regression is a powerful statistical tool used to model the relationship between a dependent variable and one or more independent variables (features). An important, and often forgotten, concept in regression analysis is that of interaction terms. In short, interaction terms enable you to examine whether the relationship between the target and the independent variable changes depending on…

Source

]]>
1
Eryk Lewinson <![CDATA[A Comprehensive Overview of Regression Evaluation Metrics]]> http://www.open-lab.net/blog/?p=63623 2023-07-11T23:19:05Z 2023-04-20T15:00:00Z As a data scientist, evaluating machine learning model performance is a crucial aspect of your work. To do so effectively, you have a wide range of statistical...]]>

As a data scientist, evaluating machine learning model performance is a crucial aspect of your work. To do so effectively, you have a wide range of statistical metrics at your disposal, each with its own unique strengths and weaknesses. By developing a solid understanding of these metrics, you are not only better equipped to choose the best one for optimizing your model but also to explain your…

Source

]]>
1
Eryk Lewinson <![CDATA[Dealing with Outliers Using Three Robust Linear Regression Models]]> http://www.open-lab.net/blog/?p=49692 2023-06-12T09:23:51Z 2022-07-20T16:30:00Z Linear regression is one of the simplest machine learning models out there. It is often the starting point not only for learning about data science but also for...]]>

Linear regression is one of the simplest machine learning models out there. It is often the starting point not only for learning about data science but also for building quick and simple minimum viable products (MVPs), which then serve as benchmarks for more complex algorithms. In general, linear regression fits a line (in two dimensions) or a hyperplane (in three and more dimensions) that…

Source

]]>
0
Eryk Lewinson <![CDATA[Three Approaches to Encoding Time Information as Features for ML Models]]> http://www.open-lab.net/blog/?p=44228 2022-08-21T23:53:25Z 2022-02-17T16:00:00Z Imagine you have just started a new data science project. The goal is to build a model predicting Y, the target variable. You have already received some data...]]>

Imagine you have just started a new data science project. The goal is to build a model predicting Y, the target variable. You have already received some data from the stakeholders/data engineers, did a thorough EDA, and selected some variables you believe are relevant for the problem at hand. Then you finally built your first model. The score is acceptable, but you believe you can do much better.

Source

]]>
0
���˳���97caoporen����