Lasso regression: From explanation to prediction
摘要: 传统的最小二乘回归法关注于对当前数据集的准确估计,容易导致模型的过拟合,影响模型结论的可重复性。随着方法学领域的发展,涌现出的新兴统计工具可以弥补传统方法的局限,从过度关注回归系数值的解释转向提升研究结果的预测能力也愈加成为心理学领域重要的发展趋势。Lasso方法通过在模型估计中引入惩罚项的方式,可以获得更高的预测准确度和模型概化能力,同时也可以有效地处理过拟合和多重共线性问题,有助于心理学理论的构建和完善。
Abstract: Psychological researches focus on describing, explaining and predicting behavior, and having a good understanding of the association between variables is an essential part of this process. Regression analysis, a method to evaluate the relationship between variables, is widely used in psychological studies. However, due to its highly focus on the interpretation of sample data, the traditional ordinary least squares regression has several drawbacks, such as over-fitting problem and limitation on dealing with multicollinearity, which may undermine the generalizability of the model. These drawbacks have an inevitable influence on the promotion and prediction of the model conclusion. With the rapid development of methodology, Least absolute shrinkage and selection operator (Lasso) regression has been emerged to better compensate for the limitations of traditional methods. By introducing a penalty term in the model and shrinking the regression coefficients to zero, Lasso regression can achieve a higher accuracy of model prediction and model generalizability with the cost of a certain estimation bias. Besides, Lasso regression can also effectively deal with the multicollinearity problem. Therefore, it has been widely used in medicine, economics, neuroscience and other fields. In psychology, due to the limitations of computer computing power, researchers used to mainly rely on hypothesis testing to understand the association among variables to verify theories. Now, with the rapid development of machine learning, a shift from focusing on interpretation of the regression coefficients to improving the prediction of the model has emerged and become more and more important. Therefore, based on fundamental theories and real data analysis, the aim of this paper is to introduce the principles, implementation steps and advantages of the Lasso regression. With the help of statistic science, it is promising that more and more applied researchers will be called upon to focus on the emerging statistical tools to promote the development of psychology.
[V1] | 2020-05-14 10:39:28 | ChinaXiv:202005.00044V1 | 下载全文 |
1. 系统合理信念维护心理健康:缓和作用与拓展 | 2023-09-23 |
2. The intersubjective interaction in psychoanalysis: Enlightenment from Martin Buber’s philosophy of dialogue | 2023-09-19 |
3. “惩前毖后”与“率先垂范”:第三方干预行为的影响效应 | 2023-09-18 |
4. 贝叶斯方差分析在JASP中的实现 | 2023-09-18 |
5. 加工需求驱动下词汇阅读神经通路的动态协作机制 | 2023-09-14 |
6. 第三方惩罚行为的认知神经机制 | 2023-09-13 |
7. 身份尴尬与身份辩护:劳务派遣员工组织身份发展过程 | 2023-09-12 |
8. 自我参照的神经成像认知本体论数据集 | 2023-09-12 |
9. 中文词语远距离联想测验的编制及初步探索 | 2023-09-10 |
10. 社会概念表征和整合的神经基础 | 2023-09-09 |