ChinaXiv.org 中国科学院科技论文预发布平台

按提交时间

2022
1

按主题分类

计算机科学的集成理论
1

按作者

按机构

Artificial Intelligence Engineering Department, Near East University, Nicosia, Cyprus, Mersin 10, Turkey
1
Electrical and Electronic Engineering Department, Near East University, Nicosia, Cyprus, Mersin 10, Turkey
1
Information Systems Engineering Department, Near East University, Nicosia, Cyprus, Mersin 10, Turkey
1
Research Centre for AI and IoT, Near East University, Nicosia, Cyprus, Mersin 10, Turkey
1
Software Engineering Department, Near East University, Nicosia, Cyprus, Mersin 10, Turkey
1

当前资源共 1条

隐藏摘要

点击量

时间

下载量

1. ChinaXiv:202211.00424
下载全文

Comparative Evaluation and Comprehensive Analysis of Machine Learning Models for Regression Problems

分类：计算机科学 >> 计算机科学的集成理论提交时间： 2022-11-28 合作期刊: 《数据智能（英文）》

Boran, Sekeroglu Yoney, Kirsal Ever Kamil, Dimililer Fadi, Al-Turjman

摘要： Artificial intelligence and machine learning applications are of significant importance almost in every field of human life to solve problems or support human experts. However, the determination of the machine learning model to achieve a superior result for a particular problem within the wide real-life application areas is still a challenging task for researchers. The success of a model could be affected by several factors such as dataset characteristics, training strategy and model responses. Therefore, a comprehensive analysis is required to determine model ability and the efficiency of the considered strategies. This study implemented ten benchmark machine learning models on seventeen varied datasets. Experiments are performed using four different training strategies 60:40, 70:30, and 80:20 hold-out and five-fold cross-validation techniques. We used three evaluation metrics to evaluate the experimental results: mean squared error, mean absolute error, and coefficient of determination (R2 score). The considered models are analyzed, and each model's advantages, disadvantages, and data dependencies are indicated. As a result of performed excess number of experiments, the deep Long-Short Term Memory (LSTM) neural network outperformed other considered models, namely, decision tree, linear regression, support vector regression with a linear and radial basis function kernels, random forest, gradient boosting, extreme gradient boosting, shallow neural network, and deep neural network. It has also been shown that cross-validation has a tremendous impact on the results of the experiments and should be considered for the model evaluation in regression studies where data mining or selection is not performed.

点击量 2488 下载量 346 评论

Comparative Evaluation and Comprehensive Analysis of Machine Learning Models for Regression Problems