Model selection in multivariate adaptive regressions splines (MARS) using alternative information criteria

dc.contributor.authorBekar Adıgüzel, Meryem
dc.contributor.authorCengiz, Mehmet Ali
dc.date.accessioned2023-09-28T05:44:38Z
dc.date.available2023-09-28T05:44:38Z
dc.date.issued2023
dc.departmentOrtaköy Meslek Yüksekokulu
dc.description.abstractMultivariate Adaptive Regression Splines (MARS) is a useful non-parametric regression analysis method that can be used for model selection in high-dimensional data. Since MARS can identify and model complex, non-linear relationships between the dependent variable and independent variables without requiring any assumptions, it has advantage over simple linear regression techniques. Also, for simplifying the model building process and preventing overfitting, MARS can select automatically the variables to be included in the model, which is useful for datasets with many variables. While MARS is a flexible non-parametric regression method, generalized cross validation (GCV) technique is used within the MARS framework to avoid overfitting and to select the best model. GCV criterion is widely used and can be effective in many situations, however it has some criticism. These criticism are the arbitrary value of the smoothing parameter used in the algorithm of the GCV criterion and the models obtained using this criterion are high-dimensional. In this paper, it is aimed to obtain the barest model that best explains the relationship between the dependent variable and independent variables by using alternative information criteria (Akaike information criterion (AIC), Schwarz Bayesian criterion (SBC) and information complexity criterion (ICOMP(IFIM)PEU)) instead of the use of smoothing parameters in order to put an end to the criticism. To achieve this goal, a simulation study was first conducted with a data set composed of variables that do and do not contribute to the dependent variable to test the success of the information criteria. As a consequence of this simulation work, when variables (which do not contribute to the dependent variable) are not included in the regression model, it demonstrates the success of the criteria in model selection. As a real data set, the reasons for loan defaults were investigated between the years 2005–2019 by utilizing data from 18 banks operating in Türkiye. The results obtained reveal the success of ICOMP(IFIM)PEU criterion in model selection.
dc.identifier.doi10.1016/j.heliyon.2023.e19964
dc.identifier.issn2405-8440
dc.identifier.issue9en_US
dc.identifier.pmid37809827
dc.identifier.scopusqualityQ1
dc.identifier.urihttps:/dx.doi.org10.1016/j.heliyon.2023.e19964
dc.identifier.urihttps://hdl.handle.net/20.500.12451/10989
dc.identifier.volume9en_US
dc.identifier.wosWOS:001079469600001
dc.identifier.wosqualityQ1
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.indekslendigikaynakPubMed
dc.language.isoen
dc.publisherElsevier Ltd
dc.relation.ispartofHeliyon
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/openAccess
dc.subjectAkaike Information Criterion
dc.subjectInformation Complexity Criterion
dc.subjectModel Selection
dc.subjectMultivariate Adaptive Regression Splines
dc.subjectSchwarz Bayesian Information Criterion
dc.titleModel selection in multivariate adaptive regressions splines (MARS) using alternative information criteria
dc.typeArticle

Dosyalar

Orijinal paket
Listeleniyor 1 - 1 / 1
Yükleniyor...
Küçük Resim
İsim:
bekar adiguzel-meryem-2023.pdf
Boyut:
747.1 KB
Biçim:
Adobe Portable Document Format
Açıklama:
Tam Metin / Full Text
Lisans paketi
Listeleniyor 1 - 1 / 1
[ X ]
İsim:
license.txt
Boyut:
1.44 KB
Biçim:
Item-specific license agreed upon to submission
Açıklama: