Skip to main content
Applied Soft ComputingVolume 149, December 2023, Article number 110955

Improving audit opinion prediction accuracy using metaheuristics-tuned XGBoost algorithm with interpretable results through SHAP value analysis(Article)

  Save all to author list
  • aFaculty of Business Economics, Singidunum University, Danijelova 32, Belgrade, 11000, Serbia
  • bFaculty of Informatics and Computing, Singidunum University, Danijelova 32, Belgrade, 11000, Serbia
  • cUniversity of Belgrade, Faculty of Transport and Traffic Engineering, Vojvode Stepe 305, Belgrade, 11010, Serbia
  • dYuan Ze University, College of Engineering, Department of Industrial Engineering and Management, Taoyuan City, 320315, Taiwan
  • eDepartment of Industrial Engineering, Istinye University, Istanbul, 34396, Turkey
  • fDepartment of Industrial and Mechanical Engineering, Lebanese American University, Byblos, Lebanon

Abstract

This study aims to create a machine learning model that can predict opinions in external audits and surpass the benchmark set in a prior study from the literature. This tool could reduce audit risk, which is a crucial task in external audits. Previous studies have shown that it is possible to create models that can predict the audit opinion a company will receive. In these studies, authors used statistics and machine learning models, and both non-financial (e.g. audit lag) and financial data (e.g. financial ratios, or absolute value items available from financial statements) to make predictions. In this study, the performance of the XGBoost model optimized by metaheuristics algorithms is examined and evaluated. This study compares the performance of six different metaheuristic algorithms used to tune the XGBoost model in two separate scenarios. The first scenario represents a realistic client portfolio, where a majority of the clients are known, while the second scenario simulates a new clients-only portfolio, a more difficult scenario where prior information such as audit lag is not available. The study uses a dataset of 12,690 observations of Serbian companies and their audit opinions from 2016 to 2019. The findings indicate an improvement over the benchmark due to a more optimized hyperparameter tuning process and the use of the iterative sine-cosine algorithm for the XGBoost model. © 2023 Elsevier B.V.

Author keywords

Audit opinion predictionMachine learningSHAP value analysisSine cosine algorithmXGBoost

Indexed keywords

Engineering controlled terms:FinanceHeuristic algorithmsIterative methodsMachine learningOptimizationValue engineering
Engineering uncontrolled termsAudit opinion predictionMachine learning modelsMachine-learningMeta-heuristics algorithmsMetaheuristicPerformancePrediction accuracySHAP value analyseSine-cosine algorithmXgboost
Engineering main heading:Forecasting
  • ISSN: 15684946
  • Source Type: Journal
  • Original language: English
  • DOI: 10.1016/j.asoc.2023.110955
  • Document Type: Article
  • Publisher: Elsevier Ltd

  Tirkolaee, E.B.; Department of Industrial Engineering, Istinye University, Istanbul, Turkey;
© Copyright 2023 Elsevier B.V., All rights reserved.

Cited by 21 documents

Surbhi , Chauhan, N.R. , Dahiya, N.
Optimizing XGBoost hyperparameters using the dragonfly algorithm for enhanced cyber attack detection in the internet of healthcare things (IoHT)
(2025) Cluster Computing
Li, S. , Huang, M. , Ma, M.
Interpretable artificial intelligence for advancing slope stability assessment techniques with Technosols
(2025) Soil Use and Management
Fan, C. , Ding, Y. , Zheng, Y.
Bond strength and failure mode prediction model for recycled aggregate concrete based on intelligent algorithm optimized support vector machine
(2025) Structures
View details of all 21 citations
{"topic":{"name":"Swarm Intelligence; Genetic Algorithm; Mathematical Optimization","id":42340,"uri":"Topic/42340","prominencePercentile":91.424446,"prominencePercentileString":"91.424","overallScholarlyOutput":0},"dig":"90be018df30376aa9ddab80e620678a148ecd1e31c5ddbe3a977c42f85713ebd"}

SciVal Topic Prominence

Topic:
Prominence percentile: