More transparent and explainable machine learning algorithms are required to provide enhanced and sustainable dataset understanding

•The need for transparent and explainable machine learning algorithms is justified.•Few such algorithms exist leading to mistrust of the decisions hey influence.•Transparent open box (TOB) configures optimized data matching for transparency.•TOB is recently upgraded to be executable more flexibly in...

Full description

Saved in:

Bibliographic Details
Published in	Ecological modelling Vol. 498; p. 110898
Main Author	Wood, David A.
Format	Journal Article
Language	English
Published	Elsevier B.V 01.12.2024
Subjects	artificial intelligence data collection Dataset interrogation ethics Forensic dataset interpretability Optimized data matching politics prediction Prediction explainability Python coded TOB Transparent open box (TOB) algorithms Dataset interrogation Prediction explainability Python coded TOB Transparent open box (TOB) algorithms Forensic dataset interpretability Optimized data matching
Online Access	Get full text
ISSN	0304-3800
DOI	10.1016/j.ecolmodel.2024.110898

Cover

More Information
Summary:	•The need for transparent and explainable machine learning algorithms is justified.•Few such algorithms exist leading to mistrust of the decisions hey influence.•Transparent open box (TOB) configures optimized data matching for transparency.•TOB is recently upgraded to be executable more flexibly in open-source Python code.•Interpretability of complex ecological datasets are enhanced by applying TOB. For detailed dataset interrogation and auditing purposes the lack of dataset explainability/transparency of the majority of available machine-learning (ML) models poses limitations. There is a tendency for ML models to focus on prediction speed and accuracy at the expense of transparently revealing dataset relationships. A case is made here to broaden that focus and for ML models to offer alternative configurations tailored to provide more explanations about how individual predictions are derived. Indeed, those striving to achieve sustainable objectives should not rely on opaque ML models and seek transparency as a fundamental objective of good modelling practice (GMP). Doing so tends to boost trust and confidence in the outputs of models relating to complex socio-environmental systems (SES), particularly those being used to potentially justify controversial social, political and ethical decisions. Currently, the transparent open box algorithms (TOB) are the only ML algorithms available that are configured specifically to routinely provide detailed data record relationships for each of their predictions. This study describes the data mining benefits of the Python-coded optimized data-matching TOB algorithms generally, and when applied to environmental datasets characterized by complex non-linear relationships involving many variables.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	0304-3800
DOI:	10.1016/j.ecolmodel.2024.110898