More transparent and explainable machine learning algorithms are required to provide enhanced and sustainable dataset understanding
•The need for transparent and explainable machine learning algorithms is justified.•Few such algorithms exist leading to mistrust of the decisions hey influence.•Transparent open box (TOB) configures optimized data matching for transparency.•TOB is recently upgraded to be executable more flexibly in...
Saved in:
| Published in | Ecological modelling Vol. 498; p. 110898 |
|---|---|
| Main Author | |
| Format | Journal Article |
| Language | English |
| Published |
Elsevier B.V
01.12.2024
|
| Subjects | |
| Online Access | Get full text |
| ISSN | 0304-3800 |
| DOI | 10.1016/j.ecolmodel.2024.110898 |
Cover
| Summary: | •The need for transparent and explainable machine learning algorithms is justified.•Few such algorithms exist leading to mistrust of the decisions hey influence.•Transparent open box (TOB) configures optimized data matching for transparency.•TOB is recently upgraded to be executable more flexibly in open-source Python code.•Interpretability of complex ecological datasets are enhanced by applying TOB.
For detailed dataset interrogation and auditing purposes the lack of dataset explainability/transparency of the majority of available machine-learning (ML) models poses limitations. There is a tendency for ML models to focus on prediction speed and accuracy at the expense of transparently revealing dataset relationships. A case is made here to broaden that focus and for ML models to offer alternative configurations tailored to provide more explanations about how individual predictions are derived. Indeed, those striving to achieve sustainable objectives should not rely on opaque ML models and seek transparency as a fundamental objective of good modelling practice (GMP). Doing so tends to boost trust and confidence in the outputs of models relating to complex socio-environmental systems (SES), particularly those being used to potentially justify controversial social, political and ethical decisions. Currently, the transparent open box algorithms (TOB) are the only ML algorithms available that are configured specifically to routinely provide detailed data record relationships for each of their predictions. This study describes the data mining benefits of the Python-coded optimized data-matching TOB algorithms generally, and when applied to environmental datasets characterized by complex non-linear relationships involving many variables. |
|---|---|
| Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
| ISSN: | 0304-3800 |
| DOI: | 10.1016/j.ecolmodel.2024.110898 |