A hybrid fault detection and diagnosis method in server rooms' cooling systems

Data centers as all complex systems are prone to faults, and cost of them can be very high. This paper is focused on detecting the faults in the cooling systems, in particular on local fans level. In the paper, a hybrid approach is proposed. In the approach a model is used as substitute of the real...

Full description

Saved in:
Bibliographic Details
Published inIEEE International Conference on Industrial Informatics (INDIN) Vol. 1; pp. 1405 - 1410
Main Authors Berezovskaya, Yulia, Yang, Chen-Wei, Mousavi, Arash, Zhang, Xiaojing, Vyatkin, Valeriy
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.07.2019
Subjects
Online AccessGet full text
ISSN2378-363X
DOI10.1109/INDIN41052.2019.8971959

Cover

More Information
Summary:Data centers as all complex systems are prone to faults, and cost of them can be very high. This paper is focused on detecting the faults in the cooling systems, in particular on local fans level. In the paper, a hybrid approach is proposed. In the approach a model is used as substitute of the real system to generate dataset containing records of both normal and fault cases. On the generated data, machine learning algorithm or ensemble of algorithms are selected and trained to detect the faults. To demonstrate the approach, the rack model of real data center is created, and reliability of the model is shown. Using the model, the dataset with normal as well as abnormal records of data is generated. To detect faults of local fans, simple classifiers are built for all pairs: a local fan - a processor unit. Classifiers are trained on one part of generated data (training data), and then their accuracy is estimated on another part of generated data (test data). A real-time fault detection system is built based on the classifiers. The rack model is used as the substitute of the real plant to check operability of the system.
ISSN:2378-363X
DOI:10.1109/INDIN41052.2019.8971959