Advances, Systems and Applications
From: Cloud failure prediction based on traditional machine learning and deep learning
Article | Data source | Prediction scope | Feature studied | SML | TML | DL |
---|---|---|---|---|---|---|
Guan et al. [37] | Private Data | System Failure | CPU usage, Memory Usage, Swap Space Utilization Page Faults, Interrupts, Network Activity, I/O and Data Transfer, Power Management | - | Bayesian Network, DT | - |
Adamu et al. [38] | NERSC | Component Failure | Failed Component, Failure Time | - | LR, SVM | - |
Pitakrat et al. [39] | Private Data | System Failure | Resource Usage, Failure Information, Failure Time | - | RF | - |
Zhang et al. [40] | Public Dataset | Switch Failure | Message Template Sequence, Frequency, Seasonality, Surge | HSMM | RF, SKSVM | - |
Lin et al. [41] | Private Cloud | Node Failure | Resource Usage, Group Policy, Domain Group, Rack Location | - | LR, SVM, RF | LSTM |
Han et al. [42] | Alibaba’s Cloud, Backblaze SMART | Disk Failure | SMART Log, SysLog, Trouble Ticket | - | LGBM | - |
Mohammed et al. [43] | NERSC | Component Failure and Service Failure | Multiple Sources of Failure | ARIMA | LDA, CART, RF, SVM, KNN | - |
Chen et al. [44] | Microsoft Cloud System | Outage Prediction | Storage Location, Physical Networking, Storage Streaming Component | - | SVM, PLR, Bayesian Network, XGBoost | - |
Li et al. [13] | System X | Node Failure | Resource Usage, Group Policy, Domain Group, Rack Location | - | RF | LSTM |
Rawat et al. [45] | Private Cloud | VM Failure | Number of VM Used | ARIMA | - | - |
Yu et al. [46] | Alibaba Cloud System | DRAM Failure | System Kernel Log, MCA Data Log | - | XGBoost | - |