Controller Fault Resulting in Performance Degradation
This example shows a very peculiar behavior in CPU temperature. At some point, while under max load, the CPU temperature actually begins to decrease. One might think that the fan is kicking in, but the fan is not able to reduce CPU temperature at this dramatic rate. Examining the right figures, it is seen that the fan indeed has increased to its maximum value. However that happened at around the same time as the CPU maxed out, and as mentioned before, we know from our fan models, that the fan cannot change the CPU temperature at that rate. The answer lies in the top right figure. As the temperature got into a critical state, the CPU underclocked itself to save it from potential overheating. This underclocking, results in this drastic reduction in temperature. The CPU Power State model output shows how the power state should be kept at maximum, since the computational power is needed. This means the system experiences a performance degradation without the users knowledge. VARYC's Model Based Hardware Fault Detection System lets the user know that the system is not running optimally and the critical increase in temperature might be due to faulty fan, change in ambient temperature, or change in the environment around or inside the system.