Fault Tolerant Systems

course image

Fault Tolerant Systems

Learn how to make your systems work under unexpected and faulty conditions.


With this course, you will be introduced to the basics of fault tolerance. You will learn about the concepts of design and implementation of fault tolerance mechanisms in general systems are introduced. You’ll get to know the quantitative and qualitative methodology which is used in the evaluation of specific fault tolerance principles. After the course, you will be able to understand and perform key alterations to system functions, components, or mechanisms. You will also be able to add an appropriate level of redundancy with the purpose of achieving the expected reliability of systems, despite the residual faults in the system design.


Course topics:

  • Main system dependability concepts, correlation with system safety and functional safety;
  • System integrity and safety integrity concepts;
  • Threats to system dependability (such as faults, errors, or failures);
  • Basic analysis of high-level composite system diagrams, key areas for dependability improvement;
  • Fault tolerance mechanism in appropriate stages (error detection, damage assessment, error recovery, fault treatment);
  • Static, dynamic, and hybrid redundancy calculations;
  • Fault-tolerant information exchange;
  • Fault tolerance mechanisms, such as channel and error models, detection/correction codes, and retry mechanisms;
  • Fault tolerance and system repair.


Software: Chrome browser.

Hardware: Computer with an Internet connection, working speakers, and microphone.

Prior knowledge: Students should have previous knowledge of selected calculus topics, such as basic differential equations, operations with matrices, Galois fields, vector spaces, and Boolean algebra. Students shall have basic knowledge of system and safety engineering and system reliability concepts, such as failure probability, reliability, failure rate, constant failure rate, MTTF, FIT, etc.