Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Fault tolerant software architecture

I'm looking for some good articles on fault tolerant software architectures. Could I please have some recommendations.

like image 455
macleojw Avatar asked Mar 04 '09 11:03

macleojw


People also ask

What is fault tolerance architecture?

Fault tolerance refers to the ability of a system (computer, network, cloud cluster, etc.) to continue operating without interruption when one or more of its components fail.

How a software application can be tolerant to faults?

Fault-tolerant software assures system reliability by using protective redundancy at the software level. There are two basic techniques for obtaining fault-tolerant software: RB scheme and NVP. Both schemes are based on software redundancy assuming that the events of coincidental software failures are rare.

What is a good example of fault tolerance?

A highly fault-tolerant system might continue at the same level of performance even though one or more components have failed. For example, a building with a backup electrical generator will provide the same voltage to wall outlets even if the grid power fails.

Which computing technique is fault tolerant?

BIST stands for Build in Self Test. System carries out the test of itself after a certain period of time again and again, that is BIST technique for hardware fault-tolerance.


2 Answers

This article about Software Fault Handling techniques covers the following topics:

  • Timeouts
  • Audits
  • Exception Handling
  • Task Rollback
  • Incremental Reboot
  • Voting
like image 123
Sandeep Avatar answered Oct 24 '22 09:10

Sandeep


Handbook of Software Reliability Engineering you can read it in pdf. One of the main principles of software reliability is fault tolerance.

Take a look at chapter 14 Fault-Tolerant software.

like image 38
Mark Avatar answered Oct 24 '22 09:10

Mark