Home

Specifications

Schema

Commentary

Mark Wahl


Web Design by
Kristen Lanum

Commentary by Mark Wahl, CISA

Organizing principles for systems:
Data Sharing and fault tolerance (20070909)

One topic which has not seen as wide discussion in the context of the (wiki) has been the ability for data sharing to help provide the users with fault tolerance for social networking services they rely upon. This is a problem worth addressing as currently a single hosting center outage can shut down multiple independently-operated social network services. Furthermore, that outage shut down an OpenID identity provider (OP), and thus the users of that OP were no longer able to use their OpenIDs to log into services elsewhere which were still online.

In a fault tolerant distributed system, the system as a whole continues to operate, perhaps in a degraded mode, even when one or more of the components of the system have failed. Some of the failure modes might include:

Some of the techniques worth considering would include: