Implementing a Multi-agent Organization that Changes Its Fault Tolerance Policy at Run-Time

  • Sebnem Bora
  • Oguz Dikenelli
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3963)


In this paper, we present an approach that supports simultaneously applying different fault tolerance policies in multi-agent organizations. The main strategy of our approach is to implement fault tolerance policies as reusable agent plans using HTN (Hierarchical Task Network) formalism. In this way, different fault tolerance policies such as static and adaptive ones can be implemented as different plans. In a static fault tolerance policy, all parameters related to the fault tolerance are set by a programmer before run-time. However, an adaptive fault tolerance policy requires dynamically adapting resource allocation and replication mechanisms by monitoring the system. Monitoring of a system brings some cost to the system. If all agents in an organization apply the adaptive fault tolerance policy, the monitoring cost will become an important factor for the system performance. Hence by applying our approach, the adaptive policy can be applied only to the critical agents whose criticalities can be observed during the organization’s lifetime and the static one can be applied to the remaining agents. This reduces the monitoring cost and increases the overall organization performance. A case study has been implemented to show the effectiveness of our approach.


Fault Tolerance Failure Detector Replication Strategy Replication Service Hierarchical Task Network 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Paolucci, M., et al.: A planning component for RETSINA Agents. In: Jennings, N.R., Lesperance, Y. (eds.) ATAL 1999. LNCS (LNAI), vol. 1757. Springer, Heidelberg (2000)Google Scholar
  2. 2.
  3. 3.
    Dikenelli, O., Erdur, R.C., Gumus, O., Ekinci, E.E., Gurcan, O., Kardas, G., Seylan, I., Tiryaki, A.M.: SEAGENT: A Platform for Developing Semantic Web Based Multi Agent Systems. In: AAMAS 2005, p. 1270 (2005)Google Scholar
  4. 4.
    Dikenelli, O., Gumus, O., Tiryaki, A.M., Kardas, G.: Engineering a Multi Agent Platform with Dynamic Semantic Service Discovery and Invocation Capability. In: Eymann, T., Klügl, F., Lamersdorf, W., Klusch, M., Huhns, M.N. (eds.) MATES 2005. LNCS, vol. 3550, pp. 141–152. Springer, Heidelberg (2005)CrossRefGoogle Scholar
  5. 5.
    Kumar, S., Cohen, P.R., Levesque, H.J.: The adaptive agent architecture: Achieving fault-tolerance using persistent broker teams. In: Proceedings, Fourth International Conference on Multi-Agent Systems (July 2000)Google Scholar
  6. 6.
    Klein, M., Dallarocas, C.: Exception handling in agent systems. In: Etzioni, O., Muller, J.P., Bradshaw, J.M. (eds.) Proceedings of the Third International Conference on Agents (Agents 1999), Seattle, WA, pp. 62–68 (1999)Google Scholar
  7. 7.
    Hägg, S.: A sentinel approach to fault handling in multi-agent systems. In: Foo, N.Y., Göbel, R. (eds.) PRICAI 1996. LNCS, vol. 1114. Springer, Heidelberg (1996)Google Scholar
  8. 8.
    Fedoruk, A., Deters, R.: Improving fault-tolerance by replicating agents. In: Proceedings of 1st International Joint Conference on Autonomous Agents and Multi-Agent Systems, Bologna, Italy (2002)Google Scholar
  9. 9.
    Guessoum, Z., Ziane, M., Faci, N.: Monitoring and organizational-level adaptation of multi-agent systems. In: Kudenko, D., Kazakov, D., Alonso, E. (eds.) AAMAS 2004. LNCS, vol. 3394, pp. 514–522. Springer, Heidelberg (2005)Google Scholar
  10. 10.
    Guessoum, Z., Briot, J.-P., Charpentier, Z., Aknine, S., Marin, O., Sens, P.: Dynamic and Adaptative Replication for Large-Scale Reliable Multi-Agent Systems. In: Proc. ICSE 2002 First International Workshop on Software Engineering for Large-Scale Multi-Agent Systems (SELMAS 2002). ACM, Orlando (2002)Google Scholar
  11. 11.
    Guessoum, Z., Briot, J.P.: From active objects to autonomous agents. IEEE Concurrency 7(3), 68–76 (1999)Google Scholar
  12. 12.
    Guessoum, Z., Briot, J.P., Sens, P., Marin, O.: Toward fault-tolerant multi-agent systems. In: MAAMAW 2001, Annecy, France (2001)Google Scholar
  13. 13.
    FIPA. FIPA Specifications,
  14. 14.
    Tanenbaum, A.S., van Steen, M.: Distributed Systems: Principles and Paradigms. Prentice-Hall, Englewood Cliffs (2002)zbMATHGoogle Scholar
  15. 15.
    Chockler, V.G., Keidar, I., Vitenberg, R.: Group Communication Specifications: A Comprehensive Study. ACM Computing Surveys 33(4), 1–43 (2001)CrossRefGoogle Scholar
  16. 16.
  17. 17.
  18. 18.
    Shehory, O., Sycara, K., Chalasani, P., Jha, S.: Agent Cloning: An Approach to Agent Mobility and Resource Allocation. IEEE Communications 36(7), 58–67 (1998)CrossRefGoogle Scholar
  19. 19.
    Decker, K., Sycara, K., Williamson, M.: Cloning for Intelligent Adaptive Information Agents. In: Rao, A., Singh, M.P., Wooldridge, M.J. (eds.) ATAL 1997. LNCS (LNAI), vol. 1365, pp. 63–75. Springer, Heidelberg (1998)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Sebnem Bora
    • 1
  • Oguz Dikenelli
    • 1
  1. 1.Computer Engineering DepartmentEge UniversityIzmirTurkey

Personalised recommendations