DOI:
https://doi.org/10.64539/sjcs.v1i2.2025.327Keywords:
Multi-Agent Systems, Agent Orchestration, Communication Backbone, Scalable AI, System Observability, Architectural PatternAbstract
As multi-agent AI systems evolve from prototypes to production-grade enterprise applications, the need for a robust and scalable communication architecture has become an operational imperative. However, traditional approaches like linear chaining or ad-hoc peer-to-peer messaging result in brittle, unmanageable systems that lack observability and fail to handle the non-deterministic nature of AI agents. To address this architectural deficiencies, this paper introduces the Message, Context, and Protocol (MCP) framework, an architectural pattern designed to serve as a communication backbone and "central nervous system" for complex AI systems by decoupling agent intent from execution. Performance evaluations under simulated enterprise load demonstrate that by decoupling agents through a central message bus and a stateful orchestrator, MCP maintains system resilience and prevents catastrophic failure even under high load (500 req/sec), although the orchestrator itself is identified as a primary bottleneck requiring horizontal scaling. These results underscore that centralized state management is not merely an option but a necessity for enterprise AI, providing the modularity and fault tolerance required to transition agentic workflows from experimental concepts to reliable business solutions.
References
[1] Z. Xi, et al., “The Rise and Potential of Large Language Model Based Agents: A Survey,” arXiv preprint arXiv:2309.07864 pp. 1–86, 2023, Available: https://arxiv.org/abs/2309.07864.
[2] L. Wang, et al., “A Survey on Large Language Model based Autonomous Agents,” Frontiers of Computer Science, vol. 18, no. 6, p. 186345, 2024. Available: https://arxiv.org/abs/2308.11432.
[3] Y. Talebirad and A. Nadiri, “Multi-Agent Collaboration: Harnessing the Power of Intelligent LLC Agents,” arXiv preprint arXiv:2306.03314, 2023. Available: https://arxiv.org/abs/2306.03314.
[4] J. Lewis and M. Fowler, “Microservices: a definition of this new architectural term,” martinfowler.com, 2014. Available: https://martinfowler.com/articles/microservices.html.
[5] N. Dragoni, S. Giallorenzo, A. L. Lafuente, M. Mazzara, F. Montesi, R. Mustafin, and L. Safina, “Microservices: Yesterday, Today, and Tomorrow,” in Present and Ulterior Software Engineering, pp. 195–216, 2017. https://doi.org/10.1007/978-3-319-67425-4_12.
[6] Q. Wu, et al., “AutoGen: Enabling Next-Gen LLM Applications,” arXiv preprint arXiv:2308.08155, 2023. Available: https://arxiv.org/abs/2308.08155.
[7] C. Richardson, “Microservices Patterns: With Examples in Java,” Manning Publications, pp. 1-520, 2018. Available: https://books.google.co.id/books/about/Microservices_Patterns.html?hl=id&id=UeK1swEACAAJ.
[8] S. Newman, “Building Microservices: Designing Fine-Grained Systems.” O'Reilly Media, 2021. Available: https://www.oreilly.com/library/view/building-microservices-2nd/9781492034018/.
[9] R. T. Fielding, “Architectural Styles and the Design of Network-based Software Architectures,” Ph.D. dissertation, Univ. of California, Irvine, 2000. Available: https://www.bibsonomy.org/bibtex/833ffb7e240120036017a40097125555.
[10] J. Willard and J. Hutson, “The Evolution and Future of Microservices Architecture with AI-Driven Enhancements,” International Journal of Recent Engineering Science, vol. 12, no. 1, pp. 16-22, 2025. https://doi.org/10.14445/23497157/IJRES-V12I1P103.
[11] S. Yao, J. Zhao, D. Yu, N. Du, I. Shafran, K. Narasimhan, and Y. Cao, “ReAct: Synergizing Reasoning and Acting in Language Models,” arXiv preprint arXiv:2210.03629, 2022. Available: https://arxiv.org/abs/2210.03629.
[12] T. Schick, J. Dwivedi-Yu, R. Dessì, R. Raileanu, M. Lomeli, L. Zettlemoyer, N. Cancedda, T. Scialom, “Toolformer: Language Models Can Teach Themselves to Use Tools,” arXiv preprint arXiv:2302.04761, 2023. Available: https://arxiv.org/abs/2302.04761.
[13] Amazon Web Services, “Service Discovery,” AWS Architecture Center, 2023. Available: https://docs.aws.amazon.com/whitepapers/latest/microservices-on-aws/service-discovery.html.
[14] HashiCorp, “Consul: Service Discovery and Service Mesh,” HashiCorp Documentation, 2024. Available: https://developer.hashicorp.com/consul/docs/intro.
[15] H. P. Nii, “Blackboard systems: The blackboard model of problem solving and the evolution of blackboard architectures,” AI Magazine, vol. 7, no. 2, pp. 38–53, 1986. https://doi.org/10.1609/aimag.v7i2.537.
[16] J. Kreps, N. Narkhede, and J. Rao, “Kafka: A Distributed Messaging System for Log Processing,” in Proc. NetDB, pp. 1–7, 2011. Available: https://www.microsoft.com/en-us/research/wp-content/uploads/2017/09/Kafka.pdf.
[17] P. T. Eugster, P. A. Felber, R. Guerraoui, and A.-M. Kermarrec, “The many faces of publish/subscribe,” ACM Computing Surveys, vol. 35, no. 2, pp. 114–131, 2003. Available: https://doi.org/10.1145/857076.857078.
[18] Amazon Web Services, “What is Event-Driven Architecture?,” AWS Concepts, 2024. Available: https://aws.amazon.com/event-driven-architecture/.
[19] Temporal Technologies, “Temporal Application Development Guide,” Temporal Documentation, 2024. Available: https://docs.temporal.io/dev-guide.
[20] J. Wei, X. Wang, D. Schuurmans, M. Bosma, B. Ichter, F. Xia, E. Chi, Q. Le, D. Zhou, “Chain-of-Thought Prompting Elicits Reasoning in Large Language Models,” arXiv preprint arXiv:2201.11903, vol. 35, pp. 1–43, 2022. Available: https://arxiv.org/abs/2201.11903.
[21] LangChain, “LangGraph: A Graph-Based Orchestration Framework,” LangChain Blog, 2024. Available: https://blog.langchain.dev/langgraph/.
[22] OpenTelemetry Authors, “OpenTelemetry Concepts: Distributed Tracing,” OpenTelemetry.io, 2024. Available: https://opentelemetry.io/docs/concepts/signals/traces/.
[23] B. H. Sigelman, L. A. Barroso, M. Burrows, P. Stephenson, M. Plakal, D. Beaver, S. Jaspan C. Shanbhag, “Dapper, a Large-Scale Distributed Systems Tracing Infrastructure,” Google Technical Report, 2010. Available: https://research.google/pubs/pub36356/.
[24] H. Garcia-Molina and K. Salem, “Sagas,” in SIGMOD '87: Proceedings of the 1987 ACM SIGMOD international conference on Management of data, pp. 249–259, 1987. https://doi.org/10.1145/38713.38742.
[25] V. Vernon, “Implementing Domain-Driven Design”. Addison-Wesley, 2013. Available: https://books.google.co.id/books/about/Implementing_Domain_driven_Design.html?id=aVJsAQAAQBAJ.
[26] OpenAI, “Safety Best Practices,” OpenAI Documentation, 2024. Available: https://platform.openai.com/docs/guides/safety-best-practices.
[27] Y. Liu, G. Deng, Y. Li, K. Wang, Z. Wang, X. Wang, T. Zhang, Y. Liu, H. Wang, Y. Zheng, Y. Liu, “Prompt Injection Attack Against LLM-Integrated Applications,” arXiv preprint arXiv:2306.05499, 2023. Available: https://arxiv.org/abs/2306.05499.
[28] M. Shapiro, N. Preguiça, C. Baquero, and M. Zawirski, “Conflict-Free Replicated Data Types,” in Proc. 13th Intl. Symp. Stabilization, Safety, and Security of Distributed Systems, 2011. Available: https://hal.inria.fr/inria-00609399/document.
[29] J. Carlson, Redis in Action. Manning Publications, 2013. Available: https://www.manning.com/books/redis-in-action.
[30] G. Li, H. A. A. K. Hammoud, H. Itani, D. Khizbullin, B. Ghanem, “CAMEL: Communicative Agents for ‘Mind’ Exploration of Large Language Model Society,” arXiv preprint arXiv:2303.17760, 2023. Available: https://arxiv.org/abs/2303.17760.
[31] K. Zhang, Z. Yang, and T. Başar, “Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms,” in Handbook of Reinforcement Learning and Control, vol. 325, 2021. https://doi.org/10.1007/978-3-030-60990-0_12.
[32] P. Lewis, et al., “Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks,” in NIPS'20: Proceedings of the 34th International Conference on Neural Information Processing Systems, pp. 9459–9474, 2020. Available: https://dl.acm.org/doi/abs/10.5555/3495724.3496517.
[33] R. M. Monarch, Human-in-the-Loop Machine Learning. Manning Publications, 2021. Available: https://www.manning.com/books/human-in-the-loop-machine-learning.
[34] M. Kleppmann, Designing Data-Intensive Applications. O’Reilly Media, 2017. Available: https://www.oreilly.com/library/view/designing-data-intensive-applications/9781491903063/.
[35] B. Gates, “The Age of AI Has Begun,” GatesNotes, 2023. Available: https://www.gatesnotes.com/The-Age-of-AI-Has-Begun.

