For citation:

Silinskaya A. A., Bogomolov A. S., Kushnikov V. A. A mathematical model of social group evacuation from buildings with multiple exits. Izvestiya of Saratov University. Mathematics. Mechanics. Informatics, 2026, vol. 26, iss. 2, pp. 302-311. DOI: 10.18500/1816-9791-2026-26-2-302-311, EDN: UEEHGG

This is an open access article distributed under the terms of Creative Commons Attribution 4.0 International License (CC-BY 4.0).

Published online:

01.06.2026

Full text:

download

(downloads: 81)

Language:

Russian

Heading:

Computer Sciences

Article type:

Article

UDC:

001.891.573

DOI:

10.18500/1816-9791-2026-26-2-302-311

EDN:

UEEHGG

A mathematical model of social group evacuation from buildings with multiple exits

Autors:

Silinskaya Anna A., Saratov State University

Bogomolov Aleksey S., Saratov State University

Kushnikov Vadim Alexeevich, Saratov State University

Abstract:

This paper introduces a computational model for simulating multi-agent evacuation dynamics based on the Multi-Agent Proximal Policy Optimization (MAPPO) algorithm. The proposed framework incorporates multiple evacuation exits with varying opening times, heterogeneous agent types, panic-induced behavioral modifications, and social interactions of the leader – follower type. A hybrid action space is employed, combining discrete exit selection with continuous movement control. Training is performed under a curriculum learning paradigm, gradually increasing the number of agents to enhance generalization and adaptability to different population sizes. Several methodological refinements were implemented to improve training stability and efficiency: dropout layers to mitigate overfitting, exponential exploration decay to enable a smooth shift toward precise actions, and reward normalization to stabilize policy updates. Simulations were conducted in a $15\times20$ m environment with three exits (each 1.5 m wide, opening sequentially every 6 seconds). The model also incorporates mechanisms of information dissemination: leaders are aware of all exits from the start of the simulation, while individual agents detect exits within a 5 m radius and propagate this knowledge to neighbors within 2 m. Social groups follow predefined behavioral rules, such as granting elderly agents a speed adjustment and assigning leaders strategic decision-making roles. Computational experiments with scenarios involving 50 agents demonstrated that the presence of social groups and leaders significantly accelerates evacuation, particularly benefiting elderly agents. Optimal performance was observed in settings with two leaders, whereas scenarios with a single leader led to bottlenecks, longer evacuation times, and higher levels of panic. These findings highlight the potential of reinforcement learning–based approaches for analyzing and optimizing evacuation processes in complex indoor environments. The developed mathematical model is intended for use in the creation of digital twins for simulating and optimizing human flow processes, as well as for conducting computational experiments to calculate efficient evacuation times and routes.

Key words:

multi-agent model

evacuation

reinforcement learning

Acknowledgments:

This work was supported by the Ministry of Science and Higher Education of the Russian Federation within the framework of the state assignment (project No. FREM-2026-0006).

References:

Kotkova E. A., Matveev A. V., Nefedev S. A., Tarantsev A. A. Agent modeling of the process of people evacuation during fire in buildings: A review of approaches and research. Modern High Technologies, 2023, iss. 10, pp. 55–62. DOI: https://doi.org/10.17513/snt.39791, EDN: CZHEJY
Zia K., Ferscha A. An agent-based model of crowd evacuation: combining individual, social and technological aspects. Proceedings of the 2020 ACM SIGSIM Conference on Principles of Advanced Discrete Simulation. New York, Association for Computing Machinery, 2020, pp. 129–140. DOI: https://doi.org/10.1145/3384441.3395973
Sukhanov V. O., Kuzmin A. I., Skorokhodov D. V. Geoinformation system support decision-making on evacuation of the population. Sovremennye tekhnologii obespecheniya grazhdanskoy oborony i likvidatsii posledstviy chrezvychaynykh situatsiy [Modern Technologies for Civil Defense and Emergency Response], 2019, iss. 1 (10), pp. 411–413 (in Russian). EDN: WNCRRV
Tsvirkun A. D., Rezchikov A. F., Samartsev А. A., Bogomolov A. S., Ivashchenko V. A., Kushnikov V. A., Filimonyuk L. Yu. Integrated model of the fire dangerous factors dynamics in premises and the evacuation. Herald of Computer and Information Technologies. Scientific, Technical and Production Monthly Journal, 2019, iss. 2 (176), pp. 47–54 (in Russian). DOI: https://doi.org/10.14489/vkit.2019.02.pp.047-054, EDN: ZACMDZ
Tsvirkun A. D., Rezchikov A. F., Filimonyuk L. Y., Samartsev A. A., Ivashchenko V. A., Bogomolov A. S., Kushnikov V. A. System of integrated simulation of spread of hazardous factors of fire and evacuation of people from indoors. Automation and Remote Control, 2022, vol. 83, iss. 5, pp. 692–705. DOI: https://doi.org/10.1134/S0005117922050034, EDN: POZSHW
Samartsev A., Ivaschenko V., Rezchikov A., Kushnikov V., Filimonyuk L., Bogomolov A. Multiagent model of people evacuation from premises while emergency. Advances in Systems Science and Applications, 2019, vol. 19, iss. 1, pp. 98–115. DOI: https://doi.org/10.25728/assa.2019.19.1.558, EDN: JJSDFW
Gamayunova V. O., Bogomolov A. S., Kushnikov V. A., Ivashchenko V. A. Multi-agent modeling of evacuation from premises with consideration of agent collisions. Izvestiya of Saratov University. Mathematics. Mechanics. Informatics, 2025, vol. 25, iss. 1, pp. 106–115 (in Russian). DOI: https: //doi.org/10.18500/1816-9791-2025-25-1-106-115, EDN: TLQGGD
Rosa A. C., Falqueiro M. C., Bonacin R., De Mendonça F. L. L., Filho G. P. R., Gonçalves V. P. EvacuAI: An analysis of escape routes in indoor environments with the aid of reinforcement learning. Sensors, 2023, vol. 23, iss. 21, art. 8892. DOI: https://doi.org/10.3390/s23218892
Ünal A. E., Gezer C., Pak B. K., Güngör V. Ç. Generating emergency evacuation route directions based on crowd simulations with reinforcement learning. 2022 Innovations in Intelligent Systems and Applications Conference (ASYU). Antalya, IEEE, 2022, pp. 1–6. DOI: https://doi.org/10.1109/ASYU56188.2022.9925560
Xu D., Huang X., Mango J., Li X., Li Z. Simulating multi-exit evacuation using deep reinforcement learning. Transactions in GIS, 2021, vol. 25, iss. 3, pp. 1542–1564. DOI: https://doi.org/10.1111/tgis.12738
Sinpan N., Sasithong P., Chaudhary S., Poomrittigul S., Leelawat N., Wuttisittikulkij L. Simulative investigations of crowd evacuation by incorporating reinforcement learning scheme. ICACS ’22: Proceedings of the 6th International Conference on Algorithms, Computing and Systems. New York, Association for Computing Machinery, 2022, pp. 1–5. DOI: https://doi.org/10.1145/3564982.3564983
Yu C., Velu A., Vinitsky E., Gao J., Wang Y., Bayen A., Wu Y. The surprising effectiveness of PPO in cooperative multi-agent games. Advances in Neural Information Processing Systems, 2022, vol. 35, pp. 24611–24624.
Xiong J., Wang Q., Yang Z., Sun P., Han L., Zheng Y., Fu H., Zhang T., Liu J., Liu H. Parametrized deep Q-networks learning: Reinforcement learning with discrete-continuous hybrid action space. arXiv preprint arXiv:1810.06394, 2018. DOI: https://doi.org/10.48550/arxiv.1810.06394
Srivastava N., Hinton G., Krizhevsky A., Sutskever I., Salakhutdinov R. Dropout: A simple way to prevent neural networks from overfitting. Journal of Machine Learning Research, 2014, vol. 15, pp. 1929–1958.
Gal Y., Ghahramani Z. Dropout as a Bayesian approximation: Representing model uncertainty in deep learning. Proceedings of the 33rd International Conference on Machine Learning. New York, USA, PMLR, 2016, pp. 1050–1059.
Sutton R. S., Barto A. Reinforcement learning: An introduction. Cambridge, The MIT Press, 2020. 552 p.
Lillicrap T. P., Hunt J. J., Pritzel A., Heess N., Erez T., Tassa Y., Silver D., Wierstra D. Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971, 2015. DOI: https://doi.org/10.48550/arXiv.1509.02971
Schulman J., Wolski F., Dhariwal P., Radford A., Klimov O. Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347, 2017. DOI: https://doi.org/10.48550/arXiv.1707.06347
Narvekar S., Peng B., Leonetti M., Sinapov J., Taylor M. E., Stone P. Curriculum learning for reinforcement learning domains: A framework and survey. Journal of Machine Learning Research, 2020, vol. 21, iss. 1, pp. 181:1–181:50.
Wang L., Zheng J.-H., Zhang X.-S., Zhang J.-L., Wang Q.-Z., Zhang Q.-Zh. Pedestrians’ behavior in emergency evacuation: Modeling and simulation. Chinese Physics B, 2016, vol. 25, iss. 11, art. 118901. DOI: https://doi.org/10.1088/1674-1056/25/11/118901
Trivedi A., Rao S. Agent-based modeling of emergency evacuations considering human panic behavior. IEEE Transactions on Computational Social Systems, 2018, vol. 5, iss. 1, pp. 277–288. DOI: https://doi.org/10.1109/TCSS.2017.2783332
Ding N., Sun C. Experimental study of leader-and-follower behaviours during emergency evacuation. Fire Safety Journal, 2020, vol. 117, art. 103189. DOI: https://doi.org/10.1016/j.firesaf.2020.103189

Received:

12.11.2025

Accepted:

29.11.2025

Published:

01.06.2026

Journal issue:

Izvestiya of Saratov University. Mathematics. Mechanics. Informatics, 2026, vol. 26, iss. 2

335 reads

Headings

For citation:

A mathematical model of social group evacuation from buildings with multiple exits

User login