Izvestiya of Saratov University.

Mathematics. Mechanics. Informatics

ISSN 1816-9791 (Print)
ISSN 2541-9005 (Online)


For citation:

Силинская А. А., Bogomolov A. S., Kushnikov V. A. A mathematical model of social group evacuation from buildings with multiple exits. Izvestiya of Saratov University. Mathematics. Mechanics. Informatics, 2026, vol. 26, iss. 2, pp. 302-311. DOI: 10.18500/1816-9791-2026-26-2-302-311, EDN: UEEHGG

This is an open access article distributed under the terms of Creative Commons Attribution 4.0 International License (CC-BY 4.0).
Full text:
(downloads: 3)
Language: 
Russian
Heading: 
Article type: 
Article
UDC: 
001.891.573
EDN: 
UEEHGG

A mathematical model of social group evacuation from buildings with multiple exits

Autors: 
Bogomolov Aleksey S., Federal Research Center “Saratov Scientific Centre of the Russian Academy of Science”
Kushnikov Vadim Alexeevich, Federal Research Center “Saratov Scientific Centre of the Russian Academy of Science”
Abstract: 

This paper introduces a computational model for simulating multi-agent evacuation dynamics based on the Multi-Agent Proximal Policy Optimization (MAPPO) algorithm. The proposed framework incorporates multiple evacuation exits with varying opening times, heterogeneous agent types, panic-induced behavioral modifications, and social interactions of the leader – follower type. A hybrid action space is employed, combining discrete exit selection with continuous movement control. Training is performed under a curriculum learning paradigm, gradually increasing the number of agents to enhance generalization and adaptability to different population sizes. Several methodological refinements were implemented to improve training stability and efficiency: dropout layers to mitigate overfitting, exponential exploration decay to enable a smooth shift toward precise actions, and reward normalization to stabilize policy updates. Simulations were conducted in a $15\times20$ m environment with three exits (each 1.5 m wide, opening sequentially every 6 seconds). The model also incorporates mechanisms of information dissemination: leaders are aware of all exits from the start of the simulation, while individual agents detect exits within a 5 m radius and propagate this knowledge to neighbors within 2 m. Social groups follow predefined behavioral rules, such as granting elderly agents a speed adjustment and assigning leaders strategic decision-making roles. Computational experiments with scenarios involving 50 agents demonstrated that the presence of social groups and leaders significantly accelerates evacuation, particularly benefiting elderly agents. Optimal performance was observed in settings with two leaders, whereas scenarios with a single leader led to bottlenecks, longer evacuation times, and higher levels of panic. These findings highlight the potential of reinforcement learning–based approaches for analyzing and optimizing evacuation processes in complex indoor environments. The developed mathematical model is intended for use in the creation of digital twins for simulating and optimizing human flow processes, as well as for conducting computational experiments to calculate efficient evacuation times and routes.

Acknowledgments: 
This work was supported by the Ministry of Science and Higher Education of the Russian Federation within the framework of the state assignment (project No. FREM-2026-0006).
References: 
  1. Kotkova E. A., Matveev A. V., Nefedev S. A., Tarantsev A. A. Agent modeling of the process of people evacuation during fire in buildings: A review of approaches and research. Modern High Technologies, 2023, iss. 10, pp. 55–62. DOI: https://doi.org/10.17513/snt.39791, EDN: CZHEJY
  2. Zia K., Ferscha A. An agent-based model of crowd evacuation: combining individual, social and technological aspects. Proceedings of the 2020 ACM SIGSIM Conference on Principles of Advanced Discrete Simulation. New York, Association for Computing Machinery, 2020, pp. 129–140. DOI: https://doi.org/10.1145/3384441.3395973
  3. Sukhanov V. O., Kuzmin A. I., Skorokhodov D. V. Geoinformation system support decision-making on evacuation of the population. Sovremennye tekhnologii obespecheniya grazhdanskoy oborony i likvidatsii posledstviy chrezvychaynykh situatsiy [Modern Technologies for Civil Defense and Emergency Response], 2019, iss. 1 (10), pp. 411–413 (in Russian). EDN: WNCRRV
  4. Tsvirkun A. D., Rezchikov A. F., Samartsev А. A., Bogomolov A. S., Ivashchenko V. A., Kushnikov V. A., Filimonyuk L. Yu. Integrated model of the fire dangerous factors dynamics in premises and the evacuation. Herald of Computer and Information Technologies. Scientific, Technical and Production Monthly Journal, 2019, iss. 2 (176), pp. 47–54 (in Russian). DOI: https://doi.org/10.14489/vkit.2019.02.pp.047-054, EDN: ZACMDZ
  5. Tsvirkun A. D., Rezchikov A. F., Filimonyuk L. Y., Samartsev A. A., Ivashchenko V. A., Bogomolov A. S., Kushnikov V. A. System of integrated simulation of spread of hazardous factors of fire and evacuation of people from indoors. Automation and Remote Control, 2022, vol. 83, iss. 5, pp. 692–705. DOI: https://doi.org/10.1134/S0005117922050034, EDN: POZSHW
  6. Samartsev A., Ivaschenko V., Rezchikov A., Kushnikov V., Filimonyuk L., Bogomolov A. Multiagent model of people evacuation from premises while emergency. Advances in Systems Science and Applications, 2019, vol. 19, iss. 1, pp. 98–115. DOI: https://doi.org/10.25728/assa.2019.19.1.558, EDN: JJSDFW
  7. Gamayunova V. O., Bogomolov A. S., Kushnikov V. A., Ivashchenko V. A. Multi-agent modeling of evacuation from premises with consideration of agent collisions. Izvestiya of Saratov University. Mathematics. Mechanics. Informatics, 2025, vol. 25, iss. 1, pp. 106–115 (in Russian). DOI: https: //doi.org/10.18500/1816-9791-2025-25-1-106-115, EDN: TLQGGD
  8. Rosa A. C., Falqueiro M. C., Bonacin R., De Mendonça F. L. L., Filho G. P. R., Gonçalves V. P. EvacuAI: An analysis of escape routes in indoor environments with the aid of reinforcement learning. Sensors, 2023, vol. 23, iss. 21, art. 8892. DOI: https://doi.org/10.3390/s23218892
  9. Ünal A. E., Gezer C., Pak B. K., Güngör V. Ç. Generating emergency evacuation route directions based on crowd simulations with reinforcement learning. 2022 Innovations in Intelligent Systems and Applications Conference (ASYU). Antalya, IEEE, 2022, pp. 1–6. DOI: https://doi.org/10.1109/ASYU56188.2022.9925560
  10. Xu D., Huang X., Mango J., Li X., Li Z. Simulating multi-exit evacuation using deep reinforcement learning. Transactions in GIS, 2021, vol. 25, iss. 3, pp. 1542–1564. DOI: https://doi.org/10.1111/tgis.12738
  11. Sinpan N., Sasithong P., Chaudhary S., Poomrittigul S., Leelawat N., Wuttisittikulkij L. Simulative investigations of crowd evacuation by incorporating reinforcement learning scheme. ICACS ’22: Proceedings of the 6th International Conference on Algorithms, Computing and Systems. New York, Association for Computing Machinery, 2022, pp. 1–5. DOI: https://doi.org/10.1145/3564982.3564983
  12. Yu C., Velu A., Vinitsky E., Gao J., Wang Y., Bayen A., Wu Y. The surprising effectiveness of PPO in cooperative multi-agent games. Advances in Neural Information Processing Systems, 2022, vol. 35, pp. 24611–24624.
  13. Xiong J., Wang Q., Yang Z., Sun P., Han L., Zheng Y., Fu H., Zhang T., Liu J., Liu H. Parametrized deep Q-networks learning: Reinforcement learning with discrete-continuous hybrid action space. arXiv preprint arXiv:1810.06394, 2018. DOI: https://doi.org/10.48550/arxiv.1810.06394
  14. Srivastava N., Hinton G., Krizhevsky A., Sutskever I., Salakhutdinov R. Dropout: A simple way to prevent neural networks from overfitting. Journal of Machine Learning Research, 2014, vol. 15, pp. 1929–1958.
  15. Gal Y., Ghahramani Z. Dropout as a Bayesian approximation: Representing model uncertainty in deep learning. Proceedings of the 33rd International Conference on Machine Learning. New York, USA, PMLR, 2016, pp. 1050–1059.
  16. Sutton R. S., Barto A. Reinforcement learning: An introduction. Cambridge, The MIT Press, 2020. 552 p.
  17. Lillicrap T. P., Hunt J. J., Pritzel A., Heess N., Erez T., Tassa Y., Silver D., Wierstra D. Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971, 2015. DOI: https://doi.org/10.48550/arXiv.1509.02971
  18. Schulman J., Wolski F., Dhariwal P., Radford A., Klimov O. Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347, 2017. DOI: https://doi.org/10.48550/arXiv.1707.06347
  19. Narvekar S., Peng B., Leonetti M., Sinapov J., Taylor M. E., Stone P. Curriculum learning for reinforcement learning domains: A framework and survey. Journal of Machine Learning Research, 2020, vol. 21, iss. 1, pp. 181:1–181:50.
  20. Wang L., Zheng J.-H., Zhang X.-S., Zhang J.-L., Wang Q.-Z., Zhang Q.-Zh. Pedestrians’ behavior in emergency evacuation: Modeling and simulation. Chinese Physics B, 2016, vol. 25, iss. 11, art. 118901. DOI: https://doi.org/10.1088/1674-1056/25/11/118901
  21. Trivedi A., Rao S. Agent-based modeling of emergency evacuations considering human panic behavior. IEEE Transactions on Computational Social Systems, 2018, vol. 5, iss. 1, pp. 277–288. DOI: https://doi.org/10.1109/TCSS.2017.2783332
  22. Ding N., Sun C. Experimental study of leader-and-follower behaviours during emergency evacuation. Fire Safety Journal, 2020, vol. 117, art. 103189. DOI: https://doi.org/10.1016/j.firesaf.2020.103189
Received: 
12.11.2025
Accepted: 
29.11.2025
Published: 
01.06.2026