RL-OPRA: Reinforcement Learning for Online and Proactive Resource Allocation of crowdsourced live videos
Author | Baccour, Emna |
Author | Erbad, Aiman |
Author | Mohamed, Amr |
Author | Haouari, Fatima |
Author | Guizani, Mohsen |
Author | Hamdi, Mounir |
Available date | 2020-08-16T07:33:58Z |
Publication Date | 2020-11-01 |
Publication Name | Future Generation Computer Systems |
Identifier | http://dx.doi.org/10.1016/j.future.2020.06.038 |
Citation | Baccour, E., Erbad, A., Mohamed, A., Haouari, F., Guizani, M., & Hamdi, M. (2020). RL-OPRA: Reinforcement Learning for Online and Proactive Resource Allocation of crowdsourced live videos. Future Generation Computer Systems. |
ISSN | 0167-739X |
Abstract | © 2020 Elsevier B.V. With the advancement of rich media generating devices, the proliferation of live Content Providers (CP), and the availability of convenient internet access, crowdsourced live streaming services have witnessed unexpected growth. To ensure a better Quality of Experience (QoE), higher availability, and lower costs, large live streaming CPs are migrating their services to geo-distributed cloud infrastructure. However, because of the dynamics of live broadcasting and the wide geo-distribution of viewers and broadcasters, it is still challenging to satisfy all requests with reasonable resources. To overcome this challenge, we introduce in this paper a prediction driven approach that estimates the potential number of viewers near different cloud sites at the instant of broadcasting. This online and instant prediction of distributed popularity distinguishes our work from previous efforts that provision constant resources or alter their allocation as the popularity of the content changes. Based on the derived predictions, we formulate an Integer-Linear Program (ILP) to proactively and dynamically choose the right data center to allocate exact resources and serve potential viewers, while minimizing the perceived delays. As the optimization is not adequate for online serving, we propose a real-time approach based on Reinforcement Learning (RL), namely RL-OPRA, which adaptively learns to optimize the allocation and serving decisions by interacting with the network environment. Extensive simulation and comparison with the ILP have shown that our RL-based approach is able to present optimal results compared to heuristic-based approaches. |
Sponsor | This work was supported by the Qatar Foundation . |
Language | en |
Publisher | Elsevier |
Subject | Geo-distributed clouds Live streaming Machine and reinforcement learning QoE |
Type | Article |
Pagination | 982-995 |
Volume Number | 112 |
Check access options
Files in this item
This item appears in the following Collection(s)
-
Computer Science & Engineering [2402 items ]