RL-OPRA: Reinforcement Learning for Online and Proactive Resource Allocation of crowdsourced live videos

Baccour, Emna; Erbad, Aiman; Mohamed, Amr; Haouari, Fatima; Guizani, Mohsen; Hamdi, Mounir

Author	Baccour, Emna
Author	Erbad, Aiman
Author	Mohamed, Amr
Author	Haouari, Fatima
Author	Guizani, Mohsen
Author	Hamdi, Mounir
Available date	2020-08-16T07:33:58Z
Publication Date	2020-11-01
Publication Name	Future Generation Computer Systems
Identifier	http://dx.doi.org/10.1016/j.future.2020.06.038
Citation	Baccour, E., Erbad, A., Mohamed, A., Haouari, F., Guizani, M., & Hamdi, M. (2020). RL-OPRA: Reinforcement Learning for Online and Proactive Resource Allocation of crowdsourced live videos. Future Generation Computer Systems.
ISSN	0167-739X
URI	http://hdl.handle.net/10576/15529
Abstract	© 2020 Elsevier B.V. With the advancement of rich media generating devices, the proliferation of live Content Providers (CP), and the availability of convenient internet access, crowdsourced live streaming services have witnessed unexpected growth. To ensure a better Quality of Experience (QoE), higher availability, and lower costs, large live streaming CPs are migrating their services to geo-distributed cloud infrastructure. However, because of the dynamics of live broadcasting and the wide geo-distribution of viewers and broadcasters, it is still challenging to satisfy all requests with reasonable resources. To overcome this challenge, we introduce in this paper a prediction driven approach that estimates the potential number of viewers near different cloud sites at the instant of broadcasting. This online and instant prediction of distributed popularity distinguishes our work from previous efforts that provision constant resources or alter their allocation as the popularity of the content changes. Based on the derived predictions, we formulate an Integer-Linear Program (ILP) to proactively and dynamically choose the right data center to allocate exact resources and serve potential viewers, while minimizing the perceived delays. As the optimization is not adequate for online serving, we propose a real-time approach based on Reinforcement Learning (RL), namely RL-OPRA, which adaptively learns to optimize the allocation and serving decisions by interacting with the network environment. Extensive simulation and comparison with the ILP have shown that our RL-based approach is able to present optimal results compared to heuristic-based approaches.
Sponsor	This work was supported by the Qatar Foundation .
Language	en
Publisher	Elsevier
Subject	Geo-distributed clouds Live streaming Machine and reinforcement learning QoE
Title	RL-OPRA: Reinforcement Learning for Online and Proactive Resource Allocation of crowdsourced live videos
Type	Article
Pagination	982-995
Volume Number	112
dc.accessType	Open Access

Check access options

Files in this item

Name:: 1-s2.0-S0167739X20306269-main.pdf
Size:: 2.695Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Computer Science & Engineering [‎2484‎ items ]

Show simple item record

RL-OPRA: Reinforcement Learning for Online and Proactive Resource Allocation of crowdsourced live videos

Files in this item

This item appears in the following Collection(s)

Video