Deep Reinforcement Learning-Based Policy for Baseband Function Placement and Routing of RAN in 5G and Beyond

Zhengguang Gao, Shuangyi Yan, Jiawei Zhang*, Bingtao Han, Yongcheng Wang, Yuming Xiao, Dimitra Simeonidou, Yuefeng Ji

*Corresponding author for this work

Research output: Contribution to journalArticle (Academic Journal)peer-review

10 Citations (Scopus)
151 Downloads (Pure)


In this paper, we propose a deep reinforcement learning (DRL)-based algorithm to generate policies of Baseband Function (BBF) placement and routing. In order to explore the performance of the proposed algorithm in practical systems, the online scenario with the completely random requests is used in the simulation considering C-RAN and NG-RAN architectures. Besides, an Integer Linear Programming (ILP) model is formulated to generate the optimal solution as the benchmark. The simulation results show that DRL-based algorithm converges in a short time, and its performance closes to the optimal benchmark obtained by ILP in terms of latency and bandwidth for the online scenarios. In addition, the performance of the generated policies based on DRL is compared with a classic heuristic algorithm, i.e., first-fit algorithm. The performance of DRL-based algorithm is superior to the first-fit algorithm from above two perspectives. The fast convergence and the near-optimal performance prove that the DRL-based algorithm is a promising approach for the BBF placement and routing of RAN in 5G and Beyond.
Original languageEnglish
Number of pages1
JournalJournal of Lightwave Technology
Early online date9 Sept 2021
Publication statusE-pub ahead of print - 9 Sept 2021

Bibliographical note

Publisher Copyright:


  • 5G and Beyond
  • 5G mobile communication
  • Bandwidth
  • Baseband
  • Baseband Function placement and routing
  • Benchmark testing
  • Computer architecture
  • Deep reinforcement learning
  • Heuristic algorithms
  • Routing


Dive into the research topics of 'Deep Reinforcement Learning-Based Policy for Baseband Function Placement and Routing of RAN in 5G and Beyond'. Together they form a unique fingerprint.

Cite this