Online self-organizing network control with time averaged weighted throughput objective (Q1727088): Difference between revisions
From MaRDI portal
Set OpenAlex properties. |
ReferenceBot (talk | contribs) Changed an Item |
||
Property / cites work | |||
Property / cites work: Methods for removing links in a network to minimize the spread of infections / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Maximizing lifetime in wireless sensor networks with multiple sensor families / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Combining simulated annealing with Lagrangian relaxation and weighted Dantzig-Wolfe decomposition for integrated design decisions in wireless sensor networks / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Sensor deployment optimization methods to achieve both coverage and connectivity in wireless sensor networks / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Practical scheduling schemes with throughput guarantees for multi-hop wireless networks / rank | |||
Normal rank |
Revision as of 07:43, 18 July 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Online self-organizing network control with time averaged weighted throughput objective |
scientific article |
Statements
Online self-organizing network control with time averaged weighted throughput objective (English)
0 references
20 February 2019
0 references
Summary: We study an online multisource multisink queueing network control problem characterized with self-organizing network structure and self-organizing job routing. We decompose the self-organizing queueing network control problem into a series of interrelated Markov Decision Processes and construct a control decision model for them based on the coupled reinforcement learning (RL) architecture. To maximize the mean time averaged weighted throughput of the jobs through the network, we propose a reinforcement learning algorithm with time averaged reward to deal with the control decision model and obtain a control policy integrating the jobs routing selection strategy and the jobs sequencing strategy. Computational experiments verify the learning ability and the effectiveness of the proposed reinforcement learning algorithm applied in the investigated self-organizing network control problem.
0 references
0 references