Reinforcement learning ramp metering without complete information (Q763169): Difference between revisions

Summary: This paper develops a model of reinforcement learning ramp metering (RLRM) without complete information, which is applied to alleviate traffic congestions on ramps. RLRM consists of prediction tools depending on traffic flow simulation and optimal choice model based on reinforcement learning theories. Moreover, it is also a dynamic process with abilities of automaticity, memory and performance feedback. Numerical cases are given in this study to demonstrate RLRM such as calculating outflow rate, density, average speed, and travel time compared to no control and fixed-time control. Results indicate that the greater is the inflow, the more is the effect. In addition, the stability of RLRM is better than fixed-time control.

0 references

zbMATH Keywords

reinforcement learning ramp metering (RLRM)

0 references

traffic flow simulation

0 references

optimal choice model

0 references

reinforcement learning theories

0 references

stability of RLRM

0 references

MaRDI profile type

MaRDI publication profile

0 references

full work available at URL

https://doi.org/10.1155/2012/208456

0 references

cites work

Q4517722

0 references

Micro- and macro-simulation of freeway traffic

0 references

Synchronized flow as a new traffic phase and related problems for traffic flow modelling

0 references

Identifiers

zbMATH Open document ID

1235.93269

0 references

DOI

10.1155/2012/208456

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:763169

@@ Property / Wikidata QID @@
+Q58907923
@@ Property / Wikidata QID: Q58907923 / rank @@
+Normal rank
@@ Property / MaRDI profile type @@
+MaRDI publication profile
@@ Property / MaRDI profile type: MaRDI publication profile / rank @@
+Normal rank
@@ Property / full work available at URL @@
+https://doi.org/10.1155/2012/208456
+Normal rank
@@ Property / OpenAlex ID @@
+W1976365331
@@ Property / OpenAlex ID: W1976365331 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4517722
@@ Property / cites work: Q4517722 / rank @@
+Normal rank
@@ Property / cites work @@
+Micro- and macro-simulation of freeway traffic
@@ Property / cites work: Micro- and macro-simulation of freeway traffic / rank @@
+Normal rank
@@ Property / cites work @@
+Synchronized flow as a new traffic phase and related problems for traffic flow modelling
+Normal rank
@@ links / mardi / name / links / mardi / name @@
+Publication:763169