A semi-Markov decision model for recognizing the destination of a maneuvering agent in real time strategy games (Q1792771): Difference between revisions

Summary: Recognizing destinations of a maneuvering agent is important in real time strategy games. Because finding path in an uncertain environment is essentially a sequential decision problem, we can model the maneuvering process by the Markov decision process (MDP). However, the MDP does not define an action duration. In this paper, we propose a novel semi-Markov decision model (SMDM). In the SMDM, the destination is regarded as a hidden state, which affects selection of an action; the action is affiliated with a duration variable, which indicates whether the action is completed. We also exploit a Rao-Blackwellised particle filter (RBPF) for inference under the dynamic Bayesian network structure of the SMDM. In experiments, we simulate agents' maneuvering in a combat field and employ agents' traces to evaluate the performance of our method. The results show that the SMDM outperforms another extension of the MDP in terms of precision, recall, and \(F\)-measure. Destinations are recognized efficiently by our method no matter whether they are changed or not. Additionally, the RBPF infer destinations with smaller variance and less time than the SPF. The average failure rates of the RBPF are lower when the number of particles is not enough.

0 references

MaRDI profile type

MaRDI publication profile

0 references

full work available at URL

https://doi.org/10.1155/2016/1907971

0 references

cites work

Q4411146

0 references

Learning hierarchical task network domains from partially observed plan traces

0 references

Q3655273

0 references

Location-Based Reasoning about Complex Multi-Agent Behavior

0 references

Identifiers

zbMATH Open document ID

1400.90296

0 references

DOI

10.1155/2016/1907971

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1792771

@@ Property / full work available at URL @@
+https://doi.org/10.1155/2016/1907971
+Normal rank
@@ Property / OpenAlex ID @@
+W2236759984
@@ Property / OpenAlex ID: W2236759984 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4411146
@@ Property / cites work: Q4411146 / rank @@
+Normal rank
@@ Property / cites work @@
+Learning hierarchical task network domains from partially observed plan traces
+Normal rank
@@ Property / cites work @@
+Q3655273
@@ Property / cites work: Q3655273 / rank @@
+Normal rank
@@ Property / cites work @@
+Location-Based Reasoning about Complex Multi-Agent Behavior
+Normal rank