{"entities":{"Q581259":{"pageid":583026,"ns":120,"title":"Item:Q581259","lastrevid":62927889,"modified":"2026-04-11T09:02:11Z","type":"item","id":"Q581259","labels":{"en":{"language":"en","value":"On the properties of \\(\\epsilon\\) (\\(\\geq 0)\\) optimal policies in discounted unbounded return model"}},"descriptions":{"en":{"language":"en","value":"scientific article; zbMATH DE number 4018805"}},"aliases":{},"claims":{"P31":[{"mainsnak":{"snaktype":"value","property":"P31","hash":"fd5912e4dab4b881a8eb0eb27e7893fef55176ad","datavalue":{"value":{"entity-type":"item","numeric-id":56887,"id":"Q56887"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q581259$9AD3318C-BE12-4286-8CFC-70156C28E55B","rank":"normal"}],"P159":[{"mainsnak":{"snaktype":"value","property":"P159","hash":"e4fef2f8c1bfca350333990d927c02884081d9c8","datavalue":{"value":{"text":"On the properties of \\(\\epsilon\\) (\\(\\geq 0)\\) optimal policies in discounted unbounded return model","language":"en"},"type":"monolingualtext"},"datatype":"monolingualtext"},"type":"statement","id":"Q581259$3823E803-8809-463D-82D0-4CFB38AD3868","rank":"normal"}],"P225":[{"mainsnak":{"snaktype":"value","property":"P225","hash":"86a0926f98bbcc1b80d5b7279271318c301e810d","datavalue":{"value":"0626.90093","type":"string"},"datatype":"external-id"},"type":"statement","id":"Q581259$2D073030-2129-4945-B388-AF98D8AB094F","rank":"normal"}],"P27":[{"mainsnak":{"snaktype":"value","property":"P27","hash":"477e474916d7a61035c9c3b8540d6fcc95f00a80","datavalue":{"value":"10.1007/BF02112641","type":"string"},"datatype":"external-id"},"type":"statement","id":"Q581259$224E08B9-AB99-4D66-B8F5-D92373ABCAED","rank":"normal"}],"P16":[{"mainsnak":{"snaktype":"value","property":"P16","hash":"5f0bd258652c08ba0bc76a744c16712278ff901e","datavalue":{"value":{"entity-type":"item","numeric-id":581258,"id":"Q581258"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q581259$1A0A67F1-F11B-49C0-A7FE-65FCF71CE11E","rank":"normal"},{"mainsnak":{"snaktype":"value","property":"P16","hash":"ed3d1d5556642329b4705024951c29bed03b6dd8","datavalue":{"value":{"entity-type":"item","numeric-id":281232,"id":"Q281232"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q581259$3928DECC-DAFB-4CCF-8567-DFC204A61714","rank":"normal"}],"P200":[{"mainsnak":{"snaktype":"value","property":"P200","hash":"bd8a7678534b4c4a434a737b8e34eefad997d7a4","datavalue":{"value":{"entity-type":"item","numeric-id":176689,"id":"Q176689"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q581259$2E830010-A5BF-4E53-A18D-94776E86129E","rank":"normal"}],"P28":[{"mainsnak":{"snaktype":"value","property":"P28","hash":"5ae48c61eed19d1e1e1f33f9255d5b329362d064","datavalue":{"value":{"time":"+1987-00-00T00:00:00Z","timezone":0,"before":0,"after":0,"precision":9,"calendarmodel":"http://www.wikidata.org/entity/Q1985727"},"type":"time"},"datatype":"time"},"type":"statement","id":"Q581259$86BDF980-7383-48B8-A0BD-C1D056ADD78C","rank":"normal"}],"P1448":[{"mainsnak":{"snaktype":"value","property":"P1448","hash":"2f574513335a7ec9036c0840091716ff873565f9","datavalue":{"value":"This paper investigates the properties of \\(\\epsilon\\) (\\(\\geq 0)\\) optimal policies in the model of \\textit{Guo Shizhen} [Math. Economics 1, 109-120 (1984) (Chinese)]. It is shown that, if \\(\\pi^*=(\\pi_ 0^*\\), \\(\\pi_ 1^*\\), \\(\\cdot \\cdot \\cdot\\), \\(\\pi^*_ n\\), \\(\\pi^*_{n+1}\\), \\(\\cdot \\cdot \\cdot)\\) is a \\(\\beta\\)-discounted optimal policy, then \\((\\pi^*_ 0\\), \\(\\pi^*_ 1\\), \\(\\cdot \\cdot \\cdot\\), \\(\\pi^*_ n)^{\\infty}\\) for all \\(n\\geq 0\\) is also a \\(\\beta\\)-discounted optimal policy. Under some conditions we prove that a stochastic stationary policy \\(\\pi_ n^{*\\infty}\\) corresponding to the decision rule \\(\\pi^*_ n\\) is also optimal for the same discounting factor \\(\\beta\\). We have also shown that each \\(\\beta\\)-optimal stochastic stationary policy \\(\\pi_ 0^{*\\infty}\\), \\(\\pi_ 0^{*\\infty}\\) can be decomposed into several decision rules to which the corresponding stationary policies are also \\(\\beta\\)-optimal separately; and conversely, a proper convex combination of these decision rules is identified with the former \\(\\pi^*_ 0\\). We have further proved that for any (\\(\\epsilon\\),\\(\\beta)\\)-optimal policy, say \\(\\pi^*=(\\pi^*_ 0,\\pi^*_ 1,...\\), \\(\\pi^*_ n,\\pi^*_{n+1}\\), \\(\\cdot \\cdot \\cdot)\\), \\((\\pi^*_ 0\\), \\(\\pi^*_ 1\\), \\(\\cdot \\cdot \\cdot\\), \\(\\pi^*_{n-1})^{\\infty}\\) is \\(((1-\\beta^ n)^{- 1}\\epsilon,\\beta)\\) optimal for \\(n>0\\). At the end of this paper we mention that the results about convex combinations and decompositions of optimal policies given by \\textit{Luo Handong}, \\textit{Liu Jiwei} and \\textit{Xia Zhihao} [J. Huazhong (Central China) Univ. of Sci. and Technol. 14, No.4 (1986)] can be extended to our case.","type":"string"},"datatype":"string"},"type":"statement","id":"Q581259$28DBA86C-207C-4ECC-991F-E7873C270A1F","rank":"normal"}],"P226":[{"mainsnak":{"snaktype":"value","property":"P226","hash":"377d3ab03372cff12915e0de0374438ff70c3716","datavalue":{"value":"90C40","type":"string"},"datatype":"external-id"},"type":"statement","id":"Q581259$038F950C-BB25-4ABE-9683-D8D1FB152D29","rank":"normal"}],"P1451":[{"mainsnak":{"snaktype":"value","property":"P1451","hash":"4d6f1360d4942a33709d839e2fa7ecbe069c42fc","datavalue":{"value":"4018805","type":"string"},"datatype":"external-id"},"type":"statement","id":"Q581259$09C81AB1-D8D6-4082-A40F-A3E9FE71105F","rank":"normal"}],"P1450":[{"mainsnak":{"snaktype":"value","property":"P1450","hash":"55fe584ba4d20ee46fa2be1c61986408442a9c53","datavalue":{"value":"\\(\\epsilon \\)-optimal policy","type":"string"},"datatype":"string"},"type":"statement","id":"Q581259$1DA93DC3-B3A4-4C14-AB29-CADD8F6A8B96","rank":"normal"},{"mainsnak":{"snaktype":"value","property":"P1450","hash":"fbebc2a091534ab4f1535eb9f1a37d50865155ce","datavalue":{"value":"\\(\\beta \\)-discounted optimal policy","type":"string"},"datatype":"string"},"type":"statement","id":"Q581259$49A96519-E414-4FF1-8B86-C006ABE5FA59","rank":"normal"},{"mainsnak":{"snaktype":"value","property":"P1450","hash":"e10189e8b0f73f79ab2789c05f7ef1842d64ece6","datavalue":{"value":"stochastic stationary policy","type":"string"},"datatype":"string"},"type":"statement","id":"Q581259$4696CDDB-8681-436E-88D0-E727C27CCC5E","rank":"normal"},{"mainsnak":{"snaktype":"value","property":"P1450","hash":"2e8ec30cfc7f0a576fc99ef7f7db079e0642007a","datavalue":{"value":"convex combinations","type":"string"},"datatype":"string"},"type":"statement","id":"Q581259$348A00C9-E9FE-4AD3-9580-73A06C95DFCE","rank":"normal"},{"mainsnak":{"snaktype":"value","property":"P1450","hash":"d65d96e26ba7f502206a0841e88ddfd329fc1d64","datavalue":{"value":"decompositions","type":"string"},"datatype":"string"},"type":"statement","id":"Q581259$F0A0B29A-1066-4899-8205-C17B15DCCED0","rank":"normal"}],"P1460":[{"mainsnak":{"snaktype":"value","property":"P1460","hash":"57f7fea50d2ce1b39b695c4a1313582eed405e38","datavalue":{"value":{"entity-type":"item","numeric-id":5976449,"id":"Q5976449"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q581259$7187D0CE-E962-4E03-ABBA-0CEFAA789219","rank":"normal"}],"P223":[{"mainsnak":{"snaktype":"value","property":"P223","hash":"ffc7b9f0a56e3f8d512d4a80810f18faec1bb6fb","datavalue":{"value":{"entity-type":"item","numeric-id":3770312,"id":"Q3770312"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q581259$3D800421-7564-4890-922D-50276FF7FAD9","rank":"normal"},{"mainsnak":{"snaktype":"value","property":"P223","hash":"ba6f8ab4850a3a743a52d3fea829c62511f9b151","datavalue":{"value":{"entity-type":"item","numeric-id":3677538,"id":"Q3677538"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q581259$9B1D1C98-0D83-4A14-AEC4-7F4A864884BA","rank":"normal"},{"mainsnak":{"snaktype":"value","property":"P223","hash":"8bacdf1ae81fb1e07ee8bdf31120c42458a8b694","datavalue":{"value":{"entity-type":"item","numeric-id":4086981,"id":"Q4086981"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q581259$452CD40C-A166-49C2-BC19-9A54AEBD4C2D","rank":"normal"},{"mainsnak":{"snaktype":"value","property":"P223","hash":"3dfa6a01ee7b7fb035b90442d1f9d50b847ad009","datavalue":{"value":{"entity-type":"item","numeric-id":5678679,"id":"Q5678679"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q581259$CF1E8E85-89C1-40E5-B96B-7EFC87A588BF","rank":"normal"},{"mainsnak":{"snaktype":"value","property":"P223","hash":"02a6d9cf2f968ce77376997b7426feae021809fd","datavalue":{"value":{"entity-type":"item","numeric-id":3335551,"id":"Q3335551"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q581259$2CD4BC7F-07D3-4E41-A1CC-4B00F406060C","rank":"normal"}],"P205":[{"mainsnak":{"snaktype":"value","property":"P205","hash":"e2d26e20f2900e9ee20b6fb210163a7cdf02fe3f","datavalue":{"value":"https://doi.org/10.1007/bf02112641","type":"string"},"datatype":"url"},"type":"statement","id":"Q581259$18A6938B-B5FA-4DA5-8D25-7816EBF6EE17","rank":"normal"}],"P388":[{"mainsnak":{"snaktype":"value","property":"P388","hash":"4d530f01acc0968159b0c6d407df8919e51e1904","datavalue":{"value":"W2006036413","type":"string"},"datatype":"external-id"},"type":"statement","id":"Q581259$64B0C796-066F-4E83-87CF-0A493595486A","rank":"normal"}],"P1643":[{"mainsnak":{"snaktype":"value","property":"P1643","hash":"658da287f97ed9d341705a734b072d7e164b67dd","datavalue":{"value":{"entity-type":"item","numeric-id":3770312,"id":"Q3770312"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","qualifiers":{"P1659":[{"snaktype":"value","property":"P1659","hash":"42ab4c330369c42df00b0e5f6d9f62779b19fa14","datavalue":{"value":{"amount":"+0.9126907587051392","unit":"1"},"type":"quantity"},"datatype":"quantity"}],"P1660":[{"snaktype":"value","property":"P1660","hash":"a327a09ea0305e98d5cf33bd4036320e19f2aed0","datavalue":{"value":{"entity-type":"item","numeric-id":6821328,"id":"Q6821328"},"type":"wikibase-entityid"},"datatype":"wikibase-item"}]},"qualifiers-order":["P1659","P1660"],"id":"Q581259$09100131-7FEE-42A9-8F1E-C1F05F115538","rank":"normal"},{"mainsnak":{"snaktype":"value","property":"P1643","hash":"3a56c93567559b97ae48b8dc0c0916ff15886597","datavalue":{"value":{"entity-type":"item","numeric-id":3496174,"id":"Q3496174"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","qualifiers":{"P1659":[{"snaktype":"value","property":"P1659","hash":"89ff77e38d10fc2d92f8843ec6a1c4d78258b0b8","datavalue":{"value":{"amount":"+0.7996224761009216","unit":"1"},"type":"quantity"},"datatype":"quantity"}],"P1660":[{"snaktype":"value","property":"P1660","hash":"a327a09ea0305e98d5cf33bd4036320e19f2aed0","datavalue":{"value":{"entity-type":"item","numeric-id":6821328,"id":"Q6821328"},"type":"wikibase-entityid"},"datatype":"wikibase-item"}]},"qualifiers-order":["P1659","P1660"],"id":"Q581259$43CD0BD6-5F0E-4CF5-A647-903CF3AC8B8C","rank":"normal"},{"mainsnak":{"snaktype":"value","property":"P1643","hash":"5ed310f47b3b9224a5508b53969fede6b65caaa1","datavalue":{"value":{"entity-type":"item","numeric-id":3486379,"id":"Q3486379"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","qualifiers":{"P1659":[{"snaktype":"value","property":"P1659","hash":"de757653b45f2ab023f404a1d25cc9bac2ff0a1a","datavalue":{"value":{"amount":"+0.7988448143005371","unit":"1"},"type":"quantity"},"datatype":"quantity"}],"P1660":[{"snaktype":"value","property":"P1660","hash":"a327a09ea0305e98d5cf33bd4036320e19f2aed0","datavalue":{"value":{"entity-type":"item","numeric-id":6821328,"id":"Q6821328"},"type":"wikibase-entityid"},"datatype":"wikibase-item"}]},"qualifiers-order":["P1659","P1660"],"id":"Q581259$D829204D-5361-4696-9D63-FCA4B6448471","rank":"normal"},{"mainsnak":{"snaktype":"value","property":"P1643","hash":"52c4f1bb151ee76d1bfb7f1cc6f503b9b32d8364","datavalue":{"value":{"entity-type":"item","numeric-id":4320257,"id":"Q4320257"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","qualifiers":{"P1659":[{"snaktype":"value","property":"P1659","hash":"e769dc2b185d6ce4016dd57d9996f9fef1f5fe2f","datavalue":{"value":{"amount":"+0.794649064540863","unit":"1"},"type":"quantity"},"datatype":"quantity"}],"P1660":[{"snaktype":"value","property":"P1660","hash":"a327a09ea0305e98d5cf33bd4036320e19f2aed0","datavalue":{"value":{"entity-type":"item","numeric-id":6821328,"id":"Q6821328"},"type":"wikibase-entityid"},"datatype":"wikibase-item"}]},"qualifiers-order":["P1659","P1660"],"id":"Q581259$54159887-4ABC-4C14-B749-636760BCA53C","rank":"normal"},{"mainsnak":{"snaktype":"value","property":"P1643","hash":"664d6cfdc5df6a8100f5eff2894cb0421e87348b","datavalue":{"value":{"entity-type":"item","numeric-id":3677538,"id":"Q3677538"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","qualifiers":{"P1659":[{"snaktype":"value","property":"P1659","hash":"749207da949fa9c205a94a944fc7582264fb2464","datavalue":{"value":{"amount":"+0.7936649322509766","unit":"1"},"type":"quantity"},"datatype":"quantity"}],"P1660":[{"snaktype":"value","property":"P1660","hash":"a327a09ea0305e98d5cf33bd4036320e19f2aed0","datavalue":{"value":{"entity-type":"item","numeric-id":6821328,"id":"Q6821328"},"type":"wikibase-entityid"},"datatype":"wikibase-item"}]},"qualifiers-order":["P1659","P1660"],"id":"Q581259$462F24DF-F6B1-4ECD-8F1C-E1517F8E75F1","rank":"normal"}]},"sitelinks":{"mardi":{"site":"mardi","title":"On the properties of \\(\\epsilon\\) (\\(\\geq 0)\\) optimal policies in discounted unbounded return model","badges":[],"url":"https://portal.mardi4nfdi.de/wiki/On_the_properties_of_%5C(%5Cepsilon%5C)_(%5C(%5Cgeq_0)%5C)_optimal_policies_in_discounted_unbounded_return_model"}}}}}