Display Settings:

Format

Send to:

Choose Destination
We are sorry, but NCBI web applications do not support your browser and may not function properly. More information
    Adv Neural Inf Process Syst. 2009;21:1065-1073.

    MDPs with Non-Deterministic Policies.

    Source

    School of Computer Science, McGill University, Montreal, Canada, mmilan1@cs.mcgill.ca.

    Abstract

    Markov Decision Processes (MDPs) have been extensively studied and used in the context of planning and decision-making, and many methods exist to find the optimal policy for problems modelled as MDPs. Although finding the optimal policy is sufficient in many domains, in certain applications such as decision support systems where the policy is executed by a human (rather than a machine), finding all possible near-optimal policies might be useful as it provides more flexibility to the person executing the policy. In this paper we introduce the new concept of non-deterministic MDP policies, and address the question of finding near-optimal non-deterministic policies. We propose two solutions to this problem, one based on a Mixed Integer Program and the other one based on a search algorithm. We include experimental results obtained from applying this framework to optimize treatment choices in the context of a medical decision support system.

    PMID:
    21625292
    [PubMed]
    PMCID:
    PMC3103230
    Free PMC Article

    Images from this publication.See all images (6)Free text

    Figure 1
    Figure 3
    Figure 5
    Figure 2
    Figure 4
    Figure 6

      Supplemental Content

      Icon for PubMed Central

      Save items

      Recent activity

      Your browsing activity is empty.

      Activity recording is turned off.

      Turn recording back on

      See more...
      Write to the Help Desk