研究目的
Investigating the application of Monte Carlo tree search (MCTS) for controlling the Pac-Man character in the real-time game Ms Pac-Man, with enhancements to adapt MCTS to the real-time domain.
研究成果
The MCTS agent and enhancements described were shown to be suitable and effective for Ms Pac-Man, competing with strong opponents and achieving high rankings in competitions. The simulation strategy was responsible for the most considerable increase in score, and tree search ensured high scores versus both strong and weak ghost teams. Information reuse with continuous decay improved performance without adding specific domain knowledge. The research contributes to the understanding of applying MCTS in real-time domains.
研究不足
The experiments were conducted with parameters manually tuned against the LEGACY2 ghost team, which may not be optimal for other ghost teams. The real-time nature of the game imposes strict computational constraints, limiting the number of simulations per move.
1:Experimental Design and Method Selection:
The methodology involves using MCTS to find an optimal path for the Pac-Man agent at each turn, based on randomized simulations. Enhancements include a variable-depth tree, simulation strategies for the ghost team and Pac-Man, including long-term goals in scoring, and reusing the search tree for several moves with a decay factor.
2:Sample Selection and Data Sources:
The experiments were conducted using the Ms Pac-Man Versus Ghosts Competition framework, with the agent competing against different ghost teams.
3:List of Experimental Equipment and Materials:
The implementation was done using the framework provided by the Ms Pac-Man Versus Ghosts Competition, running on an AMD64 Opteron 2.4-GHz processor.
4:4-GHz processor.
Experimental Procedures and Operational Workflow:
4. Experimental Procedures and Operational Workflow: For each experiment, 400 games were played, allowing the agents the official 40 ms to compute a move each turn. The performance was evaluated based on average score, average number of lives remaining, and the average maze reached at the end of each game.
5:Data Analysis Methods:
The results were analyzed to determine the influence of the proposed enhancements by comparing agents with single enhancements disabled to the baseline performance.
独家科研数据包,助您复现前沿成果,加速创新突破
获取完整内容