Task Decomposition and Role Sharing for Real-time Human-AI Swarm Collaboration

  • Sotaro Karakama Mitsubishi Heavy Industries, Ltd.
  • Natsuki Matsunami Mitsubishi Heavy Industries, Ltd.
  • Masayuki Ito Mitsubishi Heavy Industries, Ltd.
Keywords: multi-agent, task decomposition, human-swarm interaction

Abstract

In spite of the impressive advances in artificial intelligence (AI), close collaboration between humans and AI systems is still difficult to achieve. To overcome this problem, we designed AI agents with a behavior tree that enables us to know what they are trying to do, and by using a consensus building algorithm, that is, a contract net protocol, a human and a group of AI agents were put together as one team. Taking advantage of this architecture, we designed an approach to decomposing cooperative tasks into appropriate roles. The effectiveness and feasibility of this approach were evaluated with teams in a simulated Tail Tag game. Matches were held with up to 29 AI agents and 1 person on one team and 30 people on the other team. The results indicate that our approach works almost evenly with human-human collaboration by sharing roles between a human and AI swarm. By understanding the roles of AI agents, a person can immediately understand the role that he/she should take. For further improvement, we also identified that it is necessary for a person to be able to give concise and global instructions.

References

D. Silver, J. Schrittwieser, K. Simonyan, I. Antonoglou, A. Huang, A. Guez, et al., “Mastering the game of Go without human knowledge,” Nature vol. 550, 2017, pp. 354–359.

N. Brown and T. Sandholm, “Superhuman AI for heads-up no-limit poker: Libratus beats top professionals,” Science, 2017.

C. Berner, G. Brockman, B. Chan, V. Cheung, P. Debiak, C. Dennison, et al., “Dota 2 with large scale deep reinforcement learning,” arXiv preprint arXiv:1912.06680, 2019.

D. Wang, J. D. Weisz, M. Muller, P. Ram, W. Geyer, C. Dugan, et al., “Human-AI collaboration in data science: Exploring data scientists’ perceptions of automated AI,” Proceedings of the ACM on Human-Computer Interaction, 3(CSCW):1–24, 2019.

J. Zhu, A. Liapis, S. Risi, R. Bidarra, and G. M. Youngblood, ‘‘Explainable AI for designers: A human-centered perspective on mixed-initiative co-creation,’’ in Proc. IEEE Conference on Computational Intelligence and Games, 2018, pp. 458–465.

N. Bard, J. N. Foerster, S. Chandar, N. Burch, M. Lanctot, H. F. Song, et al., “The Hanabi challenge: A new frontier for AI research,” arXiv preprint arXiv:1902.00506, 2019.

G. Kasparov, “The chess master and the computer,” The New York Review of Books, Vol. 57, No. 2, 2010.

D. C. Dennett, “Cognitive Wheels: The Frame Problem of AI,” Language and Thought, vol. 3, 2005.

D. Gunning, “Explainable artificial intelligence (XAI),” Defense Advanced Re-search Projects Agency (DARPA), 2017.

A. Stentz, C. Dima, C. Wellington, H, Herman, and D. Stager, “A system for semi-autonomous tractor operations in Autonomous robots,” Autonomous Robots, Vol. 13, 2002, pp. 87-104.

S. Balakirsky, S. Carpin, A. Kleiner, M. Lewis, A. Visser, J. Wang, and V. A. Ziparo, “Towards heterogeneous robot teams for disaster mitigation: Results and performance metrics from robocup rescue,” Journal of Field Robotics, 24, 2007, pp. 943-967.

J. Li, and H. Liu, “Design Optimization of Amazon Robotics. Automation,” Control and Intelligent Systems, Vol. 4, No. 2, 2016, pp. 48-52.

A. Kolling, P. Walker, N. Chakraborty, K. Sycara, and M. Lewis, “Human interaction with robot swarms: A survey,” IEEE Transactions on Human-Machine Systems, Vol. 46, No. 1, 2016, pp. 9-26.

A, B and C (anonymity for a blind review), “Architecture and interface for collaborating with a group of agents in an adversarial game,” Proceeding of a conference, 2020. (published)

M. Colledanchise, and P. Ogren, “Use of BTs in robotics and AI, in Behavior Trees in robotics and AI: An introduction,”, 1st Ed., CRC Press, 2018.

R.G. Smith, “The Contract Net Protocol: High-level communication and control in a distributed problem solver,” In IEEE Transactions on Computers, Vol. C-29, No. 12, December 1980, pp. 1104-1113.

M. Tambe, “Towards flexible teamwork,” Journal of Artificial Intelligence Research, Vol 7, 1997, pp. 83-124.

R. Nair, M. Tambe, and S. Marsella, “Role allocation and reallocation in multiagent teams: towards a practical analysis,” Proceedings of the second international joint conference on Autonomous agents and multiagent systems, 2003, pp. 552-559.

P. Stone and M. Veloso, “Task decomposition, dynamic role assignment, and low-bandwidth communication for real-time strategic teamwork,” Artificial Intelligence, Vol. 110, Issue 2, 1999, pp. 241-273.

M. B. Dias, R. Zlot, N. Kalra and A. Stentz, “Market-based multirobot coordination: A survey and analysis,” Proceedings of the IEEE, Vol. 94, No. 7, 2006, pp. 1257-1270.

D.V. Pynadath and M Tambe, “Multiagent teamwork: analyzing the optimality and complexity of key theories and models,” Proceedings of the second international joint conference on Autonomous agents and multiagent systems, 2002.

C. J. Cai, J. Jongejan, and J. Holbrook, “The effects of example-based explanations in a machine learning interface,” Proceedings of the 24th International Conference on Intelligent User Interfaces, 2019, pp. 258–262.

P. Stone, R. S. Sutton, and G. Kuhlmann, “Reinforcement learning for RoboCup-soccer keepaway,” Adaptive Behavior, Vol. 13, 2005, pp.165-188.

S. Ontañon, G. Synnaeve, A. Uriarte, F. Richoux, D. Churchill, and M. Preuss, “A Survey of Real-Time Strategy Game AI Research and Competition in StarCraft,” IEEE Transactions on Computational Intelligence and AI in games, IEEE Computational Intelligence Society, 2013, pp. 1-19.

M. Jaderberg, W. M. Czarnecki, I. Dunning, L. Marris, G. Lever, A. G. Castañeda, et al., “Human-level performance in first-person multiplayer games with population-based deep reinforcement learning,” Science 364, 2019, pp. 859-865.

H. Huang, W. Zhang, J. Ding, D. Stipanovic, and C. Tomlin, “Guaranteed decentralized pursuit-evasion in the plane with multiple pursuers,” Proceedings of IEEE Conference on Decision and Control, 2011, pp. 4835–4840.

Published
2021-10-31
Section
Technical Papers