Human-AI collaboration is not very collaborative yet: a taxonomy of interaction patterns in AI-assisted decision making from a systematic review

Gomez, Catalina; Cho, Sue Min; Ke, Shichang; Huang, Chien-Ming; Unberath, Mathias

doi:10.3389/fcomp.2024.1521066

REVIEW article

Front. Comput. Sci., 06 January 2025

Sec. Human-Media Interaction

Volume 6 - 2024 | https://doi.org/10.3389/fcomp.2024.1521066

Human-AI collaboration is not very collaborative yet: a taxonomy of interaction patterns in AI-assisted decision making from a systematic review

Shichang Ke

Department of Computer Science, Johns Hopkins University, Baltimore, MD, United States

Leveraging Artificial Intelligence (AI) in decision support systems has disproportionately focused on technological advancements, often overlooking the alignment between algorithmic outputs and human expectations. A human-centered perspective attempts to alleviate this concern by designing AI solutions for seamless integration with existing processes. Determining what information AI should provide to aid humans is vital, a concept underscored by explainable AI's efforts to justify AI predictions. However, how the information is presented, e.g., the sequence of recommendations and solicitation of interpretations, is equally crucial as complex interactions may emerge between humans and AI. While empirical studies have evaluated human-AI dynamics across domains, a common vocabulary for human-AI interaction protocols is lacking. To promote more deliberate consideration of interaction designs, we introduce a taxonomy of interaction patterns that delineate various modes of human-AI interactivity. We summarize the results of a systematic review of AI-assisted decision making literature and identify trends and opportunities in existing interactions across application domains from 105 articles. We find that current interactions are dominated by simplistic collaboration paradigms, leading to little support for truly interactive functionality. Our taxonomy offers a tool to understand interactivity with AI in decision-making and foster interaction designs for achieving clear communication, trustworthiness, and collaboration.

1 Introduction

Advances in Artificial Intelligence (AI) developments open new possibilities for supporting human decision making across a wide variety of applications. Decision making tasks in a broad range of applications share a process that starts when evidence is presented before making a decision within discrete choices, usually with follow-up effects. Within this framework, the decision-making process emerges as a scenario for human-AI teamwork where at a minimum two parties, i.e., the human and the AI, factor into finding a solution to the decision problem. The exact dynamics of how this collaboration occurs can vary from one situation to another, leading to multiple interaction options that range from simple recommendations to involved exchanges (Bansal et al., 2019; Lai et al., 2023; Bertrand et al., 2023). To bridge algorithmic suggestions and human expectations, embracing a human-centered approach in designing AI solutions is crucial to identify what information AI should provide to aid humans while ensuring a safe and transparent operation. It is equally crucial to understand how and when to best communicate the information for designing successful human-AI interactions. Explainable AI (XAI) expands model capabilities by providing not just so-called black box recommendations but also justifications tailored to end users' needs. The design of justifications from a human-centered perspective informs what AI models should deliver and it naturally involves careful thought on how to present this support, although it may not specifically address the ensuing interactions. The presentation style (how) and the strategic timing (when) for providing AI-generated insights are closely related to the type and sequence of interactions between humans with AI, which are ultimately enabled by the affordances of certain assistance elements (Lai et al., 2023). Despite the importance of and opportunities in interaction design within AI, there is not currently a common vocabulary to describe and differentiate these interactions.

Interactivity is a familiar concept to humans and widely studied in more specialized domains such as information visualization, interface and software design (Yi et al., 2007), and human-human interactions from a social perspective (Magnusson, 2018). Understanding interactions requires delving into multiple dimensions that involve subjects, modes, and purposes of interaction, and the context in which they take place (Schleidgen et al., 2023). Among these, interaction patterns emerge as sequences of behaviors that occur more often than by chance between agents and systems or artifacts. We have learned from these disciplines the importance of deliberate choices in selecting interaction types crucial for achieving specific goals, rather than imbuing unnecessary high levels of interactivity that do not result in better products (Sims, 1997). Likewise, finding the right balance of interactivity between humans and AI systems is not just a matter of enhancing user experience but is essential for achieving clear communication, trustworthiness, and meaningful collaboration. While current AI systems excel in offering problem-solving capabilities, there is often a disproportionate emphasis on the technological advancements, overlooking the critical aspects of user interface and experience. This oversight is apparent in many empirical studies in diverse domains and decision tasks (Lai et al., 2023; Bertrand et al., 2023), where interactions with AI agents are typically reduced to basic actions like menu selections or button clicks. However, the specific configurations needed to evaluate the effect of different AI assistance elements (or other context-related factors) on humans interacting with AI, can lead to multiple forms of collaboration or the actual interactions afforded.

Which forms of interactivity to incorporate in human-AI interactions is an open question and may depend on the overall context, emphasizing the need to deliberately study interaction patterns between humans and AI that can guide the development of better solutions. Psychology and social sciences have extensively studied human-human interactions, revealing patterns of reciprocal actions, such as “Question-Answer” and “Request-Offer” (Magnusson, 2018), which can also be applied in a setup where humans interact with AI. Likewise, the field of Human-Robot Interaction (HRI) has studied how humans and robots communicate, collaborate, and engage with each other, often through the analysis of observable patterns in their interactions (David et al., 2022; Sauppé and Mutlu, 2014; Ma and Cao, 2019). These patterns cover aspects of social interactions like greetings, attention, feedback, turn-taking, social cues, and farewell. All these elements foster a more natural and efficient communication experience, which can take place in AI technologies that incorporate natural language abilities. Other domains such as information visualization and user interface design have also developed taxonomies and libraries of interaction patterns based on different criteria such as user's intent, purpose, scope, abstraction level, and granularity (Yi et al., 2007; Silva-Rodŕıguez et al., 2020). These categorizations serve as a base to build on more comprehensive taxonomies grounded on and informed by evidence from empirical studies and incorporating knowledge from interaction design in other disciplines. In the context of AI, descriptions of possible interactions between humans and machines or AI have grouped them by user control and initiative (Van Berkel et al., 2021; Cheng et al., 2022), task nature (Parasuraman et al., 2000), and level of automation (Mackeprang et al., 2019). Existing classifications of human-AI interactions are often domain-specific, lacking a comprehensive framework that spans multiple domains. Without a shared vocabulary to describe human-AI interactions in decision-making tasks, it becomes difficult to compare approaches, aggregate findings, and synthesize knowledge across diverse studies, ultimately limiting the exploration of new interaction paradigms. This highlights the importance of a structured framework for categorizing different interaction patterns, helping designers understand and improve human-AI collaboration, and facilitating informed decisions in interaction design to enhance partnerships between humans and AI.

To address this gap, we first conducted an extensive systematic review on human-AI interactions that have been reported for human-AI decision making scenarios. We searched for relevant articles from five databases that cover Human-Computer Interaction (HCI) studies and related disciplines and selected 105 to conduct a detailed coding and analysis of the sequences of interactions that exist between humans and AI. Furthermore, we considered the task context and AI system involved in these empirical studies. Recent review works have characterized the design space in human-subject studies with AI for decision-making, providing insights about AI assistance elements (Lai et al., 2023) and methodologies to empirically evaluate human factors (Vereschak et al., 2021). XAI has been a main subject of reviews as well, not only to structure existing techniques (Sheu and Pardeshi, 2022), but also to investigate human factors that arise during decision making, including cognitive biases (Bertrand et al., 2022) and cognitive processes supported by interactive explanations (Bertrand et al., 2023). For our review, we selected empirical studies involving decision tasks where users are actively aware of the interaction and engaged in the decision-making process, as this focus was crucial for studying the dynamics of human-AI interactions. Grounded in the trends from our systematic review, we propose a taxonomy of interaction patterns that comprises seven interaction patterns that arise between humans and AI. To the best of our knowledge, there is not yet a comprehensive and structured classification of existing interaction patterns between humans and AI. We propose our taxonomy of interactions as a tool to better understand existing interaction patterns across domains and applications, allowing us to identify the occurrences of common interactions across multiple domains. We envision the use of taxonomy to foster dialogues on interactivity in AI-assisted decision making, encourage refinement and evaluation of novel patterns, and ultimately design better and more user-centered AI-based solutions.

2 Methods

2.1 Search strategy and selection criteria

This survey focuses on Human-AI interaction paradigms for explicit decision-making tasks, in contrast to proxy task where users are asked to simulate the AI outputs. Therefore, we aim to understand and evaluate the works that study human-AI interactions during decision-making tasks under AI assistance, instead of improvements of the model. Our survey covers studies conducted between 2013 and June 2023. Specifically, we searched within five databases: ACM Digital Library, IEEE Explore, Compendex, Scopus, and PubMed. The first four have extensive coverage of relevant studies in HCI covering conference proceedings and journal publications (Compendex and Scopus included papers from more subjects), while PubMed allowed us to capture research specifically related to medical applications of Human-AI interaction and decision-making. We were particularly interested in empirical studies with AI in the healthcare domain because of the need for human-subjects research in AI's development cycle and the importance of aligning algorithms with user needs and clinical workflows (Chen et al., 2022; van de Sande et al., 2024). The study of existing interactions between humans and AI support can suggest new directions and opportunities to enhance the experience and outcomes of using AI to support decision making in healthcare. We defined the search terms covering four dimensions: use of AI systems, human-AI interaction or collaboration studies, decision-making tasks, and interaction design. We included the last term since we wanted to focus on articles that evaluate interactions with AI systems during decision making tasks. The complete set of keywords used in our search can be found in the Appendix A. We defined the following inclusion criteria:

• The tasks in consideration are those related to decision-making, and in particular, we limit the study selection to those that implement complete decision making processes and not only evaluations of decision makers' perceptions, such as understanding, preferences, or judgments of AI's advice.

• The paper shows an implementation of the interface that was presented to human users to interact with AI.

• The modes of interaction encompass screen-based interfaces, virtual agents, and non-embodied setups.

• We have restricted our selection solely to papers featuring empirical user studies.

In addition to the inclusion criteria above, our search excluded studies in robotics and gaming by filtering out these keywords in the title and abstract. We excluded studies that involve robots because physical embodiment enables more dimensions of an interaction and those involving gaming scenarios because they are more complex, with less constraints to study how humans can interact with AI assistance. However, studies that implement decision making tasks through gamified tasks were included. Other survey papers or comments were also excluded by filtering out keywords in the title and abstract.

2.2 Study selection

The initial search returned 3,770 papers, and 358 duplicates were found and deleted automatically. This left us with a total of 3,412 papers to screen. They were assigned to two authors to first go through title and abstract screening, followed by full-text screening. The screening phase was oriented toward the exclusion of papers that were not focused on human-AI interaction, i.e., limited to technical contributions, did not involve a complete decision-making task, were short papers (< 8 pages), involve gaming or robotics, have not been peer-reviewed, and were review or survey papers. We did not constrain our selection to works that directly manipulate the type of interaction between the human and AI. Our main interest was on which were the existing/available ways for humans to interact with AI agents in the evaluation of human-AI decision making. The total and abstract screening excluded 2,893 papers, and at the full-text review stage, 363 papers were filtered out. Lastly, 156 were considered for the information extraction stage, of which 51 were removed as a more detailed reading allowed us to identify that they did not satisfy the inclusion criteria of supporting actual AI-assisted decision making tasks. Appendix B summarizes the complete paper selection process. At the end, 105 articles were included in our review.

2.3 Data extraction strategy

2.3.1 Analysis process

The data extraction template was developed by all authors and informed by previous surveys of empirical studies in human-AI interaction (Bertrand et al., 2023; Lai et al., 2023). Two authors distributed the final selected articles to be analyzed and coded the assigned articles independently. Then, one author checked the individual reports of each article to ensure consistency in the final extraction. Further discussions with the other authors took place to clarify discrepancies in the interactions or ambiguous cases. For the analysis of the interaction patterns, the authors reviewed the sequences of interactions and discussed how to group them into the design patterns that were repeated and are presented in this work. We iterated over the definition of each pattern to refine the actual components that constitute the interaction.

2.3.2 Coding of the papers

The following components comprised the extraction template:

2.3.2.1 Context

We identified general information in which the decision making task takes place. This includes the domain and we adapted the categories initially proposed in this survey of AI-assisted decision making (Lai et al., 2023). Furthermore, we specified the decision making task to be completed by the human (e.g., detection of hate speech, sleep stage classification, price estimation, among others) and the level of expertise required to successfully complete the task.

2.3.2.2 AI system

As we are interested in humans interactions with AI agents, we retrieved the original goal of incorporating AI assistance in the decision making task, and briefly characterized the AI system used in the study. In particular, we extracted the technique supporting the AI's recommendations (whether a real model was used or the outcomes were simulated), its performance (if any evaluation metric was reported), and the terminology used to introduce the AI agent to participants in the user study. Further details such as data type and source, output type were not reported as the main focus of this survey is on the interactions rather than the type of AI methods as previously surveyed in Lai et al. (2023).

2.3.2.3 Interaction building blocks

An interaction involves a reciprocal action or influence between two agents in the context of this survey (Schleidgen et al., 2023). To characterize this, we defined two elements: the action undertaken and the resulting output of that action. These two elements constitute the interaction building blocks that can be integrated into more complex interactions. In the definition of our taxonomy of human-AI interactions, we considered these building blocks as the main elements that constitute the interaction patterns. Our familiarity with prior studies on human-AI interactions served as an initial reference to ideate the (action - output) pairs available for the agents involved in the interaction, namely the human and AI parts. We further drew inspiration from existing interaction design patterns developed for prototyping human-robot interaction (Sauppé and Mutlu, 2014) and a taxonomy of interactivity techniques in XAI (Bertrand et al., 2023). We considered the actions listed in these works to characterize the interactions and functionality of explanations, respectively, and how they could be extended to agents involved in decision-making tasks. These building blocks primarily focus on the AI assisting the user in decision-making processes and are grouped based on the main action and specify the agent that can execute it. It is important to highlight that these building blocks can also occur sequentially, where one may be triggered in response to another. However, we have chosen to maintain a granular level of detail in the following descriptions to better capture the nuances of individual actions that later compose the interaction patterns.

• Predict - Outcome: The agent produces a solution to the primary decision-making task after receiving task information. This action is observed to be executed by either the AI or the user independently.

• Decide - Outcome: The agent integrates the assistance they received with the task-related information available to finalize the decision outcome. This action is typically observed to be executed by the user.

• Provide - Options: The agent offers solutions for a secondary task that, while not directly resolving the primary decision-making task, are still informative. This action is typically observed to be executed by the AI.

• Display - Information: The agent presents supplementary evidence (e.g., explanations, uncertainty values, alternate solutions) supporting a solution to the primary decision-making task. This action is observed to be predominately executed by the AI.

• Request - Outcome/Information: The agent actively seeks information or solutions from its counterpart. This action is observed to be either mandatory or optional and is typically executed by the AI when it requires user inputs, or by the user when they seek a direct solution or supplementary information from the AI.

• Collect - Inputs: The agent gathers task-related information and provides it to the other agent. This action is observed to be typically executed by the user when their input is needed for the AI to provide a solution to the decision-making task.

• Modify - Outcome/Information: The agent makes changes to the solutions or supplementary information provided by its counterpart. This action is typically observed to be optional and executed by the user.

• Delegate - Decision: The agent decides whether to retain responsibility for the task or transfer to its counterpart. This action is observed to be executed by either the AI or the user. Events after delegation can differ, ranging from complete surrender of agency to opportunities for supervising the other agent's decision-making process.

• Other: if an action does not fit the previous types.

These concepts were refined and iterated as we reviewed more works since there could be new actions supported for either agent, while trying to maintain the generalizability of the blocks across multiple studies. For each paper, we first identified all possible (action - output) pairs as the building blocks for each agent involved in the decision-making task and included a brief description in free text form. Then, we defined a sequence of interactions considering the order in which these events take place and the agent in charge. The sequences were not preset in advance in the extraction template since we wanted to discover the interaction patterns here. We note that depending on the experimental manipulation of the user study, different modes of interactivity with the user could be plausible and we separated these into different sequences.

3 Results

3.1 Taxonomy of interaction patterns for AI-assisted decision making

We present seven categories of interaction patterns that we have identified in our corpus, illustrated in Figure 1. Interactions involve changes over time and we attempted to capture this evolution/progression in the interaction patterns presented below and in the diagrams that illustrate them. To formulate the taxonomy, we began by reviewing previous literature containing taxonomies of interaction techniques in other domains, such as information visualization (Yi et al., 2007), human-robot interaction (Sauppé and Mutlu, 2014), multi-agent systems (Cabri et al., 2002), and educational technologies (Sims, 1997). Contrasting the concepts in these taxonomies with a sample of studies in human-AI interaction that we were familiar with, we identified potential ways in which the types of interactions described previously could potentially apply to humans interacting with AI during decision making. As we analyzed the articles included in our review, we iteratively refined our taxonomy's definitions and structure, merging similar interaction types and creating new categories when existing ones did not adequately capture the observed patterns. In each pattern formulation, we considered an appropriate level of abstraction so that they can capture multiple actions of the agents and generalize over various studies included in the systematic review. The interaction patterns we defined in the taxonomy are comprised of the interaction building blocks and can borrow some of the other pattern categories. Furthermore, we included a “Other patterns” category to present the interaction patterns that did not exactly fit into the main categories. In the following descriptions, we assume that users already have some background knowledge or intuitions that can be used during the decision making task. Lastly, the classification of interaction patterns does not mean they are mutually exclusive events, but elements that can be consolidated and combined. After finalizing the taxonomy, we quantified the frequency of each interaction pattern across our corpus of 105 articles using a combination of an automated search for certain action-output pairs present in sequences of interactions that we characterized in the data extraction stage of the review, followed by manual inspection and verification. We then analyzed the application domains where these interactions were studied.

Figure 1

Figure 1. Taxonomy of interaction patterns identified in AI-assisted decision making. The user (human) and the AI are represented as separate agents and the temporal evolution of the interactions is illustrated from top to down. The boxes contain the building blocks (action-output) pairs that compose each pattern. The direction of arrows denote the agent who started the action. Dashed lines represent optional operations.

3.1.1 AI-first assistance

This pattern manifests when the decision-making problem and the AI-predicted outcome are simultaneously displayed to the user. As the ultimate decision maker, the user can choose to incorporate the AI's advice into their final decision or opt to disregard it. When task-related stimuli (e.g., images or case details) are presented alongside the AI-predicted outcome, the user is provided with a more comprehensive set of information to consider. In addition, the AI's outcome can be accompanied by support information as captured in the Display - Information building block. This pattern has been previously observed and referred to as the“concurrent paradigm” (Tejeda et al., 2022) or “one-step workflow” (Fogliato et al., 2022).

3.1.2 AI-follow assistance

This pattern begins with the user forming an independent preliminary prediction given the decision-making problem. Following this initial judgment, the AI's predicted outcome is presented and may accompanied by support information. This procedure provides the user with a reference (their initial assessment) to compare against the AI's advice, and an opportunity to reassess their initial judgment. This approach has been identified as the “sequential paradigm” (Tejeda et al., 2022) or “two-step workflow” (Fogliato et al., 2022) and has been commonly used to evaluate the human's reliance on the AI's advice.

3.1.3 Secondary assistance

In this pattern, the AI offers information that does not serve as direct solutions to the decision-making problem. The user must interpret this supplementary information as an auxiliary task to determine its relevance and decide how to incorporate it into their primary decision-making process. We distinguish this as a unique interaction pattern because users may respond differently to direct assistance compared to more secondary assistance in their decision-making process. For example, machine learning models can predict risk values associated with certain profile information and the human's decision problem is to make an investment decision (Dikmen and Burns, 2022).

3.1.4 Request-driven AI assistance

In this interaction pattern, the user has to actively seek information or solutions from the AI. Rather than the AI's inferences being automatically presented, the user can control when they want to receive the AI assistance. Meanwhile, the user can spend more time deliberating about the problem, a strategy known as cognitive forcing (Buçinca et al., 2021; Park et al., 2019). This pattern can be perceived as less intrusive to the user, as it empowers the user to “ask” for information or solutions from the AI, and allows the user to anticipate the AI's assistance in the decision-making process.

3.1.5 AI-guided dialogic user engagement

Within this interaction pattern, the AI facilitates a dialogue-like engagement with the user. Guided by the AI's instructions, the user responds by providing pertinent information. The iterative exchange continues until the AI's instruction requirements are satisfied, and is followed by the presentation of the AI's predicted outcome for the decision-making task. This responsive exchange not only involves retrieving and sharing information in line with the task but also ensures that the users recognize the influence of their inputs on the AI's predicted outcome. While this pattern has been commonly observed in humans interacting with conversational agents (Jiang et al., 2022; Gupta et al., 2022), it is not only limited to traditional conversational interfaces (Gomez et al., 2023).

3.1.6 User-guided interactive adjustments

Inspired by the taxonomy of interactive explanations recently proposed from a scoping review (Bertrand et al., 2023), we included an interaction pattern where humans can modify the outcome space of the AI agent. Typically, information flows from the AI to the user. However, in this pattern, the direction of flow is reversed, with humans providing the AI with feedback, corrections, or information to shape its inferences. While a detailed classification of potential modifications is beyond the scope of this survey, we distinguished cases in which the changes are merely visual updates in the interface or considered as feedback to improve the underlying AI models, as in interactive machine learning (Amershi et al., 2014).

3.1.7 Delegation

In this interaction pattern, both the user and the AI leverage their unique strengths and capabilities to optimize the decision-making outcome. Delegation can be a strategic choice when one agent assesses its counterpart as better equipped for a particular task (Fügener et al., 2022). On the other hand, if an agent feels confident in their ability to complete the task, they will take the lead. Studies highlight that the complementary abilities of humans and AI when synergized properly, can enhance the decision-making outcome (Zhang et al., 2022).

3.1.8 Others

We included within this category those patterns that involve a combination of the main interaction blocks and did not fit into the patterns described before. More complex interaction emerges when the decision making problem may involve multiple individual decisions, agents (more than two), and continuous interactions with an AI agent.

3.2 Identification of interaction patterns in AI-assisted decision making studies

We identified the different categories of interaction patterns in the selected 105 articles. If a study included more than one interaction sequence, i.e., the experimental manipulation resulted in different ways in which users can interact with the AI, we considered them separately and counted patterns in each one. In total, we analyzed 131 sequences. The most common interaction pattern during empirical evaluations of AI-assisted decision making tasks was the AI-first assistance (n = 67), followed by the AI-follow assistance (n = 28). Furthermore, the AI's solutions to the decision making task were presented along with additional information in the majority of the cases, 81% and 68% during AI-first and AI-follow interactions, respectively. For instance, in the AI-first pattern for predicting student outcomes, the AI takes student-related features and predicts pass or fail with a confidence value (Rastogi et al., 2022). Participants then review this AI prediction alongside the student data to make their final decision. Meanwhile, in the AI-follow pattern for another binary task, participants make a initial prediction regarding the output of speed dating events, and then a final one after seeing AI's prediction. We observed 16 instances of the Secondary assistance pattern. In particular, we noticed that most of the decision making tasks required expertise (11/16). As an example, in the task of predicting gestational diabetes mellitus (GDM), healthcare professionals are presented with various descriptive features such as a history of diabetes mellitus, age, body weight, etc. (Du et al., 2022). The AI system processes this information and outputs a categorized risk of GDM, labeling it as either low or high. Rather than offering a direct solution, the AI presents this risk category alongside an explanation, thereby serving as a form of secondary assistance.

Sometimes during the decision making task, users had to actively seek support from the AI agent as specified in the Request-driven AI assistance (n = 25). More specifically, the requests could be for a direct a solution to the decision making task (n = 14) or for the presentation of support information (n = 13). In the former, only in three cases the request for the AI's solution was optional (Kumar et al., 2021; Tolmeijer et al., 2021; Baudel et al., 2021), meaning that users could come with a solution to the decision making task on their own. As illustrated in a house search scenario, users could choose to use an AI system to help them find a house that satisfies certain requirements, with the option to directly submit the suggested house or verify (Tolmeijer et al., 2021). Meanwhile, the support information at the user's discretion was identified in eight cases (Calisto et al., 2022; Vössing et al., 2022; Suresh et al., 2020; van der Waa et al., 2021; Liu et al., 2021; Molina and Sundar, 2022; Prabhudesai et al., 2023), for demanding explanations in particular. In the AI-first assistance pattern, where the user may not have an opportunity to form an independent assessment of the decision making problem, we observe cases (n = 10) in which users are given the ability to control when they want to receive the AI assistance via a request (Khadpe et al., 2020; Mackeprang et al., 2019; Baudel et al., 2021; Molina and Sundar, 2022; Gomez et al., 2023; Buçinca et al., 2021).

To a lesser extent, we found the interaction patterns that involve more exchange components between the human and the AI agent. For the AI-guided dialogic user engagement, as the name suggests, five out of the six interactions were supported via conversations with the AI agent. Through conversational interfaces, users had to provide some constraints given in the decision making problem for the AI to propose a candidate solution (Khadpe et al., 2020; Jiang et al., 2022; Gupta et al., 2022). However, such exchange of information does not necessarily rely on a conversational interface, as demonstrated in the evaluation of an AI system that could constantly provide guidance on a decision sub-problem for identifying bird categories (Gomez et al., 2023). The two-way interaction occurs as the AI requests and suggests bird attributes for description, culminating in a bird category suggestion. Users can actively engage by processing the attributes, considering the AI's input, and making decisions regarding the bird's attributes. Ultimately, they verify the AI's suggested bird category. Interactivity also support the adjustment of the AI's outcome space, and we found the User-guided interactive adjustments in nine cases that differed on the observed effect of the adjustment. For instance, manipulations of the inputs result in new AI's outcome computations for exploratory purposes (Liu et al., 2021; Gu et al., 2023; Nguyen et al., 2018; Zytek et al., 2022; Suresh et al., 2022). Such functionality can be incorporated into interactive explanations, where users can manipulate input values of a specific instance and observe the change in recidivism predictions (Liu et al., 2021). When adjustments did not directly translate into an updated AI's outcome, users' feedback was considered for future improvement of the model (Ashktorab et al., 2021; Molina and Sundar, 2022; Smith-Renner et al., 2020; Lee et al., 2021). Manual labeling after observing AI predictions can be leveraged to identify incorrect intent classifications from textual samples and re-train the models (Ashktorab et al., 2021).

Opportunities to delegate decisions were observed in nine sequences of interactions, though with differences in the conditions for delegation. For instance, in some cases the users “blindly” delegate the decision to the AI without having access to their outcome and not being able to supervise it later (Chiang and Yin, 2021; Maier et al., 2022; Zhang et al., 2020; Fügener et al., 2022). For example, in stock investment decisions, people can choose to invest directly in specific stocks or delegate a portion of their funds to the AI for future investment decisions (Maier et al., 2022). In others, the AI agent has the decision to delegate and the user is assigned some of the decision making tasks to be completed on their own (Hemmer et al., 2023; Fügener et al., 2022), or presented with the AI's outcome as support, resulting in the AI-first pattern (Bondi et al., 2022). In addition, the user can object to the delegation decision of the AI and take charge of the decision if considered appropriate (van der Waa et al., 2021).

Lastly, the articles that we included in the “Others” category of interaction patterns can be separated into three groups. First, decision making tasks that involved more than one decision outcome (Porat et al., 2019; M. A. Rahman et al., 2021) and corresponding support from the AI agent. Second, decision making problems where multiple instances of decision tasks can take place and the interaction with the AI agent is continuous (Van Berkel et al., 2022; Fan et al., 2022; Nourani et al., 2021; Lindvall et al., 2021; Reverberi et al., 2022). Third, interactions that involved a third agent (Wu et al., 2022; Brachman et al., 2022; Banas et al., 2022). In addition, a different case results when independent solutions, from the human and the AI agent, to the decision making problem are averaged as the final verdict (Xiong et al., 2023).

3.3 Landscape over domains evaluated in AI-advised human decision making

3.3.1 What domains have been defined as contexts to evaluate AI-assisted decision making?

The selected articles included in this survey cover a broad range of different domain categories previously identified (Lai et al., 2023). Articles that included more than one experimental decision making task were counted toward more than one domain. Table 1 presents a summary of the major domains and the different decision making tasks evaluated. Overall, the majority of the studies conducted human-AI interaction evaluations in real-world applications, with less than 15% formulating artificial tasks. We included medical related databases in our search strategy, which contributes to the large representation of decision making tasks on the healthcare domain (26/108). In addition to healthcare, decision making tasks that may involve high-stakes outside of an experimental setup were identified in the finance and business (15/108), and law domains (6/108). The second most common domain was in the context of generic tasks (20/108) that are low-effort processing for humans but have mostly been used to develop AI benchmarks and demonstrate technical feasibility of algorithms. Other domains that typically include tasks targeted to non-expert users are social media (8), labeling (8), and leisure (1). We assigned tasks with unique applications to the Other domain (6/108). Even though we identified multiple decision making tasks in applications that require a specialized population, where recruiting large numbers of participants is often challenging, the majority of human-AI interactions have been evaluated with non-expert uses (60/108). The type of AI systems behind the interactions with users in the studies that covered these decision-making tasks were distributed among three categories: simulated models or Wizard of Oz experiments (39/105), deep learning-based models (34/105), and shallow models (35/105).

Table 1

Table 1. Domains and corresponding decision tasks used to study AI-assisted decision making.

3.3.2 In what contexts were the interaction patterns observed during AI-assisted decision making processes?

To better understand the existence and availability of the interaction patterns in different domains, we quantified the occurrence of patterns per domain and provide an overview describing the trends. Figure 2 shows the distribution of interaction patterns within our taxonomy for different domains. Values equal to zero mean that certain interaction pattern was not observed in the studies included in this survey for a specific domain. Tables in the Appendix C provide more details on the interactions patterns for each paper included in this review.

Figure 2

Figure 2. Percentage of interaction patterns observed in each domain of AI-assisted decision making tasks included in this review. The numbers in the cells denote the percentage values (e.g., 17% of the patterns identified in the healthcare domain correspond to Secondary assistance). One study can include multiple sequences of interaction and interaction patterns are not mutually exclusive.

Human-AI interactions in AI-advised decision makings in the healthcare domain mostly adhered to the AI-first assistance pattern (n = 14), followed by Secondary assistance (n = 7). Request-driven AI assistance was observed in a few cases (n = 4) as well as AI-follow assistance (n = 4). We identified one interaction that supported AI-guided dialogic user engagement, three interactions in the User-guided interactive adjustments, and one in which delegation was an option. Five sequences of interactions were in the “Others” category due to the higher complexity of the interactions. In the domain of finance and business, AI-first assistance was the most common type of interaction (n = 6), followed by AI-follow assistance (n = 4) and Secondary assistance (n = 4). Some interactions supported Request-driven AI assistance (n = 3), AI-guided dialogic user engagement (n = 2), and delegation (n = 3). We did not observe support for User-guided interactive adjustments. We identified four types of interaction patterns in the law and civic domain: AI-first assistance (n = 4), AI-follow assistance (n = 2), Request-driven assistance (n = 2), and User-guided interactive adjustments (n = 2). Decision-making tasks that involve professional related topics mostly followed AI-first assistance (n = 4). Request-driven AI assistance and user-guided interactive adjustments were observed in one case each one. The other interaction patterns were not observed. Interaction patterns during decision making tasks involved in social media contexts were mostly of the AI-first (n = 4), AI-follow (n = 4), and Request-driven AI assistance types (n = 4). We identified Secondary assistance in one case, and two cases in which User-guided interactive adjustments were enabled, in particular, for updating relevant terms for content moderation purposes. No Delegation or AI-guided dialogic user engagement was observed and two cases fell into the “Others” patterns category. During decision making tasks in generic applications, most of the interactions were dominated by the AI-first assistance (n = 21), while only two cases involved the AI-follow assistance interaction. Regarding the more interactive patterns, two cases supported Request-driven AI assistance, two AI-guided dialogic user engagement, and one User-guided interactive adjustments. Only in one case the type of assistance was secondary. Delegation was featured in five cases. Labeling tasks mostly included AI-first assistance patterns (n = 6), followed by Request-driven AI (n = 4) and Secondary assistance (n = 3), mostly clustering similar data points. We further identified AI-follow assistance and User-guided interactive adjustments once each one, and two types of interactions in the “Others” patterns category. Artificial decision making tasks were mostly dominated by AI-follow follow (n = 8) and AI-first assistance (n = 7). Request-driven AI assistance was featured in four cases and four interactions were in the “Others” patterns category. Lastly, AI-guided dialogic user engagement was the type of interaction in the leisure-related task and AI-first assistance in the task within the education domain. Interactions during decision-making tasks that belong to other domains mainly contained AI-first (n = 4) and AI-follow (n = 3) assistance types, or patterns in the “Others” category (n = 3). Further, one case supported Request-driven AI assistance.

4 Discussion

In establishing the interaction patterns presented in this paper, we drew from our observations of the Human-AI interactions used in AI-advised decision-making scenarios in prior empirical studies. Constructing a taxonomy is inherently challenging due to the wide array of potential approaches that can be adopted. Specifically, in the domain of Human-AI interaction, interactions can be examined through various lenses (e.g., system-centric, oriented around user goal/task, distinguished by varying levels of granularity in interaction techniques). In this work, we have taken a preliminary step to structure an approach by integrating our perspectives with observations of interaction paradigms used in existing studies. Below, we discuss the findings from our systematic review in combination with the taxonomy, showcasing how it allowed us to identify trends and opportunities for the study of human-AI interactions.

4.1 Trends in existing Human-AI interactions

Using our taxonomy, we characterized existing interactions adopted in empirical studies. In general, while the most common patterns were AI-first, AI-follow, or Secondary assistance, in which the human role was limited to supervising the AI predictions, we did also note the presence of more dynamic interactions (e.g., Request-driven AI assistance, AI-guided dialogic user engagement, User-guided interactive adjustments), although less frequently. Interactive elements can enhance the communication of explanations allowing users to interpret AI predictions through selection, mutation, and dialogue as suggested by Bertrand et al. (2023). Likewise, supporting human input and review in the field of Interactive Machine Learning requires the design of interface elements for sample review, model inspection, or feedback assignment (Dudley and Kristensson, 2018). Beyond supervising AI outputs, a new paradigm arises when humans and AI co-create solutions, blending human intuition and expertise with AI's computational strengths for more robust outcomes. Conversational interfaces play a key role in enabling this two-way interaction. Moreover, the concept of distributing decision-making responsibilities among different agents, as seen in delegation patterns, extends the assistance beyond individual decisions (Lai et al., 2022). This leads to diverse collaborative strategies, ranging from working in parallel—as exemplified in delegation scenarios where AI operates autonomously but in alignment with human intent—to more coordinated efforts, such as the turn-taking dynamic inherent in conversational AI.

A finer analysis per domain revealed the limited use and support for diverse interaction patterns, represented by most cell values equal to zero in Figure 2. However, it is worth noticing that for high-stake domains, such as healthcare and finance, multiple interaction patterns have been explored when AI provides decision support. The choice of specific interaction patterns can be influenced by several factors including both design choices made by researchers and the intrinsic nature of the problems being addressed. In the former, the research intent and available resources may affect design choices. In the later, ethical and legal considerations play an important role, specially in high-stakes domains. In more specialized fields, domain experts have their unique set of capabilities that can directly influence the choice and efficacy of interactions with the AI. For these experts, Secondary assistance can be beneficial, since they have the insights to effectively use the supplementary AI information for the primary decision-making task. However, for non-experts, the most suitable and beneficial choice of interaction pattern is unclear. There remain questions about the universality of certain interaction patterns across varied user groups and task scenarios. Furthermore, with the common usage of the AI-first and AI-follow approaches, understanding their pros and cons is crucial in developing AI systems that align with human cognitive processes and decision-making styles.

4.2 Challenges of different interaction patterns

The popularity of the AI-first assistance pattern can be attributed to the straightforwardness of demonstrating the effects of incorporating AI assistance into a decision-making task. However, it presents challenges, notably the difficulty in measuring the actual influence of AI assistance on user decision-making. Since the AI's solution is revealed before the user has had the opportunity to process the task independently, it can be convenient for the user to either dismiss or follow the AI's recommendation without sufficient reflection. Interaction patterns may engender different types of biases and knowing them in advance may help guard against biases. For instance, the AI-first interaction interaction can make the user susceptible to the “anchoring bias,” a phenomenon where a person's judgment is biased based on initial information. This bias can be avoided through the use of Secondary assistance, where the user must interpret supplementary information, determine its relevance, and decide how to incorporate it into their primary decision-making process. Directly presenting a solution to the decision making problem can result in over-reliance (Nourani et al., 2020), whereas Secondary assistance can avoid anchoring effects, but may not satisfy user needs. In addition, direct presentation of AI inferences can lead to a lack of “sense of agency” for the user, which refers to the subjective feeling of controlling one's actions, and influencing external events through them (Wen and Imamizu, 2022). Request-driven AI assistance can empower the user with the choice to view AI inferences and foster a sense of agency; however, it may also introduce risks of confirmation bias or anchoring bias, especially when users seek explanations for decision verification or knowledge acquisition (Barda AJ et al., 2020).

In contrast, in the AI-follow assistance pattern, the user is given a chance to solve the problem on their own, thus, potentially minimizing anchoring bias. Yet, whether users actually restart their decision making process is open to question. An article found that participants in this “two-step” workflow rarely revised their provisional diagnoses when the AI inferences differed from their earlier assessment (Fogliato et al., 2022). This hints at confirmation bias, a person's tendency to seek supporting evidence for their current hypothesis. In case the user does re-evaluate their prior assessments, the cognitive costs increase. Cognitive costs of re-examination, when new information becomes available, can be viewed as analogous to interruption and recovery on the initial task with new information (Fogliato et al., 2022). Being the second most common pattern, the prevalence of the AI-follow pattern likely arises from a strong interest in disentangling the influence of AI advice on the human's decision (Vereschak et al., 2021). We primarily focused on biases in the most common interaction patterns, as the higher number of studies aids in identifying them. However, exploring potential biases in less frequent paradigms is also encouraged.

4.3 Gaps and opportunities for the design of interactions

From our thorough exploration of interaction patterns in a sample of over 100 articles, several gaps and opportunities emerge for advancing the design of human-AI interactions. First, most studies focus on single-user and single-AI interactions, overlooking the potential of multi-agent collaboration, which could unlock new dynamics and enhance teamwork in complex tasks. Moreover, as interaction patterns grow more complex, existing frameworks like the Delegation pattern, where the agents can reassign decisions, could be expanded to explore how agents coordinate and allocate tasks among themselves. Lastly, much of the HCI literature on AI assistance has concentrated on intermittent scenarios (i.e., turn-taking). This is in contrast to continuous user interaction scenarios, where user input is sustained and can receive AI feedback at any given moment for a more realistic and organic setup. Likewise, many of the human-AI interactions took place in artificial tasks that may not capture real-world complexities. For example, most studies end after decisions are made, without exposing decision-makers to the full consequences of their choices (Kirkebøen et al., 2013), which can influence user engagement even in scenarios labeled as high stakes. We must carefully consider differences in how users behave in experimental tasks and in equivalent real-life scenarios to assess whether AI assistance truly adds value to decision problems and capitalize on the findings of experimental evaluations.

4.4 Value of a taxonomy of interaction patterns

The design of Human-AI interactions requires deliberate choices informed by cross-disciplinary expertise. By establishing a shared terminology, our taxonomy can facilitate conversations and collaboration between researchers in AI, HCI, human factors engineering, and domain experts. This common ground can ensure consistency in methodology and foster innovation in research, development, and user experiences, ultimately leading to more informed and participative decision-making processes. The taxonomy captures reusable interaction components and their relationships, providing insights to guide concrete choices about enabling interactions between the agents involved and the corresponding interface elements needed (van Bekkum et al., 2021). We illustrate how the taxonomy serves as a framework to aggregate knowledge about the interactions present in empirical studies, revealing trends about how humans and AI collaborate in decision-making processes and identifying unexplored opportunities in human-AI collaboration. From a design perspective, it provides a collection of repeatable solutions to problem types that can be incorporated into prototypes for faster iterations. As the field of human-AI interaction continues to evolve, this taxonomy can be expanded and refined to incorporate new interaction patterns and ideas from emerging research.

4.5 Limitations

The works included in this survey are limited to published manuscripts that conducted empirical evaluations of human-AI interactions. This focus, while intentional, could have introduced certain limitations. The terminology used in the search could have excluded relevant work if the interaction design was not explicitly mentioned in the title or abstract, or even in the body of the text. Moreover, publication bias may have resulted in the exclusion of works relevant to this review. Since our search included only a healthcare-focused database alongside more general ones, the frequency of interactions we observed across various domains where AI was used for decision-making may not remain consistent if additional domain-specific libraries were included in our search. We encourage further investigations in other domains of interest to researchers to build a more comprehensive understanding of human-AI interactions. We also constrained our analysis to screen-based interfaces for AI-assistance, acknowledging that embodied AI might support additional interactions. Our strict selection criteria centered on studies encompassing complete decision-making tasks to ensure actual Human-AI interactions. Given the diversity of experimental designs and factors in the papers reviewed in this survey, we abstracted the interactions to discern patterns across the varied studies. While many of these studies effectively described their interfaces and user study procedures, there were instances where the information provided was not sufficient for a complete recreation of the interactions. Consequently, we had to carefully interpret and encode the interactions from these papers to the best of our abilities. Even though we demonstrate a use case of the taxonomy to characterize the AI-assisted decision making literature, further use case evaluations can help in assessing the utility of the taxonomy and refine its definition.

5 Conclusion

In this paper, we presented a systematic review of human-AI interactions in AI-advised decision making tasks that informed and grounded the formulation of a taxonomy of interaction patterns. Our proposed taxonomy of interaction patterns provides a structured foundation for understanding and designing these crucial interactions. It reveals that current practices often lean toward AI-driven or human-led decision processes, with limited emphasis on fostering interactive functionalities throughout the interactions. Recognizing the significance of interaction design, we advocate for deliberate choices in system development to enhance collaboration between humans and AI. Moving forward, the taxonomy presented here serves as a valuable resource to inform the design and development of AI-based decision support systems, ultimately fostering more productive, engaging, and user-centered collaborations.

Author contributions

CG: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Visualization, Writing – original draft, Writing – review & editing. SC: Formal analysis, Visualization, Writing – original draft, Writing – review & editing, Investigation. SK: Data curation, Formal analysis, Investigation, Methodology, Writing – original draft. C-MH: Conceptualization, Investigation, Methodology, Supervision, Writing – review & editing. MU: Conceptualization, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Writing – review & editing.

Funding

The authors declare financial support was received for the research, authorship, and/or publication of this article. This research was supported in part by Johns Hopkins Internal Funds.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that Gen AI was used in the creation of this manuscript. Generative AI (chatGPT 4.0) was used to process the text originally produced by the author(s). The processing was done for improving the clarity and structure of the sentences and not to create new content. All the generated outputs were validated before using and adapting them for the manuscript.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fcomp.2024.1521066/full#supplementary-material

References

Allan, K., Oren, N., Hutchison, J., and Martin, D. (2021). In search of a goldilocks zone for credible AI. Sci. Rep. 11:13687. doi: 10.1038/s41598-021-93109-8

PubMed Abstract | Crossref Full Text | Google Scholar

Alufaisan, Y., Marusich, L. R., Bakdash, J. Z., Zhou, Y., and Kantarcioglu, M. (2021). “Does explainable artificial intelligence improve human Decision-Making?” in Proceedings of the AAAI Conference on Artificial Intelligence, 6618–6626. doi: 10.1609/aaai.v35i8.16819