The Psychology of Trust in Human-AI Combat Teams

Trust in human-AI combat teams is not a nice-to-have psychological fringe; it is the connective tissue that determines whether machines extend human agency or dilute moral responsibility. In front-line operations trust governs the human decision to hand authority to an algorithm, to accept a recommendation, or to override a system. That decision is shaped by deep, well-studied psychological processes and by the design choices engineers make when they build autonomy and interfaces.

At a theoretical level we can borrow two complementary frames. First, organizational and interpersonal scholarship casts trust as a function of perceived trustworthiness: ability, benevolence, and integrity. That tripartite model explains why a Soldier might trust a human commander yet withhold trust from a black-box system that cannot communicate intent or moral purpose. Second, human factors work on automation reframes trust as a dynamic belief about a system’s competence and predictability that guides reliance. From this perspective trust is neither static nor purely cognitive; it is shaped by context, emotion, display design, and the evolving performance of the machine. Miscalibrated trust results in misuse when humans over-rely, and disuse when they under-rely, both of which degrade mission outcomes.

Empirical work in human-robot interaction offers a corrective to romantic or technophobic narratives. A careful meta-analysis of HRI studies shows that factors intrinsic to the machine, in particular observable performance and attribute signals, have the largest effect on trust formation. In short, how well an agent performs, and how transparently it signals its limitations, matter more than many human characteristics we instinctively blame. Designers therefore hold disproportionate power to enable properly calibrated trust through predictable behavior and intelligible feedback.

The military context amplifies these dynamics. Combat is a high-risk, time-compressed environment where operators face information overload and steep authority gradients. Research programs focused on human-autonomy teaming have repeatedly found transparency in the agent’s reasoning and status to be central for shared situation awareness and appropriate trust calibration. When an autonomous teammate can explain its assessment, reveal its confidence, or surface situational caveats, humans perform better and calibrate reliance more accurately. The implication is stark: opaque competence is a liability on the battlefield.

Defense research has also moved toward capability-focused remedies. DARPA’s Competency-Aware Machine Learning program articulated a pressing requirement: machines must assess and communicate their own competence in real time, especially when competence fluctuates under changing operational conditions. If a perception stack performs well in daylight but fails in smoke, a competency-aware agent should signal that limitation rather than generate false confidence. The strategic consequence of such signalling is improved trust calibration and reduction of catastrophic overtrust.

Philosophically this problem forces us to confront what trust means when one partner is nonconscious. Trust in human relations carries moral expectations. We expect benevolence and reciprocity. With an AI partner the moral content must be supplied elsewhere: by designers, institutional checks, and transparent operational rules. If the machine cannot be said to possess benevolence, then systems and organizations must embody benevolence on the team-member’s behalf. Otherwise the human bears the ethical burden alone at the point of action.

Practically, the psychology of trust suggests several concrete prescriptions:

Design for transparency and relevance. Expose competence indicators that matter in context: confidence scores, recent performance summaries, and environmental caveats. Poorly chosen or excessive information will overwhelm; the aim is intelligibility, not raw telemetry.
Train for calibrated reliance. Simulation and distributed training should present systems in degraded modes so operators learn when to trust and when to intervene. Calibration requires experience with both success and failure modes.
Build predictable, robust failure modes. Fail soft where possible and provide clear, time-sensitive cues when a system withdraws competence or defers authority. Predictability reduces the emotional shock of surprise and supports better decision-making.
Institutionalize accountability. Because human operators will necessarily remain decision authorities in many scenarios, doctrine must make clear lines of responsibility when autonomy acts or errs. This reduces ethical ambiguity and supports moral trust in the broader system.
Pursue competency-awareness. Encourage algorithmic self-assessment that can be communicated concisely in the human operator’s workflow. Such systems are not panaceas but they can materially improve trust calibration.

There are limits and paradoxes. Transparency can increase trust in a system while simultaneously revealing brittleness. A system that explains every hesitation might be judged honest and thus trusted less for critical tasks. Too little transparency and we get complacency. Too much transparency and we create cognitive burden. The art is balance, and getting to balance requires empirical iteration inside the messy realities of operational practice.

Finally, a moral note. As autonomy becomes more capable we may be tempted to abdicate moral labor to machines under the guise of efficiency. That temptation will be strongest when systems are competent, but competence does not equal moral agency. Trust without accompanying institutional safeguards and human judgement is not virtue, it is a risk. If we are to use machines to reduce human harm we must preserve human moral responsibility through design, doctrine, and training. In the field the question is not whether machines can be trusted; the question is how we will choose to be trustworthy collaborators with our tools.