Introduction

Active listening is an important instrument of communication between psychotherapists and their clients (Robertson, 2005). Active listening includes various techniques for gathering information. Among them are the use of open questions, summarizing, paraphrasing or verbalization, and clarification (Simpson et al., 1991). An active listener is expected to avoid spontaneous reactions when actively listening, as these are thought to be disruptive to communication. In this way the client and his or her needs can be acknowledged and given the opportunity to express themselves actively. Consequently, the therapist’s communication with the client is facilitated by active listening, not only verbally, but also nonverbally (Robertson, 2005). Active listening is generally considered to be beneficial for the client, e.g., through a pleasant and quiet environment. An environment can be created in which clients can express themselves well and without fear, feeling understood, and accepted (Martin et al., 2017).

Several studies have examined the effects of active listening and communication between therapist and client on therapy success. Nugent and Halvorson (1995), for example, showed that active listening can reduce the symptoms of depression. Wanzer and colleagues (2004) further investigated client-centered communication and its effects on client satisfaction using a survey. It was found that client-centered communication had a positive influence on client satisfaction with treatment and communication. Martin and colleagues (2017), in turn, investigated active listening both as an attitude of the therapist and a technique to find out what influence this distinction has on the satisfaction of the therapist-client relationship. In their study, active listening was investigated with a changed mediation in therapist-client conversations. Three groups, 19 students in total, were recruited, one of which was trained in active listening as an attitude. The second group was trained with regard to technique, while the third group was a control group. The comparison of the two intervention groups showed that the group that received attitude-oriented training responded significantly better to clients’ feelings than the technique-oriented group. This reflects that active listening can express empathy and understanding.

Nonverbal communication in general can promote trusting relationships. A systematic review by Lorié and colleagues (2017) investigated how different cultures transmit nonverbal expressions of empathy and what effects this has on therapist-client relationships. Empathy is described as the ability to perceive verbal and nonverbal emotional cues of a counterpart and to react to them. Empathy can be expressed not only through verbal but also through purely nonverbal behavior (Riess & Kraft-Todd, 2014). The review concluded that the quality of communication is influenced by nonverbal expressions of empathy, which in turn influenced client satisfaction.

There are several ways in which a therapist can implement active listening; in turn, clients may differ in how they perceive certain active listening techniques. Investigating how active listening influences the perception of the therapist has important implications for counselling therapy as it is possibly associated with client satisfaction. Ultimately, whether a client perceives their therapist as empathic, understanding, and authentic (i.e., expressing oneself according to one’s own true emotions, needs, thoughts, and beliefs; Wirtz et al., 2020) can influence therapy success.

Active listening can be signaled through different channels, one of them being the use of affirmative nonlexical verbal signals such as “mmh” or “aha”, which serve as empathic continuers to show emotional involvement and understanding (Fitzgerald & Leudar, 2010). Similarly, empathic continuers serve as a “back-channel response” to keep the conversation going and signal attention to the client (Bodie et al., 2012). In addition, there are nonverbal signals of active listening, such as a simple nod, an open posture or eye contact. Pinto et al. (2012), for example, found a positive correlation between such interaction styles during a therapeutic session and the reported empathy of the therapists.

Other studies on verbal and nonverbal signals were less conclusive. Henry and colleagues (2012) investigated the effects of nonverbal communication between clients and therapists during clinical interactions in a meta-analysis. They concluded that both warmth and perceived listening on the therapist’s side had a positive impact on client satisfaction. However, the authors emphasized that further studies are needed to investigate the influence of nonverbal signals on client satisfaction (Henry et al., 2012). What remains unclear is how the use of different types of empathic continuers influences the relationship between therapist and client.

To fill in this research gap and extend the work of Martin et al. (2017) and Henry et al. (2012), the present study investigated the influence of the confirmatory nonlexical verbal signal “mmh” on the perceived authenticity, empathy, and understanding of a listener in a conversation (e.g., a therapist listening to a client). We expected that variations in the nonlexical verbal confirmation signal “mmh” expressed during a conversation would increase the perceived authenticity, understanding, and empathy of the listener. Based on the results of Martin et al. (2017), which documented some initial indications of possible effects of active listening on self-reported and observed satisfaction with an interaction, we developed the following hypotheses:

H1: In the 1x-mmh condition, participants perceive interviewers to be more authentic, empathic, and understanding compared to the interviewers in the 3x-mmh condition.

H2: In the control group (0x-mmh condition), interviewers are perceived to be more authentic, empathic, and understanding than in the 3x-mmh condition.

H3: In the 1x-mmh condition, interviewers are perceived to be more authentic, empathic, and understanding than interviewers in the control group.

Method

Sample

A power analysis using g*power (Faul et al., 2007) indicated a required sample size of N = 90 participants to reliably detect a medium sized effect (d = 0.4, power = 0.8). Our research team originally consisted of 10 undergraduate interviewers, who each collected data from nine participants, however data from one interviewer was lost in the process. Therefore, the remaining sample consisted of 81 participants (38 female and 43 male). Participants were recruited via handouts and social media in various German cities. Their average age was M = 30.28 years (SD = 12.07, range = 18–64). Exclusion criteria for participation were undergoing current medical or psychotherapeutic treatment or having any type of physiological condition that might have influenced participants’ ability to listen, talk or see.

Procedure

The study took place at nine different locations across Germany, with nine different interviewers. All interviewers were part of a research group of the author’s institution consisting of eight females and one male who were undergraduate students (Mage = 20.50, SD = 1.58). Specific communication training was not provided because interaction with participants was strictly prescribed by the study protocol. The rooms in each location were chosen based on prior defined characteristics, such as being an almost plain, private room with minimal distractions and no third-party listeners. This was done to minimize the risk of any geographic effect on the results. Additionally, all nine interviewers set up the room in a similar manner, minimizing experimental noise that may have resulted from the different locations. Data collection through different interviewers carries the risk of an experimenter effect. Nevertheless, this approach was chosen to increase the validity of the results. To minimize the influence of personal characteristics of an individual interviewer, each interviewer collected data in each condition. Prior to the experiment, participants were greeted and introduced to the experiment, including information about data protection and the possibility to withdraw participation at any time.

After reading the instructions and giving written informed consent, participants were asked to report on a difficult experience at work for five minutes. In the control group the interviewer listened in silence. In the 1x-mmh group, the interviewer voiced several monosyllabic confirmatory nonlexical verbal signals (“mmh”) while listening. In the 3x-mmh group the interviewer repeatedly said “mmh-mmh-mmh” while listening. After the report, participants completed a written questionnaire that captured the dependent variables, perceived authenticity, empathy, and understanding. Finally, participants were debriefed about the purpose of the experiment and had the opportunity to ask questions.

Participants were randomly assigned to the three groups, with each group consisting of n = 27 participants. The participants were assigned to an interviewer based on geographic proximity to the closest test location. The participants were randomized by the order of their appointments with the interviewer; the first participant was placed in the control group, the second in the 1x-mmh condition, the third in the 3x-mmh condition, and so on. Thus, each interviewer performed all conditions in a fixed order. During the participant’s verbal report the interviewer either said nothing (control group), or uttered the nonlexical signals several times “mmh” (1x-mmh condition) or “mmh-mmh-mmh” (3x-mmh condition). The interviewer attempted to interject the “mmh” whenever it appeared appropriate during the participant’s report (e.g., when there was a natural pause in the flow of the report). On average the nonlexical verbal communication signals were used roughly every 60 seconds (about five times during each verbal report). The exact number of the interjections was not recorded. In order to eliminate other nonverbal interferences, the behavior of the interviewer was strictly prescribed by the study protocol (e.g., body posture upright, neutral voice during greeting, eye contact during the experiment).

Measures

For the purpose of this study, we developed a new questionnaire based on previous work. Existing inventories measuring perceived authenticity, empathy, and understanding either include too many items regarding a personal relationship between the participant and the interviewer (Ganley, 1989; in our study, participants did not know the interviewers), have poor content and construct validity (Holz-Ebeling & Steinmetz, 1995), or are self-reporting measures (Kernis & Goldman, 2006; van den Bosch & Taris, 2014). The definitions (Wirtz et al., 2020) of the dependent variables were deductively operationalized to items, and existing questionnaires were used for additional guidance (Authenticity: Kernis & Goldman, 2006; van den Bosch & Taris, 2014; Empathy: Barrett-Lennard, 1962; Kim et al., 2004; Understanding: Barrett-Lennard, 1962). As the focus of our study was to investigate the perception of a listener with no prior relationship, only items that could be translated to perceptions of others were chosen and rephrased for our purposes.

The resulting questionnaire, called the socio-emotional perception scale (SEPS-I), measured the perceived authenticity (ten items), empathy (eight items), and understanding (five items; see supplementary material for an English version of the questionnaire, translated from German). Questions measuring the perceived authenticity included items such as “I think my listener was showing his/her real emotions”, while empathy was measured using items similar to “I think my listener put him-/herself in my shoes”. To measure understanding, the questionnaire consisted of items like “I think the listener understood my concerns”. In addition, social desirability was captured by items like “I never stole anything in my life”.

Each item was rated on a Likert scale ranging from 1 = fully disagree to 6 = fully agree. A score for perceived authenticity, empathy, and understanding was calculated by taking the average of the items for each construct. Higher values on each scale correspond to higher perceived authenticity, empathy, and understanding of the listener. For each scale, Cronbach’s alpha was calculated, yielding α = .90 for authenticity, α = .88 for empathy, and α = .81 for understanding, showing an overall high reliability.

Statistical Analysis

IBM SPSS Statistics 22 (IBM Corp; Armonk, NY) was used for statistical analysis. Univariate ANOVAs were used to test for significant differences. The procedure was chosen even though the data was not normally distributed, given its robustness against violation of this assumption. Least Significant Difference (LSD) tests were used for follow-up testing and the Bonferroni correction was applied where necessary. The significance level was set to .05 a priori.

Results

The demographic variables age (Kruskal-Wallis test, p = .09) as well as gender (p = .95) did not significantly differ between conditions. No other demographic variables were examined. Figure 1 illustrates the mean scores of the three dependent variables by group.

Figure 1
Figure 1.Means of perceived authenticity, empathy and understanding per condition

Note. Error bars represent SEM. The x-axis starts at two.

A univariate ANOVA was performed to test for significant differences between conditions. There was a significant main effect of condition on authenticity, F(2, 78) = 3.65, p = .03, ω = 0.06, but not on empathy, F(2, 78) = 1.97, p = .15, ω = 0.02, nor on understanding, F(2, 78) = 2.13, p = .13, ω = 0.03. Post-hoc tests were performed to further investigate the condition differences in perceived authenticity. An LSD-test showed significant differences in perceived authenticity between the control group and the 3x-mmh condition, t(78) = -2.476, p = .02 (pbonferroni = .046), d = 0.78, indicating that authenticity was higher in the control group (M = 5.30, SE = 0.13) than in the 3x-mmh condition (M = 4.78, SE = 0.19). After correcting for multiple comparisons (Bonferroni), no significant differences in authenticity were found between the control group (M = 5.30, SE = 0.13) and the 1x-mmh condition (M = 5.24, SE = 0.12), t(78) = -0.30, p = .77 (pbonferroni = 1), d = 0.10, nor between the 1x-mmh condition and the 3x-mmh condition, t(78) = -2.178, p = .03 (pbonferroni = .10), d = 0.47. Consequently, the data supported hypothesis 2, but hypotheses 1 and 3 had to be rejected.

Discussion

The objective of the present study was to investigate the effect of nonlexical verbal signals on the perceived authenticity, empathy, and understanding of a listener. In summary, the results show that the extensive use of confirmatory nonlexical verbal signals may decrease perceived authenticity of a listener in some cases. The perceived authenticity was lower if one used three consecutive “mmh” compared to no verbal signal, but there was no difference between three “mmh” and one “mmh” after correcting for multiple testing. Contrary to our expectations and previous work (Martin et al., 2017), we did not find evidence that the verbal signals used in conversations influenced perceived empathy or understanding. Our second hypothesis, stating that the control group perceived the interviewer as more authentic, empathetic, and understanding than the 3x-mmh condition, was partially supported, as only perceived authenticity was influenced by the manipulation. Our first and third hypotheses were rejected, as there were no differences between conditions.

There are several possible explanations for the results observed in the present study. For instance, we speculate that nonverbal communication through body language might have gained more importance in the control group where there was no verbal communication. Therefore, the control group might have been particularly influenced by cues like facial expressions or posture. This, however, needs to be tested in a future study. Additionally, it is possible that the 3x-mmh condition appeared artificial to participants, compared to no or monosyllabic continuers, resulting in reduced perceived authenticity.

Contrary to earlier work (Martin et al., 2017), we did not find an influence of verbal signals on the empathy or understanding of a listener, but we did find an effect on the perceived authenticity. These findings can serve as first insights into the importance of nonlexical verbal signals on the perceived authenticity of a listener, which in turn could possibly influence client satisfaction in therapy. Our study extends the findings by Martin et al. (2017) and Henry et al. (2012), who only showed an influence of active listening on understanding and patient satisfaction and confirms the importance of active listening signals in conversations. Additionally, previous research (Bodie et al., 2012) indicated that verbal signals can be associated with friendliness and responsiveness. Our results expand this work, showing that some forms of nonlexical verbal signals may be less effective than others in communicating the authenticity of a listener. Although future research is clearly needed to address the inconsistencies in results, practitioners using active listening need to be aware that not all techniques may be equally effective.

Limitations and Recommendations for Future Research

There are several limitations to the present study. First and foremost, the generalization of the results is limited. Due to the online recruitment of participants, the representativeness of the sample is not guaranteed, which is a common limitation in our field. Second, the questionnaire is not tested regarding its reliability and validity. Even though Cronbach’s alpha was high for the developed questionnaire, future work is needed to compare the present tool to an established psychometric survey instrument (e.g., Tscheulin & Splitter, 2001). Third, since there is no documentation of the exact timing and number of interjections in the 1x-mmh and 3x-mmh conditions, as well as the nonverbal behavior, it is difficult to gauge which aspects of the manipulation are responsible for the observed results. Fourth, the different locations and interviewers used may have resulted in slight experimental noise and an experimenter effect, although the best possible has been done to minimize those effects through the actions described above. Fifth, the initial (welcome and consent) and final interactions between participant and interviewer may have been remembered particularly vividly but were not part of the experiment itself (Primacy or Recency-Effect; Atkinson & Shiffrin, 1986). Finally, no objective measurements (e.g., videotaping) of the interviews were taken. It cannot be ruled out that the interviewers inadvertently expressed verbal or other behavior during the consenting phase and even the experiment that may have also influenced the perception. Future iterations of this study should include two experimenters, one responsible for instruction, study procedure, and assessment of the interviewer’s emitted nonverbal communication, and another taking on the role of the interviewer.

Practical limitations that need to be accounted for include the nonclinical experimental setting. The study scenario and sample, although attempting to mimic a clinical setting, differed from most clinical settings. This limits the generalizability of the present findings to clinical work, as the results ideally would need to be tested again, for example with clients in a psychotherapeutic setting.

In general, most participants across the control group and the two experimental conditions reported relatively high authenticity, empathy, and understanding scores. Future research should investigate situations in which participants would be expected to trust listeners less (e.g., between a suspect in a criminal investigation and the police) to test if active listening techniques can increase authenticity, empathy, or understanding in more challenging settings. Additionally, it would be interesting to investigate whether the effect of verbal signals on authenticity can be replicated in a clinical setting as this would make the results more generalizable. As emphasized above, body language and facial cues could be manipulated, in order to test whether some variability in the perceived authenticity, understanding, and empathy can be accounted for by either of these. Finally, as authenticity was the only variable influenced by the verbal signal’s manipulation, it would be interesting to focus on how authenticity is influenced by other verbal signals (e.g., “aha”) or signal properties (e.g., intonation; lexical vs. nonlexical signals), and how these can be used to increase the perceived authenticity in order to improve client satisfaction.

Implications for Practice

Our findings have important implications for Counseling Psychology in Europe. We showed that nonlexical verbal signals might influence the perception of a listener, indicating a possible impact on the relationship between client and therapist. Ultimately, our research can inform the European counseling community about the tools available to strengthen therapist-client relationships.

Summary and Conclusion

The present study investigated the effect of nonlexical verbal signals on the perceived authenticity, empathy, and understanding of a listener. The results showed that the extensive use of confirmatory nonlexical verbal signals may decrease perceived authenticity of a listener in some cases. We did not find evidence that the verbal signals used in conversations influenced perceived empathy or understanding. Future research is needed to confirm and further explore our results, but practitioners using active listening need to be aware that not all techniques may be equally effective.