Body Movement and Emotion: Investigating the Impact of Audiovisual Tempo Manipulations on Emotional Arousal and Valence

Körperbewegung und Emotion: Untersuchung der Auswirkungen von audiovisueller Tempomanipulationen auf emotionale Erregung und Valenz

Xinyue Wang*1 , Birgitta Burger1 , Clemens Wöllner2

Jahrbuch Musikpsychologie, 2024, Vol. 32, Article e191, https://doi.org/10.5964/jbdgm.191

Received: 2024-03-18. Accepted: 2024-07-15. Published (VoR): 2024-07-31.

Reviewed by: Friedrich Platz; Kathrin Schlemmer.

*Corresponding author at: Institute for Systematic Musicology, Universität Hamburg, Universität Hamburg, Mittelweg 177, 20148 Hamburg, Germany. E-Mail: philippapw@gmail.com

This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 International License, CC BY 4.0, which permits unrestricted use, distribution, and reproduction, provided the original work is properly cited.

Abstract

The perception of emotions in humans moving can be influenced by several movement features such as fluidity, complexity, and tempo. Manipulations of movement tempo by acceleration or deceleration are widely employed in media, yet there has been limited research on how these affect perceived emotions. The current study examined how tempo-manipulated point-light displays (PLDs) of human dance-like movements, compared to their tempo-original counterparts, influence the perceived emotional arousal and valence by altering the tempo and hence the underlying movement characteristics. In an online perceptual experiment, we presented tempo-original and tempo-manipulated PLDs at three tempi, with and without synchronized drumbeats. Participants were asked to judge the PLDs based on their emotional arousal, valence, and naturalness. Results suggest that movements with higher fluidity were perceived as lower in arousal than those with low fluidity. Stimuli higher in naturalness were perceived to be more positive in valence than those low in naturalness. Audiovisual stimuli, including the drumbeats, received higher arousal but lower valence ratings than visual-only stimuli. Furthermore, decelerated movements were associated with increased fluidity compared to accelerated or tempo-original ones. Tempo deceleration was associated with lower naturalness compared to tempo-original movements. These findings indicate that tempo manipulation can be identified via kinematic feature changes, consequently altering emotional attributes perceived in the movements.

Keywords: tempo manipulation, fluidity, kinematic features, Point-Light Displays (PLDs), emotions, motions, naturalness

Zusammenfassung

Die Wahrnehmung von Emotionen in menschlichen Bewegungen kann durch verschiedene Bewegungsmerkmale wie Fluidität, Komplexität und Tempo beeinflusst werden. Es gibt jedoch nur wenige Untersuchungen darüber, wie sich die Manipulation des Tempos auf die wahrgenommenen Emotionen auswirkt, wenn das Präsentationstempo einer Bewegung beschleunigt oder verlangsamt wird. In der vorliegenden Studie sollte untersucht werden, wie tempomanipulierte Point-Light-Displays (PLDs) tänzerischer menschlicher Bewegungen im Vergleich zum Originaltempo die wahrgenommene emotionale Erregung und Valenz beeinflussen, wenn auch die zugrundeliegenden Bewegungsmerkmale durch das Tempo verändert sind. In einem Online-Wahrnehmungsexperiment präsentierten wir tempo-originale und tempo-manipulierte PLDs in drei Tempi, mit oder ohne synchronisierten Schlagzeugklängen. Die Teilnehmer beurteilten die emotionale Erregung, Valenz und Natürlichkeit der Bewegungen. Die Ergebnisse deuten darauf hin, dass Bewegungen mit höherer Fluidität als weniger erregend, und Bewegungen mit höherer Natürlichkeit als positiver in ihrer Valenz wahrgenommen wurden als diejenigen mit niedriger Natürlichkeit. Audiovisuelle Stimuli mit den Schlagzeugklängen wurden höher in der Erregung, aber niedrigerer in der Valenz bewertet als visuelle Stimuli. Darüberhinaus zeigen die Analysen, dass verlangsamte Bewegungen höhere Werte in der Fluidität erzielten als beschleunigte oder im ursprünglichen Tempo ausgeführte Bewegungen. Die Verlangsamung des Tempos wurde jedoch auch mit einer geringeren Natürlichkeit in Verbindung gebracht als das ursprüngliche Tempo. Diese Ergebnisse deuten darauf hin, dass die Manipulation des Tempos zu Veränderungen der kinematischen Merkmale führt, wodurch sich die in den Bewegungen wahrgenommenen emotionalen Attribute verändern.

Schlüsselwörter: Tempo-Manipulation, Fluidität, Kinematische Merkmale, Point-Light-Displays (PLDs), Emotionen, Bewegungen, Wahrnehmungsexperiment, Natürlichkeit

Since the advent of moving images and audiovisual media, tempo has been manipulated in various forms. Media consumers, be it for videos on social networks, commercial films, or video clips of musical performances, are increasingly confronted with displays of human motion that deviate from standard movement patterns. Given the high relevance of body movements and visual information on the perception of musical performances (Platz & Kopiez, 2012) and the vital link between tempo, arousal, and emotion (Droit-Volet et al., 2013), this study investigated how tempo manipulation in point-light recordings of dance movements, accompanied by drumbeats, were emotionally perceived in relation to original, un-manipulated movements at the same speeds.

Manipulating time by slowing down or speeding up has been a common technique for elevating emotions in films (Wöllner et al., 2018), commercial video marketing (Yin et al., 2021), or the replay of highlights in sports matches and games (Pan et al., 2001). In the film Matrix (Wachowski & Wachowski, 1999), when Neo bent backward to dodge the bullets in slow motion, film critics called this moment “bullet time” and referred to this moment as majestic and “incredibly cool to see, even after so many years since the film's release in 1999” (Dutta, 2022, para. 1). In contrast, accelerated visual scenes usually speed up the processes that otherwise span over hours and longer in standard time or create an amusing effect such as jittered human movements. Understanding the effects of tempo manipulation on emotions can help leverage such tools by acceleration and deceleration to capture attention (Compton et al., 2003), enhance memories (Levine & Pizarro, 2004), or change the perceived time (Droit-Volet & Berthon, 2017). In music and dance practices, by manipulating tempo, artists can shift the emotional expressions in their performances to create unique interpretations of the same work (Quinn & Watt, 2006). Moreover, tempo manipulation may also be useful in music therapy when assisting treatment (Brownlow, 2017) or regulating emotional experiences.

Relatively few studies have shed light on how tempo manipulations affect emotions. Decelerated slow-motion film scenes, ballet, and sports excerpts were perceived as lower in emotional arousal and more positive in valence than tempo-matched, real-time footage (Wöllner et al., 2018). These results were also present in the physiological responses of participants. Further analyses with tempo-adapted film scenes found smaller pupil dilation and higher fixation ratios with slow-motion than with the real-time videos (Hammerschmidt & Wöllner, 2018), suggesting a lower arousal level and increased attention to detail. The findings are consistent with research that found decreased arousal with slow music (Droit-Volet et al., 2013). Furthermore, manipulation of movement speed is frequently employed in popular culture to enhance the audience’s attention to the scenes' details and increase aesthetic pleasure (Nebens, 2021). However, to our knowledge, no studies have directly examined the effects of tempo manipulation on the two dimensions of emotions: Arousal and valence.

To examine the link between movement speeds and emotions, it is crucial to define how emotions are theorized and measured. Research has developed several systems to describe emotions (Mauss & Robinson, 2009). Among them, the circumplex model (Russell, 1980), which entails a two-dimensional system of valence and arousal, has been adopted to evaluate facial expressions (Calder et al., 2001) and human movements (Pollick et al., 2001). Following Pollick et al.’s (2001) research, several studies suggest that emotion inferences were also possible from static body postures (Coulson, 2004), gestures (Castillo & Neff, 2019), gait (Kang & Gross, 2016), social interaction (Clarke et al., 2005), movements in joint musical improvisation (Wöllner, 2020), and dance movements (Burger & Toiviainen, 2020a). Together, these findings indicate a connection between movement features and the two emotion dimensions.

Studies further investigated the link between emotions, kinematic features, and movement angularity, the latter referring to the perspectives and geometrical relations of movements in a three-dimensional space. Castro and Boone (2015) found a connection between the accuracy of emotion perception and movement angularity. In this case, happy and sad body postures, being located respectively in the circumplex model, were recognized more accurately when participants were more sensitive toward the angles of geometric line patterns. Kinematic features may be critical indicators of emotions when viewing full-body movements. Several kinematic features have been identified to relate to specific basic emotions. A further study on the motion-emotion link revealed that music-induced emotions, reflected in spontaneous dance movements, were characterized by a set of kinematic features (Burger et al., 2013). In this regard, movement fluidity provides an index quantifying the smoothness of the motion, whereas complexity relates to how many dimensions or directions of the movement are used, with higher dimensionality being more complex than movements of lower dimensionality. Burger et al.’s (2013) study indicated that high fluidity correlated with music-inducing emotions of low arousal yet moderately positive valence. In contrast, high movement complexity correlated with music that induced emotions of moderately high arousal and positive valence. This finding was in line with other studies that suggested similar features, and movement fluidity and complexity were reliable indicators of basic emotional states such as happiness or sadness when enacting (Van Dyck et al., 2013) and perceiving human motions (Camurri et al., 2003; Montepare et al., 1999).

Tempo changes influence kinematic features. A study with professional drummers performing in various tempi found that movement fluidity was higher with slow compared to fast tempo, while movement complexity was the highest with slow tempo when the amount of drummer’s movement peaks (Burger & Wöllner, 2023). The finding indicates the influence of tempo but not of tempo manipulation. It is yet to be found whether acceleration and deceleration of motion affect the kinematic features in visual displays. Motion capture with point-light displays (PLDs) is often used to derive kinematic features. PLDs of human movements, initially developed by Johansson (1973), entails the technique that extracts movement patterns by attaching reflective markers to human bodies and recording the movement of these markers using an optical motion capture system, receiving an accurate 3-dimensional representation of the movement (e.g., Burger & Toiviainen, 2020b). PLDs have been widely used in emotion research. A study investigating participants’ discrimination sensitivity towards fluctuations in emotional intensity with PLDs suggested that dynamic PLDs were associated with higher sensitivity than static PLDs on a level comparable to full-light displays (Atkinson et al., 2004). The studies showed the potential of accurate emotion recognition with dynamic PLDs. In addition, PLD does not carry confounding factors such as the age or clothing of the performer, making it ideal for experimental designs.

Parallel to kinematic features, movement tempo influences the perceived emotions. Temporal attributes of PLDs were found to strongly predict the actor's emotional states in motion-emotion research (Pollick & Paterson, 2008). Fast movements were often associated with happiness and anger, while slow movements were associated with sadness and a neutral mood (de Meijer, 1989; Montepare et al., 1999; Pollick et al., 2001; Roether et al., 2009). Taking this a step further, according to the circumplex model (Feldman Barrett & Russell, 1998), a fast tempo was related to a high arousal level and a slow tempo to low arousal. In contrast, slow and fast movements were found at both ends of the spectrum of the valence scale. While the link between tempo and arousal has often been observed, the correlation between movement tempo and emotional valence is less clear. It cannot be ruled out that a specific tempo represents multiple emotions regarding valence. In a passive viewing task, fast movements were rated angry or happy, while slow movements were rated sad or neutral (Montepare et al., 1999). The ambiguity of the tempo-valence relationship in identifying dance-conveyed emotions was echoed by studies adopting local movements such as knocking (Gross et al., 2010), drinking (Pollick et al., 2001), and walking (Roether et al., 2009). The findings, therefore, call upon further validations, which are part of the goals of the current study.

In addition to tempo, a movement's natural appearance may affect the perceived emotions. The tempo-original and -manipulated movements may be distinguishable from how natural they look. Despite few studies on this topic, naturalness has been frequently used to measure the validity of artificially generated motion in virtual reality (Knopp et al., 2019). Highly natural movements represent a crucial overlap between the temporal patterns of the virtual and realistic movements that match human prediction. Interestingly, Nilsson and colleagues (2015) found that the tempo thresholds for natural treadmill walking in virtual reality are higher than walking in place (WIP), suggesting expectations for the pace of perceptually natural walking depend highly on contexts (treadmill or WIP). The definition of natural walking tempo differs by whether a person is moving forward or not. The finding indicates the possibility that viewers who controlled the avatar for walking in virtual space also considered the temporal attributes of different types of motion. Similarly, when viewing real-time compared to decelerated or accelerated movements, naturalness may be tied to how much they fit with typical, realistic movement features at the corresponding speeds. However, research has yet to explore the connection between tempo and tempo manipulation and perceived naturalness. Although no direct evidence has been found, Chen and colleagues’ (2023) study revealed that prototypical (most representative of the motion type) walking was perceived as more natural and aesthetically pleasing than atypical walking. This study suggested that high naturalness mediates the effects of visual attributes on aesthetic pleasure. Such an effect might be extended to positive emotional valence. The current study aims to investigate whether tempo manipulation affects the perceived naturalness of body movements and, if yes, whether increases in naturalness affect emotional valence positively.

Similar to tempo, the sensory modality also affects the perceived emotions. Preferences for multimodal rather than unimodal information were found for an emotion recognition task, in which facial expressions of fear and disgust, with or without vocal sounds of the consistent emotions, were presented to viewers (Collignon et al., 2008). A significant improvement in emotion discrimination performances was observed when PLDs of human movements were integrated with voices that expressed affective states, such as anger and fear, compared to neutral voices (Jessen et al., 2012). Multimodal information also increases perceived arousal. A study investigated how visual kinematic features and auditory information contribute to emotion perception in musical performances (Vuoskoski et al., 2016). The findings suggest that the audiovisual condition was perceived higher in emotional arousal than the visual-only condition, while the latter was also rated less positive in valence. The impact of tempo (e.g., comparing 72 BPM to 184 BPM in Droit-Volet et al., 2013; Wöllner et al., 2018), particularly tempo-manipulated movements, is yet to be investigated. Tempo acceleration and deceleration are typically accompanied by auditory information in movies, commercials, or sports, giving rise to strong emotional responses (e.g., Wöllner et al., 2018; Yin et al., 2021). Understanding the effect of sensory modalities in tempo manipulation will shed light on the realistic use of movements in audiovisual media. Therefore, another goal of the current study is to compare visual and audiovisual presentations of the same movements in tempo-original and -manipulated conditions for their effects on the perceived emotions.

Taken together, the current study aims to investigate how real-time (original), decelerated, and accelerated (audio-) visual stimuli influence perceived emotions. With the results, we hope to provide insights into how the presentations of music and dance performances affect the viewers’ emotional experiences. Additionally, we hope to offer solutions to regulate one’s moods by managing the tempo of videos and music playlists in everyday life.

We predict that tempo manipulation affects the movement features, which in turn affects emotional valence and arousal. When movements are presented at decelerated speeds, fluidity should increase, and complexity decrease, leading to lower perceived arousal and higher perceived valence than the tempo-original condition (Burger et al.,2013). Furthermore, we hypothesize that tempo manipulation as an independent variable affects naturalness as a dependent variable: The larger the extent of manipulation (manipulated – original tempo), the less natural body movements are perceived. We further expect that naturalness acts as a mediator to the effect of tempo manipulation on perceived valence and arousal. The larger the extent of manipulation, the less natural the stimulus is perceived, and the more negative valence and the lower arousal should be perceived. Therefore, naturalness will act as an independent variable in this analysis to predict changes in emotional arousal and valence as dependent variables. According to Pollick et al. (2001), we hypothesize that a faster presentation tempo leads to higher perceived arousal. Finally, the presentation modality is expected to moderate the perceived emotions such that the presence of auditory drumbeats, synchronized with the visual movements, should lead to higher arousal and more positive valence.

Method

Participants

To determine the minimum sample required to test our hypotheses, a prior power analysis was conducted using G*Power Version 3.1.9 (Faul et al., 2009), suggesting a minimum of 53 participants for achieving 80% power, a medium effect size (f2 = 0.3) for linear multiple regression (fixed model; significance level α set at 0.05). An international sample of 62 participants was recruited for an online experiment using the platform SoSci Survey (Leiner, 2019) (29 females, one gender undisclosed; Mage = 29.23 years, SDage = 8.83). According to the Music Training dimension from the Goldsmith Music Sophistication Index (Müllensiefen et al., 2014), participants have been trained for an average of 3.31 years (SD = 4.06). The Dance Training dimension from the Goldsmith Dance Sophistication Index (Rose et al., 2020) suggested that participants, on average, have 0.79 years of active dance training (SD = 1.63). This suggests a low professional music and dance training prevalence in the sample population. The majority of the sample had achieved bachelor’s (N = 22) or master’s (N = 27) degrees, standing for a generally well-educated group (overall 79%). The study was approved by the Ethics Committee at the Faculty of Humanities, University of Hamburg, and participants provided their informed consent before the study. A lottery of two prizes worth € 30 was carried out at the end of data collection, including those who had opted to leave their email address.

Apparatus

The stimuli consisted of visual and audiovisual presentations of human movements at three tempi: 86 (slow), 130 (medium), and 195 BPM (fast). The visual stimuli, detailed in Allingham et al. (2021), were human movements recorded by an eleven-camera motion-capture system (Qualysis Oqus 700) at 200 frames per second (framerate). The performer (male, 32-year-old) jumped from one leg to the other while raising the arms parallel to the ground, flexing and extending the wrists ipsilaterally to the leg motions. The movements, each lasting 10 seconds, entailed bilateral hand flaps and left-right jumps (see Figure 1) and were recorded at the three above-mentioned original tempi. The MATLAB Motion Capture (MoCap) Toolbox (Burger & Toiviainen, 2013) was used to time-shift the original data and create animations that matched each original movement with the other two tempi; for example, the performance at 130 BPM was slowed down to match 86 BPM as well as sped up to 195 BPM (see Figure 2).

Figure 1

Depictions of the Biological Motions Used as Visual Stimuli

Note. The movements entail flexion and extension of the hands and lifting the left and right foot in turn.

Figure 2

Visualization of Tempo Manipulations With the Biological Motions

Note. The movements, recorded initially at 86 BPM, 130 BPM, and 195 BPM, were accelerated or decelerated to match the other two tempi. The same presentation tempo movements are marked in the same color (86 BPM: Blue, 130 BPM: Red, 195 BPM: Green).

The tempo transformation yielded nine video excerpts: slow-original, slow-to-medium, slow-to-fast, medium-original, medium-to-slow, medium-to-fast, fast-original, fast-to-slow, fast-to-medium. A slow-to-fast stimulus, for example, has an original tempo of 86 BPM and a presentation tempo of 195 BPM. Additionally, nine temporally synchronized audiovisual presentations were produced by aligning the visual material with an auditory drum beat (sequence of isochronous beats at the respective BPM synthesized in an online beat generator, https://drumbit.app, in Apple iFilm 10.1.12). In addition, the stimulus set included eight catch trials, which varied in duration to examine if participants paid attention to the displays. Half of the catch trials lasted 5 seconds, while the other half took 15 seconds. All of them were presented with synchronized drumbeats. In total, 18 experimental trials and eight catch trials were created.

Procedure

Invitations to the online experiment on SoSci Survey (https://www.soscisurvey.de) were distributed through email lists and social media sites. Participants were provided information about the experiment and asked to give their informed consent. Due to the restrictions of online experiments, no control was imposed on screen resolution, the distance to the screen, or the device through which sounds were played. However, at the beginning of the experiment, explicit instructions were given that participation should take place in a quiet environment on a computer with compatible browsers (Chrome or Firefox) and with head- or earphones. A 15-second music excerpt was presented to test the sound volume. Participants were instructed to adjust to a comfortable sound level and to keep the level consistent throughout the experiment. They were then asked to fill in demographic information and short questionnaires about their music training and active dance experience (Factor 3 Musical Training from the Goldsmith Musical Sophistication Index, Müllensiefen et al., 2014, and Factor 4 Dance Training from the Goldsmith Dance Sophistication Index, Rose et al., 2020).

In the experiment, participants were presented with two blocks of randomized stimuli, each consisting of 18 experimental trials, including the complete sets of audiovisual and visual-only presentations and four catch trials balanced in lengths (two for 5 s and two for 15 s). The stimulus (700 x 394-pixel resolution) appeared at the center of the screen. A repeated-measures design was used to control for within-subject variability and increase the experiment’s efficiency. Following each stimulus, participants were asked to rate emotional arousal from 1 calm to 7 excited, emotional valence from 1 negative to 7 positive, and naturalness from 1 unnatural to 7 natural. No time restriction was imposed, though the video could be watched only once. A test trial using a different dancer was presented at the beginning to familiarize the participants with the experiment. With regard to the catch trials, all participants differentiated reliably between the different durations of catch trials and experimental trials, F(2, 2481) = 68.46, p < .001. The effect size (ηp2 = 0.05) indicates a medium effect. Therefore, no participant was excluded.

Analyses

In the first analysis, the model was intended to determine whether tempo manipulation influenced the perceived naturalness, which, in turn, might influence emotional arousal and valence. A mixed linear regression was conducted. A group of independent variables was adopted to identify possible predictors of the perceived naturalness from a group of relevant variables: Movement complexity, fluidity, presentation tempo, tempo manipulation, and stimulus modality. The multilinear regression can be found in Equation I in the Appendix. The model was selected based on the lowest Akaike Information Criterion (AIC) values. To select the model of the highest goodness of fit for each dependent variable, maximum likelihood ratio (MLR) tests were conducted (see Table A4 in the Appendix). In the MLR tests, predictors were added one after another from the baseline model, in which only the random effects were present. The variances of participants and conditions were considered as random effects. While adding significantly to the previous model, models with the lowest AIC were considered the final models.

The dependent variable for Equation I is perceived naturalness, and the participants rated it on a 7-point scale in response to the question, “Please rate how natural the movement feels.” 1 represents the least natural movement, and 7 represents the most natural movement. The predictor variables were created in the same way as the variables for the second and third models (Equations II and III).

The second and third analyses also used a mixed linear regression to identify possible emotional arousal and valence predictors from relevant variables: Movement complexity, fluidity, perceived naturalness, presentation tempo, tempo manipulation, and stimulus modality. Post-hoc analyses with Bonferroni correction were conducted to follow up on significant main and interaction effects. The independent variables selected include movement complexity and fluidity. The models can be found in Equations II and III in the Appendix.

The dependent variables from the arousal and valence models include the following:

  • Perceived arousal: Ratings from 1 (calm) to 7 (excited).

  • Perceived valence: Ratings from 1 (negative) to 7 (positive) in response to the question, “Please rate whether the emotion of the video is negative or positive.”

The independent variables from all models above include the following:

  • Fluidity of body movement: The smoothness of the movement (the ratio between velocity and acceleration). This variable had been standardized as follows: 1 (least fluid) to 2 (most fluid) by the equation: (x – min(x)) + (max(x) – min(x))/(max(x) – min(x))

  • Movement complexity: This refers to the dimensionality of the movements, which is based on a Principal Component Analysis of the movement data. In this case, the movement is considered simple when the first five principal components can explain a large amount of the variance. The movement would be more complex when the first five components explain less variance. See Burger et al. (2013) for more explanation of the features. This variable has also been standardized from 1 (least complex) to 2 (most complex).

  • Modality: The modality (visual-only, audiovisual) of each stimulus.

  • Presentation tempo: The tempo presented to the participants after manipulation. For tempo-original stimuli, the presentation tempo equals the original tempo.

  • Tempo manipulation: The extent and direction of tempo manipulation, where -1 represents the largest deceleration, 0 represents no manipulation, and 1 represents the largest extent of acceleration. The index is calculated as follows: First, calculate the gap between the manipulated and original tempo as x, set the new anchor to -1 (new min), representing the largest gap in minus). 1 (new max) represents the largest gap above 0. Then run linear standardization of the value ((x – min(x)) / (max(x) – x)) * (new_max – new_min) + new_min.

  • Perceived naturalness: Ratings from 1 (least natural) to 7 (most natural) were selected as a control variable to disentangle the contributions of the movement features to changes in emotional valence and arousal.

Results

Two kinematic features were extracted from the motion capture data of the nine tempo-original and tempo-manipulated performances. Since original performance tempi were tempo-manipulated in two directions (acceleration, deceleration) or stayed the same, movements at the same presentation tempo may have different fluidity and complexity features (see Figure 3), depending on whether they were accelerated, decelerated, or performed initially at this speed.

Figure 3

Descriptive Plots of Movement Fluidity (Left Pane) and Complexity (Right Pane)

Note. Movement fluidity and complexity are normalized on a scale of 0 (least fluid/complex) to 1 (most fluid/complex) of tempo-manipulated and original movements in different original tempo conditions. Note that each bar represents one stimulus. For the presentation tempo, S = Slow, M = Medium, and F = Fast. An example of how to read this figure: When the original performance tempo is slow, and the presentation tempo is medium, the stimulus falls into the category “acceleration,” color-coded as grey.

Pearson correlation coefficients were computed to examine the linear relationship between presentation tempo and movement features. Negative correlations between tempo and fluidity (r = –0.89, p < .001) and between manipulation (acceleration) and fluidity r = –0.92, p < .001) were found. Thus, the faster the presentation tempo and the larger the extent of tempo acceleration, the less fluid the body movements. Positive correlations were found between tempo and complexity (r = 0.09, p < .001) and between acceleration and complexity (r = 0.70, p < .001), indicating that the faster the presentation tempo, the larger the extent of acceleration, the more complex the movements.

The Effects of Movement Features on Perceived Naturalness

The linear mixed models revealed significant main effects of tempo manipulation on the perceived naturalness. A significant two-way interaction between the presentation tempo and manipulation was found (see Table 1). Post-hoc comparison with Bonferroni correction suggested that, when the presentation tempo is medium, accelerated movements are perceived more natural (M = 4.57, SD = 1.39) than original (M = 4.40, SD = 1.39) and decelerated movements (M = 4.41, SD = 1.42), p < .001. The effect size of the model, as measured by Cohen’s f2, was f2 = 0.04, indicating a small effect. Please note that the comparison was not possible with the other tempo conditions as neither have all levels of manipulation.

Table 1

Summary of the Significant Results From the Multilinear Model Based on Equation I - Dependent Variable Perceived Naturalness

VariableBSE Btp
βmanipulation0.370.094.03< .001***
βtempo*manipulation–0.140.03-5.24< .001***

Note. For tempo manipulation, -1 to 0 represents deceleration, 0 stands for no manipulation, and 0–1 represents acceleration. For modality, 1 = Visual-only, 2 = Audiovisual. Values and definitions of the abbreviations are consistent with those in Table 1. For the full table, please refer to A2 in the Appendix.

***p < .001.

A one-way ANOVA was run to examine the overall effect of manipulation on the perceived naturalness across all presentation tempi, F(2, 2229) = 6.51, p = .002, with a small effect (ηp2 = 0.006). Significant main effects were found, suggesting that decelerated movements were perceived as significantly less natural than tempo-original ones (see Figure 4). No significant difference between accelerated and original movements was found.

Figure 4

Line Plots of Tempo Manipulation by Perceived Naturalness

Note. Mean naturalness with deceleration, original, and acceleration across all tempo levels (original scale from 1 to 7). The whiskers represent the standard errors.

***p < .001.

The Effects of Movement Features on Emotional Arousal

A mixed linear regression revealed significant main effects of presentation tempo, modality, and perceived naturalness on emotional arousal. The effect size of the model, as measured by Cohen’s f2, was f2 = 0.71, indicating a large effect. A faster tempo was associated with higher arousal. Audiovisual stimuli, including the drumbeats, are perceived higher in arousal than the visual-only ones. In addition, higher naturalness is associated with higher arousal (see Figure 5 lower pane, Table 2). A significant interaction between fluidity and presentation tempo was found. Post-hoc comparison with Bonferroni correction suggested that when tempo is medium, low fluidity (lower than 50 percentiles) was associated with higher arousal (M = 4.47, SD = 1.21) compared to high fluidity (M = 3.98, SD = 1.29), p < .001. Please note that the comparison was impossible with the other tempo conditions as neither has both fluidity levels.

Figure 5

Line Plots of Movement Fluidity and Perceived Naturalness by Arousal

Note. The upper pane shows mean arousal by low and high fluidity across all tempo levels (original scale from 1 to 7). The lower pane shows mean arousal by low, medium, and high naturalness. The whiskers represent the standard errors.

***p < .001.

Table 2

Summary of the Significant Results From the Multilinear Model Based on Equation II - Dependent Variable Emotional Arousal

VariableBSE Btp
α–2.910.99–2.94.003**
βnatural0.100.020.10< .001***
βmodality0.280.050.28< .001***
βtempo3.600.813.60< .001***
βfluid*tempo–1.740.58–3.03.002**

Note. Tempo = Presentation tempo. Fluid = Fluidity of body movements. Natural = Perceived naturalness. Modality = Sensory modality: 1 = visual-only; 2 = audiovisual. For the full table, please refer to Table A3 in the Appendix.

**p < .01. ***p < .001.

A one-way ANOVA was run to examine the overall effect of fluidity on emotional arousal across all tempo conditions. Fluidity was split into three levels: Low (lower than 33 quantiles), medium (33 to 67 quantiles), and high (higher than 67 quantiles) fluidity, and resulted in a significant effect on arousal, F(2, 2229) = 445.54, p < .001 (see Figure 5, upper pane). The effect size (ηp2 = 0.29) indicates a large effect. The significant main effect suggests that high fluidity was perceived to be significantly lower in arousal for all tempo conditions than medium and low fluidity.

The Effects of Movement Features on Emotional Valence

The mixed linear regression revealed significant main effects of modality and perceived naturalness (Table 3), while movement features, tempo or manipulation showed no significant effects. Increases in naturalness are linked with higher valence (Figure 6). The audiovisual stimuli were perceived to be less positive compared to the visual-only stimuli. The effect size of the model, as measured by Cohen’s f2, was f2 = 0.27, indicating a medium effect.

Table 3

Summary of the Significant Results From the Multilinear Model Based on Equation III - Dependent Variable Emotional Valence

VariableBSE Btp
βnatural0.370.0223.27< .001***
βmodality–0.090.04–2.16.03*

Note. Values and definitions of the abbreviations are consistent with that in Table 1. For the full table, please refer to Table A4 in the Appendix.

*p < .05. ***p < .001.

Figure 6

Line Plots of Perceived Naturalness by Emotional Valence

Note. Mean valence by low (lower than 33% percentiles), medium (33% to 67% percentiles), and high (higher than 67% percentiles) naturalness across all tempo levels (original scale from 1 to 7).

Discussion

The current study investigated the effects of tempo manipulation on perceived emotional arousal and valence. Variables investigated were movement fluidity and complexity, the tempo-manipulated and original movements' perceived naturalness, the presentation tempo, and the sensory modality. Results suggest that: 1) Arousal was influenced by movement fluidity and naturalness. The higher the movement fluidity and the lower the naturalness, the lower the arousal. 2) Naturalness was affected by tempo manipulation. At a medium presentation tempo, accelerated movements were rated more natural than decelerated or even original ones. Overall, decelerated movements were perceived to be the least natural. 3) Emotional valence was most strongly affected by naturalness rather than tempo or kinematic features, such that more natural movements were also rated more positive in valence. Taken together, changing the speed of movements in audiovisual presentations has manifold consequences for the perception of movement qualities and emotions.

The manipulation of presentation tempo, as is often undertaken in video clips and films for various emotional purposes (Wöllner et al., 2018), affected the fluidity of movements. Higher movement fluidity, in turn, was associated with lower emotional arousal, confirming the results of previous studies that found, for example, tender-related motions to be less jerky than anger-elicited ones (Burger et al., 2013). According to Dahl and Friberg (2007), anger was related to low movement smoothness and happiness with high smoothness—the latter was similar to anger in its high emotional arousal but is characterized by higher emotional valence. Similar results were found for dance (Camurri et al., 2003) and music-induced movements (Boone & Cunningham, 2001), such that happiness and anger which are both relatively high in the arousal dimension, induced more frequent tempo changes and higher jerkiness in movements than sadness. Dance movements with high fluidity were also correlated with tenderness and sadness (Burger & Toiviainen, 2020b).

In our study, given that tempo acceleration was strongly correlated with lower fluidity (see Figure 5), participants distinguished between the tempo manipulations in terms of fluidity differences in the tempo-original and tempo-manipulated stimuli. They also perceived the emotional arousal of the two types of movements accordingly. Fluidity might thus be a more salient indicator of tempo manipulation than complexity. It should be stated that movement complexity is inherently related to tempo for stimuli of the same movement types: The faster the tempo, the more movement samples are displayed in a fixed window of time, leading to an increase in complexity. Thus, a potential effect of complexity on arousal can also be partially attributed to changes in tempo. Since the movements in the current study were highly controlled PLD displays, it should be further investigated if such effects can be replicated with naturalistic stimuli such as movie scenes or sports video clips. On the other hand, fluidity affects arousal but not valence, which could be caused by the different salience of the two emotional dimensions in perceived movements. Movements with positive or negative associations may thus exhibit similar fluidity, while valence perception in movements could be more ambiguous (Gross et al., 2010; Pollick et al., 2001).

On the other hand, valence perception was significantly affected by naturalness: The more natural a stimulus is perceived, the more positive the valence (see Figure 6), and the tempo-original and accelerated movements were rated to be more natural than tempo-decelerated ones. As the perceived validity of the stimuli can define the naturalness (Knopp et al., 2019), the effects may be due to an incongruence between the participants’ expectations of a given movement and how the PLD movements appeared. The smaller the gap, the more pleasant it was perceived. Movements perceived more positively in valence may also possess a unique combination of features, also known as movement “fingerprints” (Van Vugt et al., 2013), that led to an elevated quality. Van Vugt and colleagues (2013) investigated the movement of “fingerprints” or individuality via professional pianists’ movements in unexpressive and muted performances. Their study found that the timing of the pianists’ movements is different for each individual. Participants may expect similar movement features with real-life motions that manipulated motions may not have. In this way, complex motions that are less smooth in their trajectories could be most plausible. Similarly, a study found a congruence effect such that audiovisual stimuli consistent in arousal and valence level induced significant psychophysiological responses compared to inconsistent ones (Christensen et al., 2014). However, the link between naturalness and valence should be explored with a more extensive variety of movements.

Not surprisingly, we found that faster presentation tempo was linked to higher arousal. The effect of tempo on emotional arousal has been shown in various studies showing that the faster the sequence, the higher the arousal (Droit-Volet et al., 2013; Sievers et al., 2013; Wöllner et al., 2018), both in music and human movements. Furthermore, the absence of a tempo effect on emotional valence is also consistent with previous findings. Hence, tempo did not predict the emotional valence but rather the arousal level (Khalfa et al., 2008). In addition, our results suggest that emotional arousal was significantly higher in audiovisual than in visual-only presentations, whereas emotional valence was more negative in audiovisual than in visual-only presentations. Audiovisual stimuli evoke stronger emotional arousal than unimodal stimuli (e.g., Vuoskoski et al., 2016; Wöllner et al., 2018). Our finding implies that the multisensory effect persists despite tempo manipulations with the visual inputs, thus shedding light on, for example, the usage of multimedia and slow-motion videos in real-world scenarios.

A limitation of the study lies in the generalizability of the current findings, which could be enhanced with a larger number of PLD movements than that of the current sample. In future research, movements could include more scenarios that cover, for instance, interpersonal interactions and a variety of emotions. Furthermore, a higher number of smaller tempo manipulation steps may allow more detailed comparisons between original and tempo-manipulated conditions. In our experiment, the movements are generated specifically for the experiment in order to match various performance and presentation tempo conditions in a discrete and recognizable way and could thus have been unusual in comparison with day-to-day activities. In Burger et al.’s (2013) and Burger and Toiviainen’s (2020b) studies, the PLD movements are extracted from humans moving or dancing naturally to music.

Conclusions

In this study, we investigated the impact of tempo manipulation on emotional arousal and valence. Our findings reveal that increased movement fluidity led to decreased perceived emotional arousal, and decreases in perceived naturalness with tempo-decelerated movements resulted in negative emotional valence. Thus, tempo manipulations influence both emotional dimensions, particularly when slowing down from the original tempo. The ratings from our study corroborated earlier research examining the emotional and peripheral physiological responses to slow-motion scenes from movies, sports, and dance (Hammerschmidt & Wöllner, 2018; Wöllner et al., 2018).

The findings also point to the possible mechanism of how tempo manipulations are experienced through the perception of movement features such as fluidity. Identifying the gap between manipulated vs. tempo-original movements may provide insights into how artificial movements could be produced with high plausibility (Chen et al., 2023), creating advertisements with high perceived arousal and positive valence (Yin et al., 2021), or simply understanding the emotional responses that movements could elicit in various scenarios. Our results shed light on the possibilities of shifting the emotional experiences of viewers with tempo-manipulated body movements through music/dance performances or day-to-day media consumption. Future research may investigate the extent to which fluidity affects the perceived emotions, which other movement features apart from tempo affect the perceived emotions, and whether the movement features play the same role in a set of naturalistic and manipulated scenes, for which perceived naturalness should be a key factor.

Funding

This research was supported by a Consolidator Grant from the European Research Council (Grant No. 725319) to the third author. The research is part of the 5-year project: “Slow motion: Transformations of musical time in perception and performance” (SloMo).

Competing Interests

The authors have declared that no competing interests exist.

Acknowledgments

The authors have no additional (i.e., non-financial) support to report.

Data Availability

The research data, including all perceptual ratings and the movement features for this article, are available on Zenodo (see Wang, 2024).

Supplementary Materials

For this article, the following supplementary Materials are available:

Index of Supplementary Materials

  • Wang, X. (2024). The dataset of body movement and emotion: Investigating the impact of audiovisual tempo manipulations on emotional arousal and valence [Data]. Zenodo. https://doi.org/ 10.5281/zenodo.12601004

  • Wang, X., Burger, B., & Wöllner, C. (2021). Biological motion and emotion: Investigating the impact of tempo manipulations [Video materials]. Zenodo. https://doi.org/ 10.5281/zenodo.5212232

References

  • Allingham, E., Hammerschmidt, D., & Wöllner, C. (2021). Time perception in human movement: Effects of speed and agency on duration estimation. Quarterly Journal of Experimental Psychology, 74(3), 559–572. https://doi.org/ 10.1177/1747021820979518

  • Atkinson, A. P., Dittrich, W., Germmell, A., & Young, A. (2004). Emotion perception from dynamic and static body expressions in point-light and full-light displays. Perception, 33(6), 717–746. https://doi.org/ 10.1068/p5096

  • Boone, R. T., & Cunningham, J. G. (2001). Children’s expression of emotional meaning in music through expressive body movement. Journal of Nonverbal Behavior, 25(1), 21–41. https://doi.org/ 10.1023/A:1006733123708

  • Brownlow, B. (2017). The effect of music tempo on the psychophysiological measures of stress. Continuum: The Spelman Undergraduate Research Journal, 1(1), 8–16.

  • Burger, B., Saarikallio, S., Luck, G., Thompson, M. R., & Toiviainen, P. (2013). Relationships between perceived emotions in music and music-induced movement. Music Perception, 30(5), 517–533. https://doi.org/ 10.1525/mp.2013.30.5.517

  • Burger, B., & Toiviainen, P. (2013). MoCap Toolbox-A Matlab toolbox for computational analysis of movement data. In R. Bresin (Ed.), Proceedings of the sound and music computing conference 2013 (pp. 172–178). Logos Verlag Berlin. http://urn.fi/URN:NBN:fi:jyu-201401211091

  • Burger, B., & Toiviainen, P. (2020a). Embodiment in electronic dance music: Effects of musical content and structure on body movement. Musicae Scientiae, 24(2), 186–205. https://doi.org/ 10.1177/1029864918792594

  • Burger, B., & Toiviainen, P. (2020b). See how it feels to move: Relationships between movement characteristics and perception of emotions in dance. Human Technology, 16(3), 233–256. https://doi.org/ 10.17011/ht/urn.202011256764

  • Burger, B., & Wöllner, C. (2023). Drumming action and perception: How the movements of a professional drummer influence experiences of tempo, time, and expressivity. Music & Science, 6, 1–17. https://doi.org/ 10.1177/20592043231186870

  • Calder, A. J., Burton, A. M., Miller, P., Young, A. W., & Akamatsu, S. (2001). A principal component analysis of facial expressions. Vision Research, 41(9), 1179–1208. https://doi.org/ 10.1016/S0042-6989(01)00002-5

  • Camurri, A., Lagerlöf, I., & Volpe, G. (2003). Recognizing emotion from dance movement: Comparison of spectator recognition and automated techniques. International Journal of Human-Computer Studies, 59(1–2), 213–225. https://doi.org/ 10.1016/S1071-5819(03)00050-8

  • Castillo, G., & Neff, M. (2019). What do we express without knowing? Emotion in gesture. In Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems (pp. 702–710). International Foundation for Autonomous Agents and Multiagent Systems. https://dl.acm.org/doi/abs/10.5555/3306127.3331759

  • Castro, V. L., & Boone, R. T. (2015). Sensitivity to spatiotemporal percepts predicts the perception of emotion. Journal of Nonverbal Behavior, 39(3), 215–240. https://doi.org/ 10.1007/s10919-015-0208-6

  • Chen, Y.-C., Pollick, F., & Lu, H. (2023). Aesthetic preferences for prototypical movements in human actions. Cognitive Research: Principles and Implications, 8(1), Article 55. https://doi.org/ 10.1186/s41235-023-00510-0

  • Christensen, J. F., Gaigg, S. B., Gomila, A., Oke, P., & Calvo-Merino, B. (2014). Enhancing emotional experiences to dance through music: The role of valence and arousal in the cross-modal bias. Frontiers in Human Neuroscience, 8, Article 757. https://doi.org/ 10.3389/fnhum.2014.00757

  • Clarke, T. J., Bradshaw, M. F., Fieldô, D. T., Hampson, S. E., & Rose, D. (2005). The perception of emotion from body movement in point-light displays of interpersonal dialogue. Perception, 34(10), 1171–1180. https://doi.org/ 10.1068/p5203

  • Collignon, O., Girard, S., Gosselin, F., Roy, S., Saint-Amour, D., Lassonde, M., & Lepore, F. (2008). Audio-visual integration of emotion expression. Brain Research, 1242, 126–135. https://doi.org/ 10.1016/j.brainres.2008.04.023

  • Compton, R. J., Banich, M. T., Mohanty, A., Milham, M. P., Herrington, J., Miller, G. A., Scalf, P. E., Webb, A., & Heller, W. (2003). Paying attention to emotion: An fMRI investigation of cognitive and emotional Stroop tasks. Cognitive, Affective & Behavioral Neuroscience, 3(2), 81–96. https://doi.org/ 10.3758/CABN.3.2.81

  • Coulson, M. (2004). Attributing emotion to static body postures: Recognition accuracy, confusions, and viewpoint dependence. Journal of Nonverbal Behavior, 28(2), 117–139. https://doi.org/ 10.1023/B:JONB.0000023655.25550.be

  • Dahl, S., & Friberg, A. (2007). Visual perception of expressiveness in musicians’ body movements. Music Perception, 24(5), 433–454. https://doi.org/ 10.1525/mp.2007.24.5.433

  • de Meijer, M. (1989). The contribution of general features of body movement to the attribution of emotions. Journal of Nonverbal Behavior, 13(4), 247–268. https://doi.org/ 10.1007/BF00990296

  • Droit-Volet, S., & Berthon, M. (2017). Emotion and implicit timing: The arousal effect. Frontiers in Psychology, 8, Article 176. https://doi.org/ 10.3389/fpsyg.2017.00176

  • Droit-Volet, S., Ramos, D., Bueno, J. L. O., & Bigand, E. (2013). Music, emotion, and time perception: The influence of subjective emotional valence and arousal? Frontiers in Psychology, 4, Article 417. https://doi.org/ 10.3389/fpsyg.2013.00417

  • Dutta, D. (2022, July 5). The Matrix’s original bullet-time method was a little too risky to work. Slash Film. https://www.slashfilm.com/917171/the-matrixs-original-bullet-time-method-was-a-little-too-risky-to-work/

  • Faul, F., Erdfelder, E., Buchner, A., & Lang, A. G. (2009). Statistical power analyses using G*Power 3.1: Tests for correlation and regression analyses. Behavior Research Methods, 41(4), 1149–1160. https://doi.org/ 10.3758/BRM.41.4.1149

  • Feldman Barrett, L., & Russell, J. A. (1998). Independence and bipolarity in the structure of current affect. Journal of Personality and Social Psychology, 74(4), 967–984. https://doi.org/ 10.1037/0022-3514.74.4.967

  • Gross, M. M., Crane, E. A., & Fredrickson, B. L. (2010). Methodology for assessing bodily expression of emotion. Journal of Nonverbal Behavior, 34(4), 223–248. https://doi.org/ 10.1007/s10919-010-0094-x

  • Hammerschmidt, D., & Wöllner, C. (2018). The impact of music and stretched time on pupillary responses and eye movements in slow-motion film scenes. Journal of Eye Movement Research, 11(2), 1–17. https://doi.org/ 10.16910/jemr.11.2.10

  • Jessen, S., Obleser, J., & Kotz, S. A. (2012). How bodies and voices interact in early emotion perception. PLoS One, 7(4), Article e36070. https://doi.org/ 10.1371/journal.pone.0036070

  • Johansson, G. (1973). Visual perception of biological motion and a model for its analysis. Perception & Psychophysics, 14(2), 201–211. https://doi.org/ 10.3758/BF03212378

  • Kang, G. E., & Gross, M. M. (2016). The effect of emotion on movement smoothness during gait in healthy young adults. Journal of Biomechanics, 49(16), 4022–4027. https://doi.org/ 10.1016/j.jbiomech.2016.10.044

  • Khalfa, S., Roy, M., Rainville, P., Dalla Bella, S., & Peretz, I. (2008). Role of tempo entrainment in psychophysiological differentiation of happy and sad music? International Journal of Psychophysiology, 68(1), 17–26. https://doi.org/ 10.1016/j.ijpsycho.2007.12.001

  • Knopp, B., Velychko, D., Dreibrodt, J., Endres, D., Knopp, B., Velychko, D., Dreibrodt, J., & Endres, D. (2019). Predicting perceived naturalness of human animations based on generative movement primitive models. ACM Transactions on Applied Perception, 16(3), 1–18. https://doi.org/ 10.1145/3355401

  • Leiner, D. J. (2019). SoSci Survey (Version 3.2.00) [Computer software]. https://www.soscisurvey.de

  • Levine, L. J., & Pizarro, D. A. (2004). Emotion and memory research: A grumpy overview. Social Cognition, 22(5), 530–554. https://doi.org/ 10.1521/soco.22.5.530.50767

  • Mauss, I. B., & Robinson, M. D. (2009). Measures of emotion: A review. Cognition and Emotion, 23(2), 209–237. https://doi.org/ 10.1080/02699930802204677

  • Montepare, J., Koff, E., Zaitchik, D., & Albert, M. (1999). The use of body movements and gestures as cues to emotions in younger and older adults. Journal of Nonverbal Behavior, 23(2), 133–152. https://doi.org/ 10.1023/A:1021435526134

  • Müllensiefen, D., Gingras, B., Musil, J., & Stewart, L. (2014). The musicality of non-musicians: An index for assessing musical sophistication in the general population. PLoS One, 9(2), Article e89642. https://doi.org/ 10.1371/journal.pone.0089642

  • Nebens, R. (2021, April 10). Zack Snyder’s Justice League: Cinematographer responds to all that slow motion. The Direct. https://thedirect.com/article/justice-league-slow-motion-zack-snyder

  • Nilsson, N. C., Serafin, S., & Nordahl, R. (2015). The effect of visual display properties and gain presentation mode on the perceived naturalness of virtual walking speeds. 2015 IEEE Virtual Reality Conference, VR 2015 - Proceedings, 81–88. https://doi.org/ 10.1109/VR.2015.7223328

  • Pan, H., Van Beek, P., & Sezan, M. I. (2001). Detection of slow-motion replay segments in sports video for highlights generation. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, 3, 1649–1652. https://doi.org/ 10.1109/ICASSP.2001.941253

  • Platz, F., & Kopiez, R. (2012). When the eye listens: A meta-analysis of how audio-visual presentation enhances the appreciation of music performance. Music Perception, 30(1), 71–83. https://doi.org/ 10.1525/mp.2012.30.1.71

  • Pollick, F. E., & Paterson, H. (2008). Movement style, movement features, and the recognition of affect from human movement. In F. Thomas, Shipley, M. Jeffrey (Eds.), Understanding events: From perception to action (pp. 286–308). Oxford Scholarship Online. https://doi.org/ 10.1093/acprof:oso/9780195188370.001.0001

  • Pollick, F. E., Paterson, H. M., Bruderlin, A., & Sanford, A. J. (2001). Perceiving affect from arm movement. Cognition, 82(2), B51–B61. https://doi.org/ 10.1016/S0010-0277(01)00147-0

  • Quinn, S., & Watt, R. (2006). The perception of tempo in music. Perception, 35(2), 267–280. https://doi.org/ 10.1068/p5353

  • Roether, C. L., Omlor, L., Christensen, A., & Giese, M. A. (2009). Critical features for the perception of emotion from gait. Journal of Vision, 9(6), Article 15. https://doi.org/ 10.1167/9.6.15

  • Rose, D., Müllensiefen, D., Lovatt, P., & Orgs, G. (2020). The Goldsmiths Dance Sophistication Index (Gold-DSI): A new psychometric tool to assess individual differences in dance experience. https://doi.org/ 10.31234/osf.io/wapkx

  • Russell, J. A. (1980). A circumplex model of affect. Journal of Personality and Social Psychology, 39(6), 1161–1178. https://doi.org/ 10.1037/h0077714

  • Sievers, B., Polansky, L., Casey, M., & Wheatley, T. (2013). Music and movement share a dynamic structure that supports universal expressions of emotion. Proceedings of the National Academy of Sciences of the United States of America, 110(1), 70–75. https://doi.org/ 10.1073/pnas.1209023110

  • Van Dyck, E., Maes, P. J., Hargreaves, J., Lesaffre, M., & Leman, M. (2013). Expressing induced emotions through free dance movement. Journal of Nonverbal Behavior, 37(3), 175–190. https://doi.org/ 10.1007/s10919-013-0153-1

  • Vuoskoski, J. K., Gatti, E., Spence, C., & Clarke, E. F. (2016). Do visual cues intensify the emotional responses evoked by musical performance? A psychophysiological investigation. Psychomusicology: Music, Mind, and Brain, 26(2), 179–188. https://doi.org/ 10.1037/pmu0000142

  • Van Vugt, F. T., Jabusch, H. C., & Altenmüller, E. (2013). Individuality that is unheard of: Systematic temporal deviations in scale playing leave an inaudible pianistic fingerprint. Frontiers in Psychology, 4, Article 134. https://doi.org/ 10.3389/fpsyg.2013.00134

  • Wachowski, L., & Wachowski, L. (1999). The Matrix. Warner Bros.

  • Wöllner, C. (2020). Call and response: Musical and bodily interactions in jazz improvisation duos. Musicae Scientiae, 24(1), 44–59. https://doi.org/ 10.1177/1029864918772004

  • Wöllner, C., Hammerschmidt, D., & Albrecht, H. (2018). Slow motion in films and video clips: Music influences perceived duration and emotion, autonomic physiological activation and pupillary responses. PLoS One, 13(6), Article e0199161. https://doi.org/ 10.1371/journal.pone.0199161

  • Yin, Y., Jia, J. S., & Zheng, W. (2021). The effect of slow motion video on consumer inference. JMR, Journal of Marketing Research, 58(5), 1007–1024. https://doi.org/ 10.1177/00222437211025054

Appendix

The Appendix includes the equations on which the models were based (see Analyses in the Methods section), tables of model selection processes, and tables of the full model results. In the following, Equation I refers to the naturalness model, Equation II refers to the arousal model, and Equation III refers to the valence model.

Equation I: Percceived naturalness=α +βfluidXfluid+βcomplexXcomplex + βtempoXtempo+βmanipulateXmanipulate+ βfluid*complexXfluidXcomplex

In Equation I, α stands for the fixed intercept, while βfluid and βcomplex represent the betas of the fixed effects of fluidity, body movements, and movement complexity respectively. βtempoand βmanipulaterepresent the betas of the fixed effects of the presentation tempo and tempo manipulation.

Equation II: Arousal=α+βcomplexXcomplex+βfluidXfluid + βmodalityXmodality + βnaturalXnatural + βtempoXtempo+ βtempoXtempo+βmanipulateXmanipulate+βfluid*complexXfluidXcomplex +βfluid*manipulateXfluidXmanipulate +βcomplex*manipulateXcomplexXmanipulate +βfluid*tempoXfluidXtempo +βcomplex*tempoXcomplexXtempo

Equation III: Valence=α+βcomplexXcomplex+βfluidXfluid + βmodalityXmodality + βnaturalXnatural + βtempoXtempo+βmanipulateXmanipulate + βfluid*complexXfluidXcomplex

In Equations II and III, α stands for the fixed intercept, while βfluid and βcomplex represent the betas of the fixed effects of fluidity and body movements and movement complexity respectively.  βmodality and βnatural represent the betas of modality and perceived naturalness. βtempoand βmanipulaterepresent the betas of the fixed effects of the presentation tempo and tempo manipulation. βfluid*complex stands for the beta of the interaction between fluidity and complexity.

Table A1

Summary of the Results of Maximum Likelihood Ratio Tests (χ2 Analysis)

Dependent VariableVariabledfAICBICLog likelihoodχ2p
Emotional ArousalBaseline38163.108180.23−4078.55
+ Complexity47087.837110.67−3539.911077.27< .001***
+ Fluidity57033.367061.91−3511.6856.47< .001***
+ Fluidity: Complexity66917.876952.13−3452.94117.49< .001***
+ Naturalness76896.576936.55−3441.2923.30< .001***
+ Modality86859.196904.87−3421.5939.39< .001***
+ Tempo96795.476846.86−3388.7365.72< .001***
+ Manipulation106792.726849.83−3386.364.74.03*
+ Fluidity*Manipulation116793.566856.38−3385.781.16.28
+ Complexity*Manipulation126794.336862.86−3385.161.24.27
+ Complexity*Tempo136790.496864.73−3382.255.84.02
+ Fluidity*Tempo146783.286863.23−3377.649.21< .001***
Emotional ValenceBaseline36842.486859.61−3418.24
+ Complexity46785.306808.14−3388.6559.18< 0.001***
+ Fluidity56787.046815.59−3388.520.26.61
+ Fluidity: Complexity66778.126812.38−3383.0610.92< .001***
+ Modality76777.256817.22−3381.622.87.09
+ Tempo86777.376823.05−3380.681.88.17
+ Manipulation96779.126830.51−3380.560.25.62
+ Naturalness106295.316352.42−3137.66485.81< .001***
+ Fluidity*Manipulation116295.816358.63−3136.911.50.22
+ Complexity*Manipulation126297.336365.86−3136.670.48.49
+ Fluidity*Tempo136298.756372.98−3136.370.59.44
+ Complexity*Tempo146297.156377.10−3134.583.59.06
Perceived NaturalnessBaseline37698.417715.54−3846.20
+ Complexity47693.707716.54−3842.856.71.01*
+ Fluidity57693.987722.53−3841.991.72.19
+ Tempo67639.317673.57−3813.6556.67< .001***
+ Manipulation77629.547669.52−3807.7711.76< .001***
+ Fluidity*Complexity87630.157675.83−3807.071.40.24
+ Tempo*Manipulation97604.817656.21−3793.4027.34< .001***

Note. The analyses explored the differences in perceptual changes as a function of the fluidity of body movements, the movement complexity, and the presentation tempo.

*p < .05. ***p < .001.

Table A2

Summary of Results From the Multilinear Model Based on Equation I – Dependent Variable Perceived Naturalness

VariableBSE Btp
α0.922.340.39.70
βfluid2.251.261.79.07
βcomplex0.530.590.90.37
βtempo0.280.290.97.33
βmanipulation2.620.654.03< .001***
βfluid*complex−0.470.49−0.95.34
βtempo*manipulate−0.990.19−5.24< .001***

Note. Fluid = Fluidity of body movements; Complex = Movement complexity; Tempo = Presentation tempo; Manipulation = The extent of manipulation. For tempo manipulation, -1 to 0 represents deceleration, 0 stands for no manipulation, and 0 – 1 represents acceleration.

***p < .001.

Table A3

Summary of Results From The Multilinear Model Based on Equation II - Dependent Variable Emotional Arousal

VariableBSE Btp
α−12.5211.82−1.06.29
βfluid7.654.901.56.12
βcomplex2.129.650.22.83
βnatural0.100.025.86< .001***
βmodality0.280.056.03< .001***
βtempo3.600.814.44< .001***
βmanipulation−2.061.09−1.89.06
βfluid*complex−1.165.80−0.20.84
βfluid*manipulation2.393.030.79.43
βcomplex*manipulation−1.473.81−0.38.70
βfluid*tempo−1.740.58−3.03.002**
βcomplex*tempo0.823.570.23.82

Note. For modality, 1 = Visual-only, 2 = Audiovisual. Natural = Perceived naturalness, on a scale from 1 to 7. 1 stands for low and 7 for high naturalness. Values and definitions of the abbreviations are consistent with Table A2.

**p < .01. ***p < .001.

Table A4

Summary of Results From the Multilinear Model Based on Equation III - Dependent Variable Emotional Valence

VariableBSE Btp
α3.610.884.09< .001***
βfluid−0.310.34−0.92.36
βcomplex−0.100.37−0.26.79
βnatural0.370.0223.27< .001***
βmodality−0.090.04−2.16.03*
βtempo0.000.17−0.02.98
βmanipulation0.150.150.99.32
βfluid*complex0.020.340.06.95

Note. Values and definitions of the abbreviations are consistent with Table A2 and A3.

***p < .001.