AMuCS: Affective multimodal Counter-Strike video game dataset – Scientific Data

30 July 2025

8

amucs:-affective-multimodal-counter-strike-video-game-dataset-–-scientific-data — AMuCS: Affective multimodal Counter-Strike video game dataset – Scientific Data

Background & Summary

Interactive media experiences such as video games are a versatile and multi-faceted stimulus. Video games are becoming increasingly more realistic with detailed graphics, accurate physics, convincing emotional characters, and immersive virtual reality. As such, video games elicit complex player experiences which makes them an attractive subject for the research community.

Over the past several years the collection of player data has become a serious consideration and necessity for both researchers and developers¹. Player telemetry allows designers to observe and obtain an accurate representation of several in-game behaviours, which in turn can be used to make important game design decisions and modifications. Telemetry is also often used for matchmaking and ranking players in competitive multiplayer games² to provide more enjoyable experience for players at different skill ranges. However, telemetry often only measures in-game behaviours and thus fails to capture the player behaviours and reactions required for getting a full representation of the player state. Due to the ability of video games to immerse players and to elicit complex emotions, it has been widely studied in the domain of psychology and affective computing. In these domains, physiological activity, facial expression and body posture are often recorded in addition to in-game measures^3,4,5,6,7,8.

In affective computing, the focus is on the user experience with the goal of understanding user behaviours and emotions to make the playing experience more engaging. This could for instance be achieved by adapting the video game content and difficulty according to players’ experience⁸. User emotions can be estimated continuously and in real-time by measuring physiological responses which are linked to the autonomous nervous system (ANS) and other behavioural information like facial expressions. A common finding is that it is necessary to perform multimodal recordings in order to capture as much as possible of the user experience^9,10.

Although the usage of physiology to study player behaviour is quite common in the literature, it is rarely done within a multiplayer setting where all players’ data streams are collected synchronously. Furthermore, although multimodal data collection has been achieved, we are not aware of a study which has collected all the modalities presented in this paper in a multiplayer scenario.

To facilitate research in the fields of game analysis and psychology, several datasets have been made publicly available (see Table 1). The datasets utilize a wide variety of games as a stimulus, from 2D platformers like Super Mario Bros to 3D action adventure games like Assassin’s Creed. Very few datasets use multiplayer games and even fewer collect data from all players concurrently. Multiplayer data could be useful for analyzing synchronous behaviours, interaction dynamics, and more. Typically, contextual data (e.g. game logs) is collected in a laboratory environment together with physiological signals such as electrocardiograms (ECG), electrodermal activity (EDA) and electroencephalograms (EEG).

Table 1 Open video game datasets with physiological or affective modalities.

Full size table

Annotations vary between the datasets as well. They can be collected through questionnaires that are administered after the experience and ask participants to summarize the experienced intensity of a sentiment. A common type of questionnaire for indicating emotions is the self-assessment manikin¹¹ (SAM) where participants are aided by a pictorial representation of the intensity of the emotions. Another type of annotation is based on a graphical element that represents a one or two-dimensional space of interest where participants can report their subjective feeling¹² (continuous-space annotation). The arousal-valence space is commonly used to annotate emotional experiences in a continuous-space. This may be accompanied by visual aids positioning categorical emotions in the space. Annotations can also be collected in a continuous-time manner for a specific sentiment^13,14. Annotators are asked to watch a recording and at the same time use a software to indicate the absolute or relative intensity of a sentiment at each moment. These types of annotations can be used to compare the reactions to specific moments or events throughout the experience. The datasets listed in Table 1 are annotated in one or multiple of: fun, frustration, engagement, arousal, valence, and others.

Our study aimed at collecting a large affective multimodal video game dataset to identify robust emotional modalities and train emotion recognition machine learning models. The resulting dataset (AMuCS¹⁵) is the largest multiplayer dataset by an order of magnitude (245 vs 30 participants) and includes 11 recorded modalities compared to the average of 5 modalities in Table 1. It also remains competitive with single-player datasets like BIRAFFE, FUNii, and BIRAFFE2 which have a similar number of participants (206, 190, and 103 respectively). Due to its multimodal nature, this dataset can be used to analyze several facets of video gameplay including but not limited to emotional dynamics, player experience, player performance, player behaviours, and team dynamics. The AMuCS dataset¹⁵ has previously been used to study pupil diameter¹⁶.

Methods

We recorded a total of 256 participants playing video games in realistic conditions (7.66% female, 0.81% non-binary, 0.40% no answer). The participants mean age was 22.68 years old (standard deviation of 3.71, ranging from 18 to 36 years old). The languages spoken by the participants varied between Swiss German, French, and English. The data was collected in 71 experimental sessions where groups of 2 or 4 participants played the Counter-Strike: Global Offensive (CS:GO) first person shooter (FPS) video game on a computer. Several modalities were recorded during the game using custom data acquisition software modules.

All data was synchronized using the Lab Streaming Layer (LSL) software library¹⁷. More information about the data acquisition software and architecture can be found in our relevant publication¹⁸ where we measured the synchronization delays to be within 50ms on average after offset corrections. Video frames were not sent over LSL to reduce the bandwith usage on the local network. Face and screen videos were saved directly on the participant’s PC and video frame numbers and timestamps were sent to LSL for synchronisation.

The data was collected on-site at several video game LAN events in Switzerland over the course of two years: SwitzerLAN 2020 (https://switzerlan.ch/), SwitzerLAN 2021, PolyLAN 36 (https://polylan.ch/), and PolyLAN 37. The experimental area was setup in an approximately 5 square meter area within the event. It included 4 gaming PCs, each equipped with the sensors mentioned earlier, and 1 server PC where the game server and LSL Lab Recorder were installed.

The SwitzerLAN 2020 event welcomed approximately 300 gamers in the BernExpo exposition area, a large open space in the city of Bern, Switzerland. This event took place in October of 2020 during the COVID 19 pandemic and various restrictions were in place such as wearing masks when not seated and temperature checks upon entering the LAN area. The physical setup of the experiment is illustrated in Fig. 1a. The experiment was in a corner of the same area as the LAN, consequently, the environmental lighting and noise was not controlled. Furthermore, the participants were informed that they could keep wearing their mask for the experiment. 14 sessions totaling 41 participants were recorded at this event (sessions numbered 1 to 14).

The SwitzerLAN 2021 event welcomed approximately 1200 gamers in the BernExpo exposition area. This event took place in October 2021 and gamers had to show proof of vaccination, recovery, or a PCR test before entering the LAN area. The physical setup of the experiment was similar to the SwitzerLAN 2020 setup. Mask-wearing was not enforced in this event. 18 sessions totaling 67 participants were recorded at this event (sessions 15 to 33).

The PolyLAN 36 event welcomed approximately 200 gamers in a large auditorium of the EPFL campus in Ecublens, Switzerland. This event took place in November of 2021 and had similar restrictions as the SwitzerLAN 2021 event. The physical setup of the experiment was slightly different in this event and is illustrated in Fig. 1b. The experiment was located at the last row of the auditorium. As with the other events, the environmental lighting and noise was not controlled. 7 sessions totaling 32 participants were recorded at this event (sessions 34 to 41).

The PolyLAN 37 event welcomed approximately 1200 gamers in the SwissTech Convention Center in the EPFL campus. This event took place in April 2022 and had similar restrictions as the previous two events. The physical setup is illustrated in Fig. 1a. The experiment was located in an office adjacent to the LAN area, this resulted in much less environmental noise from the rest of the LAN as well as consistent lighting. 29 sessions totaling 116 participants were recorded at this event (sessions 42 to 71).

The ethics board of the University of Geneva (CUREG) approved the conduct of the study and sharing of the data under the project title “Emotionally intelligent peripherals for video game streamers and players – video annotations” (IRB committee number CUREG-2021-06-63) and conforms to all ethical guidelines set forth by the institution. All participants provided informed consent prior to their inclusion in the study. 245 participants (out of 256 recorded) accepted to share their anonymized data with other research institutions.

The Counter-Strike: Global Offensive game

In our experiments we utilized Valve’s Counter-Strike: Global Offensive (CS:GO). It is a free and modable multiplayer first person shooter (FPS) developed in the Half-Life 2 game engine. It is also popular in the e-sport community. The game includes several game modes: demolition, hostage, deathmatch, and team deathmatch.

The experiment used the team deathmatch game mode where two teams try to eliminate each other. Players started with a 2 minute warmup round, where they could explore the game map and test their opponents without counting the score. After the warmup, players were respawned to random locations and frozen in place for 1 minute. This period could be used to establish a baseline of physiological activity. Once the freeze time was over, the main round round started and had a duration of 10 minutes. Each player started with 100 health points and 100 armor points and were randomly placed on the game map. They were equipped with a random set of weapons from an assortment of assault rifles, long range rifles, pistols, light and heavy machine guns, and a knife. The goal of the game was to kill the players in the enemy team as many times as they can while avoiding to get killed. A player was killed once their health points reached 0, and they were subsequently revived (respawned) at a random location in the map after 2 seconds. If a player managed to get 2 kills in a row without dying they were rewarded with an item (healthshot) which restored 50 health points when used.

The game data was recorded using our custom sourcemod (https://www.sourcemod.net/) plugin which enabled us to send the data on an LSL stream to be synchronized with the other experimental data. OBS was also used to record the screen and sound of both the game and the participant.

Experimental protocol

Groups of 2 or 4 participants were spontaneously recruited at the LAN events to play a single round of one versus one or two versus two team deathmatch. The participants first read and signed a consent form which describes the experiment and the data that will be recorded. Immediately after, they answered a questionnaire with demographic questions (age, gender, handedness) as well as questions about their experience playing various types of video games (Brain and Learning Lab Video Game questionnaire – https://www.unige.ch/fapse/brainlearning/vgq/, their fatigue level¹⁹, and their closeness of relationship with the other participants in the experiment group²⁰.

Before attaching the ECG electrodes, the participants were given cotton soaked in alcohol in order to clean the electrode locations on their body. The 3-lead ECG sensor consisted of adhesive wet electrodes and was attached in the Einthoven triangle pattern. Specifically, the positive and negative leads were attached half way between the shoulder and the sternum on the left and right collar bone respectively, and the reference lead was attached on the right side just below the rib cage (similar to Fig. 2a). The 2-lead EDA sensor consisted of dry electrodes secured by Velcro straps around the proximal phalanx of the index and middle finger. EDA sensors were attached to the hand which was used to manipulate the computer keyboard (similar to Fig. 2b). The specific fingers and electrode locations were chosen to be the least obstructing for the participants while still having a signal of adequate quality. The “keyboard” hand does not move as much as the “mouse” hand and we found there were fewer movement artifacts at this location. The respiration belt (a stretch sensor) was attached over the shirt of the participant around their chest and over the diaphragm like in Fig. 2c.

The participants were asked to sit approximately as they would be sitting during gameplay and then we used the Tobii eyetracker manager software (https://www.tobiipro.com/product-listing/eye-tracker-manager/) to adjust the screen distance and angle such that the eyetracker could reliably detect the eyes. The Tobii eyetracker was mounted on the bottom bezel of the screen. It was calibrated using a five point calibration procedure using the same software. The results were verified by asking the participant to look at various points on the screen. If there were significant errors in the gaze position, the calibration procedure was repeated.

Players adjusted their game settings to their own preference (ex. mouse sensitivity, UI elements, keyboard bindings, screen resolution and aspect ratio). They then joined the experiment’s game server where they started with the 2 minute warmup round, and continued with the 1 minute baselining and then the 10 minutes main round.

After the game was finished, the sensors were removed and the players used the PAGAN tool²¹ to self-annotate their own recorded gameplay video (i.e video of the screen). PAGAN is a web-based platform that can be used by researchers to easily crowdsource continuous annotations of videos. Players self-annotated their own gameplay according to the arousal or valence emotional dimensions using RankTrace²², a relative and unbounded method for continuous annotations. The gameplay video served as a recall aid and we did not show the video of their own face so as not to bias the annotations towards facial expressions. Participants only annotated one of arousal or valence, not both. This was done to reduce the cognitive load of the annotation process with the goal of producing higher quality annotations. We also chose not to have the participant annotate the video twice (once for each dimension) due to time constraints for the experiment.

All participants were compensated with 10 Swiss Francs for their participation. The player who scored the most points during the match received a gaming mouse (Logitech G305) as a prize.

Data Records

The AMuCS dataset¹⁵ consists of 245 participants who agreed to share their data with other research institutions and is available upon request in the Yareta data archive platform. The data is available to any user who signs and accepts the terms of the data use agreement, available at the repository. Users outside Switzerland should select “Switch edu-ID” from the list of sign-in options in Yareta and register for an account to do this. More details can be found in the Usage Notes section.

The dataset contains several modalities:

Mouse/keyboard button presses – recorded at an irregular rate (as button presses occurred).
Game data (health, armor, position, damage taken, damage received, etc.) – recorded at 64Hz using a custom game plugin.
Gameplay video – recorded at 30Hz using Open Broadcasting Studio (OBS).
Color and depth video of the face – recorded at up to 30Hz using an Intel RealSense D435 camera.
Seat pressure – recorded at 10Hz with a Sensing Tex seat pressure mat.
Physiological data (electrocardiogram, electrodermal activity, respiration) – recorded at 100Hz using a Bitalino (r)evolution device
Eyetracker data (gaze, pupil diameter) – recorded at 60Hz using a Tobii pro nano.

A detailed table of the recorded data types is listed in Table 2. The gameplay video also includes the gameplay audio and the microphone recordings on the same audio track. In some instances, the microphone was not recorded due to technical issues. To conform with the requirements of the consent forms, the gameplay videos have been modified for the public dataset to remove the microphone speech recordings of the participants. This was achieved using the Hybrid Transformer Demucs f.t. audio source separation^23,24 to remove any speech from the gameplay audio while preserving the in-game audio as much as possible. It is important to note that the modified gameplay audio is likely to contain audio artifacts resulting from this audio processing step.

Table 2 List of recorded data. Only a subset of the game data is listed (CS:GO plugin).

Full size table

Data pre-processing

The LSL LabRecorder software records data in the extensible data format (XDF). The files contain all local timestamps as well as timing information that facilitates the synchronization of the data streams. We used the Python xdf module (https://github.com/xdf-modules/) to read the files and automatically apply the timestamp synchronization. We then used the pandas package²⁵ in Python to format the data into easily queried data structures and to ultimately convert the xdf files to parquet (https://parquet.apache.org/) or comma separated value (CSV) files. Since the annotations of the gameplay were performed after data collection, they had to be aligned with the rest of the data. This was achieved by using the synchronized frame timing information of the gameplay that was recorded with LSL.

For convenience and ethical concerns, we derived some additional features that may be relevant and are either computationally complex or rely on data which will not be made public such as the face videos. These features include the luminance of the screen, in-game combat, a danger level indicator, and facial features like action unit (AU) activations.

Screen Luminance

To compute the perceived lightness (luminance) of each screen pixel the Lstar from CIELAB²⁶ was computed from the RGB values. First, the RGB values were converted from gamma encoding to linear encoding, then the standard coefficients for sRGB (0.2126, 0.7152, 0.0722 for R, G, and B respectively) were applied to compute RGB luminance. Finally, the RGB luminance was converted to the perceived lightness, Lstar, which closely matches human light perception. It is important to note that Lstar does not take the Helmholtz-Kohlrausch effect²⁷ into account wherein the intense saturation of spectral hue is perceived as part of the color’s luminance.

Having the Lstar value for each pixel, we then averaged the Lstar pixel values within an 8 degree horizontal foveal area of the screen centered at the gaze target of the participant. We used a rectangular area instead of a circular area since it simplified our computations. This foveal area was approximately a 16cm by 9cm rectangular region on the screen with the same aspect ratio as the screen. We did this for each frame of the video recording, always centering on the gaze target at each frame using the eye-tracking data.

In the dataset, we provide the mean luminance of the entire screen, the mean luminance of the gaze region and the central region of the screen as well as the mean luminance of the screen excluding the gaze region and excluding the central region.

Note that the Lstar luminance measures the pixel activations and does not correspond to the absolute luminance as measured by a luminance meter. However, this information can still be useful when analyzing the pupil size and to attenuate the pupil light response from the pupil data as demonstrated in Fanourakis et al.¹⁶.

Combat and other special game events

From the game event data we computed some special indicators/events such as the number of enemies that are: in the field of view, in close range (<500 game distance units), and in mid range (<1000 game distance units). We also computed a health danger indicator indicating if the health is below 70%, 50%, or 30%.

We labeled gameplay as combat if the player had received or dealt damage from/to another player within a 5 second window. Once there was a death event (either the player was killed or the enemy was killed), the combat state was reset even if it was within a 5 second window of the combat events previously mentioned. These game event combinations were selected among other recorded game events based on their relevance towards the game’s two main goals: staying alive and killing the enemy. Combat is directly relevant to these two goals since the most common outcome of combat will either be a goal success (enemy killed) or a goal failure (player death).

The “health danger” indicator gives an indication of the probability of achieving or failing the goals. All else being equal, a player with lower health will be killed more quickly during combat. The “number of enemies in field of view” event puts the player in the position to seek goal resolution by taking action to stay alive and/or kill the enemy. The “number of enemies in close/mid range” can also give an indicator of the amount of danger that a player might be in. In conjunction with these combined events some other simple events can be of interest such as “reloading weapon” and “jumping”. These events prevent the player from making a fight or flight decision: a player is not able to fire while reloading, and cannot take cover while jumping. This type of information can be useful when analyzing the game context and summarizing the various events into different phases of the game like in Weber et al.²⁸.

Face features

We applied Baltrusaitis’ OpenFace²⁹ feature extraction on the color video of the face to extract the following features: gaze, facial landmarks, head pose, and continuous facial action unit activations. Although gaze is tracked by the Tobii eyetracker, this additional gaze estimate can be useful in the rare cases when the eyetracker failed to track the participant’s eyes due to bad placement or movement outside of the eyetracker’s operating range.

Technical Validation

In this section we will show the benefits of the large number of modalities and participants in our dataset. We will see how the number of modalities and number of participants in the training set influences machine learning prediction performance. This also gives a baseline of the predictive capacity of our dataset. Note that we did not aim to achieve competitive results and used classical machine learning methods such as gradient boosting. We used the f1 score and Cohen’s Kappa to measure the performance of classifiers and the concordance correlation coefficient (CCC) for regressors.

Multimodal prediction of game events

FPS games are typically fast paced with several different game events happening in rapid succession and in bursts. These game events can elicit physiological and behavioural responses from the players. This relationship allows us to use our dataset to predict game events from physiological signals.

The main challenge is that the physiological responses are not at the same cadence as game events. The elicited fluctuations in physiology through the ANS are generally at much lower frequencies (generally below 0.5Hz)^30,31 than game events from a fast paced game (bursts of events can be well above 2Hz in our dataset). To facilitate the process we may analyze the game events to derive general game states. Weber et al. did so by defining several game phases based on game micro-events for the game “Tactical Ops: Assault on Terror”²⁸. They defined a total of 6 phases: use of in-game menu, safe, danger (enemy in field of view), combat (player uses weapon), under attack, ghost mode (player has died and is viewing the arena in 3rd person). They then defined events in the context of these phases and found significant differences of heart rate responses to these game events.

We will proceed in a similar strategy but define only two phases: safe and danger. We added some complexity to the detection of the danger phase of Weber et al. by not only taking into account the enemies in the field of view but also the distance to the enemy, the current health of the player, and if the player is reloading their weapon or jumping. We also merged combat, and under attack phases of Weber et al. with the danger phase. We discarded the ghost phase since this was not enabled in our game, and we also discarded the use of in-game menu phase since the menu was used relatively rarely. Details on how we computed combat and danger can be found in the previous section. We wanted to verify that there is a perceived difference of emotional arousal between the game phases so we compared the participants’ arousal annotations during portions of the game when the players were safe versus when they were in danger according to the previously computed game phases. The results are shown in Fig. 3 where the mean arousal (z-score) is −0.36 and 0.12 during the safe phase and danger phase respectively. A one-sided Mann-Whitney U test shows that the increase of the arousal annotation values during the danger phase is statistically significant (p-value smaller than 0.001).

The physiological modalities that we used for predicting the two phases were: the EDA, heart rate (HR), respiration, facial action units, on-screen gaze speed. The modalities were normalized per participant with a z-score. We used the signals of 121 participants who had usable data for these modalities.

A low pass filter with a cutoff of 5Hz was applied to the EDA signal to remove high frequency noise. The same filter was also used for the respiration signal. The instantaneous heart rate was computed from the ECG signal after a low pass filter of 45Hz (to remove high frequency noise) by applying Hamilton method of R-peak detection of the BioSPPy Python package³², then computing the peak rates and smoothing them using a boxcar smoother of length 10. The on-screen gaze speed was computed from the gaze data by measuring the distance between consecutive gaze points on the screen. This feature gives an indication of saccade and fixation behaviours without information about where on the screen the player is looking at. We extracted several features from each of these signals by using a 15 second rolling window with a step size of 10 seconds. The features computed within each window are summarized in Table 3.

Table 3 Features derived for each windowed signal.

Full size table

We similarly windowed the game phase signals and extracted only the maximum value in the windows. Due to the fast paced nature of the game the players find themselves more frequently in the danger phase than the safe phase, resulting in imbalanced classes. Across all the windowed game phase signals and participants we had a total of 1050 instances of the safe phase class and 6762 instances of the danger phase class.

We then used a gradient boost classifier from the scikit-learn Python package³³ with leave-one-participant-out cross validation to predict the game phase from the physiological modalities. We used the default parameters of the model: log loss, learning rate of 0.1, 100 estimators, Friedman MSE criterion, maximum depth of 3. The results are summarized in Table 4 where we report the mean f1 score and Cohen’s kappa of the test sets. We performed one-sided paired Wilcoxon statistical tests to determine if the increase in performance (f1 score) of the models was statistically significant with p-value smaller than 0.01 (*) or 0.001 (**). The results of the statistical tests are summarized in Table 5.

Table 4 Game phase prediction results, leave-one-participant-out cross validation from a pool of 121 participants.

Full size table

Table 5 One-sided paired Wilcoxon statistical test of game phase model performance (f1 score) with the different training modalities defined in Table 4.

Full size table

On their own, EDA, gaze speed, and the facial action units have the best performance. In addition, their multimodal fusion lead to a significant increase of performance for game phase prediction. With all the modalities together we reach an f1 score of 0.75 (Cohen’s Kappa of 0.52). The HR and respiration signals perform poorly on their own. But when they are combined with other signals the performance is not affected significantly. A more diverse set of features or a different machine learning model may be able to utilize these signals more effectively. Although there is extensive literature showing that heart rate is correlated with emotional arousal, our models were not able to distinguish between the safe phase and the danger phase despite that their arousal annotations had a statistical difference. One potential reason could be that the set of features we extracted from the heart rate were not appropriate. Indeed, heart rate variability (HRV) features are more often used in the literature and the lack of such features in our case could be the reason why our models performed poorly for this modality. Another potential reason could be that the heart rate fluctuations can be induced by both the sympathetic system and the parasympathetic system, thus making the heart rate response origin uncertain between emotional stimuli or other functions. In combination with the complexity of the game stimuli, it could very well be the case that heart rate on its own is not enough to determine the game phase. In the literature, experiments showing the effects of arousal on the heart rate typically use a single unambiguous emotional stimulus and enough data is recorded (more than 40s) to capture the low frequency fluctuations (0.05Hz to 0.15Hz) of the heart rate which are related to the sympathetic nervous system. On the other hand, the EDA is directly linked to the sympathetic nervous system and emotional stimuli have a more direct influence³⁴. It is also important to note that inter-participant variability in the physiological response to stimuli can have a substantial negative impact on the classification performance of leave-one-participant-out cross validation. However, it is a useful technique to explore input features which generalize well to an unseen population.

Despite these shortcomings, it is evident from our results that there are statistically significant improvements in the performance of models when including multiple diverse modalities. The multimodal aspect of this dataset can therefore be a valuable attribute which can be utilized to train state of the art models. Potential avenues to improve the classification performance include using more complex machine learning models (ex. deep neural networks), and training more specialized models. The latter could be achieved, for example, by grouping participants according to their gaming experience and training models for each group.

Prediction of emotional arousal annotations

Machine learning models often generalize better when trained with larger datasets. AMuCS¹⁵ is the largest multimodal dataset with continuous affect annotations and we will use it to show that model performance improves significantly as we increase the amount of data that is available for training. We will focus on emotional arousal as the target for our machine learning models to validate the continuous emotional annotations at the same time. Due to the high number of iterations necessary for statistical tests we must make sure the model is simple and will only use EDA and HR as input signals which further limits the target to arousal.

We fit a gradient boost regressor in Python (using the implementation in the scikit-learn package³³) using an increasing amount of training data N, ranging from 5 training participants to 64 training participants in increments of 10. Note that we have access to one more participant compared to what is published and summarized in Table 8 for the relevant modalities. This is because some participants did not give their consent to share their data with other research institutions and are not included in AMuCS¹⁵. We used the default parameters of the model: squared error loss, learning rate of 0.1, 100 estimators, Friedman MSE criterion, maximum depth of 3. We performed leave-one-out cross validation. For each left-out participant and for each total training data size N, we randomly selected N participants from the remaining 64 participants. We then fit the regressor using this random selection, and tested on the test participant. We repeated the random selection of N participants (with replacement) to fit and test new models until the mean of the CCC between all random trials was stable (i.e. the measured mean was within ±0.005 of the real mean with 99% confidence) or a maximum of 1000 trials was reached.

We used the EDA and HR (computed from ECG as described earlier in this section) as input and the arousal annotation as target. The input and target modalities were normalized per participant (z-score). We used a window of size 7 seconds and step size 5 seconds for extracting the features.

In Table 6 and Fig. 4 we report the mean CCC of the random trials between all left-out participants for each training data size N. We observe that as we increase N, the mean CCC is increased. We performed one-sided paired Wilcoxon statistical tests to confirm that the improvement of the mean CCC as we increase N is statistically significant with a p-value smaller than 0.001 (**) except between values of N = 35 and N = 45 where the p-value is smaller than 0.01 (*).

Table 6 Arousal prediction results, leave-one-participant-out cross validation.

Full size table

One-sided paired Wilcoxon statistical tests show that the performance improvement as we increase the number of participants in the training set is significant. The results are summarized in Table 7.

Table 7 One-sided paired Wilcoxon statistical test of arousal model performance with different training set size N.

Full size table

In Fig. 4 we also observe an increase in the variance between the participants’ results. The reason for this is that, although the CCC increases as we increase N, it does not increase equally across the participants. To verify this, we plot the change in CCC of each participant in Fig. 5. We can clearly see that the participants have very different changes in performance. Most improve (green curves), some have no statistically significant change (blue curves), and for a few, the performance decreases (red curves), this explains the increase in variance that we saw in Fig. 4 even though the results improve on average as we increase N. This trend of increased overall performance and simultaneous increase in the variance can also be observed in the boxplots of the performance difference from N = 5 in Fig. 6.

In Fig. 5, we observed that for some participants (8 participants out of 65), increasing N from 5 to 55 results in a statistically significant decrease in performance. Upon further investigation we have found that for 6 of those participants, the performance was generally bad (CCC below 0.1). This could be caused by poor quality annotations, for example if the participant did not understand the task. The other 2 participants had an above average CCC and we have not determined the precise cause of this performance decrease.

Effect of time lag between the target and input window

In the previous section, we modified the number of participants in the training set while keeping the input features and targets time-aligned (i.e. no offset). A similar methodology was applied to compare how the performance of the gradient boost regressor changes when applying different offsets between the input and target feature windows. In this analysis the number of participants in the training set is constant (N = 64). As previously, we used a 7 second window to compute the EDA and HR features as well as the mean of the arousal annotations. We applied a series of offsets in the range of − 5 to +7 seconds. That is, the start of the input feature window was 5 seconds before the start of the target window and up to 7 seconds after the start of the target window.

The results are illustrated in Fig. 7 where we observe that the performance with the different offsets was not significantly different (one-sided paired Wilcoxon tests) from the time-aligned performance (offset = 0 seconds). A potential explanation is that the length of the feature window (7 seconds) for computing input features and the arousal mean is sufficient to include, at least in part, any delayed physiological responses to arousal changes induced by game events. Therefore applying a global offset between the windows does not have any significant effect on the performance of the regressor.

On the other hand, it may be necessary to account for the inter-participant variability of the physiological response delay by computing optimal window delays for each participant before training the regressor. This could be achieved using the cross correlation measure to determine a time-invariant delay for each participant. It may also be the case that the response delay is time-varying. Regularizing the data to account for this time-varying delay (for example, with dynamic time-warping) may lead to an improved performance of the regressor.

Data Quality

All data modalities were visually inspected for determining their quality. We grouped data into four categories: not usable, partial, usable, and good. The first level of inspection was simply to determine how much of the data was missing or outside of an acceptable signal to noise ratio (SNR) during the main round of the game (10 minutes of data). The SNR was mainly a concern with the EDA, ECG, and respiration signals. For the EDA signal, we accounted for the presence and visibility of phasic responses versus other signal artifacts. For the ECG signal, we accounted for the visibility of the QRS complex or R-peaks versus other signal artifacts. The signals was tentatively labeled as:

good – if less than 10% of the signal was missing or outside of SNR range
usable – if less than 33% of the signal was missing or outside of SNR range
partial – if less than 50% of the signal was missing or outside of SNR range
not usable – if more than 50% of the signal was missing or outside of SNR range

Next we looked more closely at the quality of some of the signals. For EDA, we accounted for the resolution of the signal. Participants’ base skin conductivity varied significantly and several had intrinsically low skin conductivity. This meant that their phasic responses had very low amplitude and could be more challenging to analyze but still usable, hence we labeled such EDA signals as usable. In several cases the resolution was too low to discern any phasic responses and these were labeled as not usable. At the other extreme, there were several participants whose skin perspiration was too high leading to saturation of the EDA signals. These were also labeled as not usable. If minor artifacts were visible, such as those that may occur during keystrokes, the signal was labeled as usable.

For ECG, we accounted for the visibility of the QRS complex. If all parts of the QRS complex were visible, then we labeled the data as good, if only the R-peaks were visible we labeled the data as usable, otherwise we labeled it as not usable or partial. For respiration, we mainly focused on artifacts from the heart rate. If these artifacts had amplitude larger than 10% of the amplitude changes caused by breathing then the signal was labeled as usable, otherwise as good.

For the face video, if the participant’s face was fully visible throughout the session it was labeled as good, if the face was partially covered (ex. wearing a mask) or otherwise not fully visible it was labeled as partial.

For the arousal and valence annotations we merely indicated if the corresponding annotation fulfilled the base criteria mentioned earlier (the proportion of the data missing). To avoid introducing any bias, we did not impose any of our preconceptions on what a good annotation should look like for the corresponding gameplay. However, a visual analysis of the annotations led to the conclusion that participants labeled their gameplay with dissimilar patterns.

In Table 8 we summarize the number of usable data for some combinations of modalities.

Table 8 Number of sessions (out of 71) and participants (out of 245) with usable data in the AMuCS dataset.

Full size table

Despite several participants having modalities of low quality or entirely missing, there remains enough data to maintain this dataset among the current largest in size. It is feasible that state of the art methods can also be employed to overcome the issues of missing modalities^35,36.

Usage Notes

The dataset can be accessed by following the DOI link https://doi.org/10.26037/yareta:wqmr4bkmrvhc3jkqcscluehjyi.

Interested parties can request access to the data archive on the Yareta platform where they will be asked to complete a data use agreement (DUA), sign it, and submit it along with the request. The request can then be processed for approval. At the time of writing, the Yareta platform requires a Switch edu-ID account (https://www.switch.ch/en/edu-id) for authentication, however, other authentication methods may be available at a future time. Although the Switch edu-ID is primarily used by Swiss universities, an account can be created by anyone irrespective of their affiliation to Swiss academic institutions (https://eduid.ch/registration). The full public dataset, which includes video screen captures and depth videos, amounts to a total of 955GB of data and can only be downloaded in full.

Python version 3.10 was used to process and analyze the data. We made use of the following Python packages: NumPy³⁷, Pandas²⁵, scikit-learn³³, SciPy³⁸, BioSPPy³², NeuroKit2³⁹, and PyTorch⁴⁰.

In the documentation directory of the dataset archive we provide a file containing detailed descriptions of each gamedata stream. In the same directory, we also provide a file indicating the quality of each modality for every participant, we recommend that it is taken into consideration when analyzing the data.

In the Python script directory of the dataset archive we provide a script which can be used to merge all the data modalities into a single pandas dataframe synchronized to the timestamps of a modality of choice. It may need to be adapted to meet individual needs but can be used as a starting point. We also provide a Python script example for reading video frames from the gameplay videos or the depth videos.

The units of the recorded EDA data cannot be converted to micro Siemens (μS). Due to the EDA sensor’s limited sensing range we made a hardware modification to the sensor which enabled us to manually adjust the amplification gain as needed and measure conductivity values that were beyond the design specifications of the sensor albeit at a reduced precision. The gain parameters are thus different for each participant and do not match the manufacturer specifications. The analysis of EDA data should focus on its dynamics rather than its absolute values and normalization of the data for each individual participant is recommended.

Caution should be used when analyzing the pupil diameter since the environmental and screen luminance were not controlled. Certain game events which may induce psychosensory pupil responses also produce distinctive luminance changes on the screen. Thus, the pupil light response is the main driver of pupil diameter changes in our experiment. Attempts to attenuate the pupil light response from the data were somewhat successful but luminance artifacts remained¹⁶.

Code availability

The code related to the experimental setup can be found in https://gitlab.unige.ch/sims/esports-data-platform. The individual data acquisition modules can be found in https://gitlab.unige.ch/sims/lsl-modules. Various scripts and packages for analyzing the data can be found in https://gitlab.unige.ch/Guillaume.Chanel/e-sport-mland access is available upon request. Some basic scripts for reading the data are already included in the dataset archive.

References

Drachen, A., Canossa, A. & Yannakakis, G. N. Player modeling using self-organization in Tomb Raider: Underworld. In 2009 IEEE Symposium on Computational Intelligence and Games, 1–8, https://doi.org/10.1109/CIG.2009.5286500 (IEEE, 2009).
Delalleau, O. et al. Beyond Skill Rating: Advanced Matchmaking in Ghost Recon Online. IEEE Transactions on Computational Intelligence and AI in Games 4, 167–177, https://doi.org/10.1109/TCIAIG.2012.2188833 (2012).

Article Google Scholar
Kivikangas, J. M. et al. A review of the use of psychophysiological methods in game research. Journal of Gaming & Virtual Worlds 3, 181–199, https://doi.org/10.1386/jgvw.3.3.181_1 (2011).

Article Google Scholar
Christy, T. & Kuncheva, L. I. Technological Advancements in Affective Gaming: A Historical Survey. GSTF Journal on Computing (JoC) 3, 38, https://doi.org/10.7603/s40601-013-0038-5 (2014).

Article Google Scholar
Yannakakis, G. N., Martinez, H. P. & Garbarino, M. Psychophysiology in Games. In Emotion in Games, 119–137, https://doi.org/10.1007/978-3-319-41316-7_7 (Springer, 2016).
Clerico, A. et al. Biometrics and classifier fusion to predict the fun-factor in video gaming. In 2016 IEEE Conference on Computational Intelligence and Games (CIG), 1–8, https://doi.org/10.1109/CIG.2016.7860418 (IEEE, 2016).
Aranha, R. V., Correa, C. G. & Nunes, F. L. S. Adapting software with Affective Computing: a systematic review. IEEE Transactions on Affective Computing 3045, 1–1, https://doi.org/10.1109/TAFFC.2019.2902379 (2019).

Article Google Scholar
Chanel, G. & Lopes, P. User Evaluation of Affective Dynamic Difficulty Adjustment Based on Physiological Deep Learning. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), https://doi.org/10.1007/978-3-030-50353-6_1 (2020).
Gilroy, S. W., Cavazza, M. O. & Vervondel, V. Evaluating multimodal affective fusion using physiological signals. In Proceedings of the 16th International Conference on Intelligent User Interfaces, IUI ’11, 53–62, https://doi.org/10.1145/1943403.1943413 (Association for Computing Machinery, New York, NY, USA, 2011).
Baig, M. Z. & Kavakli, M. A survey on psycho-physiological analysis & measurement methods in multimodal systems. Multimodal Technologies and Interaction 3, https://doi.org/10.3390/mti3020037 (2019).
Bradley, M. M. & Lang, P. J. Measuring emotion: The self-assessment manikin and the semantic differential. Journal of Behavior Therapy and Experimental Psychiatry 25, 49–59, https://doi.org/10.1016/0005-7916(94)90063-9 (1994).

Article CAS Google Scholar
Russell, J. A. A circumplex model of affect. Journal of Personality and Social Psychology 39, 1161–1178, https://doi.org/10.1037/h0077714 (1980).

Article Google Scholar
Cowie, R., Douglas-Cowie, E., Savvidou, S., Sawey, E. M. M. & Schroeder, M. FEELTRACE: AN INSTRUMENT FOR RECORDING PERCEIVED EMOTION IN REAL TIME. Proceedings of the ISCA Tutorial and Research Workshop on Speech and Emotion (2000).
Gunes, H. & Schuller, B. Categorical and dimensional affect analysis in continuous input: Current trends and future directions. Image and Vision Computing 31, 120–136, https://doi.org/10.1016/j.imavis.2012.06.016 (2013).

Article Google Scholar
Fanourakis, M. A. & Chanel, G. Affective multimodal counter-strike video game dataset (amucs) – public https://doi.org/10.26037/yareta:wqmr4bkmrvhc3jkqcscluehjyi (2025).
Fanourakis, M. & Chanel, G. Attenuation of the dynamic pupil light response during screen viewing for arousal assessment. Frontiers in Virtual Reality 3, https://doi.org/10.3389/frvir.2022.971613 (2022).
The lab streaming layer development team. sccn/labstreaminglayer: Lab streaming layer https://github.com/sccn/labstreaminglayer
Fanourakis, M., Lopes, P. & Chanel, G. Remote Multi-Player Synchronization using the Labstreaming Layer System. In Foundations of Digital Games Demos https://archive-ouverte.unige.ch/unige:148594 (Malta, 2020).
Greenberg, S., Aislinn, P. & Kirsten, D. Development and Validation of the Fatigue State Questionnaire: Preliminary Findings. The Open Psychology Journal 9, 50–65, https://doi.org/10.2174/1874350101609010050 (2016).

Article Google Scholar
Gächter, S., Starmer, C. & Tufano, F. Measuring the Closeness of Relationships: A Comprehensive Evaluation of the ’Inclusion of the Other in the Self’ Scale. PLOS ONE 10, e0129478, https://doi.org/10.1371/journal.pone.0129478 (2015).

Article CAS Google Scholar
Melhart, D., Liapis, A. & Yannakakis, G. N. PAGAN: Video Affect Annotation Made Easy. In 2019 8th International Conference on Affective Computing and Intelligent Interaction (ACII), 130–136, https://doi.org/10.1109/ACII.2019.8925434/ (IEEE, 2019).
Lopes, P., Yannakakis, G. N. & Liapis, A. RankTrace: Relative and unbounded affect annotation. In 2017 Seventh International Conference on Affective Computing and Intelligent Interaction (ACII), vol. 2018-Janua, 158–163, https://doi.org/10.1109/ACII.2017.8273594 (IEEE, 2017).
Défossez, A. Hybrid spectrogram and waveform source separation. In Proceedings of the ISMIR 2021 Workshop on Music Source Separation (2021).
Rouard, S., Massa, F. & Défossez, A. Hybrid transformers for music source separation. In ICASSP 23 (2023).
The pandas development team. pandas-dev/pandas: Pandas. https://doi.org/10.5281/zenodo.3509134
Robertson, A. R. The CIE 1976 Color-Difference Formulae. Color Research & Application 2, 7–11, https://doi.org/10.1002/j.1520-6378.1977.tb00104.x (1977).

Article Google Scholar
Donofrio, R. L. Review Paper: The Helmholtz-Kohlrausch effect. Journal of the Society for Information Display 19, 658, https://doi.org/10.1889/1.1828693 (2011).

Article Google Scholar
Weber, R., Behr, K.-M., Tamborini, R., Ritterfeld, U. & Mathiak, K. What Do We Really Know About First-Person-Shooter Games? An Event-Related, High-Resolution Content Analysis. Journal of Computer-Mediated Communication 14, 1016–1037, https://doi.org/10.1111/j.1083-6101.2009.01479.x (2009).

Article Google Scholar
Baltrusaitis, T., Zadeh, A., Lim, Y. C. & Morency, L.-P. OpenFace 2.0: Facial Behavior Analysis Toolkit. In 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018), 59–66, https://doi.org/10.1109/FG.2018.00019 (IEEE, 2018).
BERNTSON, G. G. et al. Heart rate variability: Origins, methods, and interpretive caveats. Psychophysiology 34, 623–648, https://doi.org/10.1111/j.1469-8986.1997.tb02140.x (1997).

Article CAS Google Scholar
Freeman, R. & Chapleau, M. W. Testing the autonomic nervous system. In Said, G. & Krarup, C. (eds.) Peripheral Nerve Disorders, vol. 115 of Handbook of Clinical Neurology, 115–136, https://doi.org/10.1016/B978-0-444-52902-2.00007-2 (Elsevier, 2013).
Carreiras, C. et al. BioSPPy: Biosignal processing in Python. https://github.com/PIA-Group/BioSPPy/ (2015–).
Pedregosa, F. et al. Scikit-learn: Machine learning in python. Journal of Machine Learning Research 12, 2825–2830 (2011).

MathSciNet Google Scholar
Dawson, M. E., Schell, A. M. & Filion, D. L. The Electrodermal System. In Cacioppo, J. T., Tassinary, L. G. & Berntson, G. (eds.) Handbook of Psychophysiology, 217–243, https://doi.org/10.1017/CBO9780511546396.007 (Cambridge University Press, Cambridge, 2017).
Jaques, N., Taylor, S., Sano, A. & Picard, R. Multimodal autoencoder: A deep learning approach to filling in missing sensor data and enabling better mood prediction. In 2017 Seventh International Conference on Affective Computing and Intelligent Interaction (ACII), 202–208, https://doi.org/10.1109/ACII.2017.8273601 (2017).
Zuo, H., Liu, R., Zhao, J., Gao, G. & Li, H. Exploiting modality-invariant feature for robust multimodal emotion recognition with missing modalities. In ICASSP 2023 – 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 1–5, https://doi.org/10.1109/ICASSP49357.2023.10095836 (2023).
Harris, C. R. et al. Array programming with NumPy. Nature 585, 357–362, https://doi.org/10.1038/s41586-020-2649-2 (2020).

Article ADS CAS Google Scholar
Virtanen, P. et al. SciPy 1.0: Fundamental algorithms for scientific computing in python. Nature Methods 17, 261–272, https://doi.org/10.1038/s41592-019-0686-2 (2020).

Article CAS Google Scholar
Makowski, D. et al. Neurokit2: A python toolbox for neurophysiological signal processing. Behavior Research Methods 53, 1689–1696, https://doi.org/10.3758/s13428-020-01516-y (2021).

Article Google Scholar
Ansel, J. et al. Pytorch 2: Faster machine learning through dynamic python bytecode transformation and graph compilation. In 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2 (ASPLOS ’24), https://doi.org/10.1145/3620665.3640366 (ACM, 2024).
Yannakakis, G. N., Martínez, H. P. & Jhala, A. Towards affective camera control in games. User Modeling and User-Adapted Interaction 20, 313–340, https://doi.org/10.1007/s11257-010-9078-0 (2010).

Article Google Scholar
Karpouzis, K., Yannakakis, G. N., Shaker, N. & Asteriadis, S. The platformer experience dataset. In 2015 International Conference on Affective Computing and Intelligent Interaction (ACII), 712–718, https://doi.org/10.1109/ACII.2015.7344647 (IEEE, 2015).
Alchalabi, A. E., Shirmohammadi, S., Eddin, A. N. & Elsharnouby, M. FOCUS: Detecting ADHD Patients by an EEG-Based Serious Game. IEEE Transactions on Instrumentation and Measurement 67, 1512–1520, https://doi.org/10.1109/TIM.2018.2838158 (2018).

Article ADS Google Scholar
Song, M. et al. Audiovisual Analysis for Recognising Frustration during Game-Play: Introducing the Multimodal Game Frustration Database. In 2019 8th International Conference on Affective Computing and Intelligent Interaction (ACII), 517–523, https://doi.org/10.1109/ACII.2019.8925464 (IEEE, 2019).
Blom, P. M., Bakkes, S. & Spronck, P. Towards Multi-modal Stress Response Modelling in Competitive League of Legends. In 2019 IEEE Conference on Games (CoG), vol. 2019-Augus, 1–4, https://doi.org/10.1109/CIG.2019.8848004 (IEEE, 2019).
Kutt, K. et al. BIRAFFE : bio-reactions and faces for emotion-based personalization. In 3rd Workshop on Affective Computing and Context Awareness in Ambient Intelligence (AfCAI 2019), http://ceur-ws.org/Vol-2609/AfCAI2019_paper_6.pdf (Cartagena, Spain, 2019).
Beaudoin-Gagnon, N. et al. The FUNii Database: A Physiological, Behavioral, Demographic and Subjective Video Game Database for Affective Gaming and Player Experience Research. In 2019 8th International Conference on Affective Computing and Intelligent Interaction (ACII), 1–7, https://doi.org/10.1109/ACII.2019.8925502 (IEEE, 2019).
Granato, M., Gadia, D., Maggiorini, D. & Ripamonti, L. A. An empirical study of players’ emotions in VR racing games based on a dataset of physiological data. Multimedia Tools and Applications 79, 33657–33686, https://doi.org/10.1007/s11042-019-08585-y (2020).

Article Google Scholar
Smerdov, A., Zhou, B., Lukowicz, P. & Somov, A. Collection and validation of psychophysiological data from professional and amateur players: a multimodal esports dataset. arXiv XX, 1–12, http://arxiv.org/abs/2011.00958 (2020).
Svoren, H. et al. Toadstool: A Dataset for Training Emotional Intelligent Machines Playing Super Mario Bros. In Proceedings of the 11th ACM Multimedia Systems Conference, 309–314, https://doi.org/10.1145/3339825.3394939 (ACM, New York, NY, USA, 2020).
Alakus, T. B., Gonen, M. & Turkoglu, I. Database for an emotion recognition system based on EEG signals and various computer games – GAMEEMO. Biomedical Signal Processing and Control 60, 101951, https://doi.org/10.1016/j.bspc.2020.101951 (2020).

Article Google Scholar
Melhart, D., Liapis, A. & Yannakakis, G. N. The Affect Game AnnotatIoN (AGAIN) Dataset. arXiv http://arxiv.org/abs/2104.02643 (2021).
Kutt, K. et al. BIRAFFE2, a multimodal dataset for emotion-based personalization in rich affective game environments. Scientific Data 9, 274, https://doi.org/10.1038/s41597-022-01402-6 (2022).

Article Google Scholar
Dresvyanskiy, D. et al. Dycoda: A multi-modal data collection of multi-user remote survival game recordings. In Prasanna, S. R. M., Karpov, A., Samudravijaya, K. & Agrawal, S. S. (eds.) Speech and Computer, 163–177, https://doi.org/10.1007/978-3-031-20980-2_15 (Springer International Publishing, Cham, 2022).

Download references

Acknowledgements

This work was funded by Innosuisse with grant number 34316.1 IP.ICT. The authors would like to thank Logitech S.A. for their vital collaboration in this study and for providing computer peripherals and the prizes for the winners of each round. We also thank the organizers of SwitzerLAN and PolyLAN for accommodating our study in their events and our student assistants for the long days and nights helping us collect the data.

Author information

Authors and Affiliations

Social Intelligence and MultiSensing (SIMS) lab, University of Geneva, Geneva, Switzerland

Marios Fanourakis & Guillaume Chanel

Authors

Marios Fanourakis
Guillaume Chanel

Contributions

G.C. conceived the study. All authors conducted the experiments, analyzed the results, and reviewed the manuscript.

Corresponding author

Correspondence to Marios Fanourakis.

Ethics declarations

Competing interests

This work was done in collaboration with Logitech S.A but funded by Innosuisse. The authors declare no competing interests (for instance financial, profession or personal) with logitech S.A or any other party.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Fanourakis, M., Chanel, G. AMuCS: Affective multimodal Counter-Strike video game dataset. Sci Data 12, 1325 (2025). https://doi.org/10.1038/s41597-025-05596-3

Download citation

Received: 12 June 2024
Accepted: 09 July 2025
Published: 30 July 2025
DOI: https://doi.org/10.1038/s41597-025-05596-3

AMuCS: Affective multimodal Counter-Strike video game dataset – Scientific Data

Background & Summary

Methods

The Counter-Strike: Global Offensive game

Experimental protocol

Data Records

Data pre-processing

Screen Luminance

Combat and other special game events

Face features

Technical Validation

Multimodal prediction of game events

Prediction of emotional arousal annotations

Effect of time lag between the target and input window

Data Quality

Usage Notes

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Rights and permissions

About this article

Cite this article

LEAVE A REPLY Cancel reply

Most Popular

Recent Comments

EDITOR PICKS

POPULAR POSTS

POPULAR CATEGORY

ABOUT US

FOLLOW US