Emotional Reasoning in the Human-Robot Interaction using Cognitive Robotics (PART-2)

 

                                                        Image by montrealethics.ai
      

Methodology

Study design and participants

We employed a within-subjects experimental design to evaluate our hypothesis. Undergraduate volunteers enrolled in psychology classes at a nearby university took part in the study. The sample included 29 people (17 of whom were female; M=24.6 years, SD=2.42 years). The sample size was determined based on previous research that addressed emotional reactions comprehensively and had sample sizes ranging from 14 to 41. None of the subjects had a history of neurological or neuropsychiatric disorders, and none were using psychoactive medication at the time of the study. All participants signed a written informed consent form. The study was carried out following the most recent version of the Declaration of Helsinki (October 2013) and was authorized by the local university's Ethical Review Board.

Measurements

Baseline measures

Anxiety, motivation, and mood were measured at baseline to check for potential implications of dispositional and endogenous factors on participants' cognitive performances and emotional responses (Gendolla, 2000). The STAI was used to assess trait (20 items) and state (20 items) anxiety using the Italian version. The possible scores ranged from 20 to 80, with the greater the score, the higher the amount of anxiety. Internal consistency was satisfactory for both state (Cronbach's alpha=0.86) and trait (0.90) anxiety in this study. A 10-cm visual analog scale (VAS) was used to assess motivation and mood, ranging from 0 (not at all motivated to engage in this study/very bad mood) to 10 (extremely motivated/extremely good mood).

Valence

After each encounter, the Italian version of the Positive and Negative Affect Schedule was used to assess changes in positive and negative affect. It was created to offer quick measurements of positive and negative affect and comprises of two independent 10-item mood scales. On a 5-point scale (from 1='very slightly or not at all' to 5='very much'), respondents are asked to rate the amount to which they have experienced each emotion over a specific period. The PANAS can be used with a variety of timeframes, however, in this study, the timeframe utilized to measure state effect was 'at this moment.' Both the state positive (Cronbach's =0.83) and negative (Cronbach's =0.85) impact scales demonstrated strong reliability in the Italian version of the PANAS. With values of 0.76 and 0.89, Cronbach's alpha was satisfactory in this investigation.

Non-verbal behavior

All of the test sessions were videotaped for later study of the nonverbal behavioral patterns of each participant. An a priori coding system was established based on the Ethnological Interview Coding System (ECSI) to examine various elements of the subject's emotional and social functioning during two interviews. This study's coding scheme includes seven behavioral patterns that were divided into four major categories to represent people's emotional responses during clinical interviews. The coding method and definition of each behavior type were examined in this study. Two encoders were used to encode the video data. The duration or frequency of each behavioral pattern was determined based on its parameters. The percentage of time (length) spent staring at the doctor/robot over the entire testing session, for example, was measured as the "look" behavioral pattern. That is, the number of times (frequency) that the participant touched their face with their hands was counted to determine the hand-facial behavior pattern. We reduced the video rate to half that of real-time to improve the accuracy of the analysis. Before beginning analysis, encoders were trained to dependably encode behavioral patterns in the same way. Each encoder encoded three videos during training to verify appropriate stability before moving on to full data analysis. The intra-class correlation coefficient (ICC), which reflects the accuracy of the rating procedure, was calculated as a measure of interracial reliability.

Psychophysiological assessment


Bodyguard 2 (Firstbeat), which has been extensively utilized for HR measurement in both laboratory and ambulatory research, was used to continuously monitor heart rate using a conventional electrode setup. A visual inspection of successive R waves (identified by an automatic beat-detection method) was performed, and any irregularities were corrected.

HRV Analysis Software was used to derive time (root mean square of successive differences, RMSSD) and frequency (low-frequency HRV; LF-HRV, high-frequency HRV; HF-HRV, and LF/HF-HRV) domain HRV measures. The RMSSD reflects the integrity of vagus nerve-mediated autonomic control of the heart and is less susceptible to respiratory and movement artifacts, according to Task Force guidelines (Task Force of the European Society of Cardiology, 1996); the HF-HRV reflects parasympathetic activity; the LF-HRV is proportional to sympathetic activity but influenced by parasympathetic tone. The LF/HF-HRV ratio is typically used to calculate the LF/HF-HRV ratio, which is used to interpret LF-HRV principally as a measure of sympathetic tone.

 Cognitive workload


To guarantee that tasks in both contexts weighed similarly on participants' cognitive effort, the NASA Task Load Index was utilized to quantify the subjective workload experienced when engaging with the robot or the clinician. The NASA TLX assesses the overall workload incurred when using a particular program and determines the primary sources of workload, which are estimated across six dimensions: mental demand, physical demand, temporal demand, performance, effort, and frustration. Each dimension's perceived workload is graded on a 20-step bipolar scale. The weight of each workload task dimension is then assessed by comparing each dimension to the others. In each of the six scales, there are 15 possible pairwise comparisons. The number of times each component is selected is counted, and it might range from 0 (not relevant) to 5 (very significant) (more important than any other factor). The overall workload score is computed by multiplying each raw rating by the weight given to that element by the participant. The sum of the weighted ratings is then divided by 15 (the total weights) to provide an absolute workload score that runs from 0 to 100, with 100 representing the participant's highest effort.


Comments

Read Also

Deep Neural Networks for ADMET properties' prediction

What is Terraforming?

Marine eDNA Analysis using DL techniques

Emotional Reasoning in the Human-Robot Interaction using Cognitive Robotics (PART-1)