Sensation seeking and risk adjustment: the role of reward sensitivity in dynamic risky decisions

Qianlan, Yin; Shou, Chen; Tianya, Hou; Wei, Dong; Liu, Taosheng

doi:10.3389/fnbeh.2025.1492312

ORIGINAL RESEARCH article

Front. Behav. Neurosci., 07 February 2025

Sec. Motivation and Reward

Volume 19 - 2025 | https://doi.org/10.3389/fnbeh.2025.1492312

This article is part of the Research TopicExecutive Functions, Reward Systems and Addiction in Adolescents and Young AdultsView all 11 articles

Sensation seeking and risk adjustment: the role of reward sensitivity in dynamic risky decisions

Yin Qianlan^†

Chen Shou^†

Hou Tianya

Dong Wei^*

Taosheng Liu^*

Faculty of Psychology and Mental Health, Naval Medical University, Shanghai, China

Objective: The primary objective of our research is to delve into the relationships between sensation seeking (SS), reward sensitivity (RS), and risk adjustment (RA) within the context of dynamic risk-taking behaviors. By integrating the reinforcement learning model and neural measures obtained from dynamic risk-taking tasks, we aim to explore how these personality traits influence individual decision-making processes and engagement in risk-related activities. We aim to dissect the neural and cognitive mechanisms underlying this interplay, thereby shedding light on the stable brain-based characteristics contributing to the observed variability in risk-taking and decision-making behaviors. Understanding these links could significantly enhance our ability to predict individual differences in risk preferences and develop targeted interventions for managing risky behaviors across different contexts.

Method: We developed a task to measure RA through a structured yet uncertain environment modeled after the Balloon Analog Risk Task. We enlisted 80 young adults to perform this task, and of these, 40 were subjected to electroencephalography (EEG) to assess neural correlates of RS. Subsequently, we analyzed event-related potentials and spectral perturbations to discern neural distinctions related to RS. We compared these distinctions concerning RA among participants exhibiting different levels of SS.

Results: Individuals exhibiting higher levels of SS (HSS) in the study displayed a tendency to disregard past risks, potentially resulting in diminished behavioral adaptability. EEG results indicated that individuals with HSS exhibited reduced neural responses to feedback compared to those with low SS, potentially affecting their feedback processing and decision-making. Moreover, the comparison of effects underscores the significant impact of RS and SS on shaping RA during dynamic decision-making scenarios.

Conclusion: This study has advanced the understanding of how SS and RS influence RA, revealing that RS prompts RA, while individuals with HSS often exhibit blunted RS, leading to worse RA. Future research should focus on the specific aspects of HSS and their implications for decision-making across different risk contexts. Employing advanced neuroimaging and cognitive modeling techniques will be pivotal in unraveling the neural mechanisms driving these individual differences in risky behavior.

1 Introduction

Sensation seeking (SS) denotes individual tendency to seek diverse, new, intricate, and intense sensations and experiences, along with a willingness to take risks across various life domains for the sake of such experiences (Zuckerman, 2013). This characteristic is linked to engaging in real-world risky behaviors and experimental risky decisions. Some researchers have suggested that SS plays a direct role in risky decision-making by being associated with sensitivity toward reward during dynamic decision-making tasks involving uncertain rewards (Bornovalova et al., 2009; Aven and Renn, 2013). In scenarios involving risky decisions with uncertain rewards, reward sensitivity (RS) is connected to the assessment of particular rewards and may show variations in expectancy across different risk levels (Neuser et al., 2020). Research has demonstrated that SS can be both adaptive and maladaptive, depending on how it manifests and interacts with other personality factors. High levels of SS have been associated with various risk behaviors, including substance use and problematic internet use (Chase and Ghane, 2023). Moreover, RS is significant in certain psychiatric disorders, such as bipolar disorder and substance use disorders, which exhibit elevated levels of RS (Whitton et al., 2015; Kim-Spoon et al., 2016). Therefore, both SS and RS play a crucial role in the development of behavioral disorders.

Psychologists differentiate between SS and RS in the context of risky decision-making. SS is considered a stable personality trait, reflecting individual differences in the propensity to seek novel and intense experiences. RS, on the other hand, is viewed as a more state-like characteristic, varying within individuals based on their current motivational and emotional context (Fleeson and Jayawickreme, 2015). Research findings demonstrate that both aspects of personality contribute to molding individual disparities in decision-making processes. Individuals with high SS (HSS) tend to appraise risks lower than those with low SS (LSS). This difference is attributed to HSS individuals temporarily attenuating their attention to or sensitivity toward potential negative outcomes, while LSS individuals maintain a greater focus on potential losses (Horvath and Zuckerman, 1993; Lauriola et al., 2014). RS is thought to influence risk-taking by modulating the valuation of potential rewards or the impact of feedback during learning (Smillie et al., 2006). Hence, it could be understanded that SS may provide a stable framework for predicting risk-taking tendencies, while RS allows for a more nuanced understanding of how individuals may adjust their behaviors based on immediate circumstances and feedback. Importantly, they may be interconnected, such that RS could be a mechanism by which SS influences risk-taking behaviors (Bianca et al., 2023). However, limited empirical evidence exists for these interconnections.

Reward sensitivity is commonly evaluated in dynamic decision-making situations involving uncertain rewards and punishments, where individuals’ choices impact their sequential risk-taking behaviors. A key aspect of RS is reward responsivity, which refers to an individual’s responsiveness to rewards (Ross, 1975). Lower variation in reward responsivity corresponds to decreased RS (DiMenichi and Tricomi, 2016). Moreover, reward responsivity may influence individuals’ adjustment of their risk-taking behavior in response to feedback, as individuals are more likely to repeat rewarded actions and avoid unrewarded ones (Harmon et al., 2021; Lee et al., 2021; Markanday and Galarraga, 2021). The process by which individuals modify their risk-taking behavior based on feedback is also referred to risk adjustment (RA) (Iezzoni, 1997; Ahn et al., 2003; Juhnke et al., 2016; Kelley et al., 2019). RA is a key element in dynamic decision-making, illustrating how individuals flexibly adjust their risk preferences in response to past choice outcomes. Despite the recognized link between RA and RS, there is a scarcity of empirical research on RA in dynamic risky decision-making scenarios and its association with RS and SS.

Dynamic decision-making tasks provide a critical framework for understanding how individuals progressively develop behavioral strategies by systematically exploring the intricate relationships between SS, RS, and RA. By employing reinforcement learning (RL) models, particularly in contexts involving probabilistic monetary outcomes, researchers can effectively illuminate the complex mechanisms underlying reward processing and adaptive decision-making strategies (Botvinick and Braver, 2015; O’Doherty et al., 2017). RL methodologies formalize the acquisition of action values based on past experiences and elucidate the role of different valuation systems in decision control and the mechanism of reward anticipation (Sutton and Barto, 2018). They provide insight into how rewards or losses influence subsequent choice behavior and allow for an investigation into how RS shapes reward-driven behavior, particularly RA (De Wit et al., 2012; Patzelt et al., 2018). According to RL theory, learning is shaped by prediction errors (PEs) – essentially a type of unexpected result in comparison to the anticipated value, and determining this disparity (Sutton and Barto, 2018). These PEs update value expectations, shaping subsequent actions. Individual differences in RS likely moderate the impact of these PEs, effectively influencing the learning rate (Luman et al., 2012). Hence, a computational model incorporating RL principles could offer valuable insights into RA during dynamic risky decision-making. Specifically, such a model could elucidate the interplay between RS, as a state-like characteristic, and SS, a trait-like personality factor, in shaping risk-taking behavior.

Additionally, analyzing stable brain-based characteristics of a neural trait approach can partially account for this diversity in behavior (Nash and Knoch, 2016). Recently, investigations have suggested potential links between the rate of evidence learned from feedback and variations in the relative brain activity of RS via electroencephalography (EEG) (Frank et al., 2015; Fukunaga et al., 2018). Feedback-related negativity (FRN), an event-related potential component arising from the variance in electrical potentials between losses and gains, is sensitive to the valence of outcomes (Frank et al., 2015; Kim-Spoon et al., 2016; Nash and Knoch, 2016). It acts as a neural representation of reward PE and is responsive to discrepancies in reward probability and adjustments in posterior magnitude (Cohen and Ranganath, 2007; Walsh and Anderson, 2011). Furthermore, analyzing EEG data through time-frequency analysis to investigate reward processing has unveiled a wealth of insights. This method aids in distinguishing the distinct impacts of overlapping event-related potentials (ERPs) by segmenting the EEG signal into spectral power. Studies indicate that delta band activity (1–4 Hz) is especially responsive to rewards and positive reward PEs, whereas theta band activity (4–8 Hz) is primarily associated with negative outcomes and unsigned PEs (Sambrook and Goslin, 2016; Brown and Cavanagh, 2020). For these reasons, tracking neural activity during dynamic risky decision-making could contribute to illuminating state-based expression tendencies like SS and RS personality traits.

This article addresses the intricate connections among SS, RS, and RA through a RL framework and EEG signals from a dynamic uncertain decision task. We hypothesized that individuals with HSS would show a greater tendency to modify their choices due to deficiencies in intentional decision-making processes or generally exhibiting reduced preference for rewards. These anticipated findings might indicate that SS could have moderating influences on choice variability and reward, which relate to RA and RS.

2 Materials and methods

2.1 Participants

All participants in this research were students enrolled in a medical university. They were all right-handed individuals with normal or corrected-to-normal vision, devoid of any history of neurological or psychiatric disorders, head trauma, or recent alcohol or tobacco consumption within 2 weeks before the study. The experimental protocols were endorsed by the Research Ethics Committee of Second Military University, Shanghai, China. Each participant provided written informed consent, acknowledging the objectives and methodologies of the study. There were no anticipated risks or discomfort, and participants were compensated upon task completion.

In study 1, 44 participants completed the task in our behavior lab using computers without EEG equipment. In study 2, 44 college students who did not participate in study 1 performed the task in the behavior lab while wearing EEG caps. However, one participant withdrew from study 1, and seven participants from study 2 were excluded from subsequent analyses due to technical issues or movement artifacts during data collection.

2.2 Procedure

Initially, all participants were invited to our laboratory and instructed to complete paper questionnaires, which included assessments of personality traits and demographic details, after providing written informed consent. Following a briefing on the safety protocols associated with EEG devices, participants engaged in computer-based tasks using the Windows 7 operating system and E-prime 2.0 software within a noise-reduced psychology laboratory environment, positioned approximately 60 cm away from a 20-inch LCD screen. Notably, participants in study 2 undertook the task while wearing a 64-channel EEG cap (Biosemi Product) with conductive gel. Both sets of participants completed the tasks within the same laboratory setting.

The task duration in study 1 averaged 15 min without fixed intervals between interfaces. In contrast, in study 2, the task lasted no less than 45 min due to the EEG setup prolonging interface delays and incorporating intermediate breaks. Before commencing the experiment, all participants underwent 15 practice trials, during which they carefully reviewed the task instructions and familiarized themselves with the risk characteristics. Participants wearing EEG caps were instructed to maintain stable head positions, focus on the screen, and minimize muscular movements. However, occasional glances at the keyboard for responses were inevitable, prompting a brief task delay to refocus attention on the screen. Following this delay, feedback on task performance was provided. Upon task completion, participants received compensation proportional to their risk decision-making performance, with an average payment of approximately 100 yuan for all participants and a standard compensation of 100 yuan for participants in study 2 to meet the experimental requirements.

2.3 Sensation Seeking Scale

The Sensation Seeking Scale Version 5 (SSS-V) is a psychometric tool that assesses individual variations in the desire for novel experiences and sensations (Zuckerman, 2007). This scale can evaluate both enduring personality traits and temporary states of SS, making it applicable to individuals across various age groups, including adults and adolescents, and suitable for diverse settings such as clinical, research, and educational environments (Zuckerman and Aluja, 2015). Widely recognized for its reliability and validity in measuring SS, the SSS-V has been extensively utilized in research to investigate the correlation between SS tendencies and a range of behaviors and outcomes (Zuckerman, 2015).

In this study, the Chinese version of the Sensation Seeking Scale Form V (SSS-V) was utilized and completed by all participants before engaging in the behavioral assessments. This scale comprises four sub-scales, each comprising 10 items: thrill and adventure seeking, boredom susceptibility, experience-seeking, and disinhibition (Wang et al., 2013). The total score is derived by summing the responses to all 40 items, providing an overall SS score reflecting the combined contributions. The scale’s internal consistency, as indicated by Cronbach’s alpha, was calculated to be 0.805, demonstrating good reliability. Detailed results of the interrelationships between sub-scales and the outcomes of factor analysis can be found in the Supplementary material.

2.4 Dynamic decision-making task (Balloon Inflation Test)

The Balloon Inflation Test (BIT) was derived from the Balloon Analog Risk Task (BART) (Lejuez et al., 2002). The BIT entailed a calculated risk assessment to quantify RA and incorporated more sophisticated measures for observing behaviors. The task was structured with fixed intervals between interfaces, as illustrated in Figure 1. Participants were briefed that they could earn money by inflating balloons, with the earnings directly linked to the chosen inflation percentage. Following a practice session, participants made trial-by-trial adjustments to optimize their earnings. Each choice corresponded to a specific likelihood of the balloon bursting, and rewards were determined based on the degree of inflation chosen. While the quality of the balloons was randomly generated and undisclosed to the participants, statistically, the optimal choice at the group level was 50%. Behavioral data from each trial, encompassing responses, reaction times, feedback, losses, and earnings, were meticulously recorded using E-prime 2.0 Professional.¹ After completing the 60 trials, participants were queried about their preferred inflation range by selecting options such as “1 – among 10%–30%,” “2 – among 30%–50%,” “3 – among 50%–70%,” and “4 – among 70%–90%.”

FIGURE 1

Figure 1. The diagram depicting BIT with consistent intervals between interfaces. In the balloon inflation task, participants’ focus was directed by the “start choice” phase lasting 1,000 ms, allowing them to choose the degree of inflation. Subsequently, an empty screen was displayed for 1,000 ms, followed by feedback lasting 1,000 ms. A 2,000 ms interval with a blank screen followed before initiating the next trial.

2.5 Electroencephalography recording

During the BIT, EEG recordings were conducted using the Biosemi ActiveTwo amplifier system (Biosemi, Amsterdam, Netherlands) with a bandpass of 0.1–100 Hz and a sampling rate of 512 Hz. This system utilizes two active electrodes, the Common Mode Sense (CMS) and Driven Right Leg (DRL), in place of traditional “ground” electrodes. The CMS is a recording reference, while the DRL serves as ground. Participants wore a flexible electrode cap with 64 Ag/AgCL electrodes arranged according to the International 10–20 system. EEG data were initially recorded using nasorostral as an online reference and then re-referenced algebraically to the average of all 64 channels for each participant. Electrode impedance was maintained below 3 kΩ. Horizontal and vertical electrooculograms were captured to detect blinks and eye movements. Subsequently, all data were processed offline using MATLAB R2020a (Math Works, Natick, MA) and the EEGLAB Toolbox 12.0.1 (Delorme and Makeig, 2004). A 30-Hz low-pass filter and a 0.1-Hz high-pass filter were applied for data filtering. Each EEG epoch commenced 1,000 ms before the feedback onset and concluded 1,000 ms after onset. Prior to averaging, independent Components Analysis was employed to correct for eye-blink and movement artifacts. Any trials contaminated by eye movements, blinks, or muscle potentials exceeding ±100 μv at any electrode were excluded from the analysis.

3 Statistical analysis

3.1 Behavioral data processing

In the context of the BIT, it is crucial to assess the average inflation level and its variation to comprehend participants’ risk-taking behavior. Analyzing the average inflation level provides insight into general risk inclination while studying inflation variability, which can yield valuable information on decision-making processes and response dynamics during the task. Additionally, a detailed examination was conducted on a trial-by-trial basis to improve understanding of decision precision, learning effects, and behavioral dynamics observed in the experimental data. This approach involved treating the number of trials as a fixed-effect factor and examining its influence on inflation choices based on sensation-seeking levels. Linear mixed-effects modeling using R’s lme4 package was conducted to conduct this trial-by-trial analysis. Furthermore, these effects were visualized using the ggplot2 package for comprehensive interpretation.

3.2 Computational modeling

To characterize the learning behavior of participants in BIT and reveal underlying trial-by-trial aspects of decision-making variables, we developed two sets of computational models. We applied them to the behavioral data of participants. These models were based on the simple RL model following the Rescorla-Wagner (RW) learning rule, as well as the RL model incorporating a Kalman (KL) filter (Rescorla, 1972). We assessed the suitability of these models and progressively integrated the most effective ones by comparing their performance. Within the RL model framework, each decision was represented by a 9-option value spanning from 10% (V_t(1)) to 90% (V_t(9)):

{\tilde{V}}_{t} = [V_{t} (1), V_{t} (2), V_{t} (3), V_{t} (4), V_{t} (5), V_{t} (6), V_{t} (7),

V_{t} (8), V_{t} (9)] (1)

In each trial t, V_t (representing the option’s value) was depicted as a two-element vector with a value of zero (indicating not chosen) or one (indicating chosen). Given the nature of the task, where each option was associated with increasing rewards, the likelihood of each option remaining at its current level or advancing to a higher degree was considered. For example, if a participant selected 50%, they were expected to choose 40% in the next trial, indicating a gradual progression in their decision-making. This aligns with the notion that participants were inclined to advance to higher degrees cautiously, as evidenced by their reluctance to jump directly to 70% without first embracing the risk of 60%. This decision-making process was reflected in the probability of choosing option i (indexed from 1 to 9), as outlined below.

P_{t} (i) = \frac{e^{V_{t} (i)}}{e^{V_{t} (i)} + e^{V_{t} (i + 1)}} = \frac{1}{1 + e^{- (V_{t} (i) - V_{t} (i + 1))}} (2)

The values were subsequently transformed into action probabilities through the utilization of a SoftMax function:

P_{t} (i) = Φ (V_{t} (i)) (3)

Where Φ was the inverse logit linking function:

Φ (x) = \frac{e^{x}}{1 + e^{x}} = \frac{1}{1 + e^{- x}} (4)

It is important to note that we incorporate the widely employed inverse SoftMax temperature parameter τ in the model specifications of action probability. This parameter regulates the degree of randomness in decision-making, varying from τ = 0 for entirely random responses to τ = ∞ for selecting the highest value option with certainty. In the basic model, an RW model was employed to represent decision-making, where only the selected value was adjusted based on the PE. In contrast, the unselected value remained unchanged from the previous trial.

P E = R_{t} (i) - V_{t} (i) (5)

V_{t + 1} (i) = V_{t} (i) + α P E (6)

Here, R_t represented the reward received in trial t, and the learning rate (α_t) denoted by RS (0 < α_t 1) determined the influence of the PE on updating the value. Upon inputting values of (1), (2), (3), (4), and (5), we obtained the categorical distribution of choices as:

C h o i c e C a t e g o r i c a l (Φ (τ \times V_{t})) (7)

The model incorporating the KL filter, known as the Pearce-Hall model, is additionally computed with the standard error (Gheza et al., 2018). This error signifies the dynamic learning rate that governs the impact of the PE and employs a form of the delta rule to adjust the estimated value according to the reward PE. KL filter models stand out by monitoring the (posterior) variance of the estimated value for each choice, reflecting estimation uncertainty, and leveraging this information to modify the learning rate adaptively. The lazy KL filter introduces a bias to the learning rate, facilitating a slower learning process. This model introduces additional parameters: the initial learning rate and the asymptotic learning rate, which collectively characterize the progression of the effective learning rate over time. The term “KL gain” represented by k_t functions as a learning rate. Consequently, Equation 6 was revised as:

V_{t + 1} (i) = V_{t} (i) + k_{t} α_{t} PE (8)

The parameter η ? (0, 1) dictates the bias in the updates of the KL gain, potentially leading to a slower learning rate (hence the term “lazy”). In the standard KL filter, this parameter is set at η = 1, whereas in the lazy versions, it is a variable parameter, allowing for less precise updates. The term k_t is influenced by S_t, representing the variance of the posterior distribution of the average reward, incorporating the innovation variance and the reward variance of the option. When t = 1, it signifies the initial variances as priors. These concepts are outlined as follows:

α_{t + 1} = η | PE | + (1 - η) α_{t} (9)

k_{t} = \frac{S_{t} + σ_{ζ}^{2}}{S_{t} + σ_{ζ}^{2} + σ_{ε, t}^{2}} (10)

In our models, logistic regressions are commonly utilized for vectors, and we employed the logistic model due to the U-shaped pattern observed in the expected values in our study. Specifically, the choice of 30% was rewarded equivalently to 70% based on probability. Consequently, we postulated a non-continuous computational process of choice utilities when individuals were making choices, aligning with the fundamental assumption of the logistic regression model. In this analysis, we initialized the values of V_t(i) at 0, while the subsequent choices, V1–V9 (representing the utility of the 9 options calculated as the product of risk probability and reward), were 9, 16, 21, 24, 25, 24, 21, 16, and 9, respectively. The standard deviations derived from reward probability were 3.12, 4.22, 4.83, 5.16, 5.27, 5.16, 4.83, 4.22, and 3.12, respectively.

We evaluated the superior model against alternative computational hypotheses within the hierarchical Bayesian framework (Table 1). To further validate our leading model, we employed two rigorous approaches. Firstly, we conducted a parameter recovery analysis to ensure the accurate and specific identification of all parameters (refer to Supplementary material). We conducted posterior predictive checks by leveraging model comparison to assess relative model performance. These analyses were executed in the R environment utilizing a combination of Rstan for Bayesian inference, hBayesDM for hierarchical Bayesian modeling, and loo for model comparison. The distinction in parameters was assessed using the Wilcoxon Rank Sum Test.

TABLE 1

Table 1. Computational models, model parameters, and model comparison.

3.3 Event-related potentials and spectral perturbations analysis

For each EEG epoch, we established the baseline for ERPs measurements by averaging the voltage recorded during the 200 ms pre-feedback interval and 1,000 ms. Subsequently, FRN was evaluated by calculating the mean wave amplitudes within a time window of 250–320 ms following positive or negative feedback onset. In line with previous studies, the FRN was initially evaluated across three electrodes: Fz, FCz, and Cz, prompting a focused and detailed analysis on these specific electrodes (Euser et al., 2011; Takács et al., 2015). Therefore, our detailed analyses were centered on the Fz, FCz, and Cz electrodes. Additionally, we conducted a time-frequency analysis on single-trial ERP data synchronized with feedback. This analysis involved averaging and baseline correction to ascertain event-related spectral perturbations (ERSPs). The algorithm used for time-frequency analysis is based on the Morlet wavelet transform principle involving convolution between wavelets with peak frequencies and the temporal signal under investigation to obtain a representation of power at different frequencies within the temporal domain signal. The dataset examines a frequency range from 0.1 to 30 Hz in the frequency domain divided into 50 frequency points. The temporal width spans 2 s, encompassing the entire epoch, with a margin of ±1 s around the presentation of the result feedback. The wavelet cycles range from 3 to 14, and trial-specific time-frequency results undergo averaging and normalization processes. To reveal the effect of SS on RS related neural substrates, we compared the difference in these related substrates using the Wilcoxon Rank Sum Test

3.4 Moderation effect statistic

The moderation effect statistic was computed to examine the interaction between SS and RS on RA measured in the dynamic risky decision. We used multiple regression analysis to test the interaction effect, with SS and RS as predictors and the choice degree varied across bins as the outcome variable with choice order as the intercept. The interaction between SS and RS can be defined mathematically as:

Y_{i j} = β_{0} + β_{1} S S_{i} + β_{2} R S_{j} + β_{3} (S S_{i} R S_{j}) + γ_{k} + ?_{i j} (11)

In this context, Y is the choice degree varied across trials, and β₃ signifies the interaction effect that demonstrates the varying impact of RS on the outcome, in response to the dynamic risk associated with RA, based on the level of SS. By including random effects, mixed models can handle correlations within the data and provide more accurate estimates of fixed effects. The analyses were performed in R with the “lmer” function. Additionally, we verified the centering of all variables in the model using the R functions (scale).

4 Results

4.1 Demographic characteristics

A summary outlining the demographic features of all participants is displayed in Table 2. The collective average SS scores stood at 99.63, with a standard deviation of 13.23. As there were no notable distinctions in the behavioral outcomes (Ps > 0.055), we simultaneously evaluated participants’ behavioral performance across both studies.

TABLE 2

Table 2. Sample demographic characteristics, SS scores (SSS), and the averages of behavioral variables.

4.2 Risk adjustment behavior among individuals with varying level of SS

At the initiation of our analysis, we investigated the associations between the Sensation Seeking Scale (SSS) scores and the behavioral indexes derived from the BIT. The findings revealed predominantly insignificant correlations between SSS and its subscale scores with BIT indexes, including reaction time, variability, and maximum scores (refer to the Supplementary material). To further explore the relationship between different levels of SS and their impact on BIT performance, we conducted a three-group analysis, categorizing participants into high (HSS), medium (MSS), and low (LSS) sensation seeking groups based on the 33rd and 66th percentiles of the SSS scores considering a potential nonlinear relationship between SSS and BIT results. This allowed us to more meticulously explore the relationship between different levels of SSS and behaviors. During the grouping process, we ensured a balanced amount of data in each group to guarantee the accuracy and reliability of the results. This resulted in 27 participants being placed into the LSS group, 30 into the MSS group, and the remaining 23 into the HSS group. To further explore behavioral differences between these groups, we analyzed their choice details as depicted in Figure 2. The figure illustrates similar choice distributions for LSS and HSS groups, while MSS preferred lower risk over other groups (χ² = 27.94, df = 2, P < 0.0010, see Figure 2A). The regression results also indicated that trial orders affected choice preferences (β = −0.339, SD = 0.127, P = 0.007, see Figure 2B) and being in the MSS (β = −6.383, SD = 2.444, P = 0.010, see Figure 2B). However, there was no significant difference in the choice of standard deviation among these groups (see Figure 2C).

FIGURE 2

Figure 2. Group-behavioral characteristics of LSS, MSS, and HSS groups. Panel (A) presents a bar graph illustrating the correlation between the distribution of sample proportions and choices. The bars are color-coded to represent distinct groups, with each bar labeled by its corresponding percentage, signifying the proportion of each choice within the group. Panels (B,C) depict three-line graphs, evidently linked to a time series of experiments. The x-axis denotes the experimental stage with increments of five trials, grouped into bins. Meanwhile, the y-axis portrays the mean and standard deviation of choices over five trials. In both graphs, different colored lines signify data from diverse groups. Error bars denote the variability or uncertainty of the measured points within the bin.

4.3 Estimation and comparison of parameter values for RS

We collectively inputted the data from the three groups into the Hierarchical Bayesian models to uncover the distinction at a more detailed level. Utilizing the Markov Chain Monte Carlo (MCMC) method, we performed a comprehensive simulation of data, primarily aimed at estimating the range of parameters as posterior distributions for each parameter at the individual level (Gallagher et al., 2009). The specifics of each hierarchical Bayesian model were outlined in Supplementary material. Based on the leave-one-out information criterion (LOOIC) presented in Table 1, the KL-RL model exhibited slightly superior predictive performance compared to the RW-RL model. Consequently, we concluded that the parameters of the KL-RL model were more effective in representing RS during the learning process. Figure 3 displayed the connections among model parameters, demonstrating a negative correlation between model parameters and choice standard deviation.

FIGURE 3

Figure 3. Relationships between model parameters and behaviors. Each of the small plots in the lower triangle of the matrix is a scatterplot of two variables. The plots on the diagonal are histograms showing the distributions of individual variables. Above the diagonal, there are correlation coefficients. The histogram visually displays the occurrence frequency of values for each variable at the individual level, encompassing the bias of KL gain (eta_ind), inverse temperature (theta_ind), the innovation variance (varm_ind), and initial variance (vari_ind), the mean of the selected degree (Meanchoice), the mean reaction time (MeanRT), the variability in choices (standard deviation of choices, abbreviated as StdDevchoice), and the frequency of choosing 50% (choice counts of 50%). The symbols in this figure represent statistical significance levels: * indicates p < 0.05; ** indicates p < 0.01, *** indicates p < 0.001.

Hence, we separately entered the behavior data of three groups into the three hierarchic models, and the posterior distributions for the bias of KL the bias of KL gain (η, Figure 4A), inverse temperature (τ, Figure 4B), the innovation variance ( $σ_{ξ}^{2}$ , Figure 4C), and initial variance ( $σ_{ε}^{2}$ , Figure 4D) were calculated at the individual level. As depicted in Figure 4, only one parameter showed significant differences among the groups with varying levels of SS. The HSS group exhibited the highest value, which was significantly different from the LSS and MSS groups (Ps < 0.038); however, there was no significant difference between the MSS and LSS groups (P = 0.923). These findings suggest that individuals with HSS may have a stronger inclination toward unfamiliar risks and show persistence in their decision-making.

FIGURE 4

Figure 4. The posterior distributions for individual parameters of KL_RL model. Panels (A–D) depict the model parameters distributions in the three groups, respectively. The violin plots combine elements of a box plot with a kernel density estimation. The thick black line inside each colored shape shows the interquartile range (IQR), and the white point represents the median. The width of the colored shape at different points on the y-axis shows the probability density of the parameter values: the wider the section, the higher the probability of the parameter taking a value within that range. The different colors per group allow for a quick visual comparison across groups.

4.4 The correlation between SS and neural substrates of RS

In BIT, successful inflation results in a “Win” and corresponding scores, while bursting the balloon leads to a “Loss.” Therefore, measuring RS is based on the differences in event-related potentials recorded from Fz\FCz\Cz channels. Figure 5A illustrates significant variations between the two conditions at time points around 250–450 ms, evident in overall ERPs and individual participant trial differences. Moving forward to Figure 5B, it seems that participants’ trial differences reach their peak latency around 300 ms, further supported by statistical analysis shown in Figure 5C. A detailed analysis of ERP differences and their correlation with SS was presented in Supplementary material. This analysis demonstrates that the time window between approximately 250–450 ms exhibits significant differences in the grand-averaged ERPs between the two experimental conditions.

FIGURE 5

Figure 5. The different representations of the ERP data of two conditions. (A) Mean ERPs with 95% confidence intervals. The black dots on the x-axis indicate time points with significant paired t-test results (P < 0.05). The dark-blue line represents the win condition, the light-blue line represents the loss condition, and the red line signifies the comparison between loss and win conditions. (B) The temporal evolution of ERP variances between the two conditions for individual participants is displayed. (C) The progression of individual differences over time, with participants arranged along the y-axis. ERP amplitudes are color-coded to correspond with the outcomes depicted in panel (B). Significant time points are highlighted with transparency. A bootstrap cluster sum method was employed to adjust for multiple comparisons.

Moreover, we conducted ERSP analysis focusing on the signals between 250 and 450 ms. Figure 6 illustrates the ERS in the 1–30 Hz frequency range within the notable time-frequency window. Consistent with the ERP outcomes, we observed heightened power at FCz following the feedback, aligning with earlier findings related to FRN. In Figure 6D, statistical significance was evident within the 200–400 ms timeframe and across frequencies spanning 1–10, showing notable differences in the comparison at the low- and middle frequency bands. Additionally, topographic maps for the ERSP results, contrasted across different channels, were provided in the Supplementary material.

FIGURE 6

Figure 6. Event-related synchronization related to feedback. Panels (A,B) illustrates brain activity’s intensity or synchronization levels. The color scheme typically indicates intensity or power, with blue possibly representing lower synchronization or power and red indicating higher synchronization or power. Panel (C) may present a T-contrast of the (Win vs. Loss) conditions. Panel (D) displays P values, a measure of statistical significance; the blue areas indicate time points and frequencies where the data significantly differ from background or control conditions.

To demonstrate the impact of SS on RS concerning neural pathways, we compared individuals with varying levels of SS. This comparison revealed significant differences between high and low levels of SS in terms of both the FRN amplitude (P = 0.037, Figure 7A) and the power within the 1-10 Hz frequency band (P = 0.0072, Figure 7C). No significant difference showed in the LPP amplitude (Figure 7B). Individuals with HSS exhibited smaller amplitudes and weaker responses to negative feedback than those with LSS.

FIGURE 7

Figure 7. The difference of neural substrates for RS among groups with different levels of SS. Panels (A–C) depict the amplitude of the feedback-related negativity (FRN), the late positive potential (LPP) amplitude, and the power within the 1-10 Hz frequency band, respectively.

4.5 The moderating effect of SS on RS and RA

Three regression models were constructed to examine the moderate impact of SS (Table 3). These models used RA to represent the variation in choice degree as the outcome variable. The random effect was defined as choice order (categorized 10 times), and the common fixed effect was represented by levels of SS. Model1 included the individual-level model parameter ( $σ_{ξ}^{2}$ ) and tits interaction with SS from study 1 and 2 samples. In model 2 and 3, FRN, band power, and their interaction with SS were the predictive variables with samples from study 2. When compared to the HSS group, it was observed that the interaction between SS and FRN significantly predicted variations in choices across bins, indicating that at higher levels of SS, the positive predictive effect of FRN on choice variation became stronger.

TABLE 3

Table 3. Summaries of effects on RA.

5 Discussion

As we predicted, SS may have a moderating influence on choice variability, demonstrating RA through RS. This was confirmed by the RL model method and further supported by neural substrates showing the relationship between SS and RS. Regarding behavioral findings, individuals with HSS are likely to display a decreased tendency to alter choices and low levels of RS in response to reward feedback. Additionally, EEG results revealed a significant effect of SS on the neural substrates of RS. In particular, FRN related to RS predicted RA, and its effect was moderated by SS. These results highlight SS’s behavioral presentation and neural substrates and contribute to understanding the complex interplay between SS and RS in influencing RA.

Previous studies also found that SS and RS are closely connected in terms of the pursuit of reward and novelty (Zuckerman, 1994; Bornovalova et al., 2009; Xu et al., 2019). HSS individuals are more sensitive to rewards but less sensitive to punishment compared with LSS. The potential explanation centers around motivation in SS and the “hyperactive approach system” (Kruschwitz et al., 2012). HSS might be inversely linked to RS, as it may require a higher degree of risk to meet the seekers’ presumed excitement and arousal for reward (Zuckerman, 2007). In particular, we made an unexpected observation after dividing the comparison groups into three levels, which differs from traditional pairwise comparisons. Our observations indicated that the MSS group demonstrated a decreased tendency for risk compared to the HSS group. This indicates that the MSS group may lead to a lower preference for risk-taking. However, these distinctions are only apparent in the average inclination and do not persist in choice variability. These results indicate a difference in the relationship between SS and risk inclination compared to SS and RA. Advancedly, we used an RL model combined with a KL filter to investigate the potential impact of SS on RA. We also incorporated the influence of RS using relevant parameters. The KL filter’s ability to handle uncertainty and evolving situations makes it ideal for examining the effects of SS and RS on RA in decision-making processes involving learning from feedback (Zhang et al., 2021; Kim et al., 2022). Through model analysis, we found that individuals with HSS tend to disregard past risks and concentrate excessively on immediate stimuli and potential rewards, causing them to neglect previous encounters with risks. This behavioral pattern can impact their adaptability and decision-making flexibility in dynamic environments.

Reward sensitivity has evolved from being a trait related to the behavioral approach and inhibition system to being seen as a state personality that can be applied in human behavioral experiments (Kale et al., 2018; Fischer and Karl, 2020). However, most field studies used self-reported surveys, with only a few investigating the relationship between state and trait personality in experimental behavioral tasks, especially those related to risky decision-making. As a result, our research utilized an experimental task to control the outcome of a risky decision. Subsequent choices were influenced by outcomes that likely relate more to feedback or reward-based learning processes. This served as a practical measure for assessing how personalities affect decision outcomes. Similar to previous studies, we discovered that RS (the tendency to repeat previously rewarded actions while avoiding losses) played a pivotal role in shaping decisions during reward-based learning and decision-making situations (Cockburn and Holroyd, 2018). RS directly links to individual learning rates that scale the effect of PEs on updating values (Wu et al., 2017). These concepts extend to environments involving dynamic uncertain decisions, where RS involves comparing a choice’s expected value with its actual outcome – signaling subjective-value representation or “satisfaction” with a choice’s result. These results suggest that RS affects how risk is evaluated and adjusted afterward, which implies that people who are highly sensitive to rewards tend to take more risks when making decisions and adapt their risk preferences based on their experiences. Hence, this emphasizes the importance of RS in shaping adjustments made for risky decisions in dynamic environments.

Examining consistent brain-based traits in individuals using a neural trait approach can help explain some of the variations in behavior (Nash and Knoch, 2016). We aimed to investigate further the underlying neural mechanisms of SS and its influence on RS-related behaviors. We employed EEG to examine the brain activities associated with RS during a reward-based decision-making task to achieve this. Our findings indicated that individuals with HSS showed reduced responsiveness to different types of feedback compared to those with LSS. Consistent with our observations regarding differences in FRN, previous studies have demonstrated that the magnitude of FRN changes based on learning from PEs generated by RL models (Walsh and Anderson, 2011). FRNs have been associated with risky decision-making behavior (Polezzi et al., 2010; Takács et al., 2015; Zhong et al., 2020). The emergence of an active FRN signal during such decisions may reflect a conflict between expected and actual outcomes, leading to emotions related to regret or disappointment. This suggests that features defining FRNs make them valuable indicators for monitoring neural activity during dynamic risky decision-making, offering insights into overarching pattern analysis and potentially shedding light on state-based expression tendencies such as SS and RS personality traits. Our study provided supporting evidence for these aspects of the value of FRN.

Besides, time-frequency analysis revealed that individuals with HSS exhibited increased theta band power for reward in the prefrontal cortex during decision-making, suggesting their heightened cognitive processing and attention allocation to reward-related information. Consistent with prior findings, the links between theta (4∼8 Hz) signals associated with cognitive control and adjustment of thresholds (Cavanagh et al., 2010; Narayanan et al., 2013), as well as associations between trial-to-trial variation in subthalamic nucleus activity and variance in decision thresholds based on computational modeling analysis (Cohen and Cavanagh, 2011; Crowley et al., 2014; Rawls et al., 2020). Our findings indicate statistical significance between 200 and 400 ms, spanning frequencies of 1–10 Hz. This suggests that low-frequency activity could be a reliable marker for reward processing, providing more immediate and accurate understanding of the cognitive mechanisms underlying decision-making in situations with potential risks (Ratcliff and Frank, 2012; Cavanagh and Frank, 2014). Hence, more EEG signals can be utilized to examine neural mechanisms within RL models during dynamic decision-making and provide additional details about enduring brain-based characteristics.

Our study contributed novel advancements by widely employing BIT design without necessitating numeracy skills, as seen in BART (Lejuez et al., 2002). The variances between BIT and BART are rooted in the inflation method, transitioning from continuous clicks in BART to a single click in BIT. This modification in BIT allows for a more explicit revelation of decision-makers genuine preferences. Consequently, the choice variability reflects RA quantitatively, as each decision is uniformly assessed due to the consistent risk associated with each renewed balloon. Another crucial disparity is that while the burst probability signifies potential risk in both tasks, this crucial information cannot be acquired in BART due to its floating burst point, unlike in BIT, where risk ratios for each option are deliberately designed. Hence, the RL model in BIT facilitates the cognitive process of reward-driven choices and generates structured model parameters that index RS. These distinctive features of BIT contribute to the differentiation of reward responsivity from RS, where RA is directly influenced by behaviors impacting the most recent rewarded choice. At the same time, RS is associated with the learning pace in a reward-centric learning setting. Prior research utilizing questionnaires grounded in approach-avoidance personality theories has faced difficulties integrating perception/valuation sensitivity with motivation/action sensitivity, which impacts the observed behaviors (Corr and McNaughton, 2012; Nash and Knoch, 2016). The challenge arises in handling perception/valuation sensitivity and motivation/action sensitivity in self-report questionnaires. In BIT, money allows for easy manipulation through the presentation or omission of rewards, offering a significant advantage in modifying stimuli and altering states. Similar to concepts in previous quantitative frameworks related to self-valuation for rewards, RS scales the subjective value of potential choices guiding decision-making processes. In contrast, reward responsivity influences responses and behavioral control, known as RA. Consequently, BIT analyzed by RL models can provide direct indicators for RS and RA, which is suitable for neural analysis of RS.

Various significant limitations need consideration when interpreting the study results. The approach of treating SS as a trait with a total score, instead of analyzing subscale scores like experience-seeking and thrill-seeking, was chosen to facilitate comparisons with prior research and enhance the integrated understanding of SS. While utilizing total scores of SSS offers advantages, it also poses limitations. Future research should delve deeper into individual SS subscales and explore interaction effects between these subscales and other psychological constructs, such as RS, to enhance comprehension of the intricate dynamics of risk-taking. Additionally, while our study focused on quantifying RA using unknown but ordered risks within cognitive processes of dynamic decision-making, it is important to note that dynamic risk decision-making encompasses various contexts. As a result, the results from BIT experiment may not generalize to other forms of risk due to potential changes in the formation of risk aversion. Lastly, our outcomes relied on trial-by-trial variance instead of block-between-block analysis, presenting difficulties in merging ERP data and requiring artifact removal and noise cancellation across trials. Nonetheless, utilizing advanced techniques such as time-frequency analysis, source analysis, or even microstate analysis for ERP could provide resolutions and unveil additional evidence concerning the neural underpinnings associated with personality traits influencing risky decision-making.

6 Conclusion

In conclusion, our research findings have provided valuable insights into the influence of RS and SS on RA. The evidence suggests that individuals with high RS tend to take more risks and adapt their risk preferences based on their experiences. This underscores the importance of RS in shaping adjustments made for risky decisions in dynamic environments. Moreover, our study demonstrated that individuals with HSS showed reduced responsiveness to different types of feedback compared to those with LSS. This suggests SS may impact how individuals process and respond to feedback in decision-making contexts.

Overall, our findings contribute to a better understanding of how individual differences in RS and SS can shape decision-making processes, with potential implications for decision-making strategies and interventions in real-world settings. About solutions, our research highlights the importance of developing more sensitive behavioral measures for RA and neural presentation for sensation-seeking traits. Future research could benefit from employing advanced methods such as cognitive modeling and source analysis of ERP to investigate further the neural substrates related to personality traits influencing risky decision-making.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in this article/Supplementary material.

Ethics statement

The studies involving humans were approved by the Ethics Committee of Navy Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

YQ: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. CS: Data curation, Investigation, Validation, Writing – original draft, Writing – review & editing. HT: Investigation, Methodology, Writing – original draft, Writing – review & editing. DW: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – review & editing. LT: Conceptualization, Data curation, Funding acquisition, Project administration, Resources, Supervision, Validation, Visualization, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This article is supported by the Project of Prevention and Protection for Mental Health (grant no. [2024]432-73) and the Health Bureau’s Institutional Research Collaboration Project (grant no. 2023-TQ-LC-013).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnbeh.2025.1492312/full#supplementary-material

Abbreviations

BIT, Balloon Inflation Test; EEG, electroencephalography; ERP, event-related potentials; ERS, event-related synchronization; ERSPs, event-related spectral perturbations; FRN, feedback-related negativity; HSS, high sensation seeking; KL, Kalman filter; LOOIC, leave-one-out information criterion relative to the winning model; LSS, low sensation seeking; MCMC, Markov Chain Monte Carlo; MSS, middle level of sensation seeking; PE, prediction error; RL, reinforcement learning; RW-RL, the simple RL model according to the Rescorla-Wagner learning rule; KL-RL, the RL model with Kalman filter; RA, risk adjustment; RS, reward sensitivity; SS, sensation seeking; SD, standard deviation.

Footnotes

^ https://www.pstnet.com/

References

Ahn, D.-H., Conrad, J., and Dittmar, R. F. (2003). Risk adjustment and trading strategies. Rev.Financ. Stud. 16, 459–485. doi: 10.1093/rfs/hhg001

Crossref Full Text | Google Scholar

Aven, T., and Renn, O. (2013). Real or hypothetical monetary rewards modulates risk taking behavior. J. Risk Res. 12, 1–11. doi: 10.3724/SP.J.1041.2013.00874

PubMed Abstract | Crossref Full Text | Google Scholar

Bianca, P. A., Elaine, N. A., Arthur, A., Tracy, C., and Robert, M. (2023). Sensory processing sensitivity and its relation to sensation seeking. Curr. Res. Behav. Sci. 4:100100. doi: 10.1016/j.crbeha.2023.100100

Crossref Full Text | Google Scholar

Bornovalova, M. A., Cashman-Rolls, A., O’Donnell, J. M., Ettinger, K., Richards, J. B., deWit, H., et al. (2009). Risk taking differences on a behavioral task as a function of potential reward/loss magnitude and individual differences in impulsivity and sensation seeking. Pharmacol. Biochem. Behav. 93, 258–262. doi: 10.1016/j.pbb.2008.10.023

PubMed Abstract | Crossref Full Text | Google Scholar

Botvinick, M., and Braver, T. (2015). Motivation and cognitive control: from behavior to neural mechanism. Annu. Rev. Psychol. 66, 83–113. doi: 10.1146/annurev-psych-010814-015044

PubMed Abstract | Crossref Full Text | Google Scholar

Brown, D. R., and Cavanagh, J. F. (2020). Novel rewards occlude the reward positivity, and what to do about it. Biol. Psychol. 151:107841. doi: 10.1016/j.biopsycho.2020.107841

PubMed Abstract | Crossref Full Text | Google Scholar

Cavanagh, J. F., and Frank, M. J. (2014). Frontal theta as a mechanism for cognitive control. Trends Cogn. Sci. 18, 414–421.

Google Scholar

Cavanagh, J. F., Frank, M. J., Klein, T. J., and Allen, J. J. (2010). Frontal theta links prediction errors to behavioral adaptation in reinforcement learning. Neuroimage 49, 3198–3209. doi: 10.1016/j.neuroimage.2009.11.080

PubMed Abstract | Crossref Full Text | Google Scholar

Chase, H. W., and Ghane, M. (2023). Seeking pleasure, finding trouble: functions and dysfunctions of trait sensation seeking. Curr. Addict. Rep. 10, 140–148. doi: 10.1007/s40429-023-00484-5

Crossref Full Text | Google Scholar

Cockburn, J., and Holroyd, C. B. (2018). Feedback information and the reward positivity. Int. J. Psychophysiol. 132 (Pt. B), 243–251. doi: 10.1016/j.ijpsycho.2017.11.017

PubMed Abstract | Crossref Full Text | Google Scholar

Cohen, M. X., and Cavanagh, J. F. (2011). Single-trial regression elucidates the role of prefrontal theta oscillations in response conflict. Front. Psychol. 2:30. doi: 10.3389/fpsyg.2011.00030

PubMed Abstract | Crossref Full Text | Google Scholar

Cohen, M. X., and Ranganath, C. (2007). Reinforcement learning signals predict future decisions. J. Neurosci. 2, 371–378. doi: 10.1523/JNEUROSCI.4421-06.2007

PubMed Abstract | Crossref Full Text | Google Scholar

Corr, P. J., and McNaughton, N. (2012). Neuroscience and approach/avoidance personality traits: a two stage (valuation–motivation) approach. Neurosci. Biobehav. Rev. 36, 2339–2354. doi: 10.1016/j.neubiorev.2012.09.013

PubMed Abstract | Crossref Full Text | Google Scholar

Crowley, M. J., van Noordt, S. J., Wu, J., Hommer, R. E., South, M., Fearon, R., et al. (2014). Reward feedback processing in children and adolescents: medial frontal theta oscillations. Brain Cogn. 89, 79–89. doi: 10.1016/j.bandc.2013.11.011

PubMed Abstract | Crossref Full Text | Google Scholar

De Wit, S., Watson, P., Harsay, H. A., Cohen, M. X., van de Vijver, I., and Ridderinkhof, K. R. (2012). Corticostriatal connectivity underlies individual differences in the balance between habitual and goal-directed action control. J. Neurosci. 32, 12066–12075. doi: 10.1523/JNEUROSCI.1088-12.2012

PubMed Abstract | Crossref Full Text | Google Scholar

Delorme, A., and Makeig, S. (2004). EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis. J. Neurosci. Methods 134, 9–21. doi: 10.1016/j.jneumeth.2003.10.009

PubMed Abstract | Crossref Full Text | Google Scholar

DiMenichi, B. C., and Tricomi, E. (2016). Are you smarter than a teenager? Maybe not when it comes to reinforcement learning. Neuron 92, 1–3.

Google Scholar

Euser, A. S., Van Meel, C. S., Snelleman, M., and Franken, I. H. (2011). Acute effects of alcohol on feedback processing and outcome evaluation during risky decision-making: an ERP study. Psychopharmacology 217, 111–125. doi: 10.1007/s00213-011-2264-x

PubMed Abstract | Crossref Full Text | Google Scholar

Fischer, R., and Karl, J. A. (2020). The network architecture of individual differences: personality, reward-sensitivity, and values. Pers. Individ. Differ. 160:109922.

Google Scholar

Fleeson, W., and Jayawickreme, E. (2015). Whole trait theory. J. Res. Pers. 56, 82–92. doi: 10.1016/j.jrp.2014.10.009

PubMed Abstract | Crossref Full Text | Google Scholar

Frank, M. J., Gagne, C., Nyhus, E., Masters, S., Wiecki, T. V., Cavanagh, J. F., et al. (2015). fMRI and EEG predictors of dynamic decision parameters during human reinforcement learning. J. Neurosci. 35, 485–494. doi: 10.1523/JNEUROSCI.2036-14.2015

PubMed Abstract | Crossref Full Text | Google Scholar

Fukunaga, R., Purcell, J. R., and Brown, J. W. (2018). Discriminating formal representations of risk in anterior cingulate cortex and inferior frontal gyrus. Front. Neurosci. 12:553. doi: 10.3389/fnins.2018.00553

PubMed Abstract | Crossref Full Text | Google Scholar

Gallagher, K., Charvin, K., Nielsen, S., Sambridge, M., and Stephenson, J. (2009). Markov chain Monte Carlo (MCMC) sampling methods to determine optimal models, model resolution and model choice for Earth Science problems. Mar. Petroleum Geol. 26, 525–535.

Google Scholar

Gheza, D., De Raedt, R., Baeken, C., and Pourtois, G. (2018). Integration of reward with cost anticipation during performance monitoring revealed by ERPs and EEG spectral perturbations. Neuroimage 173, 153–164. doi: 10.1016/j.neuroimage.2018.02.049

PubMed Abstract | Crossref Full Text | Google Scholar

Harmon, D. A., Haas, A. L., and Peterkin, A. (2021). Experimental tasks of behavioral risk taking in alcohol administration studies: a systematic review. Addict. Behav. 113:106678. doi: 10.1016/j.addbeh.2020.106678

PubMed Abstract | Crossref Full Text | Google Scholar

Horvath, P., and Zuckerman, M. J. (1993). Sensation seeking, risk appraisal, and risky behavior. Pers. Individ. Differ. 14, 41–52. doi: 10.1016/0191-8869(93)90173-Z

Crossref Full Text | Google Scholar

Iezzoni, L. I. (1997). The risks of risk adjustment. JAMA 278, 1600–1607. doi: 10.1001/jama.278.19.1600

PubMed Abstract | Crossref Full Text | Google Scholar

Juhnke, C., Bethge, S., and Mühlbacher, A. C. (2016). A review on methods of risk adjustment and their use in integrated healthcare systems. Int. J. Integrated Care 16:4. doi: 10.5334/ijic.2500

PubMed Abstract | Crossref Full Text | Google Scholar

Kale, D., Stautz, K., and Cooper, A. (2018). Impulsivity related personality traits and cigarette smoking in adults: a meta-analysis using the UPPS-P model of impulsivity and reward sensitivity. Drug Alcohol Dependence 185, 149–167. doi: 10.1016/j.drugalcdep.2018.01.003

PubMed Abstract | Crossref Full Text | Google Scholar

Kelley, N. J., Finley, A. J., and Schmeichel, B. J. (2019). After-effects of self-control: the reward responsivity hypothesis. Cogn. Affect. Behav. Neurosci. 19, 600–618. doi: 10.3758/s13415-019-00694-3

PubMed Abstract | Crossref Full Text | Google Scholar

Kim, S., Petrunin, I., and Shin, H.-S. (2022). “A review of Kalman filter with artificial intelligence techniques,” in Proceedings of the 2022 Integrated Communication, Navigation and Surveillance Conference (ICNS), (Dulles), 1–12.

Google Scholar

Kim-Spoon, J., Deater-Deckard, K., Holmes, C., Lee, J., Chiu, P., and King-Casas, B. (2016). Behavioral and neural inhibitory control moderates the effects of reward sensitivity on adolescent substance use. Neuropsychologia 91, 318–326. doi: 10.1016/j.neuropsychologia.2016.08.028

PubMed Abstract | Crossref Full Text | Google Scholar

Kruschwitz, J. D., Simmons, A. N., Flagan, T., and Paulus, M. P. (2012). Nothing to lose: processing blindness to potential losses drives thrill and adventure seekers. Neuroimage 59, 2850–2859. doi: 10.1016/j.neuroimage.2011.09.048

PubMed Abstract | Crossref Full Text | Google Scholar

Lauriola, M., Panno, A., Levin, I. P., and Lejuez, C. W. (2014). Individual differences in risky decision making: a meta-analysis of sensation seeking and impulsivity with the balloon analogue risk task. J. Behav. Decis. Making 27, 20–36.

Google Scholar

Lee, H., Ryu, D., and Son, J. (2021). Risk-adjusted valuation in the worker’s economic decision making. Finance Res. Lett. 46:102408. doi: 10.1016/j.frl.2021.102408

Crossref Full Text | Google Scholar

Lejuez, C. W., Read, J. P., Kahler, C. W., Richards, J. B., Ramsey, S. E., Stuart, G. L., et al. (2002). Evaluation of a behavioral measure of risk taking: the Balloon Analogue Risk Task (BART). J. Exp. Psychol. Appl. 8:75. doi: 10.1037//1076-898x.8.2.75

PubMed Abstract | Crossref Full Text | Google Scholar

Luman, M., Van Meel, C. S., Oosterlaan, J., and Geurts, H. M. (2012). Reward and punishment sensitivity in children with ADHD: validating the sensitivity to punishment and sensitivity to reward questionnaire for children (SPSRQ-C). J. Abnorm. Child Psychol. 40, 145–157.

Google Scholar

Markanday, A., and Galarraga, I. (2021). The cognitive and experiential effects of flood risk framings and experience, and their influence on adaptation investment behaviour. Clim. Risk Manag. 34:100359. doi: 10.1016/j.crm.2021.100359

PubMed Abstract | Crossref Full Text | Google Scholar

Narayanan, N. S., Cavanagh, J. F., Frank, M. J., and Laubach, M. (2013). Common medial frontal mechanisms of adaptive control in humans and rodents. Nat. Neurosci. 16, 1888–1895. doi: 10.1038/nn.3549

PubMed Abstract | Crossref Full Text | Google Scholar

Nash, K., and Knoch, D. (2016). Individual Differences in Decision-Making: A Neural Trait Approach to Study Sources of Behavioral Heterogeneity, Neuroeconomics. Berlin: Springer, 191–209.

Google Scholar

Neuser, M. P., Kühnel, A., Svaldi, J., and Kroemer, N. B. (2020). Beyond the average: the role of variable reward sensitivity in eating disorders. Physiol. Behav. 223:112971. doi: 10.1016/j.physbeh.2020.112971

PubMed Abstract | Crossref Full Text | Google Scholar

O’Doherty, J. P., Cockburn, J., and Pauli, W. M. (2017). Learning, reward, and decision making. Annu. Rev. Psychol. 68, 73–100. doi: 10.1146/annurev-psych-010416-044216

PubMed Abstract | Crossref Full Text | Google Scholar

Patzelt, E. H., Hartley, C. A., and Gershman, S. J. (2018). Computational phenotyping: using models to understand individual differences in personality, development, and mental illness. Pers. Neurosci. 1:e18. doi: 10.1017/pen.2018.14

PubMed Abstract | Crossref Full Text | Google Scholar

Polezzi, D., Sartori, G., Rumiati, R., Vidotto, G., and Daum, I. (2010). Brain correlates of risky decision-making. Neuroimage 49, 1886–1894. doi: 10.1016/j.neuroimage.2009.08.068

PubMed Abstract | Crossref Full Text | Google Scholar

Ratcliff, R., and Frank, M. J. (2012). Reinforcement-based decision making in corticostriatal circuits: mutual constraints by neurocomputational and diffusion models. Neural Comput. 24, 1186–1229. doi: 10.1162/NECO_a_00270

PubMed Abstract | Crossref Full Text | Google Scholar

Rawls, E., Miskovic, V., Moody, S. N., Lee, Y., Shirtcliff, E. A., and Lamm, C. (2020). Feedback-related negativity and frontal midline theta reflect dissociable processing of reinforcement. Front. Hum. Neurosci. 13:452. doi: 10.3389/fnhum.2019.00452

PubMed Abstract | Crossref Full Text | Google Scholar

Rescorla, R. A. (1972). Informational variables in Pavlovian conditioning. Psychol. Learn. Motiv. 6, 1–46.

Google Scholar

Ross, M. (1975). Salience of reward and intrinsic motivation. J. Pers. Soc. Psychol. 32, 245. doi: 10.1037/0022-3514.32.2.245

Crossref Full Text | Google Scholar

Sambrook, T. D., and Goslin, J. (2016). Principal components analysis of reward prediction errors in a reinforcement learning task. Neuroimage 124 (Pt. A), 276–286. doi: 10.1016/j.neuroimage.2015.07.032

PubMed Abstract | Crossref Full Text | Google Scholar

Smillie, L. D., Pickering, A. D., and Jackson, C. J. (2006). The new reinforcement sensitivity theory: implications for personality measurement. Pers. Soc. Psychol. Rev. 10, 320–335. doi: 10.1207/s15327957pspr1004_3

PubMed Abstract | Crossref Full Text | Google Scholar

Sutton, R. S., and Barto, A. G. (2018). Reinforcement Learning: An Introduction. Cambridge, MA: MIT press.

Google Scholar

Takács, Á, Kóbor, A., Janacsek, K., Honbolygó, F., Csépe, V., and Németh, D. (2015). High trait anxiety is associated with attenuated feedback-related negativity in risky decision making. Neurosci. Lett. 600, 188–192.

Google Scholar

Walsh, M., and Anderson, J. (2011). Learning from delayed feedback: neural responses in temporal credit assignment. Cogn. Affect. Behav. Neurosci. 11, 131–143. doi: 10.3758/s13415-011-0027-0

PubMed Abstract | Crossref Full Text | Google Scholar

Wang, J., Chen, J., Yang, L., and Gao, S. (2013). Meta-analysis of the relationship between sensation seeking and internet addiction. Adv. Psychol. Sci. 21:1720. doi: 10.3724/SP.J.1042.2013.01720

PubMed Abstract | Crossref Full Text | Google Scholar

Whitton, A. E., Treadway, M. T., and Pizzagalli, D. A. (2015). Reward processing dysfunction in major depression, bipolar disorder and schizophrenia. Curr. Opin. Psychiatry 28:7. doi: 10.1097/YCO.0000000000000122

PubMed Abstract | Crossref Full Text | Google Scholar

Wu, X., Wang, T., Liu, C., Wu, T., Jiang, J., Zhou, D., et al. (2017). Functions of learning rate in adaptive reward learning. Front. Hum. Neurosci. 11:592. doi: 10.3389/fnhum.2017.00592

PubMed Abstract | Crossref Full Text | Google Scholar

Xu, S., Luo, L., Xiao, Z., Zhao, K., Wang, H., Wang, C., et al. (2019). High sensation seeking is associated with behavioral and neural insensitivity to increased negative outcomes during decision-making under uncertainty. Cogn. Affect. Behav. Neurosci. 19, 1352–1363. doi: 10.3758/s13415-019-00751-x

PubMed Abstract | Crossref Full Text | Google Scholar

Zhang, X., Song, Z., and Wang, Y. (2021). “Reinforcement learning-based Kalman filter for adaptive brain control in brain-machine interface,” in 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Virtual Conference, United States, 6619–6622.

Google Scholar

Zhong, N., Chen, T., Zhu, Y., Su, H., Ruan, X., Li, X., et al. (2020). Smaller feedback-related negativity (FRN) reflects the risky decision-making deficits of methamphetamine dependent individuals. Front. Psychiatry 11:320. doi: 10.3389/fpsyt.2020.00320

PubMed Abstract | Crossref Full Text | Google Scholar

Zuckerman, M. (1994). Behavioral Expressions and Biosocial Bases of Sensation Seeking. Cambridge, MA: Cambridge University Press.

Google Scholar

Zuckerman, M. (2007). The sensation seeking scale V (SSS-V): still reliable and valid. Pers. Individ. Differ. 43, 1303–1305. doi: 10.1016/j.paid.2007.03.021

Crossref Full Text | Google Scholar

Zuckerman, M. (2013). Sensation Seeking in Entertainment. New York, NY: Routledge.

Google Scholar

Zuckerman, M. (2015). Sensation Seeking: Behavioral Expressions and Biosocial Bases. Philadelphia, PA: Elsevier, University of Delaware.

Google Scholar

Zuckerman, M., and Aluja, A. (2015). Measures of Sensation Seeking, Measures of Personality and Social Psychological Constructs. New York, NY: Elsevier, 352–380.

Google Scholar

Keywords: sensation seeking, risk adjustment, reward sensitivity, risky decisions, cognitive model, neural activity

Citation: Qianlan Y, Shou C, Tianya H, Wei D and Liu T (2025) Sensation seeking and risk adjustment: the role of reward sensitivity in dynamic risky decisions. Front. Behav. Neurosci. 19:1492312. doi: 10.3389/fnbeh.2025.1492312

Received: 06 September 2024; Accepted: 16 January 2025;
Published: 07 February 2025.

Edited by:

Alessandra Maria Passarotti, University of Illinois Chicago, United States

Reviewed by:

Joseph Glicksohn, Bar-Ilan University, Israel
MoonSoo Lee, Korea University, Republic of Korea

Copyright © 2025 Qianlan, Shou, Tianya, Wei and Liu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Dong Wei, c29waGllZG9uZ3dlaUAxNjMuY29t; Taosheng Liu, bGl1dGFvc2hlbnZAc21tdS5lZHUuY24=

^†These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.