A Novel, Fast, Reliable, and Data-Driven Method for Simultaneous Single-Trial Mining and Amplitude—Latency Estimation Based on Proximity Graphs and Network Analysis

Dimitriadis, Stavros I.; Brindley, Lisa; Evans, Lisa H.; Linden, David E.; Singh, Krish D.

doi:10.3389/fninf.2018.00059

ORIGINAL RESEARCH article

Front. Neuroinform., 19 November 2018

Volume 12 - 2018 | https://doi.org/10.3389/fninf.2018.00059

A Novel, Fast, Reliable, and Data-Driven Method for Simultaneous Single-Trial Mining and Amplitude—Latency Estimation Based on Proximity Graphs and Network Analysis

Stavros I. Dimitriadis^1,2,3,4,5,6^*

Lisa Brindley⁷

Lisa H. Evans^1,4

David E. Linden^1,3,4,5,6

Krish D. Singh^1,4

¹Cardiff University Brain Research Imaging Centre, School of Psychology, Cardiff University, Cardiff, United Kingdom
²Neuroinformatics Group, Cardiff University Brain Research Imaging Centre, School of Psychology, Cardiff University, Cardiff, United Kingdom
³Division of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, Cardiff, United Kingdom
⁴School of Psychology, Cardiff University, Cardiff, United Kingdom
⁵Neuroscience and Mental Health Research Institute, School of Medicine, Cardiff University, Cardiff, United Kingdom
⁶MRC Centre for Neuropsychiatric Genetics and Genomics, School of Medicine, Cardiff University, Cardiff, United Kingdom
⁷Department of Psychology, Cardiff Metropolitan University, Cardiff, United Kingdom

Both amplitude and latency of single-trial EEG/MEG recordings provide valuable information regarding functionality of the human brain. In this article, we provided a data-driven graph and network-based framework for mining information from multi-trial event-related brain recordings. In the first part, we provide the general outline of the proposed methodological approach. In the second part, we provide a more detailed illustration, and present the obtained results on every step of the algorithmic procedure. To justify the proposed framework instead of presenting the analytic data mining and graph-based steps, we address the problem of response variability, a prerequisite to reliable estimates for both the amplitude and latency on specific N/P components linked to the nature of the stimuli. The major question addressed in this study is the selection of representative single-trials with the aim of uncovering a less noisey averaged waveform elicited from the stimuli. This graph and network-based algorithmic procedure increases the signal-to-noise (SNR) of the brain response, a key pre-processing step to reveal significant and reliable amplitude and latency at a specific time after the onset of the stimulus and with the right polarity (N or P). We demonstrated the whole approach using electroencephalography (EEG) auditory mismatch negativity (MMN) recordings from 42 young healthy controls. The method is novel, fast and data-driven succeeding first to reveal the true waveform elicited by MMN on different conditions (frequency, intensity, duration, etc.). The proposed graph-oriented algorithmic pipeline increased the SNR of the characteristic waveforms and the reliability of amplitude and latency within the adopted cohort. We also demonstrated how different EEG reference schemes (REST vs. average) can influence amplitude-latency estimation. Simulation results revealed robust amplitude-latency estimations under different SNR and amplitude-latency variations with the proposed algorithm.

Introduction

A prerequisite for the studying of evoked potentials (EPs) is the distinction of the true brain's response due to a stimulus from the brain ongoing activity. To uncover true brain activity, a large number of single trials (STs) should be collected and averaged to reveal the brain's response waveform. The assumptions that single-trials are time-locked and contaminated by Gaussian noise of zero-mean are both oversimplified (Laskaris et al., 2004). For example, the brain state of each subject changes from time moment to time moment due to shifts of attention and the fatigue level, while habituation during the task and/or previous incidental experience with the nature of the task are significant factors that alter behavior even in short duration recordings (Laskaris and Ioannides, 2001; Laskaris et al., 2003).

Both single-trial amplitude and latency of EEG/MEG signals contain valuable information regarding brain functionality in various conditions and targeted groups. For example, increased latency variation may be associated with: ADHD (De Pascalis et al., 2008), aging (Fein and Turetsky, 1989; Fjell et al., 2009), IQ scores (Geurts et al., 2008), mild cognitive impairment (MCI; Laskaris et al., 2013), and in psychosis (Bodatsch et al., 2011).

Exploring single-trial differences between groups and/or conditions demands a proper unbiased manipulation of single-trials in order to extract reliable amplitude and latency. This is a non-trivial and demanding task for brain responses given the complexity of both brain activity and the acquired EEG/MEG recordings due to low signal-to-noise ratio (SNR) of single-trial EEG/MEG responses (Fein and Turetsky, 1989; Laskaris and Ioannides, 2001) and are usually integrated signals derived from multiple brain processes (Gevins, 1984).

Only a few exploratory studies attempted to convey information from STs. To deal with the poor SNR, different methods have been proposed in the past. The basic characteristic of previous techniques to solve the aforementioned issue based on classification and categorization of single-trials (Zouridakis et al., 1997; Geva, 1998; Lange et al., 2000). The final outcome of this procedure is the categorization of STs into homogeneous classes. Each of these classes may reflect different brain behavior like spontaneous reaction time, anticipation or reflect the variability of the regional response dynamics (Laskaris et al., 2003). Complementary, Laskaris et al., proposed a summarization of STs via Voronoi testellation procedure, minimal spanning tree, and Breadth-first graph (BFS) search procedure in order to reorder prototypical responses (Laskaris et al., 2004). The ordered prototypes reflected the variability of the single-trials while their source localization of neuromagnetic recordings with Magnetic Field Tomography (MFT) algorithm succeeded to link this variability with the related sources on different brain areas and time windows. MFT is a non-linear solution of the ill-posed biomagnetic inverse problem and it is applied independently to each single snapshot (timeslice) of either resting-state activity or single trial (or average) magnetoencephalographic (MEG) brain signal. Complementary, they very first mentioned that the ongoing activity before the onset of the stimulus is functionally coupled with the subsequent regional response (Laskaris and Ioannides, 2001; Laskaris et al., 2003). Recently, they demonstrated how the “reflex level” of spontaneous activity of various cognitive subsystems shape the brain activity during cognitive tasks stimulating the same subareas (Cole et al., 2016).

Several methods have examined STs with the goal of extracting the related amplitudes and latencies. These methods can be categorized into two groups: the ones that need an a priori template and those with no waveform constraints. A few methods need the shape of the target signal which should be defined a priori (for example, Woody, 1967; Mayhew et al., 2006). Second, only a few methods allowed the free variability of STs (for example, Pham et al., 1987; Laskaris et al., 2004), whereas others incorporate in the analytic pathway the constraint of both types of variation (for example, Jaskowski and Verleger, 1999). Third, few methods assume that the data comes from a single signal (for example, Pham et al., 1987; Jaskowski and Verleger, 1999) whereas others allow multi-trials to have their own amplitude and latency (for example, Laskaris et al., 2004; Mayhew et al., 2006; Da Pelo et al., 2018). Methodologies that analyze every trial as a unique brain response are on the right place compared to averaging across all trials. Variability of single-trials is very informative for amplitude and latency estimation and should be treated properly. Fourth, others methods provide algorithms that are susceptible to noise (Jaskowski and Verleger, 2000), whereas for others this susceptibility is reduced by incorporating basis functions. Disadvantages of the aforementioned are either the a priori selection of a template waveform derived from grand-averaged time series (Hu et al., 2011) and/or the low performance in low SNR conditions.

In optimal scenarios, principal component analysis (PCA) could be used for mining electroencephalographic and magnetoencephalographic responses (Friston et al., 1996). Alternatively, independent component analysis (ICA) could be adopted for dimensionality reduction and learning purposes of multi-trial responses (Makeig et al., 1996).

The purpose of the present paper is to demonstrate a fast, reliable, and completely data-driven methodology based on data mining, graph, and network analysis in order to reveal “true” variability of the single-trials and accurate detection of amplitude and latency linked to responses on specific stimuli. It is more than evident that single-trials are noisy even in the most optimal scenarios and experimental protocols. The motivation for the presented algorithmic steps arose after analyzing EEG STs from the famous mismatch negativity (MMN) auditory task (Näätänen et al., 2004). Single-trials were completely noisy, missing even a clear peak across trials and the multi-feature paradigm. Our analysis combined a member of proximity graph called Gabriel graph (GG) and network analysis to reveal prototypical single-trials covering the whole space of their variability and then mixed them into a combined characteristic single-trial per condition. The proposed analysis is an appropriate tool for geometrical data and vectorial pattern analysis of single-trials.

The scope of our analysis on the adopted MMN paradigm for demonstration of the methodology focused on optimizing the SNR of the selected single-trials under the objective criterion to reveal the best type of filter (IIR/FIR), its order and the degree k of the GG single-trial network that choose the number of selected single-trials.

Methods

Feature Extraction

Firstly, we construct a similarity distance matrix between every pair of ST using the distance correlation estimator (Székely and Rizzo, 2009). The distance matrix called hereafter DM tabulates the distance between the temporal variability of two time series. The distance correlation is a measure of statistical dependence between two random variables or random vectors.

Embedding in a Feature Space

After constructing the DM derived by the pair-wise estimation of the temporal variability of STs, we embedded the DM in a 2D feature space. Here, we adopted multidimensional scaling (MDS) a high popular dimensionality technique among neuroscientists. This approach will help us to detect and visualize the variability of STs within a common embedded feature space.

Constructing a Proximity Graph on the Embedded Space

A proximity graph is simply a graph in which two vertices are connected by an edge if and only if the vertices satisfy particular geometric requirements. “Proximity” here means spatial distance. Many of these graphs can be formulated with respect to many metrics, but the Euclidean metric is used most frequently. Here, we used the Euclidean distance as a proper spatial distance metric.

Let L(p,q) be the intersection of the circle about p with a radius of dist(p,q) and the circle about q with a radius of dist(q,p). This is called a lune. The relative neighborhood graph RNG(V) of a set of points V, is the graph that has an edge (p,q) if and only if the intersection of L(p,q) and V is empty (Figure 1A).

FIGURE 1

Figure 1. Proximity graphs. The relationship between relative neighborhood graph (RNG), Gabriel graph (GG), Euclidean minimal spanning tree (EMST), and Delaunay triangulation (DT). (A) Relationships between proximity graphs. (B) Schematic illustration of proximity graphs with five nodes.

Let C(p,q) be the circle centered on the point halfway between p and q, and with a radius of half the distance between p and q. The GG of a set of points V, GG(V), is the graph that has an edge (p,q) if and only if the intersection of C(p,q) and V is empty (Figure 1B).

In mathematics, the GG of a set S of points in the Euclidean plane expresses one notion of proximity or nearness of those points. The GG is a subgraph of the Delaunay triangulation (Matula and Sokal, 1980). Complementary, the GG contains as a subgraph the Euclidean minimum spanning tree, the RNG, and the nearest neighbor graph (Gabriel and Sokal, 1969; see Figure 1).

If we also consider the Euclidean minimum spanning tree (which is a tree that minimizes the total edge length connecting all points) and the Delauney triangulation (which maximizes the minimum angle over all triangulations of a set of points), we get the following relationship:

\begin{matrix} E M S T \subseteq R N G \subseteq G G \subseteq D T & (1) \end{matrix}

Mathematically in GG, two points i and j are connected if the square of the distance between them, $d_{i j}^{2}$ , is less than the sum of the squared distance between each of these points and any other point k. Under GG main rule, we connect i and j if $d_{i j}^{2} ≺ d_{i k}^{2} ≺ d_{j k}^{2} (2)$ for all k points. Schematically, we demonstrated Gabriel's rule in Figure 2.

FIGURE 2

Figure 2. Geometric illustration of Gabriel's rule regarding the connection of two points.

Figure 3 illustrates a right (Figure 3A) and a wrong (Figure 3B) connections under the main rule of GG. Figure 3C illustrates a GG produced by 100 random points in a 2D plain. The main outcome of this approach is the construction of a GG in the 2D feature space where nodes are the single-trials. Two STs are connected if within the circle passes from their 2D coordinates no other ST is encapsulated. With this approach, GG notion demands to cover the feature space and to sample single-trial variability without over-representing. One can see GG as a denoising procedure to manipulate properly single-trials. GG captures the backbone of single-trials in the embedded space.

FIGURE 3

Figure 3. (A) Right, (B) wrong connected points in GG, and (C) the GG of 100 random 2D points.

Network Analysis in Gabriel Graph (GG)

The construction of a connected GG on the 2D embedded space of single-trials opens the window to adopt well-known approachsed derived from network theory. Here, in order to detect representative prototypical STs, we used the degree of each node in GG in order to detect the hubs. The degree of a node is a trivial network metric which describes the number of direct neighbors of each node. In simple words, degree counts the number of nodes with which each node is directly connected. Here, we optimized the selection of degree k using as an optimized objective criterion the increment of SNR of grand average single-trial.

Grand Average of Single Trials

After selecting the prototypical STs that simultaneously capture the variability of STs, we estimated their grand-average. The selection of prototypical single-trials based on the selection of degree k from the GG with the aim to improve the SNR.

Estimation of Amplitude–Latency

Based on the grand average signal constructed by averaging the prototypical STs, we estimated the amplitude and latency. Both amplitude and latency were extracted completely data-driven by detecting prominent peak from the whole time series as a global maxima.

Estimating the Variability of Single-Trials

To access the variability of STs, we estimated the global efficiency (GE) on a network level based on the subgraph defined by the nodes of GG linked to the extracted prototypical single-trials.

Global efficiency (GE) for a network W of N × N nodes is the inverse of the harmonic mean of the shortest path length between each pair of nodes and reflects the overall efficiency of parallel information transfer in the network (Latora and Marchiori, 2001).

\begin{array}{l} G E = \frac{1}{N} \sum_{i \in N} \frac{Σ_{j \in N, j \neq i} {(d_{i j})}^{- 1}}{N - 1} & (2) \end{array}

Here as W, we used the GG while N denotes the prototypical STs.

MATLAB Toolbox

A MATLAB toolbox will be released from the author's website, researchgate and github upon acceptance of the paper (https://github.com/stdimitr/GG_SINGLE_TRIALS_MINING/tree/master). We will demonstrate the pipeline with a few recordings which will be available to any researcher.

Influence of Reference on Waveform, Amplitude, and Latency Estimation in Single-Trial Analysis

The influence of the reference is a critical issue for electroencephalography (EEG) and event-related potentials (ERPs) studies. It seems that brain connectivity and network analysis is more robust compared to the estimation of single power (Dimitriadis et al., 2010). A recent study proposed the infinity REST reference as an appropriate common reference system for EEG analysis (Yao, 2001; Yao et al., 2005; Qin et al., 2010; Chella et al., 2016; Huang et al., 2017). Another study compared different EEG reference systems in different simulation scenarios at both sensor and source level. They demonstrated REST infinity reference is the most preferable system across the highly used reference systems in the literature (Lei and Liao, 2017). Here, we adopted also REST reference system in comparison with the average system.

Alternative Single-Trial Mining Algorithms

To demonstrate the superiority and the simplicity of our method compared to others, we repeated the whole analysis using PCA, singular value decomposition (SVD), and multi-linear regressor analysis (Hu et al., 2011). Both methodologies have been applied using average and REST reference.

Simulations

We simulated the original responses using multivariate autoregressive model (MVAR) (Anderson et al., 1998) and optimized the model selection with Akaike criterion (Aikake, 1974). The simulations followed two scenarios: In the first one, both amplitude and latency parameters were the same for the peak while in the second one both amplitude-latency varies independently apart from the peak. Practically, in the first scenario, the peaks were shifted and scaled by the same amount while in the second within each trial the peak was shifted and scaled by different values.

We simulated 42 datasets (equals the number of the subjects) each consisting of 128 trials with 205 samples (400 ms). The peak varied over trials in amplitude (lognormal distributed with mean 1 and st.d. 1.2, restricted between low and high values of the empirical dataset), and latency (normally distributed with mean 0 and st.d. 150, 170, or 170 ms). The simulation was based on recordings derived from the FZ sensor at direction-left (DIR-L) condition and for deviant-minus-standard stimulus.

All simulations were performed using MVAR for the estimated waveform under three signal-to-noise (SNR) conditions (SNR = 0.5, 1, and 2), using correlated noise. Noise was simulated using an AR(5) process with coefficients estimated from baseline trials of the empirical data.

Empirical Application in an Auditory Mismatch Negativity (MMN) Protocol

The proposed methodology is demonstrated in an auditory MMN multi-feature paradigm developed byNäätänen et al. (2004). The MMN peaks at about 100–300 ms after change onset but this latency varies slightly according to the specific paradigm or the type of regularity that is violated. According to the adopted protocol, MMN is usually evoked by a change of frequency (Low-High), direction (Low-High), intensity (Low-High), duration, and gap, for both standard and deviant stimuli (Näätänen et al., 2004). Two standard tones preceded every deviant tone. Each condition was recorded in 128 trials while the protocol was designed such as to avoid any habituation of the sequence. The total number of trials was 8 × 128 for standard trials (averaged each pair of standard trials) and 2 × 128 for deviants.

We recorded a total of 42 subjects (age = 23.75 ± 1.28, 24 females) using a BIOSEMI system with 64 channels (10–20 System; Jasper, 1958). Additional electrodes were placed on the mastoid processes. The electrooculogram (EOG) was recorded from above and below the left eye [vertical (V)EOG] and from the outer canthi [horizontal (H)EOG]. The electroencephalogram (EEG; range DC-419 Hz; sampling rate 2,048 Hz) was acquired referenced to linked electrodes located midway between POz and PO3/PO4, respectively, and was re-referenced off-line to the average of the signal at the mastoids. Trials containing large EOG artifact were rejected, as were trials containing A/D saturation or baseline drift exceeding 80 μV. Prior to any further analysis, we corrected the multichannel recordings from artifacts (muscle, blinks, cardiac) using ICA with EEGLAB (Delorme and Makeig, 2004; Dimitriadis et al., 2015, 2016a,b). Data were filtered off-line (0.5–45 Hz) and down-sampled to 512 Hz, resulting in an epoch of 400 ms after the onset of the stimulus or 205 samples. DC offset was removed by subtracting from each channel each low pass filtered component using FIR filter [Roll off 0.001–0.05 Hz, butter, (IIR), 6 dB attenuation in the stop band]. Afterward, each trial was corrected with the baseline.

Our analysis focused on midline FCZ, FZ, and CZ EEG sensors. We assessed the reliability of the proposed techniques in terms of amplitude, latency, and signal power analysis.

All the subjects who participated in this study gave written informed consent. The whole study has been approved by the ethical committee in School of Psychology in Cardiff University as part of a big multi-modal study.

Our analysis on the adopted MMN paradigm for demonstration of the methodology focused on optimizing the SNR of the selected single-trials under the objective criterion to reveal the best type of filter (IIR/FIR), its order and the degree k applied to GG single-trial network for the selection of the representative singe-trials was the SNR (see next section).

Results

The Proposed Methodology in Simple Steps

We plot all the trials of a representative condition (high intensity–deviant) from a subject (Figure 4A). Afterward, we estimated with distance correlation (Székely and Rizzo, 2009), the pair-wise associations of single-trials tabulated in a similarity matrix (Figure 4B). Then, we embedded this similarity matrix in a 2D space with MDS algorithm in order to visualize the variability of single-trials (Figure 4C). Using the 2D points of the trials as an input to the GG, we constructed the GG demonstrated in Figure 4D. Green lines represent the connected trials under the notion of GG. To sample the right representative single-trials, we estimated the degree k of each node in GG. We used two criterion to uncover the hubs on this GG based on the degree of each node: degree k = 1–4. In Figures 4E,G, we demonstrated with red circles the selected trials/nodes in the GG. We selected the hubs nodes/trials based on their degree in the GG network on the assumption that these trials encapsulate the variability of the single-trials out sharing redundant information and also being on the core of variability avoiding the selection of outliers in the periphery of single-trials 2D projection.

FIGURE 4

Figure 4. Butterworth IIR order 1: Outline of the proposed graph-based methodology (High intensity condition—deviant). (A) Plot of single trials from a single subject. (B) Similarity matrix that tabulates the pair-wise associations of single-trials with distance correlation metric. (C) Embedding the similarity matrix in (B) in a 2D space with multidimensional scaling (MDS) algorithm where each blue dot refers to a single-trial. (D) Construct the Gabriel graph (GG) based on the 2D positions of the single trials. Green lines represent the connections under the notion of GG. (E) Detection of hubs-representative single-trials based on their degree k in the GG. Here, we selected k = 3 to detect the single trials representing with red circles. (F) Characteristic grand-averaged single-trial derived from the averaging of the selected hubs/single trials (red circles) in (E). We selected 41 signals from 128 trials. Blue waveform denotes the grand-average from the whole set of trials while the black from the selected single-trials. (G) Detection of hubs-representative single-trials based on their degree k in the GG. Here, we selected k = 4 [compared to 3 in (E)] to detect the single trials representing with red circles. (H) Characteristic grand-averaged single-trial derived from the averaging of the selected hubs/single trials (red circles) in (G). We selected 15 signals from 128 trials. Blue waveform denotes the grand-average from the whole set of trials while the black from the selected single-trials. Amplitude and latency are estimated in the grand-averaged based on the global maxima.

Finally, we estimated the grand-averaged trial by averaging the selected single-trials with aforementioned network-based criterion with the objective criterion of improving the SNR of the selected single-trials. The resulting single-trial in both cases is presented in Figure 4F,H. Based on the example in Figure 4 where we used a Butterworth filter of order 1 (order since we used zero-phase filtering with filtfilt.m function of Matlab), the best result obtained with degree k ≥ 4 where we selected 15 trials from the 128. We detected a negative peak around 150 ms after the onset of the stimulus (Figure 6H). In the next section, we will demonstrate the effect of filter type, order and the selected degree in our network using as an objective criterion the improvement of SNR across the selected single-trials with our methodology.

It is important to mention here that we applied the methodology independently for standard, deviant, and deviant-minus-standard. As a promiment characteristic peak, we revealed the dominant positive for deviant, standard, and negative for deviant-minus-standard after 100 ms of the onset of the stimulus.

The Effect of FIR/IIR Filter Settings

In Figures 4–6, we demonstrated the steps of the proposed fast, reliable, and data-driven methodology under a graph-based framework. We revealed that both the type of the filter (FIR/IIR) and its order can alter the characteristic waveform for each condition and subject.

We used the eegfilt matlab function a provided in EEGLAB for FIR filtering of single-trials (Delorme and Makeig, 2004) and the butter MATLAB function for IIR filtering. We used a zero-phase filter in both cases applied on the concatenated trials separately for each stimulus (standard or deviant) and for each subject. The effect of filter with Butterworth bandpass filter can be seen on the representative time series in Figure 4H (order 1) vs. Figure 5H (order 2). Order of one gave the best results for Butterworth bandpass IIR filter. In contrary, the best result for FIR filter using eegfilt function was obtained with order 2 (Figure 6H). In Figure 4H, the characteristic negative was detected around 100 ms after the onset of the stimulus while in Figure 6H, the negative peak located 150 ms after the onset of the stimulus. The effect of type and order of filter was demonstrated in high intensity condition for deviant stimulus from a single subject. Our analysis on the adopted MMN paradigm for demonstration of the methodology revealed as the best option based on SNR for filtering the FIR using eegfilt function and with order 2. The main objective criterion to reveal the best type of filter (IIR/FIR), order and degree k for the selection of the representative singe-trials was the SNR (see next section).

FIGURE 5

Figure 5. Butterworth IIR order 2: Outline of the proposed graph-based methodology (High intensity condition—deviant). (A) Plot of single trials from a single subject. (B) Similarity matrix that tabulates the pair-wise associations of single-trials with distance correlation metric. (C) Embedding the similarity matrix in (B) in a 2D space with multidimensional scaling (MDS) algorithm where every blue dot represents a single-trial. (D) Construct the Gabriel graph (GG) based on the 2D positions of the single trials. Green lines represent the connections under the notion of GG. (E) Detection of hubs-representative single-trials based on their degree k in the GG. Here, we selected k = 3 to detect the single trials representing with red circles. (F) Characteristic grand-averaged single-trial derived from the averaging of the selected hubs/single trials (red circles) in (E). We selected 40 signals from 128 trials. Blue waveform denotes the grand-average from the whole set of trials while the black from the selected single-trials. (G) Detection of hubs-representative single-trials based on their degree k in the GG. Here, we selected k = 4 [compared to 3 in (E)] to detect the single trials representing with red circles. (H) Characteristic grand-averaged single-trial derived from the averaging of the selected hubs/single trials (red circles) in (G). We selected 25 signals from 128 trials. Blue waveform denotes the grand-average from the whole set of trials while the black from the selected single-trials. Amplitude and latency are estimated in the grand-averaged based on the global maxima.

FIGURE 6

Figure 6. FIR order 2: Outline of the proposed graph-based methodology (High intensity condition—deviant). (A) Plot of single trials from a single subject. (B) Similarity matrix that tabulates the pair-wise associations of single-trials with distance correlation metric. (C) Embedding the similarity matrix in (B) in a 2D space with multidimensional scaling (MDS) algorithm where every blue dot represents a single-trial. (D) Construct the Gabriel graph (GG) based on the 2D positions of the single trials. Green lines represent the connections under the notion of GG. (E) Detection of hubs-representative single-trials based on their degree k in the GG. Here, we selected k = 3 to detect the single trials representing with red circles. (F) Characteristic grand-averaged single-trial derived from the averaging of the selected hubs/single trials (red circles) in (E). We selected 44 signals from 128 trials. Blue waveform denotes the grand-average from the whole set of trials while the black from the selected single-trials. (G) Detection of hubs-representative single-trials based on their degree k in the GG. Here, we selected k = 4 [compared to 3 in (E)] to detect the single trials representing with red circles. (H) Characteristic grand-averaged single-trial derived from the averaging of the selected hubs/single trials (red circles) in (G). We selected 15 signals from 128 trials. Blue waveform denotes the grand-average from the whole set of trials while the black from the selected single-trials. Amplitude and latency are estimated in the grand-averaged based on the global maxima.

Improvement of Signal-To-Noise Ratio With the Proposed Methodology

We evaluated the selection of the filter type, its order and the degree k in GG for the selection of single-trials independently for each condition, standard/deviant, subject and recording EEG sensors (FZ, FCZ, CZ). The parameters were filter type (FIR/IIR), order (1,2,3), and degree k (1–4). Finally, we estimated the SNR from the selected single-trials via the proposed methodology, adopting a formula previously proposed (Laskaris et al., 2004). We scored each of the 2 × 3 × 4 = 24 different sets of parameters across conditions (8 deviant and 8 standard stimuli = 16) and recording EEG sites (three locations) the number of times where the SNR was maximum across the 24 sets. Formula (3) describes the objective criterion for the selection of best settings for each subject across the 24 different combinations. Our results demonstrated clearly a maximization of SNR for every subject with FIR filter of order 2 and with most of the cases (39 out of 42) with degree k ≥ 4.

\begin{matrix} S c o r e = \frac{\sum_{f i l t e r = 1}^{2} \sum_{o r d e r = 1}^{3} \sum_{\deg r e e = 1}^{4} \arg \max S N R (16 s t i m u l i, 3 r e c o r d i n g s i t e s)}{16 s t i m u l i x 3 s t i m u l i} & (3) \end{matrix}

The group-averaged score was 96.34 with standard deviation 2.31 with the best choice for the FIR filter (42 out of 42 subjects), order 2 (42 out of 42 subjects), and k = 4 (39 out of 42 subjects and 3 with k = 3). Table 1 summarizes the group-averaged SNR from each stimulus and EEG sensor location for the standard stimuli. We presented results from FZ location where the majority of group-differences in terms of amplitude, latency, signal power, and variability were more pronounced compared to FCZ and CZ.

TABLE 1

Table 1. Group-averaged SNR for each condition across the three selected EEG sensors for standard stimuli.

Results of SNR for the grand-averaged signal was <1 and one can see in Figures 3–6, it is a bad strategy to estimate a peak for this noisy averaged trial (blue line in Figures 4–6F,H).

Reliability of Amplitude, Latency, and Signal Power

We assessed the reliability of amplitude, latency and signal power estimates for each MMN feature, EEG sensors and for standard, deviant, and deviant-minus-standard with the coefficient of variation (CV). The CV was estimated as follow:

\begin{array}{l} C V = \frac{g r o u p m e a n (a m p l i t u d e, l a t e n c y)}{g r o u p s t d (a m p l i t u d e, l a t e n c y)} & (4) \end{array}

Amplitude and Latencies

Tables 2–4 demonstrated the group mean amplitude for standard, deviant, and deviant – standard for each condition of the MMN experimental protocol and for the three EEG sensors. We estimated the CV (Formula 4) across the cohort for every MMN feature for standard, deviant, and deviant – standard and for FZ (Table 2), FCZ (Table 3), and CZ (Table 4) EEG sensors. It is obvious that CV of the amplitude was higher for FZ EEG sensor.

TABLE 2

Table 2. FZ EEG sensor: Group-averaged amplitude for each condition and for standard, deviants, and their difference (deviant-minus-standard).

TABLE 3

Table 3. FCZ EEG sensor: Group-averaged amplitude for each condition and for standard, deviants, and their difference (deviant-minus-standard).

TABLE 4

Table 4. CZ EEG sensor: Group-averaged amplitude for each condition and for standard, deviants, and their difference (deviant-minus-standard).

Tables 5–7 demonstrated the group mean latencies for standard, deviant, and deviant – standard for each condition of the MMN experimental protocol and for the three EEG sensors. We estimated the CV across the cohort for every MMN feature for standard, deviant, and deviant – standard and for FZ (Table 5), FCZ (Table 6), and CZ (Table 7) EEG sensors. It is obvious that CV of the latency was higher for FZ EEG sensor.

TABLE 5

Table 5. FZ EEG sensor: Group-averaged latency for each condition and for standard, deviants, and their difference (deviant-minus-standard).

TABLE 6

Table 6. FCZ EEG sensor: Group-averaged latency for each condition and for standard, deviants, and their difference (deviant-minus-standard).

TABLE 7

Table 7. CZ EEG sensor: Group-averaged latency for each condition and for standard, deviants, and their difference (deviant-minus-standard).

We repeated the whole analysis by selecting a subset of single-trials from each condition and subject starting from the first 20% of the trials per condition till 100% with a step of 5% in order to explore how the number of trials affect amplitude/latency estimations. We revealed that the CV of amplitude/latency reached high values close to the ones tabulated in Tables 2–7 when the number of trials ranged between [85–95%] of the total amount of STs. The aforementioned results underline the importance of detect significant true amplitude/latency estimations in MMN paradigm.

Signal Power

Figures 7–9 demonstrated the group mean signal power for standard, deviant, and deviant – standard for each condition of the MMN experimental protocol and for the three EEG sensors. We estimated the signal power for every MMN feature for standard, deviant, and deviant – standard and for FZ (Figure 7), FCZ (Figure 8), and CZ (Figure 9) EEG sensors. In Figures 7–9, we demonstrated the signal power for each condition and std, dev, and std-dev for the whole set of trials and also for the selection of a subset of trials. CV of signal power was higher for the subset of trials compared to the whole set of trials.

FIGURE 7

Figure 7. FZ-Group-averaged signal power for each condition, for standard, deviants, and their difference (deviant-minus-standard) in both groups and across seven frequency bands. We demonstrate the signal power for the whole set of trials and for the selection of a subset of trials.

FIGURE 8

Figure 8. FCZ-Group-averaged signal power for each condition, for standard, deviants, and their difference (deviant-minus-standard) in both groups and across seven frequency bands. We demonstrate the signal power for the whole set of trials and for the selection of a subset of trials.

FIGURE 9

Figure 9. CZ-Group-averaged signal power for each condition, for standard, deviants, and their difference (deviant-minus-standard) in both groups and across seven frequency bands. We demonstrate the signal power for the whole set of trials and for the selection of a subset of trials.

Variability of Single-Trials via Global Efficiency Estimations

To assess the variability of STs, we estimated the global efficiency (GE) on a network level based on the subgraph defined by the nodes of GG linked to the extracted prototypical single-trials (red nodes in Figures 4, 5, 6E,G). We constructed the weighted graph by estimating the Euclidean distance between every pair of hub areas. Then, we normalized by the maximum value and we inversed the normalized weights in order to express functionality. This practically means that the higher the distance between two 2D points, the less efficient the communication. In a global level, the higher the GE the more efficient is the communication of the nodes within the network which further means that the nodes are closer in 2D points. So higher values of GE can be linked directly to less variability.

Table 8 tabulates the group-averaged GE for each condition in the three midline located EEG sensors for standard stimuli.

TABLE 8

Table 8. Group-averaged GE for each condition across the three selected EEG sensors for (deviant-minus-standard) stimuli.

Influence of EEG Reference System to Amplitude and Latency Estimation

We compared average to the REST reference system in terms of amplitude and latency estimation across EEG sensor locations and in standard, deviant, and standard-deviant stimuli. Comparing Tables 2–4 vs. Tables 9–11 for amplitude and Tables 5–7 vs. Tables 12–14 for latency, we revealed alterations of the group mean amplitude and latency. In both cases, the CV was too high while in some cases especially in the amplitude of deviant – standard, the CV was higher for the REST reference. For more detailed results, see section 1 in Supplementary Material.

TABLE 9

Table 9. FZ EEG sensor: Group-averaged amplitude for each condition and for standard, deviants, and their difference (deviant-minus-standard).

TABLE 10

Table 10. FCZ EEG sensor: Group-averaged amplitude for each condition and for standard, deviants, and their difference (deviant-minus-standard).

TABLE 11

Table 11. CZ EEG sensor: Group-averaged amplitude for each condition and for standard, deviants, and their difference (deviant-minus-standard).

TABLE 12

Table 12. FZ EEG sensor: Group-averaged latency for each condition and for standard, deviants, and their difference (deviant-minus-standard).

TABLE 13

Table 13. FCZ EEG sensor: Group-averaged latency for each condition and for standard, deviants, and their difference (deviant-minus-standard).

TABLE 14

Table 14. CZ EEG sensor: Group-averaged latency for each condition and for standard, deviants, and their difference (deviant-minus-standard).

Tables 12–14 demonstrated the group mean latencies for standard, deviant, and deviant – standard for each condition of the MMN experimental protocol and for the three EEG sensors. We estimated the CV across the cohort for every MMN feature for standard, deviant, and deviant – standard and for FZ (Table 5), FCZ (Table 6), and CZ (Table 7) EEG sensors. It is clear that CV of the latency was higher for the FZ EEG sensor.

Comparison With Alternative Mining Algorithms

We compared our methodology with PCA and multi-linear regressor analysis (Hu et al., 2011). The first one proposed a multiple linear regression (MLR) and multiple linear regression with dispersion term (MLRd) to estimate the single-trial latency and amplitude of ERP peaks. Regressors (an average and its temporal derivative) for each ERP peak are calculated from the average ERP waveform within a given post-stimulus interval (in this case, 0–0.3 s) for each subject. These regressors are then applied against each ST within the same post-stimulus interval and used to model each single-trial ERP peak. In MLRd, variability matrices that capture the variations of latency and morphology of each ERP peak are generated by simultaneously shifting and compressing the average ERP waveform (step 1). These variability matrices, whose order of trials (with the latency shifted and the morphology varied simultaneously) is of no importance, are fed to a PCA (step 2). The resulting three main principal components (PCs) are used to define three regressors for each peak within a given post-stimulus interval (in this case, 0–0.5 s; step 3). These regressors are then applied against each ST within the same post-stimulus interval and used to model each single-trial ERP peak (step 4). The methodology is explained in details in Hu et al. (2011).

Since the original methodology focused on the estimation of amplitude-latency per single-trial, we grand-averaged the single-trials after first applying the regressors.

The second one is PCA where we kept the first PCs that explained more than 95% of the variance of single-trials.

In Figure 10, we illustrate the resulting grand-averaged time series from subject 1 and stimulus DIR-L for standard, deviant, and deviant-minus-standard using the multi-linear regressor algorithm and the average reference system. Complementary, Figure 11 demonstrates the effect of REST reference on the grand-averaged time series illustrated in Figure 16. Both grand-averaged time series were extracted from FZ EEG sensor.

FIGURE 10

Figure 10. Grand-averaged response for DIR-L from subject 1 using multi-linear regressor analysis and average reference system (FZ-Sensor). (A) Standard stimulus, (B) Deviant stimulus, (C) Deviant-minus-standard stimulus.

FIGURE 11

Figure 11. Grand-averaged response for DIR-L from subject 1 using multi-linear regressor analysis and REST reference system (FZ-Sensor). (A) Standard stimulus, (B) Deviant stimulus, (C) Deviant-minus-standard stimulus.

In Figure 12, we illustrate the resulting grand-averaged time series from subject 1 and stimulus DIR-L for standard-deviant and deviant-minus-standard using PCA algorithm and the average reference system. On this exemplar, we demonstrate the 4th and 5th PC per case. Complementary, Figure 13 demonstrates the effect of REST reference on the grand-averaged time series illustrated in Figure 12. Both grand-averaged time series were extracted from FZ EEG sensor. We adopted the same stimulus, sensor location, and subject with multi-linear regressor analysis for comparison purposes between multi-linear regressor analysis and PCA. For further details, see section 2 in Supplementary Material.

FIGURE 12

Figure 12. Grand-averaged response for DIR-L from subject 1 using PCA analysis and average reference system (FZ-Sensor). (A) Standard stimulus, (B) Deviant stimulus, (C) Deviant-minus-standard stimulus. We plotted the characteristic time series derived from PCA analysis, the plain average from the whole set of time series and the time series derived from the proposed method.

FIGURE 13

Figure 13. Grand-averaged response for DIR-L from subject 1 using PCA analysis and REST reference system (FZ-Sensor). (A) Standard stimulus, (B) Deviant stimulus, (C) Deviant-minus-standard stimulus. We plotted the characteristic time series derived from PCA analysis, the plain average from the whole set of time series and the time series derived from the proposed method.

In Figure 14, we illustrate the resulting grand-averaged 4th and 5th singular time series from subject 1 and stimulus DIR-L for standard, deviant, and deviant-minus-standard using SVD algorithm and the average reference system. Complementary, Figure 15 demonstrates the effect of REST reference on the grand-averaged first two right singular time series illustrated in Figure 14. Both grand-averaged time series were extracted from FZ EEG sensor. We adopted the same stimulus, sensor location and subject with multi-linear regressor analysis for comparison purposes between multi-linear regressor analysis and PCA.

FIGURE 14

Figure 14. Grand-averaged of the 4th and 5th singular-values for DIR-L from subject 1 using SVD analysis and average reference system (FZ-Sensor). (A) Standard stimulus, (B) Deviant stimulus, (C) Deviant-minus-standard stimulus.

FIGURE 15

Figure 15. Grand-averaged of the 4th and 5th singular values for DIR-L from subject 1 using PCA analysis and REST reference system (FZ-Sensor). (A) Standard stimulus, (B) Deviant stimulus, (C) Deviant-minus-standard stimulus.

The three algorithms independently of the EEG reference system failed to detect an accurate amplitude and latency. The main reason is that both algorithms are sensitive to the grand-averaged response which in many cases like in MMN experimental paradigm are too noisy to get astable waveform that can be used as representative time series of brain response.

The proposed data mining scheme worked better compared to the three comparable techniques and also it is a parameter free method that can easily be used in any experimental multi-trial paradigm.

Simulations

Based on the results derived from the simulations, we revealed that both amplitude and latency are within acceptable limits. Simulations have shown that estimates of amplitude and latency are within acceptable limits (Figures 16, 17). Only if SNR is low and latency variation is low, estimates become unreliable. Figures 16, 17 illustrate the simulation based on recordings derived from the FZ sensor at DIR-L condition and for deviant-standard stimulus using the two simulated scenarios.

FIGURE 16

Figure 16. The simulation based on recordings derived from the FZ sensor in the DIR-L condition and for deviant-minus-standard stimulus. Real vs. estimated amplitude estimates for different amounts of SNR and latency variation.

FIGURE 17

Figure 17. The simulation based on recordings derived from the FZ sensor in the DIR-L condition and for deviant-minus-standard stimulus. Real vs. estimated latency estimates for different amounts of SNR and latency variation.

Finally, only in the case that both SNR and latency variation are low, estimates become unreliable (see Figures 16, 17).

Discussion

The proposed methodology can reliably sample the representative single-trials in order to simultaneously express their variability and also to reconstruct a grand-average with high SNR. A high SNR was detected across subjects, conditions and recording EEG locations which secure a reliable estimation of the amplitude and latency of the characteristic peak elicited from the whole set of MMN features. The whole approach relies on graph theory by constructing a distance network from the 2D projected STs in a common feature space. The construction of the distance matrix based on members/nodes of proximity graph, called GG. Then, we detected from GG, the hubs nodes/single-trials using the degree k of each node as an input. These hubs can clearly describe the variability of single-trials and also reconstruct a waveform with a high SNR which clearly demonstrates a characteristic peak. We presented also how the final reconstructed waveform changed due to different types of filters and the related order. To get the best combination of filter type, order and selection of degree k directly linked to the selected hubs, we employed SNR as a true objective function. Our results can be summarized below:

• FIR filter with eegfilt function of order 2 and degree = 4 demonstrates the highest SNR across conditions and subjects

• The reliability of amplitude and latency was higher for FZ EEG sensor compared to FCZ and CZ based on CV

• CV of signal power was higher for the subset of trials compared to the whole set of trials.

• Amplitude and latency are sensitive to EEG reference system

• REST reference system improved the CV of amplitude in the deviant-minus-standard stimuli

• Similar amplitude/latency estimations were revealed with the 85–95% of the total amount of single-trials

• PCA, SVD, and multi-linear regressor manipulation of single-trials failed to retrieve a robust waveform, latency, and amplitude estimation.

• Amplitude and latency estimations with the proposed method are reliable unless SNR and latency variation is too low.

There are several extensions that can be applied to the present methodology in specific steps of the analytic algorithm. First of all, we demonstrated the effect of different filtering schemes where alternative adaptive filters can be used (Mandic and Goh, 2009). One can select different metrics to estimate the pair-wise correlation between single-trials. Complementary, one can use different members of proximity graphs like minimal spanning tree alone or in combination with voronoi testellation (Laskaris and Ioannides, 2001; Laskaris et al., 2004) and RNG. Here, we tested both of them but we revealed best results with GG. Additionally, it would be very interesting to apply source-localization algorithms on the representative single-trials to connect single-trials with sources and the timing of related activity (Laskaris and Ioannides, 2001; Laskaris et al., 2003). For example, one can localize the early segment of activity and the later one, in order to demonstrate the early activation of auditory cortex and the later activation in frontal lobe (Rinne et al., 2000).

Regarding the adopted experimental paradigm to demonstrate this methodology, the MMN mechanism consists of an auditory-based frontal lobe network. After the pre-processing of the content of MMN by the targeted sensory system here the auditory cortex, frontal areas are activated playing a significant role in the elicitation of a reflex (Näätänen and Michie, 1979). The MMN generators come from temporal and frontal lobes and the related activity is captured mainly by fronto-central EEG sensors (FZ, FCZ, CZ) and also from temporal electrodes (T3, T4) (Rinne et al., 2000). Two studies aimed to reveal, both with EEG and MEG recordings, the origin of the elicited activity linked to MMN. Dipole modeling techniques applied to MMN (Scherg et al., 1989) and its magnetic counterpart (MMNm) (Hari et al., 1984) were found to have generators in the auditory cortex and in the temporal lobes. Complementary, the analysis of scalp-potential distribution revealed a right-hemispheric MMN source, which mainly was located over the frontal lobe (Giard et al., 1990; Deouell et al., 1998). A more recent paper using a simultaneous EEG-MEG recording set up, source-localized both EEG and MEG activity in an auditory MMN (Rinne et al., 2000). They validated the hypothesis that frontal MMN generators are activated later than generator in the auditory cortex. For a review of MMN generators in both healthy and disease groups and various settings, an interested reader can refer to a detailed review (Garrido et al., 2009).

The MMN is an ERP elicited by the occurrence of a rare event (deviance) in a regular acoustic environment, and is assumed to reflect a pre-attentive mechanism for change detection. Cortical generators of MMN are located in the superior temporal planes bilaterally which are responsible for the sensory memory part of change detection and frontal lobe sources responsible for triggering an attention shift upon change detection (for a review see Deouell, 2007). These bilateral temporal-frontal generators of MMN can be better detected with EEG compared to MEG while the combination of both modalities was suggested (Hämäläinen et al., 1993). Apart from bilateral auditory-cortex activation which underlines a pre-perceptual change detection with a short time-delay (Rinne et al., 2000), a predominant right hemispheric frontal process could be detected linked to involuntary attention switch to auditory change (Rinne et al., 2005, 2006). The dominant hemisphere of the MMN response due to acoustic changes is the right hemisphere (Levänen et al., 1996). For that reason, it is important in a next study to further explore amplitude-latency estimations also in bilateral frontal electrode sites complementary to EEG sensors located in the midline.

In the present study, we focused on the presentation of a data-driven methodology for a proper analysis of single-trials. We demonstrated high reliability in amplitude, latency, variability, and signal power for the whole cohort of young adults. Additionally, the majority of the fronto-central EEG channels should be studied to uncover any significant asymmetries of the brain activity between the two groups. Complementary, the main focus of this study was to enhance the reliability of the proposed methodology to reveal high SNR grand-averaged trials in various MMN conditions and reliable estimates of amplitude, latency, and variability in a healthy group. The REST reference system improved the CV of amplitude in the deviant-minus-standard stimuli while PCA, SVD, and multi-linear regressor manipulation of single-trials failed to retrieve a robust waveform, latency, and amplitude estimation. The proposed data-driven scheme worked better compared to the three well-known comparable methodologies. Moreover, it is parameter free method that can easily be adjusted to any multi-trial experimental paradigm using EEG-MEG recordings at both sensor and source levels. Finally, amplitude and latency estimations with the proposed method are reliable unless SNR and latency variation is too low.

The whole methodology will be valuable for neuroscientists particularly interest in defining a reliable biomarker based on ERP studies in various cognitive states (Picton et al., 2000; Espeseth et al., 2009; Horvath et al., 2018) and also in disease brain states such as the Alzheimer's Disease (Tsolaki et al., 2017).

Conclusions

We presented a fast, reliable, and data-driven methodology for simultaneously data-mining single-trials and amplitude-latency estimation. The method relies on graph and network analysis as appropriate tools of geometrical data analysis and vectorial pattern analytic tools of single-trials. We demonstrated the effect of filtering settings on the grand-averaged trial and the related amplitude-latency estimates. Additionally, the whole methodology was presented in an auditory EEG MMN task with the aim to detect reliable amplitude, latency, and signal power derived from the appropriate preselection of single-trials. Based on the data-driven approach of the current methodology, the whole analysis could be of high value for various evoked/event-related potentials in various neuroimaging studies including EEG, MEG, and fMRI.

Author Contributions

Conception of the research, methods and design, data analysis, and drafting the manuscript: SD; Critical revision of the manuscript: LB, LE, DL, KS; All authors read and approved the final version of the manuscript.

Funding

SD and DL were supported by MRC grant MR/K004360/1 (Behavioural and Neurophysiological Effects of Schizophrenia Risk Genes: A Multi-locus, Pathway Based Approach). SD is supported by a MARIE-CURIE COFUND EU-UK research fellowship. Data acquisition and curation was supported by the National Centre for Mental Health (NCMH), a collaboration between Cardiff, Swansea and Bangor Universities funded by Welsh Government through Health and Care Research Wales. This work was supported by the MRC Partnership Grant MR/K005464/1.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

We would like to acknowledge Cardiff RCUK funding scheme.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fninf.2018.00059/full#supplementary-material

References

Aikake, H. (1974). A new look at the statistical model identification. IEEE Trans. Automat. Contr. 19, 716–723. doi: 10.1109/TAC.1974.1100705

CrossRef Full Text | Google Scholar

Anderson, C. W., Stolz, E. A., and Shamsunder, S. (1998). Multivariate autoregressive models for classification of spontaneous electroencephalographic signals during mental tasks. IEEE Trans. Biomed. Eng. 45, 277–286. doi: 10.1109/10.661153

PubMed Abstract | CrossRef Full Text | Google Scholar

Bodatsch, M., Ruhrmann, S., Wagner, M., Müller, R., Schultze-Lutter, F., Frommann, I., et al. (2011). Prediction of psychosis by mismatch negativity. Biol. Psychiatry 69, 959–966. doi: 10.1016/j.biopsych.2010.09.057

PubMed Abstract | CrossRef Full Text | Google Scholar

Chella, F., Pizzella, V., Zappasodi, F., and Marzetti, L. (2016). Impact of the reference choice on scalp EEG connectivity estimation. J. Neural Eng. 13:036016. doi: 10.1088/1741-2560/13/3/036016

PubMed Abstract | CrossRef Full Text | Google Scholar

Cole, M. W., Ito, T., Bassett, D. S., and Schultz, D. H. (2016). Activity flow over resting-state networks shapes cognitive task activations. Nat. Neurosci. 19, 1718–1726. doi: 10.1038/nn.4406

PubMed Abstract | CrossRef Full Text | Google Scholar

Da Pelo, P., De Tommaso, M., Monaco, A., Stramaglia, S., Bellotti, R., and Tangaro, S. (2018). Trial latencies estimation of event-related potentials in EEG by means of genetic algorithms. J. Neural Eng. 15:026016. doi: 10.1088/1741-2552/aa9b97

PubMed Abstract | CrossRef Full Text | Google Scholar

De Pascalis, V., Varriale, V., and Matteoli, A. (2008). Intelligence and p3 components of the event-related potential elicited during an auditory discrimination task with masking. Intelligence 36, 35–47. doi: 10.1016/j.intell.2007.01.002

CrossRef Full Text | Google Scholar

Delorme, A., and Makeig, S. (2004). EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis. J. Neurosci. Methods. 134, 9–21. doi: 10.1016/j.jneumeth.2003.10.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Deouell, L. Y. (2007). The frontal generator of the mismatch negativity revisited. J. Psychophysiol. 21, 188–20. doi: 10.1027/0269-8803.21.34.188

CrossRef Full Text | Google Scholar

Deouell, L. Y., Bentin, S., and Giard, M. H. (1998). Mismatch negativity in dichotic listening: evidence for interhemispheric differences and multiple generators. Psychophysiology 35, 355–365. doi: 10.1111/1469-8986.3540355

PubMed Abstract | CrossRef Full Text | Google Scholar

Dimitriadis, S. I., Laskaris, N. A., Bitzidou, M. A., Tarnanas, I., and Tsolaki, M. (2015). A novel biomarker of amnestic MCI based on dynamic Cross-Frequency Coupling patterns during cognitive brain responses. Front. Neurosci. 9:350. doi: 10.3389/fnins.2015.00350

PubMed Abstract | CrossRef Full Text | Google Scholar

Dimitriadis, S. I., Laskaris, N. A., Simos, P. G., Fletcher, J. M., and Papanicolaou, A. C. (2016a). Greater repertoire and temporal variability of Cross-Frequency Coupling (CFC) modes in resting-state neuromagnetic recordings among children with reading difficulties. Front. Hum. Neurosci. 10:163. doi: 10.3389/fnhum.2016.00163

PubMed Abstract | CrossRef Full Text | Google Scholar

Dimitriadis, S. I., Laskaris, N. A., Tsirka, V., Vourkas, M., and Micheloyannis, S. (2010). What does delta band tell us about cognitive processes: a mental calculation study. Neurosci. Lett. 483, 11–15. doi: 10.1016/j.neulet.2010.07.034

PubMed Abstract | CrossRef Full Text | Google Scholar

Dimitriadis, S. I., Tarnanas, I., Wiederhold, M., Wiederhold, B., Tsolaki, M., and Fleisch, E. (2016b). Mnemonic-Strategy Training of elderly at risk for dementia enhance integration of information processing via cross-frequency-coupling. Alzheimers Dement. (N Y). 2, 241–249. doi: 10.1016/j.trci.2016.08.004

CrossRef Full Text | Google Scholar

Espeseth, T., Rootwelt, H., and Reinvang, I. (2009). Apolipoprotein E modulates auditory event-related potentials in healthy aging. Neurosci Lett. 459, 91–95. doi: 10.1016/j.neulet.2009.04.053

PubMed Abstract | CrossRef Full Text | Google Scholar

Fein, G., and Turetsky, B. (1989). P300 latency variability in normal elderly: effects of paradigm and measurement technique. Electroencephalogr. Clin. Neurophysiol. 72, 384–394.

PubMed Abstract | Google Scholar

Fjell, A., Rosquist, H., and Walhovd, K. (2009). Instability in the latency of p3a/p3b brain potentials and cognitive functioning in aging. Neurbiol. Aging 30, 2065–2079. doi: 10.1016/j.neurobiolaging.2008.01.015

CrossRef Full Text | Google Scholar

Friston, K. J., Stephan, K. M., Heather, J. D., Frith, C. D., Ioannides, A. A., Liu, L. C., et al. (1996). A multivariate analysis of evoked responses in EEG and MEG data. NeuroImage 3, 167–174. doi: 10.1006/nimg.1996.0018

PubMed Abstract | CrossRef Full Text | Google Scholar

Gabriel, K. R., and Sokal, R. R. (1969). A new statistical approach to geographic variation analysis. Syst. Zool. Soc. Syst. Biol. 18, 259–270. doi: 10.2307/2412323

CrossRef Full Text | Google Scholar

Garrido, M. I., Kilner, J. M., Stephan, K. E., and Friston, K. J. (2009). The mismatch negativity: a review of underlying mechanisms. Clin. Neurophysiol. 120, 453–463. doi: 10.1016/j.clinph.2008.11.029

PubMed Abstract | CrossRef Full Text | Google Scholar

Geurts, H. M., Grasman, R., Verté, S., Oosterlaan, J., Roeyers, H., van Kammen, S. M., et al. (2008). Intra-individual variability in ADHD, autism spectrum disorders and Tourette's syndrome. Neuropsychologia 46, 3030–3041. doi: 10.1016/j.neuropsychologia.2008.06.013

PubMed Abstract | CrossRef Full Text | Google Scholar

Geva, A. (1998). Feature extraction and state identification in biomedical signals using hierarchical fuzzy clustering. Med. Biol. Eng. Comput. 36, 608–614. doi: 10.1007/BF02524432

PubMed Abstract | CrossRef Full Text | Google Scholar

Gevins, A. (1984). Analysis of the electromagnetic signals of the human brain: milestones, obstacles, and goals. IEEE Trans. Biomed. Eng. 31, 833–850.

PubMed Abstract | Google Scholar

Giard, M. H., Perrin, F., Pernier, J., and Bouchet, P. (1990). Brain generators implicated in the processing of auditory stimulus deviance: a topographic event-related potential study. Psychophysiology 27, 627–640. doi: 10.1111/j.1469-8986.1990.tb03184.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Hämäläinen, M., Hari, R., Ilmoniemi, R. J., Knuutila, J., and Lounasmaa, O. V. (1993). Magnetoencephalography—theory, instrumentation, and applications to noninvasive studies of the working human brain. Rev. Mod. Phys. 65, 413–497. doi: 10.1103/RevModPhys.65.413

CrossRef Full Text | Google Scholar

Hari, R., Hämäläinen, M., Ilmoniemi, R., Kaukoranta, E., Reinikainen, K., Salminen, J., et al. (1984). Responses of the primary auditory cortex to pitch changes in a sequence of tone pips: neuromagnetic recordings in man. Neurosci. Lett. 50, 127–132. doi: 10.1016/0304-3940(84)90474-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Horvath, A., Szucs, A., Csukly, G., Sakovics, A., Stefanics, G., and Kamondi, A. (2018). EEG and ERP biomarkers of Alzheimer's disease: a critical review. Front. Biosci. (Landmark Ed). 23, 183–220. doi: 10.2741/4587

PubMed Abstract | CrossRef Full Text | Google Scholar

Hu, L., Liang, M., Mouraux, A., Wise, R. G., Hu, Y., and Iannetti, G. D. (2011). Taking into account latency, amplitude, and morphology: improved estimation of single-trial ERPs by wavelet filtering and multiple linear regression. J. Neurophysiol. 106, 3216–3229. doi: 10.1152/jn.00220.2011

PubMed Abstract | CrossRef Full Text | Google Scholar

Huang, Y., Zhang, J., Cui, Y., Yang, G., He, L., and Liu, Q. (2017). How different EEG references influence sensor level functional connectivity graphs. Front. Neurosci. 11:368. doi: 10.3389/fnins.2017.00368

PubMed Abstract | CrossRef Full Text | Google Scholar

Jaskowski, P., and Verleger, R. (1999). Amplitude and latencies of single-trial ERP's estimated by a maximum-likelihood method. IEEE Trans. Biomed. Eng. 46, 987–993. doi: 10.1109/10.775409

CrossRef Full Text | Google Scholar

Jaskowski, P., and Verleger, R. (2000). An evaluation of methods for single-trial estimation of p3 latency. Psychophysiology 37, 153–162. doi: 10.1111/1469-8986.3720153

PubMed Abstract | CrossRef Full Text | Google Scholar

Jasper, H. H. (1958). The ten-twenty electrode system of the international federation. Electroencephalogr. Clin. Neurophysiol. 10, 371–375.

Lange, D., Siegelmann, H., Pratt, H., and Gideon, I. (2000). Overcoming selective ensemble averaging: unsupervised identification of event-related brain potentials. IEEE Trans. Biomed. Eng. 47, 822–826. doi: 10.1109/10.844236

PubMed Abstract | CrossRef Full Text | Google Scholar

Laskaris, N., Fotopoulos, S., and Ioannides, A. (2004). Mining information from event-related recordings. IEEE Signal Process. Magaz. 21, 66–77. doi: 10.1109/MSP.2004.1296544

CrossRef Full Text | Google Scholar

Laskaris, N., and Ioannides, A. (2001). Exploratory data analysis of evoked response single trials based on minimal spanning tree. Clin. Neurophysiol. 112, 698–712. doi: 10.1016/S1388-2457(00)00560-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Laskaris, N., Liu, L. C., and Ioannides, A. (2003). Single-trial variability in early visual neuromagnetic responses: an explorative study based on the regional activation contributing to the N70m peak. NeuroImage 20, 765–783. doi: 10.1016/S1053-8119(03)00367-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Laskaris, N., Tarnanas, I., Tsolaki, M. N., Vlaikidis, N., and Karlovasitou, A. K. (2013). Improved detection of amnestic MCI by means of Discriminative Vector Quantization of Single-Trial cognitive ERP responses. J. Neurosci. Methods 212, 344–354. doi: 10.1016/j.jneumeth.2012.10.014

PubMed Abstract | CrossRef Full Text | Google Scholar

Latora, V., and Marchiori, M. (2001). Efficient behavior of small-world networks. Phys. Rev. Lett. 87:198701. doi: 10.1103/PhysRevLett.87.198701

PubMed Abstract | CrossRef Full Text | Google Scholar

Lei, X., and Liao, K. (2017). Understanding the influences of EEG reference: a large-scale brain network perspective. Front. Neurosci. 11:205. doi: 10.3389/fnins.2017.00205

PubMed Abstract | CrossRef Full Text | Google Scholar

Levänen, S., Ahonen, A., Hari, R., McEvoy, L., and Sams, M. (1996). Deviant auditory stimuli activate human left and right auditory cortex differently. Cereb. Cortex 6, 288–296.

PubMed Abstract | Google Scholar

Makeig, S., Bell, A. J., Jung, T. P., and Sejnowski, T. J. (1996). Independent component analysis of electroencephalographic data. Adv. Neural Inf. Proces. Syst. 8, 145–151.

Google Scholar

Mandic, D. P., and Goh, S. L. (2009). Complex Valued Nonlinear Adaptive Filters: Noncircularity, Widely Linear and Neural Models, Research Monograph in the Wiley Series in Adaptive and Learning Systems for Signal Processing, Communications, and Control. New York, NY: John Wiley & Sons.

Google Scholar

Matula, D. W., and Sokal, R. R. (1980). Properties of Gabriel graphs relevant to geographic variation research and clustering of points in the plane. Geogr. Anal. 12, 205–222. doi: 10.1111/j.1538-4632.1980.tb00031.x

CrossRef Full Text | Google Scholar

Mayhew, S., Iannetti, G., Woolrich, M., and Wise, R. (2006). Automated single-trial measurement of amplitude and latency of laser-evoked potentials (LEPs) using multiple linear regression. Clin. Neurophysiol. 117, 1331–1344. doi: 10.1016/j.clinph.2006.02.017

PubMed Abstract | CrossRef Full Text | Google Scholar

Näätänen, R., and Michie, P. T. (1979). Early selective-attention effects on the evoked potential: a critical review and reinterpretation. Biol. Psychol. 8, 81–136. doi: 10.1016/0301-0511(79)90053-X

PubMed Abstract | CrossRef Full Text | Google Scholar

Näätänen, R., Pakarinen, S., Rinne, T., and Takegata, R (2004). The mismatch negativity (MMN): towards the optimal paradigm. Clin. Neurophysiol. 115, 140–144. doi: 10.1016/j.clinph.2003.04.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Pham, D., Moecks, J., Koelher, W., and Gasser, T. (1987). Variable latencies of noisy signals: estimation and testing in brain potential data. Biometrika 74, 525–533. doi: 10.2307/2336691

CrossRef Full Text | Google Scholar

Picton, T., Bentin, S., Berg, P., Donchin, E., Hillyard, S., Johnson, R. Jr., et al. (2000). Guidelines for using human event-related potentials to study cognition: recording standards and publication criteria. Psychophysiology 37, 127–152. doi: 10.1111/1469-8986.3720127

PubMed Abstract | CrossRef Full Text | Google Scholar

Qin, Y., Xu, P., and Yao, D. (2010). A comparative study of different references for EEG default mode network: the use of the infinity reference. Clin. Neurophysiol. 121, 1981–1991. doi: 10.1016/j.clinph.2010.03.056

PubMed Abstract | CrossRef Full Text | Google Scholar

Rinne, T., Alho, K., Ilmoniemi, R. J., Virtanen, J., and Naatanen, R. (2000). Separate time behaviors of the temporal and frontal mismatch negativity sources. NeuroImage 12, 14–19. doi: 10.1006/nimg.2000.0591

PubMed Abstract | CrossRef Full Text | Google Scholar

Rinne, T., Degerman, A., and Alho, K. (2005). Superior temporal and inferior frontal cortices are activated by infrequent sound duration decrements: an fMRI study. NeuroImage 26, 66–72. doi: 10.1016/j.neuroimage.2005.01.017

PubMed Abstract | CrossRef Full Text | Google Scholar

Rinne, T., Särkkä, A., Degerman, A., Schröger, E., and Alho, K. (2006). Two separate mechanisms underlie auditory change detection and involuntary control of attention. Brain Res. 1077, 135–143. doi: 10.1016/j.brainres.2006.01.043

PubMed Abstract | CrossRef Full Text | Google Scholar

Scherg, M., Vajsar, J., and Picton, T. W. (1989). A source analysis of the late human auditory evoked potentials. J. Cogn. Neurosci. 1, 336–355. doi: 10.1162/jocn.1989.1.4.336

PubMed Abstract | CrossRef Full Text | Google Scholar

Székely, G. J., and Rizzo, M. L. (2009). Partial distance correlation with methods for dissimilarities. Ann. Stat. 42, 2382–2412. doi: 10.1073/pnas.1522586113

CrossRef Full Text | Google Scholar

Tsolaki, A. C., Kosmidou, V., Kompatsiaris, I. Y., Papadaniil, C., Hadjileontiadis, L., Adam, A., et al. (2017). Brain source localization of MMN and P300 ERPs in mild cognitive impairment and Alzheimer's disease: a high-density EEG approach. Neurobiol Aging 55, 190–201. doi: 10.1016/j.neurobiolaging.2017.03.025

PubMed Abstract | CrossRef Full Text | Google Scholar

Woody, C. (1967). Characterization of an adaptive filter for the analysis of variable latency neuro-electric signals. Med. Biol. Eng. Comput. 5, 539–554. doi: 10.1007/BF02474247

CrossRef Full Text | Google Scholar

Yao, D. (2001). A method to standardize a reference of scalp EEG recordings to a point at infinity. Physiol. Meas. 22, 693–711. doi: 10.1088/0967-3334/22/4/305

PubMed Abstract | CrossRef Full Text | Google Scholar

Yao, D., Wang, L., Oostenveld, R., Nielsen, K. D., Arendt-Nielsen, L., and Chen, A. C. (2005). A comparative study of different references for EEG spectral mapping: the issue of the neutral reference and the use of the infinity reference. Physiol. Meas. 26, 173–184. doi: 10.1088/0967-3334/26/3/003

PubMed Abstract | CrossRef Full Text | Google Scholar

Zouridakis, G., Boutros, N., and Jansen, B. (1997). A fuzzy clustering approach to study the auditory P50 component in schizophrenia. Psychiatry Res. 69, 169–181. doi: 10.1016/S0165-1781(96)02979-4

PubMed Abstract | CrossRef Full Text

Keywords: single-trials, data-mining, proximity graphs, network analysis, amplitude, latency, reliability, signal to noise ratio (SNR)

Citation: Dimitriadis SI, Brindley L, Evans LH, Linden DE and Singh KD (2018) A Novel, Fast, Reliable, and Data-Driven Method for Simultaneous Single-Trial Mining and Amplitude—Latency Estimation Based on Proximity Graphs and Network Analysis. Front. Neuroinform. 12:59. doi: 10.3389/fninf.2018.00059

Received: 21 February 2018; Accepted: 20 August 2018;
Published: 19 November 2018.

Edited by:

Arjen van Ooyen, VU University Amsterdam, Netherlands

Reviewed by:

Jose Manuel Ferrandez, Universidad Politécnica de Cartagena, Spain
Sebastiano Stramaglia, Università degli Studi di Bari, Italy

Copyright © 2018 Dimitriadis, Brindley, Evans, Linden and Singh. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Stavros I. Dimitriadis, c3RpZGltaXRyaWFkaXNAZ21haWwuY29t; ZGltaXRyaWFkaXNTQGNhcmRpZmYuYWMudWs=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.