Optimizing neuroscience data management by combining REDCap, BIDS and SQLite: a case study in Deep Brain Stimulation

Stawiski, Marc; Bucciarelli, Vittoria; Vogel, Dorian; Hemm, Simone

doi:10.3389/fninf.2024.1435971

TECHNOLOGY AND CODE article

Front. Neuroinform. , 05 September 2024

Volume 18 - 2024 | https://doi.org/10.3389/fninf.2024.1435971

Optimizing neuroscience data management by combining REDCap, BIDS and SQLite: a case study in Deep Brain Stimulation

$\r\nMarc Stawiski&#x;$ Marc Stawiski^†

Vittoria Bucciarelli^*†

Dorian Vogel $Simone Hemm\r\n$ Simone Hemm

Neuroengineering Group, Institute for Medical Engineering and Medical Informatics, School of Life Sciences, FHNW University of Applied Sciences and Arts Northwestern Switzerland, Muttenz, Switzerland

Neuroscience studies entail the generation of massive collections of heterogeneous data (e.g. demographics, clinical records, medical images). Integration and analysis of such data in research centers is pivotal for elucidating disease mechanisms and improving clinical outcomes. However, data collection in clinics often relies on non-standardized methods, such as paper-based documentation. Moreover, diverse data types are collected in different departments hindering efficient data organization, secure sharing and compliance to the FAIR (Findable, Accessible, Interoperable, Reusable) principles. Henceforth, in this manuscript we present a specialized data management system designed to enhance research workflows in Deep Brain Stimulation (DBS), a state-of-the-art neurosurgical procedure employed to treat symptoms of movement and psychiatric disorders. The system leverages REDCap to promote accurate data capture in hospital settings and secure sharing with research institutes, Brain Imaging Data Structure (BIDS) as image storing standard and a DBS-specific SQLite database as comprehensive data store and unified interface to all data types. A self-developed Python tool automates the data flow between these three components, ensuring their full interoperability. The proposed framework has already been successfully employed for capturing and analyzing data of 107 patients from 2 medical institutions. It effectively addresses the challenges of managing, sharing and retrieving diverse data types, fostering advancements in data quality, organization, analysis, and collaboration among medical and research institutions.

1 Introduction

Over the past few years, neuroscience research has undergone a significant transformation. Technological advancements have enabled acquisition of extensive, multi-modal datasets whose analysis can elucidate mechanisms underlying neurological disorders and unveil new treatments (Dipietro et al., 2023). However, extracting meaningful insights from neuroscience data is not a trivial problem. A first challenge is posed by dealing with a multifaceted data landscape (Dash et al., 2019). Neuroscience studies entail, in fact, the generation of heterogeneous data ranging from neuroimages to clinical evaluations, to demographics. The integration of diverse data types plays a crucial role in deriving clinically relevant conclusions (Sejnowski et al., 2014). Moreover, such data is sourced from different clinical institutions and is often gathered at various time intervals to facilitate longitudinal studies (Dipietro et al., 2023). Data collection within hospital settings often relies on traditional methods such as paper-based documentation or static spreadsheets (Poline et al., 2012; Saczynski et al., 2013). These methods, despite being straightforward and widespread in clinics, may present some limitations in data standardization and accessibility (Wilcox et al., 2012). After collection, data needs to be transferred to research institutes to perform further analyses. According to data sharing best practices (Ferguson et al., 2014), data should follow the FAIR (Findable, Accessible, Interoperable, Reusable) principles (Wilkinson et al., 2016). However, sharing data recorded using the above-mentioned methods for collaborative research purposes, raises concerns about data security and integrity. Therefore, systematic data management is imperative to leverage the insights derived from neuroscience multi-modal data. This can be achieved by employing systems facilitating efficient data capture, organization, management, and secure sharing (DiEuliis and Giordano, 2016; Dash et al., 2019; Dipietro et al., 2023).

In the last years several Electronic Data Capture (EDC) systems, such as REDCap (Harris et al., 2009), CARAT (Turner et al., 2011) and CIGAL (Voyvodic et al., 2011), have emerged to address the limitations of paper-based and spreadsheet-based data collection. These systems offer web-based interfaces for multi-modal data capture and can also provide data verification functions (Turner and Van Horn, 2012; Li and Liang, 2022). However, EDC systems do not support image files (Deserno et al., 2013). Medical images are usually stored in specialized Picture Archiving and Communication Systems (PACS) (Choplin et al., 1992). While PACS offer secure storage and retrieval of medical images, they are not designed for integrating clinical data with imaging data (Scott et al., 2011). Furthermore, lack of standardization in file structures and naming, severely hinders efficient data management in analysis pipelines, data sharing and study reproducibility (Zwiers et al., 2022). This issue has been tackled by introducing data storage specifications such as Experimental Directory Structure (ExDir) (Dragly et al., 2018) or Brain Imaging Data Structure (BIDS) (Gorgolewski et al., 2016). The latter is a standard describing how to organize neuroimaging data and related metadata following a specific folder structure and naming convention. BIDS was initially designed to accommodate raw imaging data. Guidelines to store analysis-derived data have been recently introduced but they remain quite broad and may not encompass all file types generated in the various neuroscience fields. Given its growing usage in the neuroimaging community, the last years have witnessed the development of tools providing semi-automated organization of imaging datasets according to BIDS guidelines (Lopez-Novoa et al., 2019; Zwiers et al., 2022; Levitas et al., 2024) or bridging between BIDS datasets and software solutions (Jegou et al., 2022). In order to exploit the joint power of imaging and clinical data, a recent work has proposed a solution integrating data collected through REDCap with imaging data organized according to BIDS (Kuplicki et al., 2021). However, it lacks the ability to present a comprehensive data overview. Addressing this last point, several extensive data management and sharing systems have been developed. They have adopted diverse strategies to provide functionalities such as data capture, storage, sharing, import, export, processing, retrieval, quality control, provenance tracking (Marcus et al., 2007; Keator et al., 2009; Van Horn and Toga, 2009; Prodanov, 2011; Scott et al., 2011; Das et al., 2012; Book et al., 2013; Muehlboeck et al., 2014; Woodman et al., 2014; Grigis et al., 2017). Nevertheless, they may entail considerable load related to installation, maintenance, and user training (Kuplicki et al., 2021). Moreover, there is no universal solution fitting all the possible use-cases arising in the neuroscience field (Van Horn and Toga, 2009). To achieve the maximum possible efficiency, a data management system should be tailored to the workflows generating and using the data stored in it.

Therefore, this manuscript presents a data management system providing solutions to handle extensive, heterogeneous data and designed to support Deep Brain Stimulation (DBS) research pipelines. DBS is a widely employed neurosurgical procedure for managing symptoms of a variety of movement and psychiatric disorders (Hariz et al., 2013; Johnson et al., 2013; Wårdell et al., 2022). During the whole DBS process huge amounts of heterogeneous data are generated. Group analysis (e.g., analysis of data collected from multiple patients) is crucial in enhancing understanding of the relationship between stimulation parameters and clinical outcomes (Neudorfer et al., 2021; Nordin et al., 2022). With the aim of enhancing the efficiency and reproducibility of DBS research, the system described in this paper leverages the integration of REDCap, BIDS and SQLite.¹ It offers the following functionalities:

(1) Straightforward data capture in hospital settings

(2) Secure data sharing between clinics and research institutes

(3) Management of clinical and imaging data types

(4) Standard-based imaging data organization

(5) Linkage of image files and related metadata

(6) Access to heterogeneous data through a unique tool

2 System design and workflows

This section describes data acquired during a typical DBS procedure and the methodological framework used to create a data management system for DBS research. The presented comprehensive data management system combines existing data aggregation platforms and standards, with custom software for data handling. It utilizes REDCap as EDC system, BIDS as image storing standard, and an SQLite database as a comprehensive data store. Custom Python scripts connect the different components, ensuring efficient platform interoperability.

2.1 DBS procedure and data

2.1.1 Raw data acquisition

DBS entails the implantation of electrodes in specific deep brain targets (e.g. Subthalamic Nucleus, Zona Incerta), whose electrical stimulation leads to disease symptom reduction (Lozano et al., 2019). The DBS procedure varies between medical centers but can, in general, be split into three main phases: pre-operative planning, implantation and post-operative follow-up (Hemm and Wårdell, 2010; Shah et al., 2020). All three phases include the generation and acquisition of heterogeneous types of patient data. Pre-operative planning consists in delineating the precise brain target and trajectory to reach it, based on pre-operative anatomical images (D’Haese et al., 2005; Rezai et al., 2006). Patient demographics, medical history and anatomical imaging (e.g. CT or MRI) data, electrode type and targeting information are therefore collected in this first phase. During surgery electrophysiology techniques, such as microelectrode recording (MER) and intra-operative stimulation tests are often applied to identify the optimal electrode position (Schrader et al., 2002; Miyagi et al., 2009). The DBS leads are inserted afterwards in the brain for chronic stimulation. Post-operatively the final electrode position is checked through CT or MRI imaging (Chabardes et al., 2015). DBS parameters (e.g. active contact, current amplitude, pulse width) are tuned in a series of consultations which involve symptom evaluation through disease-specific clinical scoring scales (Koeglsperger et al., 2019; Wårdell et al., 2022). Outcomes of intra-operative and post-operative stimulation tests are recorded in paper forms.

2.1.2 Data analysis

The data acquired during the DBS procedure constitutes the input to analysis workflows, which in turn create additional data streams. In particular, the generation of electrodes, electrode trajectories and brain tissue models (brain conductivity matrix) allows to simulate the Volume of Tissue Activated (VTA) by specific stimulation configurations (Butson et al., 2006; Åström et al., 2015; Alonso et al., 2016; Horn et al., 2019; Latorre and Wårdell, 2019; Butenko et al., 2020; Neudorfer et al., 2023). VTAs of multiple patients can be jointly examined in a common anatomical space, also called anatomical template or atlas, obtained by fusing together preprocessed images of several patients (Evans et al., 1994; Lemaire et al., 2010; Vogel et al., 2020). Sometimes additional anatomical information such as manually outlined structures (Lemaire et al., 2010) can be projected and added to the template. Statistical approaches applied to patient data in a common reference space allow to extract probabilistic stimulation areas (Butson et al., 2011; Eisenstein et al., 2014; Reich et al., 2019; Nordin et al., 2022). These can in turn be used to learn more about the therapeutic targets or they can be transformed to the image space of new patients and used to support in determining their optimal stimulation parameters (Roediger et al., 2021; Nordenström et al., 2022).

2.1.3 Data type categories

DBS data from two medical institutions (University Hospitals of Basel and Clermont-Ferrand) was collected and analyzed to validate the system presented in this manuscript. The data types handled by the system were classified into 2 macro-categories. The first includes raw data acquired during the DBS procedure, which can be further subdivided into clinical and imaging data. The second category encompasses all the data created by the analysis workflows.

• Raw data:

◦ Clinical data: detailed patient demographics, medical history, medication schedules, surgical details (e.g. targeting and final implanted position information), stimulation parameters, symptoms scoring.

◦ Imaging data: pre and post-operative anatomical images (e.g. CT or MRI scans), labeled anatomical structures.

• Analysis data: preprocessed anatomical images (e.g. skull-stripped), reconstructed electrode trajectories, electrode COMSOL² models, brain conductivity matrix, simulated VTAs, anatomical atlas images, segmented anatomical structures, probabilistic stimulation maps, images transformed from patient reference space to atlas reference space and vice-versa, transforms and warps needed to perform image transformations.

The main data types collected during the DBS procedure by the medical centers contributing to this work, are summarized in Figure 1A. The group analysis workflow schema is represented in Figure 1B.

FIGURE 1

Figure 1. Overview of Deep Brain Stimulation procedure and group analysis workflow. The figure shows the various types of data generated at each step, in light of the DBS protocols followed by the two medical centers contributing to the data collection in this work. (A) The DBS procedure can be split in three phases: pre-operative planning, implantation and post-operative follow-up. Data generated during pre-operative planning encompasses patients’ demographic information, medical history, anatomical brain images, the model of DBS electrode chosen for implantation and the planned target point and trajectory. During implantation recording of brain cells activity (microelectrode recording) and stimulation tests help to determine the final, optimal electrode position. In the post-operative follow-up additional anatomical imaging data and stimulation parameters evaluations (screening) data are collected. Moreover, information on the final chosen chronic stimulation configuration is also recorded. (B) The group analysis workflow firstly requires patients’ anatomical images preprocessing. Subsequently Volume of Tissue Activated (VTA) simulations can be run for each patient and each stimulation configuration. Several patients’ anatomical images can be fused together to create a common anatomical space (anatomical atlas) (Vogel et al., 2020). Patients VTAs are then transformed into the atlas reference space so that they can be superimposed and analyzed together to generate Probabilistic Stimulation Maps. These maps indicate brain areas leading to high symptom improvement when stimulated. When data from a new patient is acquired, the stimulation map can be transformed to their reference space. The data elements in the left part of image (B) are collected during the DBS procedure, while the ones on the right (gray background) are generated during the data analysis.

2.2 Data management system use case analysis

A user-centered design approach was employed to define system requirements. This involved identifying the various user roles and their corresponding use cases. By analyzing these use cases functional requirements were derived. These outline the specific actions the system should be able to perform. In a second step, non-functional requirements were added, addressing the system’s characteristics such as security, performance, and scalability. The system caters to two primary user groups: clinicians and researchers, each with distinct use cases.

• Clinician

◦ Use Case 1.1: Easy Data Entry: clinicians can effortlessly record patient information through user-friendly, structured forms within the web interface. This minimizes the data entry burden and promotes accuracy.

◦ Use Case 1.2: Data Export for Statistical Analysis: clinicians can export collected data, either for a single patient or an entire cohort. Data can be exported in a format suitable for statistical analysis using a software of choice.

◦ Use Case 1.3: Data Edit: clinicians can edit data or fill in incomplete records.

• Researcher

◦ Use Case 2.1: Data Retrieval: researchers can access clinician-captured data. They can filter the data according to specific metadata to narrow their analyses to specific patient groups.

◦ Use case 2.2: Data Edit: researchers can correct or update data.

◦ Use case 2.3: Data Analysis: researchers can apply processing steps to retrieve data, including image processing and clinical data analysis.

◦ Use case 2.4: Results Saving and Sharing: researchers can integrate generated results back into the system and make them available to other users.

Use cases 2.1, 2.3 and 2.4 are fundamental steps of the DBS group analysis workflow (see section 2.1.2). The overall use case diagram of the data management system is shown in Figure 2.

FIGURE 2

Figure 2. Use case diagram for the DBS data management system. The system features clinicians and researchers as actors. Filled lines without arrow indicate the association between the actor and the use case. Filled lines with white-filled arrow point indicate that a use case is generalization of another. In this diagram “Image processing” and “Clinical data analysis” are 2 specifications of the general use case ‘Data analysis’. The dotted lines indicate ‘Include’ relationships between use cases: in order to be able to save and share results a data analysis needs to be performed (’Data Analysis’ is therefore included by the “Results Saving and Sharing” use case). The “Data Analysis” use case depends, in turn, on the “Data Retrieval” one.

2.3 System components

REDCap (Harris et al., 2009) was chosen as secure EDC web platform to streamline patient-related clinical data. Its user-friendly interface allows to create structured forms with data validation, and role-based access control (Supplementary Figures 1–6). REDCap serves as a centralized repository, simplifying retrieval and analysis. It offers real-time data capture, accessibility, and built-in reporting tools for a better understanding of patient cohorts and treatment outcomes (Server info: REDCap 14.3.2 PHP 7.4.33 (Linux/Unix OS) MariaDB 10.5.23).

BIDS (Gorgolewski et al., 2016) was chosen as imaging data storing standard. It acts as a common language for organizing brain imaging data, including MRI scans which are vital for DBS research. This standardized format ensures consistency in file naming conventions and folder structures across studies, streamlining collaboration and data sharing (used version: v1.9.0).

SQLite empowers research with its lightweight design. This self-contained database eliminates server needs and simplifies deployment, allowing researchers to focus on their work. Furthermore, SQLite’s SQL compatibility ensures familiar data management through a proven language (used version: 3.45.2).

Python excels as a data handling tool for research thanks to its extensive libraries like Pandas and NumPy, which enable efficient manipulation and analysis of complex datasets. The open-source nature of Python’s data engineering ecosystem fosters collaboration and facilitates the sharing of code and libraries for data handling tasks.

These components, except for REDCap, are designed for portability, meaning they can be run on any operating system without modification. This independence allows them to be easily rebuilt and deployed across different environments.

2.4 Clinical data management

For each participating center a REDCap project was established in the REDCap instance, incorporating all relevant user roles and access rights. Customized forms were designed to best capture data originating from diverse sources. They encompass patient information, intra-operative details (e.g. targeting information, microelectrode recordings, implanted electrode position), and post-operative data including chronic stimulation settings. The acquired data is automatically stored on the REDCap server. To facilitate data retrieval and management, a software pipeline was developed in Python³. This pipeline utilizes REDCap’s API with appropriate tokens to automatically pull data from the server. The retrieved data undergoes a transformation process based on pre-defined mapping files, using libraries like regular expressions (regex) for pattern matching and data cleaning, and Pandas for data handling. Finally, SQL queries are generated and employed to insert the transformed data into the SQLite database (Figure 3A). The entire process, from database creation (if necessary) to data retrieval, cleaning, and import, is encapsulated within the software pipeline. The generic nature of the code allows its application across all REDCap projects.

FIGURE 3

Figure 3. (A) The clinical data management workflow starts with the filling of customized REDCap webforms. The data is then synchronized with REDCap server for secure storage. Data is exported in.csv format via API and then stored in the SQLite database by a python script. (B) Imaging data is firstly acquired in DICOMDIR format and converted in NifTI format by a custom script. NifTI files are then named according to BIDS and stored in a repository with a BIDS-compliant structure. Image metadata (e.g., session, acquisition, suffix…) and file paths are saved in the SQLite database by a python script.

2.5 Imaging data management

A dedicated data processing pipeline integrates the imaging data into the data management system. DICOM⁴ data from the neurosurgical planning station of participating centers is anonymized and exported in DICOMDIR format. This DICOMDIRs includes pre- and post-operative CT scans, as well as various MRIs with some labels and trajectories. The pipeline sets up a root directory for the image data storage. Then it utilizes a pre-built DICOM to Neuroimaging Informatics Technology Initiative (NIfTI) format converter. The converter automatically extracts information from the DICOMDIRs, converts the DICOM data into the NIfTI format, and stores the converted files according to the BIDS standard. Finally, a self-developed Python tool (Image2BIDS2SQLite⁵) generates image references through SQL queries and loads them into the SQLite database (Figure 3B).

The BIDS-compliant data structure (Figure 4) features a division between raw and analysis data (see section 2.1.3). Data differing from raw anatomical images is, in fact, saved within dedicated ‘derivatives’ folders. Inside the ‘derivatives’ folder, files are categorized according to the subject to which they refer. This classification is performed in 2 steps: firstly by distinguishing the type of subject (e.g. ‘Patient’, ‘Atlas’, ‘Electrode’) and then by indicating each subject with its unique subject ID. The subject type ‘Atlas’ includes the group analysis results contained in a common reference space. Since anatomical atlases are generated from multiple patients’ images, several atlases can exist. Thus, the ‘version’ distinguishes atlases generated from different data or image modalities. The ‘Electrode’ subject type includes the electrodes used for implantation. Each electrode is characterized by a different structure and therefore a different model to be used for electrical field simulations. Within the subjects’ subfolders, files are organized by content rather than by process. For instance, the folder ‘Segmentations’ contains segmented anatomical structures. Naming conventions such as the tag “PatientInAtlas” are used to facilitate tracking transformations between different imaging spaces. Such information is also captured in the ‘space’ field of the transformed image file. For instance, when transforming the file sub-<subject_id>_ses->session<_acq->ppp<_T1w.nii.gz to the image space of sub-ATLAS_ses-<version>, the output file will be named sub-<subject_id>_space-<version>_ses-<session>_acq-<ppp>_T1w.nii.gz and will be saved in the folder derivatives/sub-<subject_id>/PatientInAtlas. This structure essentially functions as a clear legend, allowing researchers to readily identify the data used and generated during various stages of the workflow.

FIGURE 4

Figure 4. BIDS-compliant folder structure for Deep Brain Stimulation imaging files and processing outputs. All the files generated from some processing or analysis step are saved as BIDS ‘derivatives’. The derivatives files are grouped firstly by subject type (’Patient’, ‘Atlas’, ‘Electrode’) and on a second level by subject ID. The naming convention of files in folders such as ‘PatientInAtlas’ or ‘AtlasInPatient’ allows to keep track of transformations between different image spaces. Anatomical atlas images are saved as belonging to an ATLAS subject. Anatomical atlases generated from different patient groups or from data acquired with different imaging modalities are distinguished by the ‘version’ parameter. The legend allows identifying the groups of files generated or used in the different steps of group analysis in Deep Brain Stimulation.

2.6 Comprehensive data management

The system relies on SQLite, a relational database management system, to seamlessly integrate clinical data and image paths within a centralized system. This approach streamlines data organization and facilitates efficient retrieval of clinical information alongside corresponding image files.

The patient-centric database schema consists of 28 tables containing 230 data fields (e.g., patient ID, age, implant position, image file path) and 33 established relationships between these tables. These tables can be grouped into specific categories based on information content (e.g., Clinical Data, Imaging Data, Stimulation Configuration Evaluations, Targeting). Notably, all groups except for the Electrode table have a direct connection to the central patient table (Figure 5).

FIGURE 5

Figure 5. Patient-centric relational database schema, including clinical data pulled from REDCap and references to raw and processed imaging files stored according to BIDS. The database schema is designed mirroring the different data types collected during the DBS procedure.

Clinical data tables encompass information collected during clinic visits. The “Clinical Data General” table stores medication data, while three other tables within this group capture data from relevant DBS clinical evaluations. Tables within the “Stimulation Configuration Evaluation” group store data regarding intra-operative or post-operative stimulation settings, their clinical effects, and potential side effects. “Microelectrode Recordings” tables specifically house data from intra-operative recordings used to identify distinct brain regions. The “Targeting” tables contain data relevant to pre-operative planning and arc settings. “Chronic Stimulation Configuration” stores information about patients’ post-operative electrode settings for daily use. The “Implanted Position” table captures the placement of the electrode after surgery. The “Electrodes” table stores data about the studies’ relevant electrodes and links to the “Files” table to establish a direct path to electrode-specific files.

Imaging data is managed through four tables (Figure 6). The “Files” table stores the relationship to the patient or electrode, the file path, and the general file type. If a file originates from another file, this information is documented in the “source_id” field, which links to the original file. The “transformation_id” field describes the relationship to the “Transformations” table, specifying the transform used to generate the file. The “Transformations” table stores the “transform_id” representing a transform (warp) file and the “target_id” representing the target image file (e.g. the reference image space of the transformation output). The remaining tables, “Bids” and “Labels,” share the same ID as the “Files” table. They store additional information specific to file types. For instance, the “Labels” table includes a field indicating the anatomical brain structure represented by the label file, while the “Bids” table contains BIDS-specific details for images, such as acquisition time and modality.

FIGURE 6

Figure 6. Relational database tables handling references to imaging files. The table ‘Bids’ contains some BIDS-specific image information, while the table ‘Files’ allows to retrieve the file path and the subject to which the files refer. Moreover, together with the table ‘Transformations’ it allows to keep track of image transformations between different spaces (e.g., patient, anatomical atlas). Finally, if the file represents a labeled anatomical structure, some additional information is captured in the table ‘Labels’.

In conjunction with the database, a collection of Python functions was designed to facilitate data insertion, updating, deletion, and querying operations. This approach obviates the necessity for researchers to write intricate SQL queries while managing data for analysis.

3 System functionalities and practical application

This paper presents a data management system that leverages the strengths of REDCap, BIDS and SQLite to streamline data capture and organization in neuroscience based on an application example in DBS research. The system effectively addresses the challenges associated with managing diverse data types, including patient-related information, medical images, and post-processing results.

3.1 Data management system overview

The system performs three main tasks:

1. Clinical Data Management

2. Image Data Management

3. Comprehensive Data Management

These components work together seamlessly to ensure efficient data capture, organization, and retrieval (Figure 7).

FIGURE 7

Figure 7. Overview of the components of the data collection and management framework. Clinical and imaging data is collected in the hospital. Clinical data is entered in the REDCap web-platform while imaging data is stored and transferred in DICOMDIR format. DICOMDIR data is converted in NifTI format and stored in a BIDS-compliant repository. References to medical images, images metadata and clinical data are stored in an SQLite database providing a unified interface to all the heterogeneous types of data.

The data management system addresses the requirements of both clinicians and researchers involved in DBS studies. Specifically, it enabled functionalities that were previously unattainable with earlier data collection and management techniques and enhanced the efficiency and automation of existing functions. This section provides a detailed overview of the new framework’s impact compared to the previously used methods, referring to the use case analysis of section 2.2 and the identified functional system requirements listed in Tables 1, 2. The system also addresses the non-functional requirements of data security and integrity, system efficiency, usability, accessibility and regulatory compliance. Table 3 presents a short description of such requirements and how they are fulfilled. In all three tables the requirements are identified by an alphanumeric code (e.g. F.x indicates functional ones, nF.x indicates non-functional ones).

TABLE 1

Table 1. Clinician use cases: The table details the functional requirements of the developed data management system related to the clinician use cases.

TABLE 2

Table 2. Researcher use cases: The table details the functional requirements of the developed data management system related to the researcher use cases.

TABLE 3

Table 3. The table details the non-functional requirements of the developed data management system. These requirements are more general and therefore apply to all actors.

3.2 Clinician requirements

REDCap’s web-based interface allows clinicians to enter patient data efficiently through structured forms satisfying all four requirements from use case 1.1 (Easy Data Entry). On the other hand, paper or spreadsheet-based data collection does not satisfy requirements 1.1-F.2 (Identify mandatory data fields) and 1.1-F.3 (Field-level data validation) and requirement 1.1-F.4 (Record data as batch) is completely not applicable for such methodologies. REDCap also facilitates exporting relevant datasets (either entire cohorts or specific records) to enable clinicians to perform further analyses, satisfying all three requirements from use case 1.2 (Data Export). Paper or spreadsheet-based data collection methodologies can satisfy requirement 1.2-F.5 (Export whole data of entire cohorts) by copying the files. They can also meet requirement 1.2-F.6 (Export filtered data of defined patients) but implying a substantial manual effort while requirement 1.2-F.7 (Export data in different formats for statistical analysis) cannot be fulfilled without the usage of an additional file converter. Moreover, paper and spreadsheet-based methodologies do not allow to track changed and missing values, not suiting the requirements of use case 1.3. At the same time, REDCap shows the up-to-date forms status, and tracks changes in each value field.

3.3 Researcher requirements

For researchers, accessing and re-formatting data collected in hospital settings can be a time-consuming and error prone task (Use Case 2.1: Data Retrieval). Thanks to REDCap’s secure API researchers can avoid having to manually transfer clinical data which is automatically streamed to the SQLite database. Therefore, requirement 2.1-F.10 (Access clinician-captured data) is fulfilled in a matter of minutes against the hours or days needed when extracting information from paper or Excel-based reports. Furthermore, the data format is homogeneous across all patients included in the same REDCap project. On the downside, the image data currently still needs to be manually moved. Following the data transfer, the SQLite database enables data storage in a self-defined data schema. The researcher can then query both clinical and imaging data with a specific granularity while being informed of data provenance and modifications. Requirements 2.1-F.11 (Organize and store data with a well-defined granularity for efficient management), 2.1-F.12 (Employ search queries to rapidly retrieve specific clinical or imaging data) and 2.1-F.13 (Be aware of data provenance and of the transformations applied to data) are seamlessly satisfied. Even the most well-organized file systems and spreadsheets would only partially fulfill these requirements. In addition, data retrieval would take tens of minutes to one hour depending on the dataset size and domain-expertise of the user, compared to the few minutes required to query the database independently from the user’s familiarity with the data.

Requirement 2.2-F.14 (Edit existing data entries for corrections or updates) is satisfied by both new and old frameworks. Nevertheless, the suite of Python functions available for inserting or updating database entries offers a unique method of interacting with the data. This approach eliminates the need to adapt the methodology based on varying data formats. Similar considerations can be made regarding requirements 2.3-F.15 (Interact seamlessly with external data processing tools for analysis workflows), 2.4-F.16 (Share research findings with authorized users within the system) and 2.4-F.17 (Save analysis output files according to a defined standard). In principle both the SQLite database and a simple file system can be accessed by processing pipelines and permit to save output files as desired. The strength of the proposed framework resides in the univocity of the data source interrogated by analysis workflows, relieving the user from needing to know each file’s specific location. Furthermore, saving the analysis results according to the BIDS standard, along with the automatic creation of references to these files in the database, prevents data duplication: a reduction of the total number of files by two-thirds was observed in the context of this work.

Figure 8 shows how the use cases for clinicians and researchers play out with the support of the presented data management system.

FIGURE 8

Figure 8. Graphical representation of how the use cases for clinicians and researchers play out with the support of the presented data management system. During data acquisition REDCap allows clinicians to easily carry out use case 1.1 (Data Entry), use case 1.2 (Data Export) and use case 1.3 (Data Edit). The file organization according to BIDS and storage of both clinical data and references to imaging files in the SQLite database enables researchers to efficiently retrieve specific data (use case 2.1). Such data is fed to processing pipelines for analysis (use case 2.3) and results are saved back into the database and the BIDS project folder (use case 2.4).

3.4 Application example

To validate the proposed data management system, data was collected and analyzed from two medical institutions: University Hospitals of Basel and Clermont-Ferrand.

Separate REDCap projects were created for each institution, ensuring consistency while allowing for some customization. Both projects utilized forms with a similar structure, but specific fields were adapted to each center’s data collection needs. Notably, the forms were designed with direct data validation in mind. For example, in the targeting information section text entries were restricted to prevent insertion of invalid X, Y, and Z coordinate inputs.

Collaborating physicians diligently filled out these forms. For each patient, completion of 10 distinct forms was required. The process began with the patient information form, capturing demographics and relevant medical history. Subsequent forms documented pre- and post-operative clinical evaluations and medications, available imaging data with accompanying metadata, planned electrode placement parameters, intra-operative microelectrode recordings, details of stimulation procedures and their effects, final electrode positioning, and post-operative screening results. Importantly, many forms were filled out multiple times throughout the process.

In addition to data entry in REDCap, physicians also identified research-relevant images and plans from the neurosurgical planning stations (Brainlab Elements⁶) and exported those. Following data collection by physicians and image data transfer, the system seamlessly integrated the information into the database. This integration was achieved using the two pipelines described in Methods sections 2.4 and 2.5.

The system allowed to successfully collect data from 107 Essential Tremor (ET) and Parkinson’s Disease (PD) patients, including an average of 35 imaging files per patient. These files encompass MRI and CT scans, along with labeled anatomical structures. The data is securely stored and managed within the data management system. An illustrative scenario where database-driven data integration is crucial occurs during the simulation of brain tissue volume activated by specific stimulation parameters. This process involves generating a patient-specific brain tissue model from their MRI scan and inputting the stimulation configuration into the simulation software. The developed database serves as a centralized repository for all necessary information: stimulation parameters can be accessed via querying the ‘Stimulations’ table, while the file path to the patient’s brain image is retrievable from the ‘Files’ table. Upon generation of simulation files, references to them are stored in both the ‘Files’ and ‘Bids’ tables. Furthermore, the system’s true power lies in its ability to facilitate group-level analysis. Its design allows the streamlined selection of all requisite files for the generation of disease-specific anatomical atlases or stimulation maps in a singular, efficient step. Additionally, the system processes and stores files generated during post-processing steps, guaranteeing data integrity and enabling clear traceability throughout the research process. The repository NeuroDataManagementSystem⁷ contains extracted sample REDCap data and image files, together with the BIDS project folder and populated SQLite database resulting from the application of the REDCap2SQLite and Image2BIDS2SQLite pipelines.

4 Discussion

Streamlining data capture and management is critical for advancing neuroscience research. Tackling the challenges posed by handling extensive volumes of heterogeneous data collected across multiple sites requires the implementation of a robust data management system. Such a system can bring enhancements in data quality (I), organization (II), analysis (III) and collaboration (IV) among medical and scientific institutions. The system presented in this manuscript offers progress in all of these points. The usage of a web-platform facilitates the transition away from paper or spreadsheet-based data collection methodologies. Data validation functionalities allow to minimize errors associated with manual data entry, ensuring data accuracy and consistency (I). Intuitive web forms require minimal computer proficiency by the user. Furthermore, the compliance to General Data Protection Regulation (GDPR) guarantees secure data sharing between researchers and clinicians, fostering collaborative research efforts (IV). All of these attributes, coupled with its cost-free accessibility, made REDCap the optimal selection for clinical data acquisition (Harris et al., 2009).

Data structure standardization strongly impacts cross-lab collaborations and study reproducibility (Zehl et al., 2016; Zwiers et al., 2022). Considering its expanding adoption within the neuroscience community, we structured our imaging data in accordance with the BIDS standard (Gorgolewski et al., 2016). A customized tool automates the conversion of image files from DICOM to NifTI format and arranges them in a BIDS-compliant structure, taking care of the laborious task of manually naming and saving files. BIDS was initially designed to facilitate the organization of raw imaging data and associated metadata. Nonetheless, judicious usage of the derivatives folder allowed to incorporate additional files generated by analysis pipelines while preserving adherence to the standard. As a result of this implementation, output files deriving from group analysis and transformations across various image spaces can be readily monitored and accessed (II). Recently, BIDS has also been adopted by Lead-DBS (Neudorfer et al., 2023). Lead-DBS is a neuroimaging platform allowing to perform image processing and electrical field simulation operations. Their folder structure also features a first subdivision by subject. However, files are then grouped by processing step rather than by content. Moreover, files belonging to a common anatomical space (e.g. atlas files) are not collected in a group analysis folder but they are included in patient folders. This highlights an alternative perspective: the utilization of BIDS derivatives offers flexibility in organizing files and accommodating various data types within the standard, albeit leading to increased variability in folder structures across different research groups. Consequently, the implementation of additional guidelines for organizing derivatives could prove advantageous.

Group and longitudinal analyses can strongly benefit from the joint power of demographics, clinical and imaging data. Nevertheless, a problem which frequently arises when dealing with heterogeneous data is maintaining the connections between metadata and the data objects to which they refer (Zehl et al., 2016). Recently, a solution for integrating data collected via REDCap with imaging data structured according to BIDS has been presented (Kuplicki et al., 2021). It consists in exporting clinical measures stored in REDCap in tsv files saved in the BIDS folders. However, this solution does not optimally facilitate data retrieval and fails to provide a holistic picture of the collected data. To overcome these limitations, we modeled and implemented an SQLite relational database. Self-developed tools enable the automatic population of database records with data sourced from REDCap and references to imaging files. This approach allows the database to provide a complete overview of the available data and to serve as a unified interface for accessing it. In recent years, data management systems based on a relational database backend have emerged within the neuroscience community (Marcus et al., 2007; Keator et al., 2009; Van Horn and Toga, 2009; Prodanov, 2011; Scott et al., 2011; Das et al., 2012; Book et al., 2013; Muehlboeck et al., 2014; Woodman et al., 2014; Grigis et al., 2017). Nonetheless, a considerable portion of these systems currently lacks compatibility with BIDS-compliant data standards. Furthermore, although these systems primarily focus on improving neuroscience data sharing among researchers, they have not been specifically designed with consideration for specific clinical and research workflows such as inherent to DBS. Consequently, their adoption would have required significant effort in user training and adaptation to DBS-specific use cases and pipelines. The database presented in this manuscript employs a DBS-specific data model, structured to correspond with the various sources of data involved in the DBS procedure. Moreover, a patient-centric approach was followed, as was also done in Neuroinformatics Database providing a platform for storing, analyzing and sharing neuroimaging data [NiDB (Book et al., 2013)] and Longitudinal Online Research and Imaging System [LORIS (Das et al., 2012)], dealing with multi-center, heterogeneous data acquisition, organization and dissemination. This decision was made considering its compatibility with both clinical procedures and analytical frameworks.

By harnessing the synergy between REDCap, BIDS, and SQLite, a data management system fostering heterogeneous data integration and handling for collaborative research purposes has been successfully engineered. In addition, although the database schema is DBS-specific, the pipelines enabling inter-platform communication and the overall framework concept can be easily employed in other neuroscience fields. The adoption of such a system holds the potential to substantially enhance the speed, efficiency, and robustness of research advancements reliant on large-scale data analyses. One of the advantages of utilizing a system comprising three interoperable yet independent components is that the temporary failure of one element does not compromise the functionality of the others. For instance, a temporary unavailability of the SQLite database does not entirely impede data retrieval: clinical data can still be directly exported from REDCap, and imaging data can be retrieved from the BIDS project folder, together with the relevant metadata contained in the JSON sidecar files. Although this would increase the complexity and time required for data integration, it would not completely halt workflows or, more importantly, result in data loss. Nonetheless, this system still constitutes the initial prototype of a comprehensive data management platform. Moving forward, some limitations will require attention and resolution in subsequent developments. The database is presently equipped with a suite of Python functions facilitating straightforward data query, insertion, update, and deletion. An initial enhancement would involve integrating a graphical user interface (GUI). This augmentation would enable researchers lacking coding proficiency to query data and visualize the query results, thereby enhancing the usability and user-friendliness of the system. A subsequent step would entail integrating statistical analyses or processing procedures with the system. In this way such analyses could be directly executed from the GUI rather than using external scripts. Finally, analyses outcomes data are currently accessible exclusively to authorized users within our institution. Implementing web-based access to the database would facilitate retrieval of DBS research output data by members of the scientific community. This would thus increase accessibility and utilization of such data.

5 Conclusion

Modeling and managing workflows play a crucial role in both present and future neuroscience endeavors. This is especially noticeable in light of big data continuous evolution and increasing collaborations in the scientific community. This work demonstrates the successful integration of a translational tool for streamlining data collection and organization between clinics and research institutes in the field of Deep Brain Stimulation. The proposed framework captures both standardized imaging data and comprehensive patient-metadata within a unified system. The system has demonstrably achieved successful validation through data collection and analysis from two medical institutions. It offers efficient storage and management of large-scale clinical and imaging data, while simultaneously simplifying data retrieval for group-level analysis. This approach enables researchers in the field of DBS to leverage the richness of diverse data types, potentially leading to improved clinical decision-making and ultimately, better patient outcomes.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author.

Author contributions

MS: Writing – review and editing, Writing – original draft, Software, Methodology, Data curation, Conceptualization. VB: Writing – review and editing, Writing – original draft, Software, Conceptualization. DV: Writing – review and editing, Validation, Supervision, Conceptualization. SH: Writing – review and editing, Supervision, Project administration, Funding acquisition.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This work was financially supported by the Swiss National Science Foundation (Grant No. 205320-207491).

Acknowledgments

We thank Prof. Jean-Jacques Lemaire (Université Clermont Auvergne), PhD Jérôme Coste (Université Clermont Auvergne), Dr. Peter Fuhr (University Hospital Basel), Dr. Ute Gschwandtner (University Hospital Basel), Dr. Ethan Taub (University Hospital Basel), Dr. Siegward Elsas (Clinic Arlesheim) for sharing their expertise in the field of Deep Brain Stimulation and contributing to the data collection through REDCap. We also thank Prof. Markus Degen (University of Applied Sciences and Arts Northwestern Switzerland) for setting up and maintaining the REDCap server at our institution. Finally we acknowledge the usage of the generative AI model Adobe Firefly for the generation of some icons included in the manuscript’s figures.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fninf.2024.1435971/full#supplementary-material

Footnotes

References

Alonso, F., Latorre, M., Göransson, N., Zsigmond, P., and Wårdell, K. (2016). Investigation into deep brain stimulation lead designs: A patient-specific simulation study. Brain Sci. 6:39. doi: 10.3390/brainsci6030039

PubMed Abstract | Crossref Full Text | Google Scholar

Åström, M., Diczfalusy, E., Martens, H., and Wårdell, K. (2015). Relationship between neural activation and electric field distribution during deep brain stimulation. IEEE Trans. Biomed. Eng. 62, 664–672. doi: 10.1109/TBME.2014.2363494

PubMed Abstract | Crossref Full Text | Google Scholar

Book, G. A., Anderson, B. M., Stevens, M. C., Glahn, D. C., Assaf, M., and Pearlson, G. D. (2013). Neuroinformatics database (NiDB) – A modular, portable database for the storage, analysis, and sharing of neuroimaging data. Neuroinformatics 11, 495–505. doi: 10.1007/s12021-013-9194-1

PubMed Abstract | Crossref Full Text | Google Scholar

Butenko, K., Bahls, C., Schröder, M., Köhling, R., and van Rienen, U. (2020). OSS-DBS: Open-source simulation platform for deep brain stimulation with a comprehensive automated modeling. PLoS Comput. Biol. 16:e1008023. doi: 10.1371/journal.pcbi.1008023

PubMed Abstract | Crossref Full Text | Google Scholar

Butson, C. R., Cooper, S. E., Henderson, J. M., and McIntyre, C. C. (2006). “Predicting the effects of deep brain stimulation with diffusion tensor based electric field models,” in Medical image computing and computer-assisted intervention – MICCAI 2006, eds R. Larsen, M. Nielsen, and J. Sporring (Berlin: Springer), 429–437. doi: 10.1007/11866763_53

PubMed Abstract | Crossref Full Text | Google Scholar

Butson, C. R., Cooper, S. E., Henderson, J. M., Wolgamuth, B., and McIntyre, C. C. (2011). Probabilistic analysis of activation volumes generated during deep brain stimulation. Neuroimage 54, 2096–2104. doi: 10.1016/j.neuroimage.2010.10.059

PubMed Abstract | Crossref Full Text | Google Scholar

Chabardes, S., Isnard, S., Castrioto, A., Oddoux, M., Fraix, V., Carlucci, L., et al. (2015). Surgical implantation of STN-DBS leads using intraoperative MRI guidance: Technique, accuracy, and clinical benefit at 1-year follow-up. Acta Neurochir. 157, 729–737. doi: 10.1007/s00701-015-2361-4

PubMed Abstract | Crossref Full Text | Google Scholar

Choplin, R. H., Boehme, J. M., and Maynard, C. D. (1992). Picture archiving and communication systems: An overview. RadioGraphics 12, 127–129. doi: 10.1148/radiographics.12.1.1734458

PubMed Abstract | Crossref Full Text | Google Scholar

D’Haese, P.-F., Cetinkaya, E., Konrad, P. E., Kao, C., and Dawant, B. M. (2005). Computer-aided placement of deep brain stimulators: From planningto intraoperative guidance. IEEE Trans. Med. Imaging 24, 1469–1478. doi: 10.1109/TMI.2005.856752

PubMed Abstract | Crossref Full Text | Google Scholar

Das, S., Zijdenbos, A. P., Harlap, J., Vins, D., and Evans, A. C. (2012). LORIS: A web-based data management system for multi-center studies. Front. Neuroinform. 5:37. doi: 10.3389/fninf.2011.00037

PubMed Abstract | Crossref Full Text | Google Scholar

Dash, S., Shakyawar, S. K., Sharma, M., and Kaushik, S. (2019). Big data in healthcare: Management, analysis and future prospects. J. Big Data 6:54. doi: 10.1186/s40537-019-0217-0

Crossref Full Text | Google Scholar

Deserno, T. M., Haak, D., Samsel, C., Gehlen, J., and Kabino, K. (2013). “Integrating image management and analysis into OpenClinica using web services,” in Proceedings of the medical imaging 2013: Advanced PACS-based imaging informatics and therapeutic applications, (SPIE), (Orlando, FL), 88–97. doi: 10.1117/12.2008004

Crossref Full Text | Google Scholar

DiEuliis, D., and Giordano, J. (2016). “Neurotechnological convergence and ‘big data’: A force-multiplier toward advancing neuroscience,” in Ethical reasoning in big data: An exploratory analysis, eds J. Collmann and S. A. Matei (Cham: Springer International Publishing), 71–80. doi: 10.1007/978-3-319-28422-4_6

Crossref Full Text | Google Scholar

Dipietro, L., Gonzalez-Mego, P., Ramos-Estebanez, C., Zukowski, L. H., Mikkilineni, R., Rushmore, R. J., et al. (2023). The evolution of big data in neuroscience and neurology. J. Big Data 10:116. doi: 10.1186/s40537-023-00751-2

PubMed Abstract | Crossref Full Text | Google Scholar

Dragly, S.-A., Hobbi Mobarhan, M., Lepperød, M. E., Tennøe, S., Fyhn, M., Hafting, T., et al. (2018). Experimental directory structure (Exdir): An alternative to HDF5 without introducing a new file format. Front. Neuroinform. 12:16. doi: 10.3389/fninf.2018.00016

PubMed Abstract | Crossref Full Text | Google Scholar

Eisenstein, S. A., Koller, J. M., Black, K. D., Campbell, M. C., Lugar, H. M., Ushe, M., et al. (2014). Functional anatomy of subthalamic nucleus stimulation in Parkinson disease: STN DBS location and PD. Ann. Neurol. 76, 279–295. doi: 10.1002/ana.24204

PubMed Abstract | Crossref Full Text | Google Scholar

Evans, A. C., Kamber, M., Collins, D. L., and MacDonald, D. (1994). “An MRI-based probabilistic atlas of neuroanatomy,” in Magnetic resonance scanning and epilepsy, eds S. D. Shorvon, D. R. Fish, F. Andermann, G. M. Bydder, and H. Stefan (Boston, MA: Springer US), 263–274. doi: 10.1007/978-1-4615-2546-2_48

Crossref Full Text | Google Scholar

Ferguson, A. R., Nielson, J. L., Cragin, M. H., Bandrowski, A. E., and Martone, M. E. (2014). Big data from small data: Data-sharing in the “long tail” of neuroscience. Nat. Neurosci. 17, 1442–1447. doi: 10.1038/nn.3838

PubMed Abstract | Crossref Full Text | Google Scholar

Gorgolewski, K. J., Auer, T., Calhoun, V. D., Craddock, R. C., Das, S., Duff, E. P., et al. (2016). The brain imaging data structure, a format for organizing and describing outputs of neuroimaging experiments. Sci. Data 3:160044. doi: 10.1038/sdata.2016.44

PubMed Abstract | Crossref Full Text | Google Scholar

Grigis, A., Goyard, D., Cherbonnier, R., Gareau, T., Papadopoulos Orfanos, D., Chauvat, N., et al. (2017). Neuroimaging, genetics, and clinical data sharing in python using the cubicweb framework. Front. Neuroinform. 11:18. doi: 10.3389/fninf.2017.00018

PubMed Abstract | Crossref Full Text | Google Scholar

Hariz, M., Blomstedt, P., and Zrinzo, L. (2013). Future of brain stimulation: New targets, new indications, new technology. Mov. Disord. 28, 1784–1792. doi: 10.1002/mds.25665

PubMed Abstract | Crossref Full Text | Google Scholar

Harris, P. A., Taylor, R., Thielke, R., Payne, J., Gonzalez, N., and Conde, J. G. (2009). Research electronic data capture (REDCap) - A metadata-driven methodology and workflow process for providing translational research informatics support. J. Biomed. Inform. 42, 377–381. doi: 10.1016/j.jbi.2008.08.010

PubMed Abstract | Crossref Full Text | Google Scholar

Hemm, S., and Wårdell, K. (2010). Stereotactic implantation of deep brain stimulation electrodes: A review of technical systems, methods and emerging tools. Med. Biol. Eng. Comput. 48, 611–624. doi: 10.1007/s11517-010-0633-y

PubMed Abstract | Crossref Full Text | Google Scholar

Horn, A., Li, N., Dembek, T. A., Kappel, A., Boulay, C., Ewert, S., et al. (2019). Lead-DBS v2: Towards a comprehensive pipeline for deep brain stimulation imaging. Neuroimage 184, 293–316. doi: 10.1016/j.neuroimage.2018.08.068

PubMed Abstract | Crossref Full Text | Google Scholar

Jegou, A., Roehri, N., Medina Villalon, S., Colombet, B., Giusiano, B., Bartolomei, F., et al. (2022). BIDS Manager-Pipeline: A framework for multi-subject analysis in electrophysiology. Neurosci. Inform. 2:100072. doi: 10.1016/j.neuri.2022.100072

Crossref Full Text | Google Scholar

Johnson, M. D., Lim, H. H., Netoff, T. I., Connolly, A. T., Johnson, N., Roy, A., et al. (2013). Neuromodulation for brain disorders: Challenges and opportunities. IEEE Trans. Biomed. Eng. 60, 610–624. doi: 10.1109/TBME.2013.2244890

PubMed Abstract | Crossref Full Text | Google Scholar

Keator, D. B., Wei, D., Gadde, S., Bockholt, H. J., Grethe, J. S., Marcus, D., et al. (2009). Derived data storage and exchange workflow for large-scale neuroimaging analyses on the BIRN grid. Front. Neuroinform. 3:2009. doi: 10.3389/neuro.11.030.2009

PubMed Abstract | Crossref Full Text | Google Scholar

Koeglsperger, T., Palleis, C., Hell, F., Mehrkens, J. H., and Bötzel, K. (2019). Deep brain stimulation programming for movement disorders: Current concepts and evidence-based strategies. Front. Neurol. 10:410. doi: 10.3389/fneur.2019.00410

PubMed Abstract | Crossref Full Text | Google Scholar

Kuplicki, R., Touthang, J., Al Zoubi, O., Mayeli, A., Misaki, M., NeuroMAP-Investigators, et al. (2021). Common data elements, scalable data management infrastructure, and analytics workflows for large-scale neuroimaging studies. Front. Psychiatry 12:682495. doi: 10.3389/fpsyt.2021.682495

PubMed Abstract | Crossref Full Text | Google Scholar

Latorre, M. A., and Wårdell, K. (2019). A comparison between single and double cable neuron models applicable to deep brain stimulation. Biomed. Phys. Eng. Express 5:025026. doi: 10.1088/2057-1976/aafdd9

Crossref Full Text | Google Scholar

Lemaire, J. J., Sakka, L., Ouchchane, L., Caire, F., Gabrillargues, J., and Bonny, J. M. (2010). Anatomy of the human thalamus based on spontaneous contrast and microscopic voxels in high-field magnetic resonance imaging. Neurosurgery 66, 161–172.

Google Scholar

Levitas, D., Hayashi, S., Vinci-Booher, S., Heinsfeld, A., Bhatia, D., Lee, N., et al. (2024). ezBIDS: Guided standardization of neuroimaging data interoperable with major data archives and platforms. Sci. Data 11:179. doi: 10.1038/s41597-024-02959-0

PubMed Abstract | Crossref Full Text | Google Scholar

Li, X., and Liang, H. (2022). Project, toolkit, and database of neuroinformatics ecosystem: A summary of previous studies on “Frontiers in Neuroinformatics.” Front. Neuroinform. 16:902452. doi: 10.3389/fninf.2022.902452

PubMed Abstract | Crossref Full Text | Google Scholar

Lopez-Novoa, U., Charron, C., Evans, J., and Beltrachini, L. (2019). “The BIDS toolbox: A web service to manage brain imaging datasets,” in Proceedings of the 2019 IEEE SmartWorld, ubiquitous intelligence & computing, advanced & trusted computing, scalable computing & communications, cloud & big data computing, internet of people and smart city innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI), (Leicester), 378–382. doi: 10.1109/SmartWorld-UIC-ATC-SCALCOM-IOP-SCI.2019.00108

Crossref Full Text | Google Scholar

Lozano, A. M., Lipsman, N., Bergman, H., Brown, P., Chabardes, S., Chang, J. W., et al. (2019). Deep brain stimulation: Current challenges and future directions. Nat. Rev. Neurol. 15, 148–160. doi: 10.1038/s41582-018-0128-2

PubMed Abstract | Crossref Full Text | Google Scholar

Marcus, D. S., Olsen, T. R., Ramaratnam, M., and Buckner, R. L. (2007). The extensible neuroimaging archive toolkit: An informatics platform for managing, exploring, and sharing neuroimaging data. Neuroinformatics 5, 11–33.

Google Scholar

Miyagi, Y., Okamoto, T., Morioka, T., Tobimatsu, S., Nakanishi, Y., Aihara, K., et al. (2009). Spectral analysis of field potential recordings by deep brain stimulation electrode for localization of subthalamic nucleus in patients with Parkinson’s disease. Ster. Funct. Neurosurg. 87, 211–218.

Google Scholar

Muehlboeck, J.-S., Westman, E., and Simmons, A. (2014). TheHiveDB image data management and analysis framework. Front. Neuroinform. 7:49. doi: 10.3389/fninf.2013.00049

PubMed Abstract | Crossref Full Text | Google Scholar

Neudorfer, C., Butenko, K., Oxenford, S., Rajamani, N., Achtzehn, J., Goede, L., et al. (2023). Lead-DBS v3.0: Mapping deep brain stimulation effects to local anatomy and global networks. Neuroimage 268:119862. doi: 10.1016/j.neuroimage.2023.119862

PubMed Abstract | Crossref Full Text | Google Scholar

Neudorfer, C., Elias, G. J. B., Jakobs, M., Boutet, A., Germann, J., Narang, K., et al. (2021). Mapping autonomic, mood and cognitive effects of hypothalamic region deep brain stimulation. Brain 144, 2837–2851. doi: 10.1093/brain/awab170

PubMed Abstract | Crossref Full Text | Google Scholar

Nordenström, S., Petermann, K., Debove, I., Nowacki, A., Krack, P., Pollo, C., et al. (2022). Programming of subthalamic nucleus deep brain stimulation for Parkinson’s disease with sweet spot-guided parameter suggestions. Front. Hum. Neurosci. 16:925283. doi: 10.3389/fnhum.2022.925283

PubMed Abstract | Crossref Full Text | Google Scholar

Nordin, T., Vogel, D., Österlund, E., Johansson, J., Blomstedt, P., Fytagoridis, A., et al. (2022). Probabilistic maps for deep brain stimulation - Impact of methodological differences. Brain Stimul. 15, 1139–1152. doi: 10.1016/j.brs.2022.08.010

PubMed Abstract | Crossref Full Text | Google Scholar

Poline, J.-B., Breeze, J. L., Ghosh, S. S., Gorgolewski, K., Halchenko, Y. O., Hanke, M., et al. (2012). Data sharing in neuroimaging research. Front. Neuroinform. 6:9. doi: 10.3389/fninf.2012.00009

PubMed Abstract | Crossref Full Text | Google Scholar

Prodanov, D. (2011). Data ontology and an information system realization for web-based management of image measurements. Front. Neuroinform. 5:25. doi: 10.3389/fninf.2011.00025

PubMed Abstract | Crossref Full Text | Google Scholar

Reich, M. M., Horn, A., Lange, F., Roothans, J., Paschen, S., Runge, J., et al. (2019). Probabilistic mapping of the antidystonic effect of pallidal neurostimulation: A multicentre imaging study. Brain 142, 1386–1398. doi: 10.1093/brain/awz046

PubMed Abstract | Crossref Full Text | Google Scholar

Rezai, A. R., Kopell, B. H., Gross, R. E., Vitek, J. L., Sharan, A. D., Limousin, P., et al. (2006). Deep brain stimulation for Parkinson’s disease: Surgical issues. Mov. Disord. 21, S197–S218. doi: 10.1002/mds.20956

PubMed Abstract | Crossref Full Text | Google Scholar

Roediger, J., Dembek, T. A., Wenzel, G., Butenko, K., Kühn, A. A., and Horn, A. (2021). StimFit–a data-driven algorithm for automated deep brain stimulation programming. Mov. Disord. 37, 574–584. doi: 10.1002/mds.28878

PubMed Abstract | Crossref Full Text | Google Scholar

Saczynski, J. S., McManus, D. D., and Goldberg, R. J. (2013). Commonly utilized data collection approaches in clinical research. Am. J. Med. 126:16. doi: 10.1016/j.amjmed.2013.04.016

PubMed Abstract | Crossref Full Text | Google Scholar

Schrader, B., Hamel, W., Weinert, D., and Mehdorn, H. M. (2002). Documentation of electrode localization. Mov. Disord. 17, S167–S174.

Google Scholar

Scott, A., Courtney, W., Wood, D., de la Garza, R., Lane, S., King, M., et al. (2011). COINS: An innovative informatics and neuroimaging tool suite built for large heterogeneous datasets. Front. Neuroinform. 5:33. doi: 10.3389/fninf.2011.00033

PubMed Abstract | Crossref Full Text | Google Scholar

Sejnowski, T. J., Churchland, P. S., and Movshon, J. A. (2014). Putting big data to good use in neuroscience. Nat. Neurosci. 17, 1440–1441. doi: 10.1038/nn.3839

PubMed Abstract | Crossref Full Text | Google Scholar

Shah, A., Vogel, D., Alonso, F., Lemaire, J.-J., Pison, D., Coste, J., et al. (2020). Stimulation maps: Visualization of results of quantitative intraoperative testing for deep brain stimulation surgery. Med. Biol. Eng. Comput. 58, 771–784. doi: 10.1007/s11517-020-02130-y

PubMed Abstract | Crossref Full Text | Google Scholar

Turner, J. A., and Van Horn, J. (2012). Electronic data capture, representation, and applications for neuroimaging. Front. Neuroinform. 6:16. doi: 10.3389/fninf.2012.00016

PubMed Abstract | Crossref Full Text | Google Scholar

Turner, J. A., Lane, S. R., Bockholt, H. J., and Calhoun, V. D. (2011). The clinical assessment and remote administration tablet. Front. Neuroinform. 5:31. doi: 10.3389/fninf.2011.00031

PubMed Abstract | Crossref Full Text | Google Scholar

Van Horn, J. D., and Toga, A. W. (2009). Is it time to re-prioritize neuroimaging databases and digital repositories? Neuroimage 47, 1720–1734. doi: 10.1016/j.neuroimage.2009.03.086

PubMed Abstract | Crossref Full Text | Google Scholar

Vogel, D., Shah, A., Coste, J., Lemaire, J.-J., Wårdell, K., and Hemm, S. (2020). Anatomical brain structures normalization for deep brain stimulation in movement disorders. Neuroimage Clin. 27:102271. doi: 10.1016/j.nicl.2020.102271

PubMed Abstract | Crossref Full Text | Google Scholar

Voyvodic, J. T., Glover, G. H., Greve, D., Gadde, S., and Fbirn, T. (2011). Automated real-time behavioral and physiological data acquisition and display integrated with stimulus presentation for fMRI. Front. Neuroinform. 5:27. doi: 10.3389/fninf.2011.00027

PubMed Abstract | Crossref Full Text | Google Scholar

Wårdell, K., Nordin, T., Vogel, D., Zsigmond, P., Westin, C.-F., Hariz, M., et al. (2022). Deep brain stimulation: Emerging tools for simulation, data analysis, and visualization. Front. Neurosci. 16:834026. doi: 10.3389/fnins.2022.834026

PubMed Abstract | Crossref Full Text | Google Scholar

Wilcox, A. B., Gallagher, K. D., Boden-Albala, B., and Bakken, S. R. (2012). Research data collection methods: From paper to tablet computers. Med. Care 50:S68. doi: 10.1097/MLR.0b013e318259c1e7

PubMed Abstract | Crossref Full Text | Google Scholar

Wilkinson, M. D., Dumontier, M., Aalbersberg, J., Appleton, G., Axton, M., Baak, A., et al. (2016). The FAIR Guiding principles for scientific data management and stewardship. Sci. Data 3:160018. doi: 10.1038/sdata.2016.18

PubMed Abstract | Crossref Full Text | Google Scholar

Woodman, M. M., Pezard, L., Domide, L., Knock, S. A., Sanz-Leon, P., Mersmann, J., et al. (2014). Integrating neuroinformatics tools in TheVirtualBrain. Front. Neuroinform. 8:36. doi: 10.3389/fninf.2014.00036

PubMed Abstract | Crossref Full Text | Google Scholar

Zehl, L., Jaillet, F., Stoewer, A., Grewe, J., Sobolev, A., Wachtler, T., et al. (2016). Handling metadata in a neurophysiology laboratory. Front. Neuroinform. 10:26. doi: 10.3389/fninf.2016.00026

PubMed Abstract | Crossref Full Text | Google Scholar

Zwiers, M. P., Moia, S., and Oostenveld, R. (2022). BIDScoin: A user-friendly application to convert source data to brain imaging data structure. Front. Neuroinform. 15:770608. doi: 10.3389/fninf.2021.770608

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: Neuroscience data, data management, Brain Imaging Data Structure (BIDS), Electronic Data Capture (EDC), Deep Brain Stimulation (DBS)

Citation: Stawiski M, Bucciarelli V, Vogel D and Hemm S (2024) Optimizing neuroscience data management by combining REDCap, BIDS and SQLite: a case study in Deep Brain Stimulation. Front. Neuroinform. 18:1435971. doi: 10.3389/fninf.2024.1435971

Received: 21 May 2024; Accepted: 19 August 2024;
Published: 05 September 2024.

Edited by:

Xi-Nian Zuo, Beijing Normal University, China

Reviewed by:

Karsten Tabelow, Weierstrass Institute for Applied Analysis and Stochastics (LG), Germany
Chitresh Bhushan, GE Global Research, United States

Copyright © 2024 Stawiski, Bucciarelli, Vogel and Hemm. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Vittoria Bucciarelli, dml0dG9yaWEuYnVjY2lhcmVsbGlAZmhudy5jaA==

^†These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Optimizing neuroscience data management by combining REDCap, BIDS and SQLite: a case study in Deep Brain Stimulation

1 Introduction

2 System design and workflows

2.1 DBS procedure and data

2.1.1 Raw data acquisition

2.1.2 Data analysis

2.1.3 Data type categories

2.2 Data management system use case analysis

2.3 System components

2.4 Clinical data management

2.5 Imaging data management

2.6 Comprehensive data management

3 System functionalities and practical application

3.1 Data management system overview

3.2 Clinician requirements

3.3 Researcher requirements

3.4 Application example

4 Discussion

5 Conclusion

Data availability statement

Author contributions

Funding

Acknowledgments

Conflict of interest

Publisher’s note

Supplementary material

Footnotes

References

95% of researchers rate our articles as excellent or good

95% of researchers rate our articles as excellent or good