Skip to main content

ORIGINAL RESEARCH article

Front. Neuroinform., 24 June 2021

The Stroke Neuro-Imaging Phenotype Repository: An Open Data Science Platform for Stroke Research

\r\nHossein Mohammadian ForoushaniHossein Mohammadian Foroushani1Rajat DharRajat Dhar2Yasheng ChenYasheng Chen3Jenny GurneyJenny Gurney4Ali HamzehlooAli Hamzehloo2Jin-Moo LeeJin-Moo Lee3Daniel S. Marcus*Daniel S. Marcus4*
  • 1Department of Electrical and System Engineering, School of Engineering, Washington University in St. Louis, St. Louis, MO, United States
  • 2Division of Neurocritical Care, Department of Neurology, Washington University School of Medicine, St. Louis, MO, United States
  • 3Division of Cerebrovascular Disease, Department of Neurology, Washington University School of Medicine, St. Louis, MO, United States
  • 4Department of Radiology, Washington University School of Medicine, St. Louis, MO, United States

Stroke is one of the leading causes of death and disability worldwide. Reducing this disease burden through drug discovery and evaluation of stroke patient outcomes requires broader characterization of stroke pathophysiology, yet the underlying biologic and genetic factors contributing to outcomes are largely unknown. Remedying this critical knowledge gap requires deeper phenotyping, including large-scale integration of demographic, clinical, genomic, and imaging features. Such big data approaches will be facilitated by developing and running processing pipelines to extract stroke-related phenotypes at large scale. Millions of stroke patients undergo routine brain imaging each year, capturing a rich set of data on stroke-related injury and outcomes. The Stroke Neuroimaging Phenotype Repository (SNIPR) was developed as a multi-center centralized imaging repository of clinical computed tomography (CT) and magnetic resonance imaging (MRI) scans from stroke patients worldwide, based on the open source XNAT imaging informatics platform. The aims of this repository are to: (i) store, manage, process, and facilitate sharing of high-value stroke imaging data sets, (ii) implement containerized automated computational methods to extract image characteristics and disease-specific features from contributed images, (iii) facilitate integration of imaging, genomic, and clinical data to perform large-scale analysis of complications after stroke; and (iv) develop SNIPR as a collaborative platform aimed at both data scientists and clinical investigators. Currently, SNIPR hosts research projects encompassing ischemic and hemorrhagic stroke, with data from 2,246 subjects, and 6,149 imaging sessions from Washington University’s clinical image archive as well as contributions from collaborators in different countries, including Finland, Poland, and Spain. Moreover, we have extended the XNAT data model to include relevant clinical features, including subject demographics, stroke severity (NIH Stroke Scale), stroke subtype (using TOAST classification), and outcome [modified Rankin Scale (mRS)]. Image processing pipelines are deployed on SNIPR using containerized modules, which facilitate replicability at a large scale. The first such pipeline identifies axial brain CT scans from DICOM header data and image data using a meta deep learning scan classifier, registers serial scans to an atlas, segments tissue compartments, and calculates CSF volume. The resulting volume can be used to quantify the progression of cerebral edema after ischemic stroke. SNIPR thus enables the development and validation of pipelines to automatically extract imaging phenotypes and couple them with clinical data with the overarching aim of enabling a broad understanding of stroke progression and outcomes.

Introduction

Stroke is the second leading cause of death throughout the world, and the leading cause of long-term disability (George et al., 2017). The management of acute stroke is now a time-sensitive emergency that requires organized multidisciplinary care. The early hours after stroke onset frequently map the trajectory of subsequent neurologic disability, complications, and outcomes. Big data analyses can provide an opportunity to implement precision medicine approaches to stroke (Liebeskind, 2018). Pooling of multi-center data sets can advance our understanding of the clinical and biologic factors contributing to outcomes. This has led to a surge of interest and effort to collaborate on stroke research by combining clinical and genomic databases to better understand the biology of stroke and its complications. One of the largest such collaborations [the International Stroke Genetics Consortium (ISGC)] has integrated data on stroke incidence and recovery with genetic data on over 60,000 cases to provide further novel insights into stroke biology (Malik, 2018). There has been special interest in acute stroke phenotypes and outcomes, leading to collaborations within the ISGC and the formation of the Genetics of Neurological Instability after Ischemic Stroke (GENISIS) multi-center study (Heitsch et al., 2021). GENESIS has acquired extensive clinical and genomic data on over 6,000 acute stroke patients.

Brain imaging has a key role in providing further insights about complications after stroke. Indeed, most stroke patients have at least one brain imaging study performed during their acute hospitalization, primarily for diagnostic purposes on presentation. Follow-up scans are often obtained to evaluate the size of infarction and to exclude the development of hemorrhagic transformation. An endeavor is underway to describe the design and rationale for the genetic analysis of acute and chronic cerebrovascular neuroimaging phenotypes detected on clinical magnetic resonance imaging (MRI) in patients with acute ischemic stroke within the scope of the MRI-GENetics Interface Exploration (MRI-GENIE) study (Giese et al., 2017, 2020). Another similar effort with focus on MRI data is Enhancing Neuroimaging Genetics through Meta-Analysis (ENIGMA) Stroke Recovery repository which tries to understand brain and behavior relationships using well-powered meta- and mega-analytic approaches. ENIGMA Stroke Recovery has data from over 2,100 stroke patients collected across 39 research studies and 10 countries around the world (Liew et al., 2020). Although MRI can provide detailed anatomic information, it is challenging to obtain in the acute setting so there is not enough sample which makes knowledge discovery and practical applications limited. Computed tomography (CT) is the most frequently employed modality for acute stroke imaging due to its widespread availability, lower cost, and greater speed of scanning (especially important in acutely unstable patients where “time is brain”) (Tong et al., 2014). Thus, many millions of CT exams of stroke patients with information on stroke location, infarct size, development of edema, and hemorrhagic transformation are available globally. The evaluation of these parameters is not scalable by human raters when leveraging imaging data from thousands of patients. As a result, a big data approach is required to assess images at scale, including identifying quantitative image features and developing automated tools to extract them. This imaging analysis can then be coupled with analysis of clinical and genomics data from these subjects, facilitating large-scale genomic analysis of acute complications after stroke that are best represented by imaging features (Dhar et al., 2018).

Given the potential of imaging to advance our understanding of stroke and its complications, the GENISIS study endeavored to share all clinically performed brain imaging on enrolled subjects. The Stroke Neuroimaging Phenotype Repository (SNIPR) was created as a stroke-focused medical imaging repository that could serve as a platform for this and other stroke-related research. SNIPR is based on the open source XNAT imaging informatics platform, developed at Washington University in St. Louis (WUSTL) (Marcus, 2007). SNIPR provides an environment to securely host and share clinical data and imaging scans from large international stroke cohorts. It also allows the development and deployment of image processing pipelines to extract imaging biomarkers from these stroke scans. SNIPR is deployed on a high-performance computing system that enables these pipelines to be executed as containerized applications at massive scale. SNIPR enables coupling the imaging results with clinical data, with the overarching aim of enabling a broad understanding of stroke progression and outcomes.

Materials and Methods

Stroke Neuroimaging Phenotype Repository was initiated in 2018 to manage the imaging data from the GENISIS multi-center acute stroke genetics study. SNIPR is a modified instantiation of XNAT, an extensible open source imaging informatics platform. The SNIPR image repository is built primarily to support images in DICOM format, the medical imaging industry standard (Pianykh, 2009). Users have multiple paths to importing such images into SNIPR (Figure 1). Exams obtained in the Washington University in St. Louis School of Medicine (WUSTL)-affiliated hospital system can be imported directly from the clinical image repository or Picture Archiving and Communication System (PACS) using XNAT’s PACS query interface, which extracts the images from PACS, anonymizes them, and stores them to the SNIPR file repository. Users can also upload image sessions using the XNAT desktop application, which anonymizes images locally prior to transferring to SNIPR. To guarantee secure transmission of data, outside institutions are restricted to upload through the SNIPR website rather than over the DICOM network protocol, which lacks support for user authentication and authorization. SNIPR’s DICOM anonymization service automatically removes sensitive patient metadata from the file headers following an anonymization profile approved by the WUSTL Institutional Review Board and Department of Radiology Clinical Informatics Section. Once uploaded to SNIPR, relevant DICOM metadata are extracted into the SNIPR database, allowing users to identify and search for data sets via specific acquisition parameters. Each research institution has protected access to its project data, which are only accessible to users who have been explicitly granted access to it. SNIPR uses XNAT’s standard user management, project and data organization, and data access controls. XNAT’s standard project management mechanism provides project owners with flexible control over access to their data (Marcus, 2013; Gurney et al., 2017). Projects can be set to public, protected, or private (Figure 2). Protected project data are only accessible to users who have been explicitly granted access to the project but descriptive project metadata (title, investigator name, and keywords) are accessible to all users. SNIPR can offer a variety of informatics and computational services to support multi-center projects. Access to data can be restricted by site for study coordinators and site investigators, while the overall investigator and administrative core are provided with a pooled view of all the sites’ data. To create projects, upload data, and access non-public data, an individual must create a password- protected user account and login each time they visit the site. Accounts are enabled by system administrators after verification of the submitted credentials. User requests for access to specific data sets are administered by the investigators of the contributed studies, and the data use terms may vary across data sets. SNIPR provides forms for users to request access to data and for investigators to review and approve or reject requests. To further control access, project owners can use the XNAT sharing feature to expose subsets of project data into a secondary project. When users are granted access to a project, they can use a variety of mechanisms to download the data. Imaging sessions can be downloaded as a zip-formatted file or straight to a directory. SNIPR provides a single-session zip downloader, a multi-session bulk downloader through the website, and an interface to download data via a REST programming interface. The multi-session bulk downloader provides support to stop and resume downloads and is able to restart interrupted downloads from previous sessions. All other data in SNIPR such as clinical and image processing results are most typically downloaded by the website into a CSV file. The XNAT REST interface enables users to also download non-imaging data in different formats such as CSV, HTML, XML, and JavaScript Object Notation (JSON). Users can also use the XNAT Advanced Search capability to create their own custom data sets for download.

FIGURE 1
www.frontiersin.org

Figure 1. Imaging data can be extracted from picture archiving and communication system (PACS) using DICOM interfaces implemented in XNAT. Alternatively, users can upload DICOM image files using a desktop application. SNIPR automatically removes patient identifying information from DICOM metadata. Clinical attributes can be downloaded from Electronic Data Collection (EDC) systems in spreadsheet format and uploaded into SNIPR via spreadsheets or entered into web-based form. XNAT automatically links the clinical and image data to enable searching and aggregation across domains.

FIGURE 2
www.frontiersin.org

Figure 2. SNIPR uses XNAT’s standard project management mechanism which gives project owners flexible control over access to their data. Projects can be set to public, protected, or private. Protected project data are only accessible to users who have been explicitly granted access to the project.

Additional data types such as imaging and clinical phenotypes can also be added by specifying new data type plugins which are based on XNAT’s standard XML Schema extension mechanism. Indeed, this capability to introduce additional data types empowers SNIPR to support managing multimodal data under a common research subject identifier. When research identifiers are determined by a project owner, then image studies can be directly pulled from PACS and anonymized during import, but clinical data can be downloaded from Electronic Data Collection (EDC) systems in spreadsheet format and uploaded into SNIPR. These anonymized data can be stored under a common research subject identifier in SNIPR so as to support subject level study for various stroke phenotyping at large scale. SNIPR manages subject-level clinical and derived radiological data (Table 1) and session level clinical and derived radiological data (Table 2). The clinical fields in SNIPR reflect measurements widely used in stroke research and are readily expanded to accommodate future studies. These measurements include, but are not limited to, the NIH Stroke Scale/Score (NIHSS) which quantifies stroke severity based on weighted evaluation findings, the Modified Rankin Scale (mRS) which measures degree of disability/dependence after a stroke and the Trial of Org 10172 in Acute Stroke Treatment (TOAST) which denotes five subtypes of ischemic stroke. Assessment of cerebral edema and hemorrhagic transformation after ischemic stroke also follow established grading systems (Larrue et al., 2001; Strbian et al., 2013). Users can use their spreadsheet to upload these clinical data for thousands of subjects at any time and edit these data manually and enter derived radiological data manually after image review.

TABLE 1
www.frontiersin.org

Table 1. SNIPR can pull stroke’s clinical data from REDCap and manage these data at subject level.

TABLE 2
www.frontiersin.org

Table 2. SNIPR can pull stroke’s clinical data from REDCap and manage these data at session level.

Automated imaging data analysis is implemented in SNIPR using XNAT’s Container Service. The Container Service uses a containerized computing architecture to encapsulate applications in portable structures that are readily distributed and deployed to large-scale compute clusters (Di Tommaso et al., 2015; Ismail et al., 2015; Zheng and Thain, 2015; Pienaar et al., 2017). The Container service manages the process of loading container images from public or private repositories (e.g., Docker Hub) to SNIPR’s Docker cluster, provides a user interface for configuring execution parameters and launching container instances to process specific data, and orchestrates execution of containers on the Docker server (Figure 3). SNIPR currently includes containerized applications to perform various tasks such as CT scan type classifier, image format conversion, image pre-registration for skull stripping, longitudinal image-registration, and brain image segmentation. When a containerized application is launched by a user, the container service mounts the necessary data from the SNIPR database in a directory on the container file system from which the containerized application can read and write data. At the conclusion of execution, any files written by the application are exported to the SNIPR database. SNIPR’s automation service can be configured to automatically execute containerized applications when images are uploaded to the database.

FIGURE 3
www.frontiersin.org

Figure 3. SNIPR is equipped with a container service plugin to manage cluster of containers with Docker swarm on computing cluster attached. Containers used in SNIPR pipelines such as CT scan classifier, DICOM to NIFTI, preregistration, registration, and segmentation are pushed to and pulled from Docker image hub.

Stroke Neuroimaging Phenotype Repository supports development and execution of automated and semi-automated processing pipelines. As an example, a containerized stroke edema pipeline was developed to automate image segmentation and measurement of CSF volume in serial CT scans in stroke patients (Figure 4). The pipeline includes five containerized modules, including neural network-based labeling of image acquisition type (Szegedy et al., 2015; Mohammadian Foroushani et al., 2020a), DICOM to NIfTI conversion (Li, 2016), FSL-based preregistration for skull stripping (Jenkinson, 2011), ANTS-based registration of longitudinal brain masks (Avants et al., 2009), U-Net-based segmentation of CSF (Chen et al., 2016), and its volumetric calculation (Dhar et al., 2020). This parallels the recent recommendations for processing head CT data (Muschelli, 2019). After finishing the process, each executed container uses XNAT’s web-based application programming interface (API) to store its output back into SNIPR. The SNIPR user interface provides tools to monitor pipeline progress and review pipeline output, enabling an interactive workflow to check output quality, tune parameters, and relaunch pipeline steps. The pipeline is being used in the large SNIPR patient cohorts to study how dynamic change in brain volumetric values can predict trajectory of edema (Mohammadian Foroushan et al., 2020b).

FIGURE 4
www.frontiersin.org

Figure 4. Outline of stroke edema image processing pipeline to analyze CSF volumes from large cohorts of stroke patients.

Results

As shown in Figure 5, SNIPR currently hosts nine projects with 2,406 subjects. Of the 6,149 imaging sessions on SNIPR, 6,122 are CT and 85 are MRI. A small number of CT angiography and CT perfusion scans have also been contributed. Each research institution has protected access to its project data, which is only accessible to users who have been explicitly granted access to that project, but all users can see titles of all projects. Table 3 shows summary statistic of all imaging data in SNIPR. Accordingly, SNIPR users have different access levels to projects and SNIPR hosts imaging data of different stroke types such as ischemic, including those with stroke large vessel occlusion and subarachnoid hemorrhage.

FIGURE 5
www.frontiersin.org

Figure 5. SNIPR hosts eight study cohorts and 2,246 subjects now. SNIPR provides project owners with flexible control over access to their data.

TABLE 3
www.frontiersin.org

Table 3. SNIPR now manages CT and MR data of 2,246 stroke patients with different disease types such as ischemic stroke, large vessel occlusion stroke, and subarachnoid hemorrhage.

Web-based reports provide user-friendly access to the demographic, clinical, and imaging attributes captured in SNIPR. As listed in Table 1, researchers can get an overall view of the patient’s stroke complexity such as disease type, stroke side, and lesion location in Subject Summary Field table in patient’s webpages along with having access to their different CT session studies. Figure 6 shows a sample stroke patient whose overall stroke complexity is shown in a table along with list of imaging session studies. So, the researchers can get an overview of stroke complexity for this patient based on this table, and then review imaging sessions for further investigation. As listed in Table 2, SNIPR can also manage session level clinical and derived radiological data. Figure 7 shows a sample session webpage which incorporates various session level clinical and derived radiological data along with each imaging study for this stroke patient. Researchers can get a detailed summary of radiological data such as history of surgery, midline shift, and so on in Radiological Data Fields table and have access to different imaging scans in that specific session study webpage.

FIGURE 6
www.frontiersin.org

Figure 6. SNIPR can manage clinical data and imaging CT session studies for each subject. Each patient has a subject page where user can have access to patient’s imaging visits and also patient’s clinical data in subject summary table.

FIGURE 7
www.frontiersin.org

Figure 7. SNIPR can manage clinical data and imaging CT scan studies for all imaging session studies of a patient. Each patient has several visits, each of which has a session page where users can have access to different imaging CT studies and also radiological information particular to that visit.

Discussion

SNIPR was developed as a central repository to host and provide secure access to anonymized stroke imaging and associated clinical data, and to serve as a sandbox environment for developing computational pipelines to perform large-scale quantitative analysis. Data type plugins were added to XNAT to extend its support for stroke-related phenotype data type generated by these pipelines and extracted from the electronic medical record. To support scalable computing, advanced containerization tools, including Docker Swarm, were deployed along with a custom Docker scheduling plugin for XNAT. An automated processing pipeline to extract CSF volume in serial CT images was developed and subsequently deployed as a pipeline on SNIPR for use by other projects.

While other repositories such as ENIGMA and MRI-GENIE are critically built with the focus on research MRI with limited longitudinal data, SNIPR offers longitudinal clinical CT (and MRI) which offers much larger numbers and broader representation of available imaging. Also, other repositories have focused on a variety of all neurological and psychiatric disorders or principally on stroke recovery, while SNIPR focuses on stroke-specific data, and it targets acute imaging and relevant acute phenotypes like edema and hemorrhagic transformation. Accordingly, SNIPR can equip researchers to launch processing pipelines at scale to study any stroke phenotype and its trajectory over time to study stroke and its dynamics effectively.

Stroke Neuroimaging Phenotype Repository development is ongoing, with a particular focus on development of additional imaging biomarkers, including predictors of hemorrhagic transformation, measurements of collateral flow, and support of large-scale multicenter clinical studies to integrate different stroke phenotypic studies at massive scale and validate our new discoveries on various populations. The XNAT-based SNIPR database and IT infrastructure will be expanded further to support large-scale data contributions by external collaborators. Data structures will be developed to capture all stroke imaging modalities and multimodal phenotypes. Workflows will be implemented to upload and document data from all data contributors such as anonymization of image metadata, detection and removal of alphanumeric characters in images, and obscuring of identifying facial features. A suite of image quality control pipelines will be implemented to automatically assign quality metrics to uploaded images. A standardized imaging derived disease phenotype, including intracranial compartment, CSF, and infarct volumes, will also be generated for each patient exam and it will be integrated with associated clinical data. New algorithms will be deployed onto SNIPR using XNAT’s Docker container service to enable scalable reproducible processing and validation. A data dashboard will also be implemented as an information management tool that will visually track, integrate, analyze, and display demographic information and processing results for each subject and across all subjects in each project.

Data Availability Statement

The original contributions presented in the study are included in the article/supplementary material. Further inquiries can be directed to the corresponding author/s.

Author Contributions

HM drafted the initial manuscript and performed the pipeline development. RD collected the clinical data and revised the manuscript. YC, JG, and AH revised the manuscript. DM and J-ML supervised this project, reviewed the manuscript, and made critical revisions. All authors contributed to the article and approved the submitted version.

Funding

This work was funded by the National Institutes of Neurological Disorders and Stroke through grants to RD (K23 NS099440), J-ML (R01 NS085419), and DM (R01 EB009352 and P30 NS098577).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

The authors would like to thank Stephen Moore, William Horton, Rick Herrick, and Matt Kelsey, and others on the XNAT development team at the Washington University School of Medicine and Kate Alpert at Radiologics, Inc. for their technical support. They would also like to thank their data collaborators in United States, Poland, Finland, and Spain.

References

Avants, B. B., Tustison, N., and Song, G. (2009). Advanced normalization tools (ANTS). Insight J. 2, 1–35.

Google Scholar

Chen, Y., Dhar, R., Heitsch, L., Ford, A., Fernandez-Cadenas, I., Carrera, C., et al. (2016). Automated quantification of cerebral edema following hemispheric infarction: application of a machine-learning algorithm to evaluate CSF shifts on serial head CTs. NeuroImage Clin. 12, 673–680. doi: 10.1016/j.nicl.2016.09.018

PubMed Abstract | CrossRef Full Text | Google Scholar

Dhar, R., Chen, Y., An, H., and Lee, J. M. (2018). Application of machine learning to automated analysis of cerebral edema in large cohorts of ischemic stroke patients. Front. Neurol. 9:687.

Google Scholar

Dhar, R., Chen, Y., Hamzehloo, A., Kumar, A., Heitsch, L., He, J., et al. (2020). Reduction in cerebrospinal fluid volume as an early quantitative biomarker of cerebral edema after ischemic stroke. Stroke 51, 462–467. doi: 10.1161/strokeaha.119.027895

PubMed Abstract | CrossRef Full Text | Google Scholar

Di Tommaso, P., Palumbo, E., Chatzou, M., Prieto, P., Heuer, M. L., and Notredame, C. (2015). The impact of Docker containers on the performance of genomic pipelines. PeerJ 3:e1273. doi: 10.7717/peerj.1273

PubMed Abstract | CrossRef Full Text | Google Scholar

George, M. G., Fischer, L., Koroshetz, W., Bushnell, C., Frankel, M., Foltz, J., et al. (2017). CDC grand rounds: public health strategies to prevent and treat strokes. MMWR 66, 479–481. doi: 10.15585/mmwr.mm6618a5

PubMed Abstract | CrossRef Full Text | Google Scholar

Giese, A.-K., Schirmer, M. D., Dalca, A. V., Sridharan, R., Donahue, K. L., Nardin, M., et al. (2020). White matter hyperintensity burden in acute stroke patients differs by ischemic stroke subtype. Neurology 95, e79–e88.

Google Scholar

Giese, A. K., Schirmer, M. D., Donahue, K. L., Cloonan, L., Irie, R., Winzeck, S., et al. (2017). Design and rationale for examining neuroimaging genetics in ischemic stroke: the MRI-GENIE study. Neurol. Genet. 3:e180.

Google Scholar

Gurney, J., Olsen, T., Flavin, J., Ramaratnam, M., Archie, K., Ransford, J., et al. (2017). The Washington University central neuroimaging data archive. Neuroimage 144, 287–293. doi: 10.1016/j.neuroimage.2015.09.060

PubMed Abstract | CrossRef Full Text | Google Scholar

Heitsch, L., Ibanez, L., Carrera, C., Binkley, M. M., Strbian, D., Tatlisumak, T., et al. (2021). Early neurological change after ischemic stroke is associated with 90-day outcome. Stroke 52, 132–141.

Google Scholar

Ismail, B. I., Goortani, E. M., Ab Karim, M. B., Tat, W. M., Setapa, S., Luke, J. Y., et al. (2015). “Evaluation of docker as edge computing platform,” in Proceedings of the 2015 IEEE Conference on Open Systems (ICOS), Melaka.

Google Scholar

Jenkinson, J. (2011). Lindsay Reid, Midwifery in Scotland: a history. Scottish History Press: Erskine, Renfrewshire, 2011. pp. 304. £24.95 paperback. ISBN 978 0 9564477 0 8. Innes Rev. 62, 252–255.

Google Scholar

Larrue, V., von Kummer, R., Müller, A., and Bluhmki, E. (2001). Risk factors for severe hemorrhagic transformation in ischemic stroke patients treated with recombinant tissue plasminogen activator. Stroke 32, 438–441. doi: 10.1161/01.str.32.2.438

CrossRef Full Text | Google Scholar

Li, X. (2016). The first step for neuroimaging data analysis: DICOM to NIfTI conversion. J. Neurosci. Methods 264, 47–56. doi: 10.1016/j.jneumeth.2016.03.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Liebeskind, D. S. (2018). Big data for a big problem: precision medicine of stroke in neurocritical care. Crit. Care Med. 46, 1189–1191. doi: 10.1097/ccm.0000000000003165

PubMed Abstract | CrossRef Full Text | Google Scholar

Liew, S.-L., Zavaliangos-Petropulu, A., Jahanshad, N., Lang, C. E., Hayward, K. S., Lohse, K. R., et al. (2020). The ENIGMA Stroke Recovery Working Group: big data neuroimaging to study brain-behavior relationships after stroke. Human Brain Mapp. doi: 10.1002/hbm.25015

PubMed Abstract | CrossRef Full Text | Google Scholar

Malik, R. (2018). Multiancestry genome-wide association study of 520,000 subjects identifies 32 loci associated with stroke and stroke subtypes. Nat. Genet. 50, 524–537.

Google Scholar

Marcus, D. S. (2007). The extensible neuroimaging archive toolkit. Neuroinformatics 5, 11–33. doi: 10.1385/ni:5:1:11

CrossRef Full Text | Google Scholar

Marcus, D. S. (2013). Human Connectome Project informatics: quality control, database services, and data visualization. Neuroimage 80, 202–219. doi: 10.1016/j.neuroimage.2013.05.077

PubMed Abstract | CrossRef Full Text | Google Scholar

Mohammadian Foroushani, H., Dhar, R., Chen, Y., Gurney, J., Hamzehloo, A., Lee, J. M., et al. (2020a). “SNIPR: stroke neuroimaging phenotype repository,” in Medical Imaging 2020: Imaging Informatics for Healthcare, Research, and Applications. 2020. International Society for Optics and Photonics SPIE, eds P.-H. Chen and T. M. Deserno (Bellingham, WA: SPIE).

Google Scholar

Mohammadian Foroushan, H., Hamzehloo, A., Kumar, A., Chen, Y., Heitsch, L., Slowik, A., et al. (2020b). Quantitative serial CT imaging-derived features improve prediction of malignant cerebral edema after ischemic stroke. Neurocrit. Care 33, 785–792. doi: 10.1007/s12028-020-01056-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Muschelli, J. (2019). Recommendations for processing head CT data. Front. Neuroinform. 13:61.

Google Scholar

Pianykh, O. S. (2009). Digital Imaging and Communications in Medicine (DICOM): A Practical Introduction and Survival Guide. Berlin Heidelberg: Springer-Verlag.

Google Scholar

Pienaar, R., Bernal, J., Rannou, N., Ellen Grant, P., Hähn, D., Turk, A., et al. (2017). “Architecting and building the future of healthcare informatics: cloud, containers, big data and CHIPS,” in Proceedings of the Future Technologies Conference (FTC), Vancouver.

Google Scholar

Strbian, D., Meretoja, A., Putaala, J., Kaste, M., and Tatlisumak, T., and Helsinki Stroke Thrombolysis Registry Group (2013). Cerebral edema in acute ischemic stroke patients treated with intravenous thrombolysis. Int. J. Stroke 8, 529–534. doi: 10.1111/j.1747-4949.2012.00781.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., et al. (2015). “Going deeper with convolutions,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA.

Google Scholar

Tong, E., Hou, Q., Fiebach, J. B., and Wintermark, M. (2014). The role of imaging in acute ischemic stroke. Neurosurg. Focus 36:E3.

Google Scholar

Zheng, C., and Thain, D. (2015). “Integrating containers into workflows: a case study using makeflow, work queue, and docker,” in Proceedings of the 8th International Workshop on Virtualization Technologies in Distributed Computing, Portland, OR.

Google Scholar

Keywords: big data, containerized pipeline, deep learning, informatics, phenotype repository, stroke neuroimaging, XNAT

Citation: Mohammadian Foroushani H, Dhar R, Chen Y, Gurney J, Hamzehloo A, Lee J-M and Marcus DS (2021) The Stroke Neuro-Imaging Phenotype Repository: An Open Data Science Platform for Stroke Research. Front. Neuroinform. 15:597708. doi: 10.3389/fninf.2021.597708

Received: 21 August 2020; Accepted: 24 May 2021;
Published: 24 June 2021.

Edited by:

Giuseppe Luigi Banna, United Lincolnshire Hospitals NHS Trust, United Kingdom

Reviewed by:

Graham J. Galloway, University of Queensland, Australia
Mark Etherton, Massachusetts General Hospital and Harvard Medical School, United States

Copyright © 2021 Mohammadian Foroushani, Dhar, Chen, Gurney, Hamzehloo, Lee and Marcus. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Daniel S. Marcus, dmarcus@wustl.edu

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.