Skip to main content

SYSTEMATIC REVIEW article

Front. Neurol. , 14 March 2025

Sec. Artificial Intelligence in Neurology

Volume 16 - 2025 | https://doi.org/10.3389/fneur.2025.1505533

This article is part of the Research Topic Applied Neuroimaging for the Diagnosis and Prognosis of Cerebrovascular Disease View all 6 articles

Exploring the potential of machine learning and magnetic resonance imaging in early stroke diagnosis: a bibliometric analysis (2004–2023)

Jian-cheng LouJian-cheng LouXiao-fen YuXiao-fen YuJian-jun YingJian-jun YingDa-qiao SongDa-qiao SongWen-hua Xiong
Wen-hua Xiong*
  • Yiwu Hospital of Traditional Chinese Medicine, Yiwu, China

Objective: To examine the focal areas of research in the early diagnosis of stroke through machine learning identification of magnetic resonance imaging characteristics from 2004 to 2023.

Methods: Data were gathered from the Science Citation Index-Expanded (SCI-E) within the Web of Science Core Collection (WoSCC). Utilizing CiteSpace 6.2.R6, a thorough analysis was conducted, encompassing publications, authors, cited authors, countries, institutions, cited journals, references, and keywords. This investigation covered the period from 2004 to 2023, with the data retrieval completed on December 1, 2023, in a single day.

Results: In total, 395 articles were incorporated into the analysis. Prior to 2015, the annual publication count was under 10, but a significant surge in publications was observed post-2015. Institutions and authors from the USA and China have established themselves as mature academic entities on a global scale, forging extensive collaborative networks with other institutions. High-impact journals in this field predominantly feature in top-tier publications, indicating a consensus in the medical community on the application of machine learning for early stroke diagnosis. “deep learning,” “magnetic resonance imaging,” and “stroke” emerged as the most attention-gathering keywords among researchers. The development in this field is marked by a coexisting pattern of interdisciplinary integration and refinement within major disciplinary branches.

Conclusion: The application of machine learning in the early prediction and personalized medical plans for stroke patients using neuroimaging characteristics offers significant value. The most notable research hotspots currently are the optimal selection of neural imaging markers and the most suitable machine learning algorithm models.

Introduction

Stroke is an acute cerebrovascular disorder, precipitates enduring cerebral damage, disability, and even mortality upon its onset (13). Studies have identified it as the second leading cause of death worldwide (4). Notably, 11% of stroke survivors experience a recurrence within a year, and 39% within a decade (5). Generally, strokes arise either from blood flow obstruction (ischemic strokes, constituting 87%) or intracerebral hemorrhage (hemorrhagic strokes, accounting for 10%) (6, 7). Regardless of the type, prompt medical intervention is crucial, as early diagnosis and treatment significantly influence the outcome.

Magnetic resonance imaging (MRI) stands as the gold index in stroke diagnosis (810), boasting high temporal and spatial resolution capabilities that enable meticulous observation of subtle cerebral vascular changes (11, 12). Moreover, the analytical methods derived from multimodal MRI data facilitate a nuanced identification of cerebral structural and functional network regulations (13, 14). Hence, the objective visualization tools provided by MRI technology are instrumental in the early diagnosis of stroke. However, many patients fail to adhere to medical advice for regular MRI follow-ups, leading to acute stroke episodes (15, 16). Consequently, there’s an urgent need in the medical field for a sophisticated neuroimaging algorithm capable of early stroke prediction, mitigating the issue of clinical data scarcity due to patient non-compliance.

Machine learning (ML) algorithms can automate the interpretation of abnormal imaging patterns, accelerating the diagnostic process in urgent scenarios (1719). They integrate data from diverse sources, including MRI, clinical records, and vital signs, to assess an individual’s future stroke risk. Additionally, ML enhances the precision and sensitivity of stroke diagnosis, particularly in early stages, by learning from extensive datasets (20). Deep learning, a subset of ML, represents one of the most advanced and specialized approaches within the broader ML framework. While ML encompasses a wide range of algorithms, deep learning focuses on neural network architectures capable of automatically extracting high-level features from complex data. In stroke diagnosis, deep learning has garnered significant attention due to its exceptional performance in processing MRI data, particularly for identifying subtle imaging markers of early stroke. Consequently, the bibliometric analysis in this study focuses on ML as the overarching framework, while acknowledging deep learning as a key contributor to advancements in this field. In recent years, ML has been increasingly applied to early stroke detection, with its reliability validated by authoritative multicentric randomized controlled trials (RCTs) (21) and meta-analysis (22) of high evidentiary value. However, the field of academic collaboration networks, developmental trends, and research frontiers in using ML for early stroke diagnosis through MRI feature recognition still lacks extensive bibliometric research.

CiteSpace software, a visualization tool, qualitatively and quantitatively elucidates the interconnected contributions of authors, regions, institutions, and their collaboration networks (23). Its most notable attribute is the insight into research hotspots and frontier areas, along with predictions on specific field’s future development trajectories. Compared to traditional literature reviews and meta-analyses, the bibliometric analysis facilitated by CiteSpace offers a more profound and insightful perspective (2426). This study aims to utilize CiteSpace to comprehensively search the WoSCC for relevant literature from the past two decades, conducting a bibliometric analysis on core authors, their collaboration networks, journals, countries, and affiliated academic institutions. This will deepen our understanding of the frontiers and developmental trends in the early diagnosis of stroke using ML to identify MRI characteristics.

Materials and methods

Data sources and search strategy

Data sources

The data for this study were sourced from the Science Citation Index-Expanded (SCI-E) within the Web of Science Core Collection (WoSCC), a citation-based database that provides detailed citation information and abstracts. This allows for the calculation of bibliometric indicators such as cited authors, journals, impact factors, h-indexes, and citation reports. WoSCC encompasses over 12,000 high-quality academic journals spanning more than 250 disciplines, offering a comprehensive collection of interdisciplinary publications.

Compared to databases like PubMed, which focus primarily on biomedical literature and lack citation metrics, WoSCC offers superior bibliometric capabilities, including citation networks, co-authorship relationships, and keyword co-occurrence trends. Current bibliometric tools also do not support multi-database integration due to challenges such as inconsistent data formats and record duplication, complicating the research process and compromising data consistency.

While relying solely on WoSCC may exclude articles indexed in databases like PubMed or Scopus, which could contain relevant studies on machine learning and MRI in stroke diagnosis, WoSCC’s interdisciplinary coverage and citation-based metrics make it suitable for this study. A broad search strategy was applied to minimize dataset bias. Future research could explore multi-database integration as methodologies evolve to address associated challenges.

Search strategy

The data retrieval strategy encompassed key topics such as “stroke,” “magnetic resonance imaging,” and “machine learning” (Figure 1). This encompassed a nearly two-decade span of publications, from December 1, 2004, to December 1, 2023, with the retrieval completed within a single day, December 10, 2023. We imposed no geographical restrictions on the publishing countries, but required the language to be English and the research type to be “article” (2729). Details of the retrieval strategy and results are provided in Table 1. A total of 395 articles were identified, which, after importing into CiteSpace and eliminating duplicates, confirmed the absence of redundancies.

Figure 1
www.frontiersin.org

Figure 1. Map of literature screening process related to machine learning and MRI in early stroke diagnosis.

Table 1
www.frontiersin.org

Table 1. The topic search query.

Analysis tool

The visualizations generated by CiteSpace 6.2.R6 typically include nodes, links, colors, clusters, and timelines. Nodes usually represent various research papers, authors, journals, or keywords, with the size of a node often indicating its significance or impact, such as citation frequency. Links denote the relationships between nodes, like citation or collaboration connections, with the thickness of a line possibly indicating the strength or frequency of the relationship. Different colors may represent different time periods or various research fields or categories. Clusters, composed of closely connected nodes, signify specific research themes or areas, aiding in understanding the primary branches and trends within a research field. The timeline exhibits the evolution of keywords or themes over time. Interpreting these visualizations aids in uncovering hot topics, developmental trends, and relationships in research concerning the application of ML in the field of stroke.

The parameters used in CiteSpace 6.2.R6 were as follows: time slices covered the period from 2004 to 2023, with each slice representing 1 year. All terms were included, such as “title,” “abstract,” “author keywords,” and “keywords plus.” To enhance the clarity of the final visualizations and facilitate the observation of relationships between publications, we set the g-index’s k-value to 50 and employed the Pathfinder algorithm (30, 31).

Results

Annual publications

Figure 2 illustrates the annual publication trend in using ML to analyze MRI characteristics for the early diagnosis of stroke. It was observed that prior to 2015, the quantity of publications remained at a relatively low level, correlating significantly with the nascent phase of ML as an emerging discipline. From 2015 onwards, there has been a substantial increase in publications, attributed to advancements in the computational capabilities of ML and the refinement of algorithmic architectures. These developments have shown promise in enhancing the accuracy of diagnosing stroke, its subtyping, and prognostic predictions (32). Our investigation revealed that in 2015, the U.S. Food and Drug Administration (FDA) approved several ML-based medical devices, such as RapidAI® and Viz.ai®, which have played a pivotal role in the early diagnosis and treatment decision-making of strokes. The trajectory of the trend line leads us to infer that in the next 5–10 years, a new peak in the volume of publications is likely to emerge.

Figure 2
www.frontiersin.org

Figure 2. Map of annual publications related to machine learning and MRI in early stroke diagnosis.

In addition to describing publication trends, the correlation analysis highlights a moderate positive relationship between the emergence of machine learning and its application in stroke research. This result indicates that advancements in ML directly influenced its adoption in stroke diagnosis and treatment. For example, the spike in publications after 2015 aligns with the FDA approval of ML-based medical tools, such as RapidAI® and Viz.ai®, which are designed to enhance diagnostic workflows.

Analysis of authors

Figures 3 and Table 2 display the authorship information behind the 395 published articles. Each node represents an author, with the connecting lines indicating collaborative relationships between them. The top 10 authors, in descending order, are: Castillo, Jose (9 publications); Wang, Yongjun (9 publications); Jing, Jing (8 publications); Campos, Francisco (8 publications); Meng, Xia (7 publications); Iglesias-rey, Ramon (7 publications); Sobrino, Tomas (6 publications); Li, Zixiao (6 publications); Chen, Cheng (5 publications); Zhao, Xingquan (5 publications). It was observed that all of the top 10 authors hail from Spain and China. This pattern reflects the strong research infrastructure and significant investments in ML and medical research within these countries. For instance, Spain has long been recognized for its clinical stroke research expertise, while China has emerged as a leader in applying ML technologies to medical diagnostics due to its large patient datasets and growing interdisciplinary collaborations.

Figure 3
www.frontiersin.org

Figure 3. Map of authors related to machine learning and MRI in early stroke diagnosis.

Table 2
www.frontiersin.org

Table 2. Top 10 authors related to machine learning and magnetic resonance imaging in early stroke diagnosis.

The collaboration network reveals that higher node degree correlates strongly with author centrality, suggesting that prominent authors often serve as key hubs in multi-center studies. For instance, Jose Castillo and Yongjun Wang exhibit significant influence in coordinating international collaborations, reflecting their pivotal roles in advancing research in this domain.

Analysis of countries

Figures 4 and Table 3 present the collaborative network among countries in this field of research, revealing a network comprising 54 nodes and 267 edges. The top contributors by publication volume are the People’s Republic of China (136); USA (135); England (54); Germany (51); and Canada (34). However, a closer analysis reveals an interesting distinction between publication volume and centrality, which measures the influence of a country within the collaboration network. For example, while China leads in publication volume, its centrality is relatively low, indicating fewer collaborative connections with other nations compared to Germany (centrality: 0.24) and the USA (centrality: 0.14). This discrepancy suggests that while China and the USA dominate in output, Germany plays a more integrative role in fostering international collaborations. Such insights underline the importance of not only the quantity but also the quality and connectivity of research contributions in advancing the field.

Figure 4
www.frontiersin.org

Figure 4. Map of countries related to machine learning and MRI in early stroke diagnosis.

Table 3
www.frontiersin.org

Table 3. Top 10 frequency and centrality of countries related to machine learning and magnetic resonance imaging in early stroke diagnosis.

In the country network, there is a strong positive correlation between publication volume and collaboration frequency, highlighting the synergy between research activity and international partnerships. Interestingly, the correlation is weaker for centrality, indicating that publication volume does not always reflect the strategic importance of a country within the network. For instance, Germany, with a centrality of 0.24, leads in bridging interdisciplinary collaborations despite ranking fourth in publication volume.

Analysis of institutions

Figure 5 and Table 4 display the collaborative network of institutions, comprising 431 nodes and 1,709 edges. The top 10 institutions in terms of publication volume are as follows: University of California System (33 publications); Harvard University (33 publications); Massachusetts General Hospital (23 publications); Harvard Medical School (21 publications); Capital Medical University (20 publications); Chinese Academy of Sciences (18 publications); Chinese Academy of Medical Sciences—Peking Union Medical College (14 publications); Helmholtz Association (14 publications); Mayo Clinic (14 publications); University of California Los Angeles (13 publications). It was observed that academic institutions affiliated with the USA dominate the top 10 rankings. The interconnections between institutions across various countries highlight a significant network of collaborations, which is poised to further advance the discipline in this field.

Figure 5
www.frontiersin.org

Figure 5. Map of institutions related to machine learning and MRI in early stroke diagnosis.

Table 4
www.frontiersin.org

Table 4. Top 10 publications of institutions related to machine learning and magnetic resonance imaging in early stroke diagnosis.

Institutional collaboration analysis shows a moderate positive correlation between node degree and publication output. Institutions such as the University of California System and Harvard University, which exhibit high node degrees, also demonstrate strong interconnectivity, fostering impactful collaborations that push the boundaries of ML applications in stroke research.

Analysis of cited journals

Figure 6 and Table 5 showcase the cited journal network, consisting of 838 nodes and 4,674 edges. The top 10 journals by citation frequency are: Stroke (248 citations); NeuroImage (215 citations); Neurology (183 citations); PLoS One (145 citations); Brain (127 citations); Lancet Neurol (125 citations); J Cereb Blood Flow Metab (125 citations); Am J Neuroradiol (119 citations); Ann Neurol (117 citations); Hum Brain Mapp (109 citations). Additionally, journals with notable centrality (indicated by purple rings) include Ann NY Acad Sci (0.12); Am J Neuroradiol (0.11); Acta Neuropathol (0.11). These journals primarily cover neurology, neuroimaging, and computer science. For instance, Stroke has an impact factor of 8.3, with NeuroImage and Neurology also being top-tier journals in this field.

Figure 6
www.frontiersin.org

Figure 6. Map of cited journals related to machine learning and MRI in early stroke diagnosis.

Table 5
www.frontiersin.org

Table 5. Top 10 frequency and centrality of cited journals related to machine learning and magnetic resonance imaging in early stroke diagnosis.

The journals Stroke and NeuroImage exhibit the highest normalized citation impact, indicating their influence in bridging neurology and imaging studies. Additionally, metrics such as h-index and Eigenfactor score were analyzed for the top-cited journals to further evaluate their academic impact. For instance, Stroke has an impact factor of 8.3 and an h-index of 150, showcasing its long-standing relevance in stroke research. Similarly, NeuroImage demonstrates a significant h-index of 230, reflecting its importance in neuroimaging and machine learning studies. The cited journal analysis, complemented by impact metrics, highlights the interplay between foundational stroke research and emerging machine learning methodologies. This integrative approach provides robust evidence of the academic networks and key journals shaping this interdisciplinary field.

Keywords co-occurrence and citation burst analysis

Figure 7 and Table 6 depict the network graph of keywords, encompassing 690 nodes and 2,513 edges. The top 10 keywords are: deep learning (46 occurrences); machine learning (45 occurrences); brain (39 occurrences); magnetic resonance imaging (36 occurrences); stroke (31 occurrences); MRI (25 occurrences); functional connectivity (25 occurrences); risk (24 occurrences); ischemic stroke (24 occurrences); Alzheimer’s disease (23 occurrences). Analyzing the frequency and centrality of these keywords reveals that “deep learning,” “magnetic resonance imaging,” and “stroke” have emerged as prominent themes in this research area.

Figure 7
www.frontiersin.org

Figure 7. Map of keywords related to machine learning and MRI in early stroke diagnosis.

Table 6
www.frontiersin.org

Table 6. Top 10 frequency and centrality of keywords related to machine learning and magnetic resonance imaging in early stroke diagnosis.

Figure 8 illustrates the top 20 keywords with the most robust citation bursts. The beginning and end of each burst are, respectively, marked as “Start” and “End,” with the increase in influence correlating with the rise in the “Strength” value. The pale blue region delineates the study period, while the red portion signifies the start and peak of the bursts. It was observed that “machine learning” exhibited the highest burst strength, reaching 6.57. Furthermore, early attention to “functional MRI” and “diffusion tensor imaging” indicates that changes in brain structure and function had been applied in this field from an early stage. Mid-period keywords like “executive function” and “default mode network” experienced high burst rates, signifying researchers’ growing focus on the interconnections between deep brain networks. In later periods, the frequent emergence of terms such as “prediction,” “classification,” and “machine learning” underscores the extensive application of ML in recent years for the early diagnosis of stroke and the development of individualized treatment plans to prevent the high mortality associated with acute stroke incidents.

Figure 8
www.frontiersin.org

Figure 8. Top 20 keywords with the strongest citation bursts.

The co-occurrence network reveals a strong positive correlation between keyword centrality and burst strength. Keywords such as “deep learning” and “machine learning” not only occur frequently but also drive significant citation bursts, underscoring their pivotal roles in shaping the field. Additionally, the temporal analysis suggests that early bursts in keywords like “functional MRI” paved the way for mid-period focuses on “executive function” and later trends emphasizing “classification” and “prediction.”

Keywords timeline

Figure 9 presents the evolution and interconnections of keywords, arranged chronologically. The timeline extends from left to right, delineating the emergence and disappearance of research keywords from 2004 to 2023. Additionally, the diagram clusters various themes. A total of nine clusters (#0 to #8) are depicted. The first cluster (#0), labeled “temporal consistency,” focuses on topics like deep learning, automated WMH detection, and amorphous object segmentation. The second cluster (#1), identified as “classification method,” concentrates on magnetic resonance imaging, cerebral blood flow, and related themes. The third cluster is marked as “functional connectivity strength,” highlighting areas such as intrinsic functional connectivity and graph theory analysis. The fourth cluster, labeled “final infarct volume,” focuses on chronic venous disease, peripheral artery disease, and similar subjects. The fifth cluster, named “rural-urban disparities,” is centered around risk factors and minority health.

Figure 9
www.frontiersin.org

Figure 9. Map of keywords timeline related to machine learning and MRI in early stroke diagnosis.

Cluster dependencies of reference

Figure 10 showcases the dependency relationships among clusters based on referenced literature. Areas coded in different colors represent distinct clusters of references, while arrows indicate the developmental relationships between these clusters. The convergence of arrows signifies the emergence of new disciplinary branches, while the merging of arrowheads indicates the integration of different disciplines. This is because the tail of an arrow represents the cutting edge of current knowledge, while the head points to the sources of foundational literature.

Figure 10
www.frontiersin.org

Figure 10. Map of reference of cluster dependencies related to machine learning and MRI in early stroke diagnosis.

The developmental pattern in this field primarily exists in a dual form: the coexistence of interdisciplinary integration and the refinement of major disciplinary branches. A detailed analysis reveals several cutting-edge directions of interdisciplinary integration: Neuroimaging and machine learning: The integration of advanced imaging modalities such as diffusion tensor imaging and convolutional neural networks has enabled the development of automated lesion segmentation and early diagnostic models for ischemic stroke. This integration bridges computational algorithms and clinical radiology, advancing both fields. Neurology and bioinformatics: The use of bioinformatics tools in analyzing imaging markers has enhanced the understanding of the molecular underpinnings of stroke. For example, integrating genetic data with MRI-based phenotypes offers new insights into personalized therapeutic strategies. Clinical decision support systems and artificial intelligence: AI-driven CDSS, leveraging ML-based prognostic models, has facilitated real-time decision-making in stroke management, particularly in identifying optimal therapeutic windows. This interdisciplinary collaboration integrates medical informatics, neurology, and computer science. For instance, cluster #17 represents the fusion of neuroimaging and ML (clusters #0 and #10), while cluster #5 reflects the branching of ML applications into specific subdomains of neurology (clusters #1 and #3). These trends underscore the pivotal role of interdisciplinary integration in advancing the field of early stroke diagnosis.

Correlation analysis of the reference dependency network indicates a significant relationship between cluster size and the number of interdisciplinary connections. Larger clusters, such as #0 and #10, exhibit higher connectivity, reflecting their central roles in integrating neuroimaging and ML methodologies. This interdependency highlights the importance of large clusters in driving knowledge transfer and innovation across disciplinary boundaries.

Discussion

A bibliometric analysis was conducted using CiteSpace, focusing on the early diagnosis of stroke through ML identification of MRI characteristics from 2004 to 2023. This analysis encompassed the collaborative networks of core authors, affiliated institutions, countries, and journals. Comprehensive data were provided, highlighting the focal points and trends in the early diagnosis of stroke using ML to identify MRI characteristics.

General information

This study illustrates that over the past two decades, a total of 395 publications have been released in the field of early stroke diagnosis using ML to identify characteristics in MRI. The findings show that prior to 2015, the annual number of publications was consistently below 10, reflecting the nascent stage of ML as an academic discipline. Since 2015, there has been a marked increase in publications, a development attributed to the enhanced computational capabilities of ML and the refinement of its algorithmic structures, demonstrating potential in improving the accuracy of stroke diagnosis, subtype classification, and prognostic prediction. The trend line suggests an anticipation of a new peak in publication volume within the next 5–10 years.

An analysis of authors, countries, and their affiliated institutions with higher publication numbers reveals that institutions and authors from the USA and China have established mature academic communities on a global scale, forming extensive collaborative networks with other institutions. This indicates the reliance of ML on the technological level and talent reserves of a country. Notably, despite the lower volume of publications from less developed countries, these nations may still experience high stroke incidence rates.

Through the analysis of interdependencies among clusters in references, it was found that the development pattern in the field of early stroke diagnosis using ML for MRI primarily exists in a form of coexistence between interdisciplinary integration and the refinement of major disciplinary branches. This unique characteristic is likely to promote resource integration, cross-disciplinary idea exchange, and academic innovation within the field.

In summary, ML as an emerging discipline, has shown immense value in early diagnosis of stroke through neuroimaging, medical efficiency, and personalized treatment, with a significant rise in publication volume in recent years. Based on the trends in annual publication volume and innovation in ML algorithms, significant advancements are expected in the next 5–10 years, ultimately aiming to provide precise medical services for stroke patients.

Research hotspots

Research hotspots in a field are encapsulated by keywords that represent the core content and central themes of studies within that domain. Techniques such as keyword co-occurrence analysis, keyword clustering, and keyword citation bursts enable the monitoring of various emerging trends in a given field. In the realm of using ML for early stroke diagnosis through the identification of MRI features, two primary research hotspots have emerged: the optimal selection of neural imaging markers and the most appropriate ML algorithm models.

Optimal selection of neural imaging markers

In the pursuit of the most effective neural imaging markers for stroke patients using MRI, researchers primarily focus on cerebral blood flow, brain structure, or brain function. From the perspective of cerebral blood flow, arterial spin labeling (ASL) is a predominant research method. For instance, Liu’s et al. (33) study indicates that the combination of ML and ASL can predict the outcomes of acute ischemic strokes by examining collateral circulation. Regarding brain structure, diffusion weighted imaging (DWI) often serves as the main analytical approach. Yu et al. (34) and Zhu et al. (35), for example, discovered that deep learning using DWI and clinical data is highly sensitive in predicting patients with low-perfusion strokes. From the aspect of brain function, the most valued approach is brain network analysis using functional connectivity (FC) from functional MRI. Li’s et al. (36) findings suggest that multispectral FC variations in brain regions are potential targets for differentiating stroke patients’ recovery and treatment processes. Lu et al. (37) demonstrated that acupuncture could modulate bilateral cerebral hemispheres through distinct targets, restoring abnormal FC and thus facilitating post-stroke motor recovery. Moreover, many studies advocate the integration of multimodal MRI datasets as neural imaging markers, surpassing the predictive accuracy of early stroke onset compared to single-modality data (3840).

Optimal machine learning algorithm models

ML algorithms are diverse and continually evolving, with researchers exploring various models for neuroimaging data from MRIs. Pérez Malla et al. (41) and Nishi et al. (42), for instance, regard convolutional neural networks (CNNs) as the most advanced method for early stroke prediction. Billot et al. (43) found that a combination of support vector machines (SVMs) and random forests (RFs) also exhibits commendable performance. Pinto et al. (44) proposed a fully automated deep learning approach encompassing both unsupervised and supervised learning, achieving satisfactory accuracy. Emerging deep learning techniques, such as deep neural networks (DNNs) and reinforcement learning (RL), are gaining traction due to their ability to automatically extract features from raw data and further improve model accuracy. These methods show substantial promise in enhancing early stroke detection by providing deeper, more nuanced insights into complex MRI data patterns. The introduction of these advanced techniques could lead to significant improvements in diagnostic accuracy, particularly in early-stage stroke diagnosis, where subtle changes in brain tissue are often challenging to detect.

Recent advancements in explainable AI (XAI) have also contributed to the interpretability of machine learning models, which is crucial in clinical settings. XAI approaches aim to provide transparent reasoning behind model predictions, enabling healthcare professionals to better understand and trust the automated results. Furthermore, multimodal approaches combining MRI with other data sources, such as genetic or clinical data, are gaining momentum. These methods harness the complementary strengths of different types of data to create more robust models that enhance diagnostic accuracy and prognostic prediction. By integrating various data types, multimodal systems can capture a more comprehensive view of the patient’s condition, improving decision-making in early stroke diagnosis.

The integration of ML with neuroimaging offers significant potential for bridging current gaps in early stroke diagnosis. While traditional methods rely on visual interpretation of MRI scans, ML techniques allow for the automated detection and quantification of subtle patterns that may be overlooked by human evaluators. The ability of ML algorithms to process large volumes of complex MRI data and generate predictive models can improve diagnostic accuracy, particularly in the early stages of stroke, when clinical symptoms may not yet be fully manifest. Furthermore, ML can help to identify imaging markers that correlate with stroke outcomes, offering personalized treatment options for patients. This integration is particularly promising in addressing the challenge of time-sensitive diagnoses, where rapid and accurate assessments can directly impact patient prognosis and recovery. As ML algorithms, including deep learning and multimodal approaches, continue to evolve, their capacity to enhance diagnostic workflows, reduce human error, and accelerate decision-making processes in clinical settings will be invaluable in overcoming the challenges of early stroke diagnosis.

Conclusion

The application of ML in the early diagnosis, prediction, and individualized medical plans for stroke patients using neuroimaging features offers immense value. This study specifically focused on the role of ML in early stroke detection and prediction by analyzing its capacity to identify subtle imaging markers and enhance diagnostic precision in the critical early stages. The most compelling current research hotspots are the optimal selection of neural imaging markers and the most suitable ML algorithm models for these purposes. In the future, researchers can continue to develop high-performance algorithms, further advancing early diagnosis and personalized treatment strategies in this scientific domain.

Limitations

This study is subject to several limitations. Firstly, it primarily relies on data accessible within the WOSCC database. CiteSpace is incapable of integrating data from varied databases or performing citation analysis on sources outside of WOSCC. Secondly, while CiteSpace proves invaluable in detecting and visualizing emerging trends, it does not delve into the underlying mechanisms of machine algorithm models in the application of identifying MRI features for early stroke diagnosis. Despite these constraints, we have successfully employed CiteSpace to illustrate the latest research trends in the application of machine algorithm models in the early diagnosis of stroke through the recognition of MRI characteristics.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.

Author contributions

J-cL: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Validation, Writing – original draft, Writing – review & editing. X-fY: Conceptualization, Data curation, Investigation, Resources, Writing – original draft, Writing – review & editing. J-jY: Data curation, Investigation, Methodology, Software, Writing – original draft, Writing – review & editing. D-qS: Data curation, Investigation, Software, Visualization, Writing – original draft, Writing – review & editing. W-hX: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Resources, Software, Supervision, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare that no financial support was received for the research and/or publication of this article.

Acknowledgments

We extend our gratitude to Chaomei Chen of Drexel University for his contributions in developing CiteSpace. Additionally, we would like to express our appreciation to the reviewers for their valuable insights, which have enabled us to enhance this manuscript.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The authors declare that no Gen AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Abbreviations

MRI, Magnetic resonance imaging; ML, Machine learning; RCTs, Randomized controlled trials; WOSCC, Web of Science Core Collection; SCI-E, Science Citation Index-Expanded; ASL, Arterial spin labeling; DWI, Diffusion weighted imaging; FC, Functional connectivity; CNNs, Convolutional neural networks; SVMs, Support vector machines; RFs, Random forests.

References

1. Tu, WJ, and Wang, LDSpecial Writing Group of China Stroke Surveillance Report. China stroke surveillance report 2021. Mil Med Res. (2023) 10:33. doi: 10.1186/s40779-023-00463-x

PubMed Abstract | Crossref Full Text | Google Scholar

2. Eskioglou, E, Huchmandzadeh Millotte, M, Amiguet, M, and Michel, P. National institutes of health stroke scale zero strokes. Stroke. (2018) 49:3057–9. doi: 10.1161/STROKEAHA.118.022517

PubMed Abstract | Crossref Full Text | Google Scholar

3. Papanagiotou, P, and White, CJ. Endovascular reperfusion strategies for acute stroke. JACC Cardiovasc Interv. (2016) 9:307–17. doi: 10.1016/j.jcin.2015.11.014

PubMed Abstract | Crossref Full Text | Google Scholar

4. Owolabi, MO, Thrift, AG, Mahal, A, Ishida, M, Martins, S, Johnson, WD, et al. Primary stroke prevention worldwide: translating evidence into action. Lancet Public Health. (2022) 7:e74–85. doi: 10.1016/S2468-2667(21)00230-9

PubMed Abstract | Crossref Full Text | Google Scholar

5. Mohan, KM, Wolfe, CD, Rudd, AG, Heuschmann, PU, Kolominsky-Rabas, PL, and Grieve, AP. Risk and cumulative risk of stroke recurrence: a systematic review and meta-analysis. Stroke. (2011) 42:1489–94. doi: 10.1161/STROKEAHA.110.602615

PubMed Abstract | Crossref Full Text | Google Scholar

6. Hwang, J, Kalra, A, Shou, BL, Whitman, G, Wilcox, C, Brodie, D, et al. Epidemiology of ischemic stroke and hemorrhagic stroke in venoarterial extracorporeal membrane oxygenation. Crit Care. (2023) 27:433. doi: 10.1186/s13054-023-04707-z

PubMed Abstract | Crossref Full Text | Google Scholar

7. Roger, VL, Go, AS, Lloyd-Jones, DM, Benjamin, EJ, Berry, JD, Borden, WB, et al. Heart disease and stroke statistics—2012 update: a report from the American Heart Association. Circulation. (2012) 125:e2–e220. doi: 10.1161/CIR.0b013e31823ac046

PubMed Abstract | Crossref Full Text | Google Scholar

8. Shah, S, Luby, M, Poole, K, Morella, T, Keller, E, Benson, RT, et al. Screening with MRI for accurate and rapid stroke treatment: SMART. Neurology. (2015) 84:2438–44. doi: 10.1212/WNL.0000000000001678

PubMed Abstract | Crossref Full Text | Google Scholar

9. Fiebach, JB, Schellinger, PD, Gass, A, Kucinski, T, Siebler, M, Villringer, A, et al. Stroke magnetic resonance imaging is accurate in hyperacute intracerebral hemorrhage: a multicenter study on the validity of stroke imaging. Stroke. (2004) 35:502–6. doi: 10.1161/01.STR.0000114203.75678.88

PubMed Abstract | Crossref Full Text | Google Scholar

10. Liu, L, Chang, J, Liu, Z, Zhang, P, Xu, X, and Shang, H. Hybrid contextual semantic network for accurate segmentation and detection of small-size stroke lesions from MRI. IEEE J Biomed Health Inform. (2023) 27:4062–73. doi: 10.1109/JBHI.2023.3273771

PubMed Abstract | Crossref Full Text | Google Scholar

11. Raimondo, L, Oliveira, ĹAF, Heij, J, Priovoulos, N, Kundu, P, Leoni, RF, et al. Advances in resting state fMRI acquisitions for functional connectomics. NeuroImage. (2021) 243:118503. doi: 10.1016/j.neuroimage.2021.118503

PubMed Abstract | Crossref Full Text | Google Scholar

12. Zhou, XC, Huang, YB, Wu, S, Hong, SW, Tian, Y, Hu, HJ, et al. Lever positioning manipulation alters real-time brain activity in patients with lumbar disc herniation: an amplitude of low-frequency fluctuation and regional homogeneity study. Psychiatry Res Neuroimaging. (2023) 334:111674. doi: 10.1016/j.pscychresns.2023.111674

PubMed Abstract | Crossref Full Text | Google Scholar

13. Xuan, K, Xiang, L, Huang, X, Zhang, L, Liao, S, Shen, D, et al. Multimodal MRI reconstruction assisted with spatial alignment network. IEEE Trans Med Imaging. (2022) 41:2499–509. doi: 10.1109/TMI.2022.3164050

PubMed Abstract | Crossref Full Text | Google Scholar

14. Choksi, B, Mozafari, M, VanRullen, R, and Reddy, L. Multimodal neural networks better explain multivoxel patterns in the hippocampus. Neural Netw. (2022) 154:538–42. doi: 10.1016/j.neunet.2022.07.033

PubMed Abstract | Crossref Full Text | Google Scholar

15. Schwarzbach, CJ, Eichner, FA, Rücker, V, Hofmann, AL, Keller, M, Audebert, HJ, et al. The structured ambulatory post-stroke care program for outpatient aftercare in patients with ischaemic stroke in Germany (SANO): an open-label, cluster-randomised controlled trial. Lancet Neurol. (2023) 22:787–99. doi: 10.1016/S1474-4422(23)00216-8

PubMed Abstract | Crossref Full Text | Google Scholar

16. Olasoji, EB, Uhm, DK, Awosika, OO, Doré, S, Geis, C, and Simpkins, AN. Trends in outpatient rehabilitation use for stroke survivors. J Neurol Sci. (2022) 442:120383. doi: 10.1016/j.jns.2022.120383

PubMed Abstract | Crossref Full Text | Google Scholar

17. Qiu, W, Kuang, H, Teleg, E, Ospel, JM, Sohn, SI, Almekhlafi, M, et al. Machine learning for detecting early infarction in acute stroke with non-contrast-enhanced CT. Radiology. (2020) 294:638–44. doi: 10.1148/radiol.2020191193

PubMed Abstract | Crossref Full Text | Google Scholar

18. Bonkhoff, AK, and Grefkes, C. Precision medicine in stroke: towards personalized outcome predictions using artificial intelligence. Brain. (2022) 145:457–75. doi: 10.1093/brain/awab439

PubMed Abstract | Crossref Full Text | Google Scholar

19. Castaneda-Vega, S, Katiyar, P, Russo, F, Patzwaldt, K, Schnabel, L, Mathes, S, et al. Machine learning identifies stroke features between species. Theranostics. (2021) 11:3017–34. doi: 10.7150/thno.51887

PubMed Abstract | Crossref Full Text | Google Scholar

20. Chavva, IR, Crawford, AL, Mazurek, MH, Yuen, MM, Prabhat, AM, Payabvash, S, et al. Deep learning applications for acute stroke management. Ann Neurol. (2022) 92:574–87. doi: 10.1002/ana.26435

PubMed Abstract | Crossref Full Text | Google Scholar

21. Oikonomou, EK, Spatz, ES, Suchard, MA, and Khera, R. Individualising intensive systolic blood pressure reduction in hypertension using computational trial phenomaps and machine learning: a post-hoc analysis of randomised clinical trials. Lancet Digit Health. (2022) 4:e796–805. doi: 10.1016/S2589-7500(22)00170-4

PubMed Abstract | Crossref Full Text | Google Scholar

22. Feng, J, Zhang, Q, Wu, F, Peng, J, Li, Z, and Chen, Z. The value of applying machine learning in predicting the time of symptom onset in stroke patients: systematic review and meta-analysis. J Med Internet Res. (2023) 25:e44895. doi: 10.2196/44895

PubMed Abstract | Crossref Full Text | Google Scholar

23. Zhou, XC, Huang, YB, Liu, Z, Wu, HJ, Huang, HZ, Tian, Y, et al. Bibliometric analysis of functional magnetic resonance imaging studies on manual therapy analgesia from 2002–2022. J Pain Res. (2023) 16:2115–29. doi: 10.2147/JPR.S412658

PubMed Abstract | Crossref Full Text | Google Scholar

24. Li, X, Wei, W, Wang, Y, Wang, Q, and Liu, Z. Global trend in the research and development of acupuncture treatment on Parkinson’s disease from 2000 to 2021: a bibliometric analysis. Front Neurol. (2022) 13:906317. doi: 10.3389/fneur.2022.906317

PubMed Abstract | Crossref Full Text | Google Scholar

25. Liang, YD, Li, Y, Zhao, J, Wang, XY, Zhu, HZ, and Chen, XH. Study of acupuncture for low back pain in recent 20 years: a bibliometric analysis via CiteSpace. J Pain Res. (2017) 10:951–64. doi: 10.2147/JPR.S132808

PubMed Abstract | Crossref Full Text | Google Scholar

26. Chen, YM, and Wang, XQ. Bibliometric analysis of exercise and neuropathic pain research. J Pain Res. (2020) 13:1533–45. doi: 10.2147/JPR.S258696

PubMed Abstract | Crossref Full Text | Google Scholar

27. Xi, M, and Gao, X. Bibliometric analysis of research relating to IgA nephropathy from 2010 to 2021. Med Sci Monit. (2022) 28:e937976. doi: 10.12659/MSM.937976

PubMed Abstract | Crossref Full Text | Google Scholar

28. He, Z, Dai, L, Zuo, Y, Chen, Y, Wang, H, and Zeng, H. Hotspots and frontiers in pulmonary arterial hypertension research: a bibliometric and visualization analysis from 2011 to 2020. Bioengineered. (2022) 13:14667–80. doi: 10.1080/21655979.2022.2100064

PubMed Abstract | Crossref Full Text | Google Scholar

29. Wu, F, Gao, J, Kang, J, Wang, X, Niu, Q, Liu, J, et al. Knowledge mapping of exosomes in autoimmune diseases: a bibliometric analysis (2002–2021). Front Immunol. (2022) 13:939433. doi: 10.3389/fimmu.2022.939433

PubMed Abstract | Crossref Full Text | Google Scholar

30. Chen, C, Hu, Z, Liu, S, and Tseng, H. Emerging trends in regenerative medicine: a scientometric analysis in CiteSpace. Expert Opin Biol Ther. (2012) 12:593–608. doi: 10.1517/14712598.2012.674507

PubMed Abstract | Crossref Full Text | Google Scholar

31. Chen, C, Dubin, R, and Kim, MC. Emerging trends and new developments in regenerative medicine: a scientometric update (2000–2014). Expert Opin Biol Ther. (2014) 14:1295–317. doi: 10.1517/14712598.2014.920813

PubMed Abstract | Crossref Full Text | Google Scholar

32. Mainali, S, Darsie, ME, and Smetana, KS. Machine learning in action: stroke diagnosis and outcome prediction. Front Neurol. (2021) 12:734345. doi: 10.3389/fneur.2021.734345

PubMed Abstract | Crossref Full Text | Google Scholar

33. Liu, S, Fan, D, Zang, F, Gu, N, Yin, Y, Ge, X, et al. Collateral circulation detected by arterial spin labeling predicts outcome in acute ischemic stroke. Acta Neurol Scand. (2022) 146:635–42. doi: 10.1111/ane.13694

PubMed Abstract | Crossref Full Text | Google Scholar

34. Yu, Y, Christensen, S, Ouyang, J, Scalzo, F, Liebeskind, DS, Lansberg, MG, et al. Predicting hypoperfusion lesion and target mismatch in stroke from diffusion-weighted MRI using deep learning. Radiology. (2023) 307:e220882. doi: 10.1148/radiol.220882

PubMed Abstract | Crossref Full Text | Google Scholar

35. Zhu, H, Jiang, L, Zhang, H, Luo, L, Chen, Y, and Chen, Y. An automatic machine learning approach for ischemic stroke onset time identification based on DWI and FLAIR imaging. NeuroImage Clin. (2021) 31:102744. doi: 10.1016/j.nicl.2021.102744

PubMed Abstract | Crossref Full Text | Google Scholar

36. Li, J, Cheng, L, Chen, S, Zhang, J, Liu, D, Liang, Z, et al. Functional connectivity changes in multiple-frequency bands in acute basal ganglia ischemic stroke patients: a machine learning approach. Neural Plast. (2022) 2022:1560748. doi: 10.1155/2022/1560748

PubMed Abstract | Crossref Full Text | Google Scholar

37. Lu, M, Du, Z, Zhao, J, Jiang, L, Liu, R, Zhang, M, et al. Neuroimaging mechanisms of acupuncture on functional reorganization for post-stroke motor improvement: a machine learning-based functional magnetic resonance imaging study. Front Neurosci. (2023) 17:1143239. doi: 10.3389/fnins.2023.1143239

PubMed Abstract | Crossref Full Text | Google Scholar

38. Yu, H, Wang, Z, Sun, Y, Bo, W, Duan, K, Song, C, et al. Prognosis of ischemic stroke predicted by machine learning based on multi-modal MRI radiomics. Front Psychiatry. (2023) 13:1105496. doi: 10.3389/fpsyt.2022.1105496

Crossref Full Text | Google Scholar

39. Carlson, HL, Craig, BT, Hilderley, AJ, Hodge, J, Rajashekar, D, Mouches, P, et al. Structural and functional connectivity of motor circuits after perinatal stroke: a machine learning study. NeuroImage Clin. (2020) 28:102508. doi: 10.1016/j.nicl.2020.102508

PubMed Abstract | Crossref Full Text | Google Scholar

40. Chong, B, Wang, A, Borges, V, Byblow, WD, Alan Barber, P, and Stinear, C. Investigating the structure-function relationship of the corticomotor system early after stroke using machine learning. NeuroImage Clin. (2022) 33:102935. doi: 10.1016/j.nicl.2021.102935

PubMed Abstract | Crossref Full Text | Google Scholar

41. Pérez Malla, CU, Valdés Hernández, MDC, Rachmadi, MF, and Komura, T. Evaluation of enhanced learning techniques for segmenting ischaemic stroke lesions in brain magnetic resonance perfusion images using a convolutional neural network scheme. Front Neuroinform. (2019) 13:33. doi: 10.3389/fninf.2019.00033

PubMed Abstract | Crossref Full Text | Google Scholar

42. Nishi, H, Oishi, N, Ishii, A, Ono, I, Ogura, T, Sunohara, T, et al. Deep learning-derived high-level neuroimaging features predict clinical outcomes for large vessel occlusion. Stroke. (2020) 51:1484–92. doi: 10.1161/STROKEAHA.119.028101

PubMed Abstract | Crossref Full Text | Google Scholar

43. Billot, A, Lai, S, Varkanitsa, M, Braun, EJ, Rapp, B, Parrish, TB, et al. Multimodal neural and behavioral data predict response to rehabilitation in chronic poststroke aphasia. Stroke. (2022) 53:1606–14. doi: 10.1161/STROKEAHA.121.036749

PubMed Abstract | Crossref Full Text | Google Scholar

44. Pinto, A, Pereira, S, Meier, R, Wiest, R, Alves, V, Reyes, M, et al. Combining unsupervised and supervised learning for predicting the final stroke lesion. Med Image Anal. (2021) 69:101888. doi: 10.1016/j.media.2020.101888

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: stroke, machine learning, magnetic resonance imaging, bibliometric analysis, WoSCC

Citation: Lou J-c, Yu X-f, Ying J-j, Song D-q and Xiong W-h (2025) Exploring the potential of machine learning and magnetic resonance imaging in early stroke diagnosis: a bibliometric analysis (2004–2023). Front. Neurol. 16:1505533. doi: 10.3389/fneur.2025.1505533

Received: 03 October 2024; Accepted: 24 February 2025;
Published: 14 March 2025.

Edited by:

Mingming Lu, Characteristic Medical Center of Chinese People’s Armed Police Force, China

Reviewed by:

Han Cong, Fifth Medical Center of the PLA General Hospital, China
Francesca Galassi, University of Rennes 1, France
Baobao Li, Fifth Medical Center of the PLA General Hospital, China

Copyright © 2025 Lou, Yu, Ying, Song and Xiong. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Wen-hua Xiong, eGlvbmd3ZW5odWEzQDE2My5jb20=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Research integrity at Frontiers

Man ultramarathon runner in the mountains he trains at sunset

94% of researchers rate our articles as excellent or good

Learn more about the work of our research integrity team to safeguard the quality of each article we publish.


Find out more