Skip to main content

REVIEW article

Front. Oncol., 17 August 2023
Sec. Gastrointestinal Cancers: Hepato Pancreatic Biliary Cancers

A systematic review of progress on hepatocellular carcinoma research over the past 30 years: a machine-learning-based bibliometric analysis

Kiseong LeeKiseong Lee1Ji Woong Hwang*Ji Woong Hwang2*Hee Ju SohnHee Ju Sohn2Sanggyun SuhSanggyun Suh2Sun-Whe KimSun-Whe Kim2
  • 1Humanities Research Institute, Chung-Ang University, Seoul, Republic of Korea
  • 2Department of Surgery, Chung-Ang University Gwangmyeong Hospital, Chung-Ang University College of Medicine, Gwangmyeong, Republic of Korea

Introduction: Research on hepatocellular carcinoma (HCC) has grown significantly, and researchers cannot access the vast amount of literature. This study aimed to explore the research progress in studying HCC over the past 30 years using a machine learning-based bibliometric analysis and to suggest future research directions.

Methods: Comprehensive research was conducted between 1991 and 2020 in the public version of the PubMed database using the MeSH term “hepatocellular carcinoma.” The complete records of the collected results were downloaded in Extensible Markup Language format, and the metadata of each publication, such as the publication year, the type of research, the corresponding author’s country, the title, the abstract, and the MeSH terms, were analyzed. We adopted a latent Dirichlet allocation topic modeling method on the Python platform to analyze the research topics of the scientific publications.

Results: In the last 30 years, there has been significant and constant growth in the annual publications about HCC (annual percentage growth rate: 7.34%). Overall, 62,856 articles related to HCC from the past 30 years were searched and finally included in this study. Among the diagnosis-related terms, “Liver Cirrhosis” was the most studied. However, in the 2010s, “Biomarkers, Tumor” began to outpace “Liver Cirrhosis.” Regarding the treatment-related MeSH terms, “Hepatectomy” was the most studied; however, recent studies related to “Antineoplastic Agents” showed a tendency to supersede hepatectomy. Regarding basic research, the study of “Cell Lines, Tumors,’’ appeared after 2000 and has been the most studied among these terms.

Conclusion: This was the first machine learning-based bibliometric study to analyze more than 60,000 publications about HCC over the past 30 years. Despite significant efforts in analyzing the literature on basic research, its connection with the clinical field is still lacking. Therefore, more efforts are needed to convert and apply basic research results to clinical treatment. Additionally, it was found that microRNAs have potential as diagnostic and therapeutic targets for HCC.

1 Introduction

According to the World Health Organization, hepatocellular carcinoma (HCC) is the fifth most common neoplasm and the third leading cause of cancer-related death worldwide (1). The major risk factors of HCC include chronic hepatitis B or C virus, excessive alcohol consumption, diabetes, and potentially nonalcoholic fatty liver disease (2). Additionally, liver cirrhosis, regardless of its etiology, is a potential risk factor for the development of HCC (3).

Several recommendations and guidelines have suggested the use of contrast-enhanced ultrasound (CEUS), contrast-enhanced computed tomography (CT), or magnetic resonance imaging (MRI) as the diagnostic modalities for HCC surveillance. Abdominal ultrasound has traditionally been considered the fundamental method for HCC surveillance and is still advocated as the primary test for this purpose. The addition of serum biomarkers, particularly alpha-fetoprotein (AFP), can enhance the sensitivity of abdominal ultrasound in detecting HCC at an early stage (4). Patients with HCC have access to various therapeutic approaches, including liver transplantation, surgical resection, percutaneous ablation, radiation, and transarterial and systemic therapies (5). Therefore, a multidisciplinary approach is necessary to tailor the treatment strategy to the patient’s tumor stage, liver function, and performance status.

Studies on HCC have grown significantly, with researchers being unable to access the vast amount of literature. Thus, to reflect current research trends, bibliometric analysis may be used to collect and investigate the most relevant information from a large volume of literature (6). Furthermore, it can enable the research to progress to a macroscopic perspective and provide directions for future study (7).

Natural language processing (NLP) is a type of machine learning method used for human language analyses. Among these methods, the latent Dirichlet allocation (LDA) is one of the most classical topic modeling techniques and is usually used to analyze scientific publications comprehensively by identifying research topics and categorizing articles into these topics (8, 9). Recently, LDA has been used in several bibliometric analyses in the field of anal cancer, rectal cancer, and cardiovascular research (1012).

Therefore, this study aimed to explore the research progress in studying HCC over the past 30 years using a machine learning-based bibliometric analysis and to suggest future research directions.

2 Methods

In April 2022, comprehensive research was performed in the public version of the PubMed database, using the MeSH term “hepatocellular carcinoma” for the period between 1991 and 2020 (Figure S1). The articles that lacked an abstract or were not written in English were excluded. The complete records of the results were downloaded in Extensible Markup Language format, and R was used to extract the metadata of each article, such as the publication year, the type of research, the title, the abstract, and the MeSH terms. Overall, 12,898 MeSH terms were identified in the included publications, and only MeSH terms that appeared 100 times or more were included in this analysis (n = 980). No ethical approval from an institutional review board was required since this study was a bibliometric analysis.

An LDA algorithm was adopted to identify the relevant topics from the abstract of each publication in the Python platform to analyze the research topics in numerous scientific publications. An LDA is an analytical method that identifies the main topic of a document based on the probability of a combination of words that appear together frequently. An LDA generates a topic probability distribution of each document from a large collection of documents, and one topic with the largest weight in the probability distribution becomes the main topic of the document. This study examined the abstract of papers related to HCC published over the past 30 years and calculated the probability of the appearance of each paper’s main topics and vocabulary. The number of topics was designated as 50 to align with existing research, and the name of the topic was assigned manually based on the vocabulary combination constituting each topic and the included literature title. Furthermore, LDA processing was performed using coding in Python with the gensim package (https://radimrehurek.com/gensim/). A Louvain algorithm was generated to further investigate the relationship between these topics. The network visualizations were constructed using Gephi software (https://gephi.org/) (13). All mentioned codes, including the Python code, were available on Github (https://github.com/kisleepublic/analysisHCC/; GitHub is a website and cloud-based service that helps developers store and manage their code).

3 Results

3.1 Overview of the data

Overall, 62,856 articles related to HCC over the past 30 years were searched and finally included in this bibliometric analysis. We found that 671 publications on HCC research were available in 1991, which increased to 5,240 in 2020 (Figure 1). In the last 30 years, there has been significant and constant growth in the annual publications about HCC (annual percentage growth rate: 7.34%).

FIGURE 1
www.frontiersin.org

Figure 1 Annual publication on hepatocellular carcinoma.

The highest number of publications involved clinical studies (n = 30,115, 47.9%), followed by research articles (n = 21,530, 34.3%) and reviews (n = 5,851, 9.3%). Figure 2 shows the distribution of the publication types by year. Clinical studies accounted for approximately half of the publications, and their proportion increased steadily. The proportion of research articles, case reports, and clinical trials decreased, whereas that of reviews and meta-analyses tended to increase gradually.

FIGURE 2
www.frontiersin.org

Figure 2 Distribution of publication types per year.

The most productive country was China (n = 18,476, 35.1%), followed by Japan (n = 9,248, 17.6%) and the United States (n = 6,283, 11.9%) (Figure 3). China also had the highest number of multiple country publications (MCP) (n = 1,478, MCP ratio; 0.081), while Egypt had the highest MCP ratio (n = 647, MCP ratio = 0.198) (Table S1).

FIGURE 3
www.frontiersin.org

Figure 3 The contribution of the country to publication on hepatocellular carcinoma. SCP, single country publications; MCP, multiple country publications.

3.2 MeSH term analysis

Further analyses were performed using the MeSH terms found in the publications. Notably, 980 MeSH terms appeared 100 times or more and were included in this analysis, with a total of 945,754 times of occurrence (Supplemental Data 1). Table S2 shows the most widely studied top 20 MeSH terms in HCC research. Subsequently, we investigated focused diagnosis-, treatment-, and basic research-related MeSH terms.

Among the diagnosis-related terms, “Liver Cirrhosis” was the most studied. However, in the 2010s, “Biomarkers, Tumor” began to outpace “Liver Cirrhosis” (Figure 4A). In contrast to the rapid increase of “Biomarkers, Tumors,” the appearance of “alpha-fetoproteins,” considered an important biomarker of HCC, showed a slight increase. Rather, the rapid increase of “Biomarkers, Tumor” was accompanied by an increase of “MicroRNAs.” Regarding the MeSH terms related to hepatitis, which are the representative risk factors of HCC, four terms (“Hepatitis B virus,” “Hepatitis B,” “Hepatitis B, Chronic,” and “Hepatitis C”) were included in the top 10 terms and showed a steadily increasing tendency. Notably, the MeSH terms related to hepatitis B showed a steep increase, whereas “Hepatitis C” demonstrated no significant increase. Regarding imaging modalities, “Magnetic Resonance Imaging” was included in the top 10 terms and showed a steady increase.

FIGURE 4
www.frontiersin.org

Figure 4 Accumulative occurrence of the top 10 MeSH terms regarding diagnosis (A), treatment (B), basic research (C) of hepatocellular carcinoma over a 5-year trend, and newly appeared after 2000 (D).

Concerning the treatment-related MeSH terms, “Hepatectomy” was the most studied; however, recent studies related to “Antineoplastic Agents” showed a tendency to supersede hepatectomy (Figure 4B). The recent increase in research on “Antineoplastic Agents” appears to reflect the increase in studies on “Sorafenib,” which suggested that the increase in research on “Sorafenib” played an important role in the increase in research on “Antineoplastic Agents.” Studies related to other treatment modalities, such as hepatectomy, liver transplantation, and TACE, showed a steady increase, whereas recently, those on “Catheter Ablation” demonstrated a tendency to decrease slightly.

Regarding basic research, the study of “Cell Lines, Tumors,’’ appeared after 2000 and has been the most studied among these terms (Figure 4C). Conversely, “Tumor Cells, Cultured’’ were the most studied until the early 2000s; however, there has been a significant decline since then. Among the tumor cell lines, “Hep G2 Cells” were included in the top 10 MeSH terms and appeared in the late 2000s, showing a rapid increase.

By analyzing the MeSH terms that appeared after 2000, a subgroup analysis was also performed on newly emerging research directions. It accounted for 11.6% of the total number of MeSH terms and appeared only 55,988 times (Figure 4D). Among these terms, the number of publications with the topics, including “Cell Proliferation,” “Hep G2 cells,” and “microRNAs” substantially increased and appeared in 2003, 2006, and 2006, respectively. Although MeSH terms related to basic research comprised most of the top 10, “Young Adults” and “Sorafenib” were included in the top 10 as clinical-related MeSH terms.

3.3 LDA analysis

Overall, 61,891 publications with available abstracts were analyzed using the LDA method, excluding those without an abstract. The LDA method provided further information about popular topics and their relationships by generating a topic network from the abstract text analysis (Figure 5). The network was categorized into three clusters, including “basic research” (in purple), “diagnosis and treatment research” (in orange), and “epidemiology research” (in green). The focalization of topics and the weight of connections between the topics were demonstrated as the size of the circle and the thickness of the line, respectively.

FIGURE 5
www.frontiersin.org

Figure 5 Topic cluster network analysis using latent Dirichlet allocation. Purple, basic research; orange, diagnosis and treatment research; and green, epidemiology research. The size of the circle represents the number of articles on each topic, and the thickness of the line represents the weight of the connection between each topic.

The cluster of basic research constituted the largest proportion of these topics, where “proliferation of HCC cells,” “apoptosis,” and “gene and protein expression” were the top three research topics. In the cluster of diagnosis and treatment research, “survival analysis, prognosis,” “imaging with contrast dye,” and “TACE” were the top three research topics. In the cluster of epidemiology research, “risk factor,” “hepatitis virus infection,” and “hepatitis virus_carcinogenesis” were the top three research topics.

The basic research cluster included half of the entire set of publications but had a poor connection with the other two clusters, while comparatively, the epidemiology research cluster showed many associations with other clusters. Except for “gene and protein expression,” most of the topics in the basic research cluster had a strong connection only to other topics in the basic research cluster. The topics “gene and protein expression” demonstrated strong connections to the topics “survival analysis, prognosis” and “pathology” in other clusters.

4 Discussion

To date, the analysis of publications, which grew in volume over time, has been conducted directly by researchers, resulting in a limit to the number of publications that could be analyzed. With the development of artificial intelligence technology, the bibliographic analysis of numerous publications has become possible. A bibliometric analysis is usually performed to analyze research trends in various academic fields and to help suggest future research directions. We used an NLP, which is a type of machine learning to systematically analyze 62,856 HCC-related literature published over the past 30 years. To the best of our knowledge, this was the first machine learning bibliometric analysis on HCC research. We also visualized the results from a macroscopic perspective to make the analysis of the detected trends more intuitive.

The AFP is currently the most accepted phase 5 biomarker and is used frequently in the surveillance and diagnosis of HCC (14). However, AFP has limitations as a biomarker due to its low sensitivity and specificity for early HCC (15). As shown in Figure 4A, the topic “Biomarkers Tumor” has grown significantly, while the number of publications on the topic “alpha-fetoproteins” has increased slightly, indicating that research on novel biomarkers for HCC, particularly for early HCC, is needed, and that many related studies have been conducted. Figure 4D shows that “MicroRNAs” appeared after 2000 with a rapidly increasing frequency. The recent rapid increase in “Biomarkers, Tumor” may be due to the increase in these “MicroRNAs.” MicroRNAs play an important role in the pathogenesis of HCC and are expressed differentially even in the early stages of HCC (16). The emerging role of microRNAs as novel potential biomarkers via early diagnosis may change the face of HCC surveillance (17, 18).

The European Association for the Study of the Liver guidelines has recommended the use of CEUS, contrast-enhanced CT, or MRI as the diagnostic modalities for HCC surveillance (19). Many studies have suggested that MRI has a higher diagnostic value than CEUS or CT, particularly for small HCC, because of its higher sensitivity and accuracy (20, 21). Our study also demonstrated that “MRI” was the only imaging modality included in the top 10 diagnosis-related MeSH terms, and its frequency of appearance incidence has increased steadily.

Generally, therapeutic strategies for patients with HCC are determined based on Barcelona Clinic Liver Cancer criteria (2). In the early stage, surgical resection and local ablative therapies are established therapeutic options (22, 23); in the intermediate stage, TACE is considered the standard treatment, while for advanced HCC, systemic therapy is the preferred treatment modality. As shown in the treatment-related MeSH term analysis, surgical treatments, such as liver resection and transplantation, were still shown to be the most common; however, they were overtaken by the rapid emergence of “Antineoplastic Agents.” The rise of “Antineoplastic Agents” appeared to be closely related to the increase of “Sorafenib.” Sorafenib is a multi-tyrosine kinase inhibitor and the first molecular targeted agent to show efficacy in advanced HCC (24), and it was included in one of the most common MeSH terms in our study. The overall survival of patients with HCC remains poor compared to other gastrointestinal, despite substantial research in systemic therapies; therefore, more effective systemic agents are needed.

Tumor cell lines are created by isolating cancer cells from a tumor and growing them in a laboratory setting to study the characteristics of the cancer cells, including their growth patterns, gene expression, and response to drugs (2527). Hep G2 cells are one of the most used liver cancer cell lines due to their availability, easy handling, and the fact that they are derived from human HCC (28, 29). In our study, “Hep G2 Cells” among tumor cell lines were included in the top 10 basic research-related MeSH terms. According to our study, “Hep G2 cells’’ began to appear in HCC research after 2000 along with “MircroRNAs’’ and it subsequently showed a rapid increase in appearance. Researchers can use Hep G2 cells to manipulate the expression of specific microRNAs, which will enable them to study the potential of microRNAs as diagnostic and therapeutic targets for HCC (3033).

Basic research on HCC is crucial for understanding the fundamental mechanisms of HCC and can lead to identifying potential therapeutic targets. Our LDA analysis showed that half of the entire number of studies on HCC focused on basic research. However, as found in other bibliometric studies, there has been little correlation between basic research and clinical studies in the research field of HCC (34, 35). One potential reason for this lack of correlation may have been a communication gap between the basic researchers and the clinicians. Another factor may have been the difficulty of translating basic research findings into clinical applications. Therefore, bridging these communication gaps through interdisciplinary collaboration and information sharing may help translate basic research findings into clinical research. Improving communication and collaboration between researchers and clinicians and investing in translational research efforts may also accelerate the progress toward more effective treatments for patients with HCC.

A bibliometric analysis may provide a comprehensive overview of a specific research field for performing quantitative analyses of academic literature. To the best of our knowledge, this is the first bibliometric analysis of HCC using machine learning techniques. Therefore, this study has important significance because it shows the developmental process of HCC research and possible future research directions.

However, our study had some limitations. First, only the Pubmed database was used in this study, and other databases, such as Web of Science, Scopus, and Embase, were excluded, which may have resulted in the exclusion of some research on HCC. Second, because only the MeSH term was used in the publication search, some publications not related to the subject may not have been included. This may have led to selection bias in our study. Finally, the newest research area of significance, such as immunotherapy, may not have been well discussed in this study due to the methodological problems inherent in bibliometric analyses and the low number of published studies in this area. However, we believe that machine learning bibliometric analysis may be a new tool that can enable scientists to comprehensively analyze large amounts of data.

In conclusion, this study was the first bibliometric study to analyze more than 60,000 publications on HCC over the past 30 years using machine learning. Our study showed that a machine learning bibliometric analysis may help researchers investigate large datasets, understand current trends, and identify research directions. Despite significant efforts in basic research, the connection with clinical practice is still lacking. Therefore, more efforts are needed to convert and apply the results of basic research to clinical treatment. Additionally, it was found that microRNAs have potential as diagnostic and therapeutic targets for HCC.

Author contributions

Conceptualization, JH and S-WK; methodology, KL; formal analysis, KL; Writing-original draft preparation, HS and SS; Writing-review and editing, JH. All authors contributed to the article and approved the submitted version.

Funding

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korean government (Ministry of Science and ICT) (No.RS-2022-00165595).

Acknowledgments

We would like to thank Editage (www.editage.co.kr) for English language editing.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2023.1227991/full#supplementary-material

References

1. Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin (2018) 68:394–424. doi: 10.3322/caac.21492

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Singal AG, El-Serag HB. Hepatocellular carcinoma from epidemiology to prevention: translating knowledge into practice. Clin Gastroenterol Hepatol (2015) 13:2140–51. doi: 10.1016/j.cgh.2015.08.014

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Fattovich G, Stroffolini T, Zagni I, Donato F. Hepatocellular carcinoma in cirrhosis: incidence and risk factors. Gastroenterology (2004) 127:S35–50. doi: 10.1053/j.gastro.2004.09.014

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Singal AG, Lampertico P, Nahon P. Epidemiology and surveillance for hepatocellular carcinoma: new trends. J Hepatol (2020) 72:250–61. doi: 10.1016/j.jhep.2019.08.025

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Vogel A, Meyer T, Sapisochin G, Salem R, Saborowski A. Hepatocellular carcinoma. Lancet (2022) 400:1345–62. doi: 10.1016/S0140-6736(22)01200-4

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Zou X, Vu HL, Huang H. Fifty years of accident analysis & prevention: a bibliometric and scientometric overview. Accid Anal Prev (2000) 144:105568. doi: 10.1016/j.aap.2020.105568

CrossRef Full Text | Google Scholar

7. Stout NL, Alfano CM, Belter CW, Nitkin R, Cernich A, Lohmann Siegel K, et al. A Bibliometric analysis of the landscape of cancer rehabilitation research (1992-2016). J Natl Cancer Inst (2018) 110:815–24. doi: 10.1093/jnci/djy108

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Blei DM, Ng AY, Jordan MI. Latent dirichlet allocation. J Mach Learn Res (2003) 3:992–1022.

Google Scholar

9. Chen X, Xie H, Wang FL, Liu Z, Xu J, Hao T. A bibliometric analysis of natural language processing in medical research. BMC Med Inform Decis Mak (2018) 18:14. doi: 10.1186/s12911-018-0594-x

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Wang K, Feng C, Li M, Pei Q, Li Y, Zhu H, et al. A bibliometric analysis of 23,492 publications on rectal cancer by machine learning: basic medical research is needed. Therap Adv Gastroenterol (2020) 13:1756284820934594. doi: 10.1177/1756284820934594

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Gal D, Thijs B, Glänzel W, Sipido KR. Hot topics and trends in cardiovascular research. Eur Heart J (2019) 40:2363–74. doi: 10.1093/eurheartj/ehz282

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Franco P, Segelov E, Johnsson A, Riechelmann R, Guren MG, Das P, et al. A machine-learning-based bibliometric analysis of the scientific literature on anal cancer. Cancers (Basel) (2022) 14:1697. doi: 10.3390/cancers14071697

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Jacomy M, Venturini T, Heymann S, Bastian M. ForceAtlas2, a continuous graph layout algorithm for handy network visualization designed for the Gephi software. PloS One (2014) 9:e98679. doi: 10.1371/journal.pone.0098679

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Tzartzeva K, Obi J, Rich NE, Parikh ND, Marrero JA, Yopp A, et al. Surveillance imaging and alpha fetoprotein for early detection of hepatocellular carcinoma in patients with cirrhosis: a meta-analysis. Gastroenterology (2018) 154:1706–18. doi: 10.1053/j.gastro.2018.01.064

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Parikh ND, Mehta AS, Singal AG, Block T, Marrero JA, Lok AS. Biomarkers for the early detection of hepatocellular carcinoma. Cancer Epidemiol Biomarkers Prev (2020) 29:2495–503. doi: 10.1158/1055-9965.EPI-20-0005

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Oura K, Morishita A, Masaki T. Molecular and functional roles of microRNAs in the progression of hepatocellular carcinoma-a review. Int J Mol Sci (2020) 21:8362. doi: 10.3390/ijms21218362

PubMed Abstract | CrossRef Full Text | Google Scholar

17. D’Anzeo M, Faloppi L, Scartozzi M, Giampieri R, Bianconi M, Del Prete M, et al. The role of micro-RNAs in hepatocellular carcinoma: from molecular biology to treatment. Molecules (2014) 19:6393–406. doi: 10.3390/molecules19056393

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Khare S, Khare T, Ramanathan R, Ibdah JA. Hepatocellular carcinoma: the role of microRNAs. Biomolecules (2022) 12:645. doi: 10.3390/biom12050645

PubMed Abstract | CrossRef Full Text | Google Scholar

19. EASL Clinical Practice Guidelines. Management of hepatocellular carcinoma. J Hepatol (2018) 69:182–236. doi: 10.1016/j.jhep.2018.03.019

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Chen X, Li M, Guo R, Liu W, Li J, Zong X, et al. The diagnostic performance of contrast-enhanced CT versus extracellular contrast agent-enhanced MRI in detecting hepatocellular carcinoma: direct comparison and a meta-analysis. Abdom Radiol (NY) (2022) 47:2057–70. doi: 10.1007/s00261-022-03484-7

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Semaan S, Vietti Violi N, Lewis S, Chatterji M, Song C, Besa C, et al. Hepatocellular carcinoma detection in liver cirrhosis: diagnostic performance of contrast-enhanced CT vs. MRI with extracellular contrast vs. gadoxetic acid. . Eur Radiol (2020) 30:1020–30. doi: 10.1007/s00330-019-06458-4

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Reig M, Forner A, Rimola J, Ferrer-Fàbrega J, Burrel M, Garcia-Criado Á, et al. BCLC strategy for prognosis prediction and treatment recommendation: the 2022 update. J Hepatol (2022) 76:681–93. doi: 10.1016/j.jhep.2021.11.018

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Tümen D, Heumann P, Gülow K, Demirci CN, Cosma LS, Müller M, et al. Pathogenesis and current treatment strategies of hepatocellular carcinoma. Biomedicines (2022) 10:3202. doi: 10.3390/biomedicines10123202

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Llovet JM, Ricci S, Mazzaferro V, Hilgard P, Gane E, Blanc JF, et al. Sorafenib in advanced hepatocellular carcinoma. N Engl J Med (2008) 359:378–90. doi: 10.1056/NEJMoa0708857

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Giordano S, Columbano A. Met as a therapeutic target in HCC: facts and hopes. J Hepatol (2014) 60:442–52. doi: 10.1016/j.jhep.2013.09.009

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Yang G, Xu Q, Wan Y, Zhang L, Wang L, Meng F. Circ-CSPP1 knockdown suppresses hepatocellular carcinoma progression through miR-493-5p releasing-mediated HMGB1 downregulation. Cell Signal (2021) 86:110065. doi: 10.1016/j.cellsig.2021.110065

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Zheng S, Bian H, Li J, Shen Y, Yang Y, Hu W. Differentiation therapy: unlocking phenotypic plasticity of hepatocellular carcinoma. Crit Rev Oncol Hematol (2022) 180:103854. doi: 10.1016/j.critrevonc.2022.103854

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Dehn PF, White CM, Conners DE, Shipkey G, Cumbo TA. Characterization of the human hepatocellular carcinoma (hepg2) cell line as an in vitro model for cadmium toxicity studies. In Vitro Cell Dev Biol Anim (2004) 40:172–82. doi: 10.1290/1543-706X(2004)40<172:COTHHC>2.0.CO;2

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Donato MT, Tolosa L, Gómez-Lechón MJ. Culture and functional characterization of human hepatoma HepG2 cells. Methods Mol Biol (2015) 1250:77–93. doi: 10.1007/978-1-4939-2074-7_5

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Dai HT, Wang ST, Chen B, Tang KY, Li N, Wen CY, et al. microRNA-375 inhibits the Malignant behaviors of hepatic carcinoma cells by targeting NCAPG2. Neoplasma (2022) 69:16–27. doi: 10.4149/neo_2021_210318N358

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Du X, Fan W, Chen Y. microRNA-520f inhibits hepatocellular carcinoma cell proliferation and invasion by targeting TM4SF1. Gene (2018) 657:30–8. doi: 10.1016/j.gene.2018.03.003

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Chen X, Zhang Q, Ma W, Lan T, Hong Z, Yuan Y. The abnormal expression of microRNA-542-3p in hepatocellular carcinoma and its clinical significance. Dis Markers (2018) 2018:3973250. doi: 10.1155/2018/3973250

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Zhang Y, Li T, Qiu Y, Zhang T, Guo P, Ma X, et al. Serum microRNA panel for early diagnosis of the onset of hepatocellular carcinoma. Med (Baltimore) (2017) 96:e5642. doi: 10.1097/MD.0000000000005642

CrossRef Full Text | Google Scholar

34. Zhang Z, Xia F, Li X. A comprehensive overview of the parathyroid tumor from the past two decades: machine learning-based bibliometric analysis. Front Endocrinol (Lausanne) (2021) 12:811555. doi: 10.3389/fendo.2021.811555

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Zhang Z, Wang Z, Huang Y. A bibliometric analysis of 8,276 publications during the past 25 years on cholangiocarcinoma by machine learning. Front Oncol (2021) 11:687904. doi: 10.3389/fonc.2021.687904

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: hepatocellular carcinoma, bibliometric analysis, machine learning, latent Dirichlet allocation, research trend

Citation: Lee K, Hwang JW, Sohn HJ, Suh S and Kim S-W (2023) A systematic review of progress on hepatocellular carcinoma research over the past 30 years: a machine-learning-based bibliometric analysis. Front. Oncol. 13:1227991. doi: 10.3389/fonc.2023.1227991

Received: 24 May 2023; Accepted: 31 July 2023;
Published: 17 August 2023.

Edited by:

John Gibbs, Hackensack Meridian Health, United States

Reviewed by:

Saber A Amin, University of Nebraska Medical Center, United States
Haihong Zhu, School of Medicine, Zhejiang University, China

Copyright © 2023 Lee, Hwang, Sohn, Suh and Kim. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Ji Woong Hwang, ZGF0dG9yZWVAZ21haWwuY29t

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.