Application of artificial intelligence in the diagnosis of hepatocellular carcinoma
•,,,,.
...
Abstract
Hepatocellular carcinoma (HCC) is a major cause of cancer-related deaths worldwide. This review explores the recent progress in the application of artificial intelligence (AI) in radiological diagnosis of HCC. The Barcelona Classification of Liver Cancer criteria guides treatment decisions based on tumour characteristics and liver function indicators, but HCC often remains undetected until intermediate or advanced stages, limiting treatment options and patient outcomes. Timely and accurate diagnostic methods are crucial for enabling curative therapies and improving patient outcomes. AI, particularly deep learning and neural network models, has shown promise in the radiological detection of HCC. AI offers several advantages in HCC diagnosis, including reducing diagnostic variability, optimising data analysis and reallocating healthcare resources. By providing objective and consistent analysis of imaging data, AI can overcome the limitations of human interpretation and enhance the accuracy of HCC diagnosis. Furthermore, AI systems can assist healthcare professionals in managing the increasing workload by serving as a reliable diagnostic tool. Integration of AI with information systems enables comprehensive analysis of patient data, facilitating more informed and reliable diagnoses. The advancements in AI-based radiological diagnosis hold significant potential to improve early detection, treatment selection and patient outcomes in HCC. Further research and clinical implementation of AI models in routine practice are necessary to harness the full potential of this technology in HCC management.
Introduction
Hepatocellular Carcinoma (HCC) one of the leading cancers globally.1 2 A census conducted by the global cancer observatory in 2020 showed rising rates of HCC globally, particularly within North African and East Asian populations.1 The aetiology of HCC is unique in its potential for curative therapy through surgical resection, radiofrequency ablation and liver transplants for patients with early stages of the disease.3–7 Currently, the most widely accepted classification of HCC is the Barcelona Classification of Liver Cancer (BCLC) criteria which characterise HCCs based on the number and size of nodules, supported by secondary parameters such as liver function indicators.8 The BCLC guidelines also guide treatment options by physicians and curative treatments are generally reserved only for BCLC A and BCLC 0 patients.8 Unfortunately, symptomatic presentation of HCC generally only occurs in intermediate or advanced stage disease (BCLC B and C, respectively) where treatment options and efficacy are limited.9–11 As such, timely and accurate diagnostic methods are crucial as they are open to the possibility for more curative therapies and are likely to significantly improve patient outcomes.12–14 Current first-line surveillance methods include laboratory parameters such as alpha-feto protein (AFP) and imaging techniques such as liver ultrasounds, which are preferred due to their non-invasive nature.15–17 Recent advances in artificial intelligence (AI) have opened new applications in conjunction with existing diagnostic tools. Improvements in deep learning (DL) and neural network models have seen comparable or in some cases superior outcomes compared with physicians in the diagnosis of disease.18–20 Within HCC, AI models have been incorporated in the radiological and histological detection of the disease. This paper will discuss the recent advances of AI in the radiological diagnosis of HCC in the hope of providing a clearer landscape of the present and future of its application.
Terminology
The field of AI has constantly evolved since its inception with new concepts and applications arising every few years. As such, adequate understanding of frequently used terminologies is indispensable in discussions on the topic. This section will provide a brief introduction to commonly used nomenclature within the field of AI. First, AI refers to any science or engineering involved in making intelligent machines, especially intelligent computer programmes, for problem solving.21 AI covers a broad range of concepts that aim to allow computers to understand human intelligence but is not confined by what is biologically observable. AI models may be generally separated into two concepts; narrow AI which are trained to support a specific task and general AI which would possess the self-awareness to identify, learn and solve problems with minimal intervention22 (figure 1). Currently, general AI models are purely theoretical and no true general AI models have been created.23 As such, this paper will focus on narrow AI which has seen implementation clinically. DL and machine learning (ML) are subfields of AI and refer to a system that seeks to identify meaningful relationships and patterns from observed data.24 25 Technically, DL models differ from ML models through its method of pattern recognition. ML systems refer to any system that extracts information from one or more supplied data sets and applies it to achieve a specified outcome. DL models are a subset of ML which uses multiple layers of non-linear processing units for feature extraction and classification, creating a hierarchical system capable of incorporating large amounts of data for analysis.26 Additionally, classical ML models rely on supervised learning, in which human operators manually delineate the pertinent features required to classify the image, while DL models can be trained using both labelled and unlabelled data sets.27 Lastly, artificial neural networks (ANNs) are an application of ML which aim to imitate human pattern recognition behaviour by modelling the way biological neurons signal one another.28 29 This involves three general layers: a node layer, one or more hidden layers and an output layer. The node layer receives, and weighs data provided to produce a quantifiable signal. If the produced signal exceeds a specified threshold, the node is ‘lit’ and activates downstream layers, mimicking the way neurons are fired through sufficient stimuli. Through that, ANNs can perform complex analytics and adapt to unique tasks based on the activation of different layers. Convolutional neural networks (CNNs) are an advancement of ANNs, consisting of multiple layers of ANNs which surpass the classical method in performance, pattern recognition and object recognition.30 31 It is important to note that while these terminologies describe unique concepts within the field of AI, significant overlaps exist between them. For example, while DL is considered a subset of ML, its ability for unsupervised learning can be applied to ANNs and thus could also be considered a subset of ANNs. As such, accurate nomenclature is crucial when discussing the topic.
Overview of nomenclature associated with artificial intelligence. Definitions are based on the International Business Machines (IBM) corporation descriptions. The different components of a neural network have been colour-coded; red circles represent the input layer, yellow circles represent the hidden layers and green circles represent the output layer.
Role of AI in diagnosis of HCC
The advantages of AI in diagnosing HCC are widespread but will be summarised here in three roles: the reduction of diagnostic variability, the reallocation of healthcare resources and the optimisation of data analysis. First, reduction of diagnostic variability. The diagnosis of HCC relies on an interplay of radiological, histological and cytological parameters.32 33 Of the three, radiological identification of HCC remains the preferred method for its non-invasive nature but is the most prone to variability due to factors such as experience of the radiologist and interpreting physician, patient factors and workflow variability.34 35 In a study by Covert et al evaluating the effect of interoperator variability on MRI-based manual segmentation of 140 HCC lesions, interoperator variability was found to be relatively insignificant (reliability coefficient for response evaluation criteria in solid tumours (RECIST) diameter=0.966) but was estimated to affect the perception of tumour-control probability by up to 25%.36 Theoretically, this could indicate a difference in treatment regimen and outcome between clinicians. As such, the benefit of AI is twofold; its empirical analysis enables objective interpretation of an image while its non-biological nature helps ensure consistent analysis regardless of time or patient load. The second role of AI in imaging would be the reallocation of healthcare resources. The non-biological nature of AI ensures consistent, empirical analysis of patient data while consistent research has shown comparable, if not superior, diagnostic capabilities of AI to physicians in a variety of diseases.37 This holds true in the diagnosis of HCC where, in a preliminary study evaluating the use of a CNN for the interpretation of 228 ultrasound videos, the overall detection rate of the AI (89.8%; 95% CI 84.5% to 95.0%) was significantly higher compared with non-radiologist physicians (29.1%; 95% CI 21.2% to 37.0%, p<0.001) and radiologists (70.9%; 95% CI 63.0% to 78.8%, p<0.001).38 As such, the implementation of AI systems could help ease the workload of physicians by incorporating a constantly available tool to assist the diagnosis of HCC. Lastly, the optimisation of data analysis. The integration of information systems in healthcare has provided clinicians with an assortment of patient data at their disposal, enabling more reliable and informed diagnoses. However, this increase in volume also necessitates optimal integration to ensure maximum benefit. An example within HCC is a model developed by Kim et al which integrated patient history (such as hepatitis B/C and cirrhosis), patient characteristics (such as age and sex), antiviral treatments, imaging data and laboratory parameters to generate a risk score for developing HCC in patients with chronic hepatitis B.39
Principles of radiomics
Radiomics is the emerging field involving the extraction of quantitative features and subsequent analysis of radiological images using AI models.40 The process of training a radiomics model has been extensively described in literature41–45 and can be summarised into five steps: image requisition and preprocessing, segmentation, feature extraction, model training and model validation.44 Image requisition involves selecting imaging data and identifying potential features causing variability when training the AI model. Crucially, developing a protocol for image requisition requires a balance of standardisation (to reduce noise and confounding) and variability (to ensure generalisability of the model in a clinical setting).46 Imaging data are then segmented into regions of interests delineating the tumour and its surrounding areas before being fed into the selected AI model for feature extraction. The models are then refined for accuracy, complexity and efficiency before being validated through a validation data set (usually derived from a subset of the original data). Lastly, the model is tested on a ‘blind’ training data set to determine its performance. Current radiomics models in HCC have been developed for ultrasound, CTs and MRIs. The following sections will explore some of the notable developments of AI models for each imaging technique.
Application in ultrasound
Liver ultrasonography has been a standard radiological modality for HCC surveillance due to its non-invasiveness and absent risk of radiation or contrast exposure.47 However, the utility of ultrasound in detecting HCC is being increasingly debated due to its suboptimal performance, particularly in early HCC, and its operator-dependent variability. Ultrasound detection of HCC remains relatively reliable with a meta-analysis of 32 studies (comprising 13 367 patients of varying HCC severity) concluding with an overall sensitivity of 84% (95% CI 76% to 92%). However, the study also evaluated the sensitivity of detecting early HCC at 47% (95% CI 33% to 61%).48 This disparity is likely due to most patients with HCC presenting with cofounding comorbidities such as cirrhosis which produce a coarse pattern on ultrasound, precluding identification of small HCC nodules.49 Additional factors such as obesity, fatty-liver disease and fibrosis may similarly impair the quality of liver ultrasound, thereby obfuscating the presence of smaller HCC nodules in early HCC.50 While newer applications, for example, contrast ultrasounds and diagnostic guidelines have helped limit the diagnostic variability of ultrasounds in HCC, recent interest has been on the implementation of AI models in interpreting ultrasound images. A CNN model developed by Tiyarattanachai et al, highlighted the diagnostic potential of AI systems in liver ultrasounds.38 There, a pretrained CNN model (RetinaNet51) was fed 25 557 images of various common ultrasound findings (eg, HCC, liver cysts and haemangiomas) followed by refinement using 228 ultrasound videos with difficult frames to create a model specialising in the differentiation of ultrasound findings. The model was then tested against 175 videos containing 127 lesions and its performance was compared with physicians (both non-radiologists and radiologists) to assess its potential. Overall, the AI system achieved an overall detection rate of 89.8% (114/127 lesions; 95% CI 84.5% to 95.0%) and a 100% detection rate for HCC (23/23 lesions; 95% CI 85.2% to 100%). Comparatively, overall detection rates for non-radiologist physicians and radiologists were 29.1% and 70.9%, respectively (both p<0.001) while detection rates for HCC were 39.1% (p<0.001) and 69.6% (p=0.016), respectively. This study demonstrated the feasibility of AI systems in detecting HCC through ultrasound with detection rates significantly higher than both non-radiologist and radiologist clinicians. Current clinical guidelines suggests early surveillance for HCC using ultrasound and serum AFP.52 However AFP remains negative in nearly two-thirds of patients at any stage of HCC53 and screening outcomes using ultrasound alone remain suboptimal.48 A DL model detailed by Zhang et al highlighted the potential of AI as a screening tool for AFP-negative HCCs.54 The CNN model (Xception55) was trained using a total of 305 images of HCC and focal nodular hyperplasia taken using B-mode ultrasound and model testing was done using 102 B-mode ultrasound images. HCC staging, lesion size, echogenicity and liver function were heterogenous in both training and testing data sets. Diagnostic performance was then compared with four other available CNN models (MobileNet,56 Resnet50,57 Densenet121,58 InceptionV3).59 In total, the model displayed promising diagnostic capabilities with an overall area under the curve (AUC) of 93.68% (95% CI 88.6% to 98.8%), sensitivity of 96.08%, specificity of 76.92% and accuracy of 86.41%, outperforming other CNN models and providing an optimistic non-invasive tool for HCC surveillance. Table 1 summarises the key studies evaluating the use of AI in the interpretation of liver ultrasound.
Table 1
|
Overview of studies involving AI in HCC ultrasound as of January 2023
Application in CT
CT is a primary surveillance method for HCC, exhibiting utility in particular for detecting dysplastic lesions and early stage HCC.60 61 Current guidelines recommend a multiphasic CT with extracellular contrast agents as a first-line option for the diagnosis and staging of HCC.62 A meta-analysis by Lee et al evaluating the diagnostic capabilities of CT imaging for detecting HCC concluded with an overall sensitivity of 72% (95% CI 62% to 80%).63 Additionally, the study reported a sensitivity of 74% and specificity of 81% when CT imaging was used as the initial diagnostic tool for focal liver lesions detected during surveillance. These results highlight that while acceptable, there is significant room for optimisation in CTs as a diagnostic tool for HCC and the incorporation of AI has shown promise in that regard. A DL model outlined by Wang et al reported optimistic performance in identifying patients with HCC from CT data with an AUC of 0.887 (95% CI 0.855 to 0.919) for an internal data set and 0.883 (95% CI 0.855 to 0.911) for an external data set.64 In the paper, 647 individuals with HCC of various stage of disease and 6865 non-HCC individuals were analysed using plain and contrast-enhanced CT. Obtained images were then fed into two CNN models (NoduleNet and HCCNet) for training, and performance testing was done on an internal and external data set. A subset of the test data set was also reviewed by three radiologists to obtain a comparison of performance. For the internal data set, the AI model achieved a sensitivity of 78.4% (95% CI 72.4% to 83.7%), specificity of 84.4% (95% CI 78.0% to 89.6%) and an overall accuracy of 81.0% (95% CI 76.8% to 84.8%). For the external data set, the model achieved a sensitivity of 89.4% (95% CI 85.0% to 92.8%), specificity of 74.0% (95% CI 68.5% to 78.9%) and an accuracy of 81.3% (95% CI 77.8% to 84.5%). Comparatively, the AI model performed similarly to the radiologists in terms of accuracy (0.853 vs 0.818, p = 0.107), sensitivity (0.815 vs 0.753, p = 0.064) and specificity (0.902 vs 0.903, p = 0.981) for the internal data set, with similar findings of the external data set. However, the true utility of the model was displayed when it was used to assist the radiologists in improving their diagnostic performance as seen by a significant improvement in diagnostic accuracy for the internal data set (0.873 vs 0.818, p = 0.026) and external data set (0.854 vs 0.793, p = 0.017). This study indicated the potential for AI systems to standardise the diagnosis and stratification of HCC when using CT imaging. Table 2 summarises the key studies evaluating the use of AI in the interpretation of liver CTs.
Table 2
|
Overview of studies on the application of AI in HCC CTs as of January 2023
Application in MRI
Liver MRI is another first-line imaging tool for the diagnosis and classification of HCC.62 Advantages of MRI include its high-quality imaging of the entire liver, contrasting enabling detection and classification of HCC and lack of ionising radiation allowing safer routine use.65–67 MRIs have also shown utility in detecting small HCC nodules more consistently than CTs. In a meta-analysis of 34 articles, MRIs had an overall sensitivity of 88% (95% CI 83% to 92%) and specificity of 94% (95% CI 85% to 98%). However, MRIs also had a significantly lower sensitivity in detecting lesions less than 2 cm with a sensitivity of 62% for lesions <2 cm and 95% for lesions >2 cm (p<0.02). These results highlight that while MRIs generally have higher detection rates, more optimisations may be possible and in particular for detecting early HCC. Current AI models have shown promise in differentiating common liver lesions such as HCC, haemangioma and metastatic tumours through feature extraction of MRIs. In a study by Oyama et al, non-contrast-enhanced fat-suppressed 3D T1-weight gradient echo MRIs of 150 hepatic tumours (50 HCC, 50 haemangiomas and 50 metastatic tumours) were fed into an inhouse ML model. There, the images were ranked based on their likely identity (ie, HCC, haemangioma or metastatic tumour). Overall, the ML model was able to differentiate HCCs and metastatic tumours with an accuracy of 92% (92/100 lesions), sensitivity of 100% (50/50 lesions), specificity of 84% (42/50 lesions) and an AUC of 95%. Similarly, the model differentiated HCCs with haemangiomas with an accuracy of 90% (90/100 lesions), sensitivity of 96% (48/50 lesions), specificity of 84% (42/50) and AUC of 95%. Despite the limited analysis, the results support the potential of AI systems in the interpretation of MRIs.68 Another study by Oestmann et al evaluating the use of CNNs in identifying HCCs on contrast-enhanced MRIs identified key factors affecting the diagnostic accuracy of ML models.69 In the study, 118 patients (73 HCC and 45 non-HCC) were evaluated using contrast-enhanced MRIs and the images were analysed in a CNN. Atypical lesions, defined as lesions not displaying typical MRI appearances based on the Liver Imaging Reporting and Data System criteria (ie, arterial hyperenhancement, washout and enhancing rim/pseudocapsule),70 were included to determine the correlation between lesion grading and CNN performance. The study concluded with the CNN demonstrating an overall accuracy of 87.3% and sensitivities/specificities for HCC and non-HCC lesions of 92.7%/82.0% and 82.0%/92.7%, respectively. CNN performance was also directly correlated with the lesion grading system, becoming less accurate the more atypical imaging features the lesion showed. This study highlighted the potential pitfall of an ML system as their strict algorithm lacked the flexibility to consistently identify atypical lesions. Table 3 summarises the key studies evaluating the use of AI in the interpretation of liver MRIs.
Table 3
|
Summary of studies on the application of AI in HCC MRIs as of January 2023
Other applications of AI in HCC diagnosis
Apart from directly aiding in the interpretation of radiological images to diagnose HCC, several other applications have emerged to assist in the surveillance and classification of HCC. One such example would be the development of integrated ML models for risk-score prediction.39 A recent model developed by Kim et al aimed to predict the risk of HCC in Korean and Caucasian patients with hepatitis B. The ML model was developed which incorporated ultrasound images and 10 baseline parameters to assess the risk of HCC in patients with hepatitis B virus (HBV). These parameters were the presence of cirrhosis, age, platelet count, antiviral agent used, sex, serum alanine aminotransferase levels, baseline serum levels of HBV DNA, serum levels of albumin and bilirubin, and hepatitis B e-antigen (HBeAg) status, which are listed in the order of importance. The results were prospectively validated against a Korean (n=5817) and Caucasian (n=1640) cohort and assessed with a Concordance Index (c-index).71 The model achieved a c-index of 0.79 (95% CI 0.78 to 0.80) and 0.81 (95% CI 0.79 to 0.83), respectively, in the Korean and Caucasian cohort and highlights the potential of AI models in the integration of patient parameters and for the risk stratification of HCC.
Current challenges and future directions
The recent boom of AI models into mainstream popularity has placed increased focus on their applications in healthcare. However, despite constant improvements in hardware and software capabilities, several challenges remain unaddressed. First, the standardisation of study design. Due to the relative infancy of ML and CNN models in healthcare, limited regulations exist governing their application. This holds true for their use in HCC diagnostics, where there is currently scarce data on their efficacy in actual patient settings. As of writing this paper, there is currently no consensus on the feasibility of AI in large-scale HCC radiomics nor standardised approaches on how results should be reported or interpreted. A recent meta-analysis on the quality assessment standards in systematic reviews of AI accuracy have also shown studies to be afflicted by biasness.72 These findings included 243 out of 423 (57.5%) studies reporting a high or unclear risk of patient selection bias, 110 studies (26.0%) reporting a risk of bias in index test selection, 121 studies (28.6%) reporting a risk of bias in reference standard and 157 studies (37.1%) reporting a risk of bias in study flow. The study highlights the need for AI-specific quality assessment tools and guidelines to facilitate the safe use of AI tools into clinical practice. Several guidelines have been proposed such as the Standard Protocol Items: Recommendations for Interventional Trials - Artificial Intelligence (SPIRIT-AI) and Consolidated Standards of Reporting Trials–Artificial Intelligence (CONSORT AI) guidelines which propose a framework for designing and reporting AI-related clinical trials to ensure robustness and accuracy of results.73 Additionally, while several studies exist evaluating the performance of AI systems, the vast majority of these have been retrospective in nature and prospective evidence in the field remains limited.
Another challenge is the quality assessment of data sets. Proper training and validation of ML models are crucial to ensure accuracy which necessitates quality data sets.74 Currently, most published studies on the applications of AI in HCC diagnostics have focused on improving the quality of their models through additions and optimisation on the model itself (such as optimising feature selection and increasing CNN layers). However, limited studies have been published focusing on the impact of data set quality on ML training and the ideal features of a good data set.75 76 Differences in diagnostic equipment and workflows have also made it difficult to accurately identify an optimal imaging workflow for training AI models. A systematic review by Fatania et al provides insight into how the optimal radiological features could be evaluated.77 In the paper, the intensity standardisation techniques of 12 studies were classified and evaluated to assess reliability based on the Quality Assessment of Diagnostic Accuracy Studies-2 (QUADAS-2) quality-assessment criteria.78 Despite the small sample size (n=12) and focus on gliomas, the paper by Fatania et al serves as a helpful framework for developing future evaluative studies for AI-related image acquisition methods in HCC. Another interesting approach has been the use of a supplementary AI model to automatically detect and exclude poor-quality data in a training set.79 The proposed model, based on a novel technique of untrainable data cleansing, identifies and removes a subset of the data that AI models are unable to correctly label (classify) during the AI training process. The feasibility of their model was evaluated using a publicly available data set of 624 paediatric X-rays, of which 200 images were flagged as ‘noisy’ by the system. From there, a trained radiologist was given a subset of the data (100 correct images and 100 ‘noisy’ images based on the AI’s classification) for interpretation. Results showed that the reader consensus between the radiologist’s label and the original label was significantly higher for the correct images (Cohen’s κ=0.65) compared with the noisy images (Cohen’s κ=0.05). These results indicate a high discordance in interpretation of the ‘noisy’ images that would make them suboptimal for a training data set. However, the feasibility of such a model remains unevaluated in HCC radiomics.
Lastly, the availability of large data sets remains scare. In addition to the quality of data, sufficient images are also required to optimise ML performance. Most published studies have relied solely on internal data sets for the training and validation of their models. Although the use of internal validation is acceptable, the development of new and existing public databases for external validation would be a core for the future of the ML landscape. The benefit of a large, public data set is multifold. First, it improves the accuracy of feature extraction, improving the diagnostic performance of the AI model.80 81 The fostering of a culture of cooperation would assist in the creation of databases of sufficient size and detail to adequately train new AI models.82 83 Second, it provides the opportunity for smaller hospitals, which may lack the experience or resources to create custom ML systems, to incorporate AI into their diagnostic frameworks. To that end, several sources have already made their data sets publicly available,81 84 85 marking a good first step towards an integrative data network. However, key concerns such as the cross-compatibility of data sets between hospital information systems, the safety and security of patient data, and the economic cost of developing the system must be addressed.
Conclusion
The recent surge of AI into the mainstream has raised discussion about their use in healthcare. To that extent, many hospitals have begun evaluating their feasibility. In this paper, we discussed the current landscape of AI application in the radiological diagnosis of HCC and its future directions. Current research has been optimistic, with AI models consistently reported to outperform clinicians in the interpretation of ultrasound, CT and MRI. However, rather than replace the role of clinicians, AI systems should be seen as tools bolstering the confidence of clinicians (in particular inexperienced or non-radiology trained clinicians) in their diagnosis of HCC. Despite the positive findings, it is prudent that we also acknowledge that the implementation of AI into healthcare will come with new challenges such as the financial feasibility, standardisation of use and security of data systems. All in all, the future of AI systems in diagnosing HCC looks bright but large-scale prospective is still needed.
Contributors: All authors have made substantial contributions to all of the following: (1) The conception and design of the study, or acquisition of data, or analysis and interpretation of data, (2) Drafting the article or revising it critically for important intellectual content, (3) Final approval of the version to be submitted. All authors approve the final version of the manuscript and agree to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved.
Funding: The authors have not declared a specific grant for this research from any funding agency in the public, commercial or not-for-profit sectors.
Competing interests: CHN has served as a consultant for Boxer Capital.
Patient and public involvement: Patients and/or the public were not involved in the design, or conduct, or reporting, or dissemination plans of this research.
Provenance and peer review: Not commissioned; externally peer reviewed.
Ethics statements
Patient consent for publication:
Not applicable.
Ethics approval:
Not applicable.
Sung H, Ferlay J, Siegel RL, et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin2021; 71:209–49. doi:10.3322/caac.21660•Google Scholar
Mazzaferro V, Regalia E, Doci R, et al. Liver transplantation for the treatment of small hepatocellular carcinomas in patients with cirrhosis. N Engl J Med1996; 334:693–9. doi:10.1056/NEJM199603143341104•Google Scholar
Bruix J, Cirera I, Calvet X, et al. Surgical resection and survival in Western patients with hepatocellular carcinoma. J Hepatol1992; 15:350–5. doi:10.1016/0168-8278(92)90067-y•Google Scholar
Bismuth H, Chiche L, Adam R, et al. Liver resection versus transplantation for hepatocellular carcinoma in cirrhotic patients. Ann Surg1993; 218:145–51. doi:10.1097/00000658-199308000-00005•Google Scholar
Ringe B, Pichlmayr R, Wittekind C, et al. Surgical treatment of hepatocellular carcinoma: experience with liver resection and transplantation in 198 patients. World J Surg1991; 15:270–85. doi:10.1007/BF01659064•Google Scholar
Cucchetti A, Elshaarawy O, Han G, et al. Potentially curative therapies’ for hepatocellular carcinoma: how many patients can actually be cured. Br J Cancer2023; 128:1665–71. doi:10.1038/s41416-023-02188-z•Google Scholar
Reig M, Forner A, Rimola J, et al. BCLC strategy for prognosis prediction and treatment recommendation: the 2022 update. J Hepatol2022; 76:681–93. doi:10.1016/j.jhep.2021.11.018•Google Scholar
Aljumah AA, Kuriry H, AlZunaitan M, et al. Clinical presentation, risk factors, and treatment modalities of hepatocellular carcinoma: a single tertiary care center experience. Gastroenterol Res Pract2016; 2016. doi:10.1155/2016/1989045•Google Scholar
Patel N, Yopp AC, Singal AG, et al. Diagnostic delays are common among patients with hepatocellular carcinoma. J Natl Compr Canc Netw2015; 13:543–9. doi:10.6004/jnccn.2015.0074•Google Scholar
Park J-W, Chen M, Colombo M, et al. Global patterns of hepatocellular carcinoma management from diagnosis to death: the BRIDGE study. Liver Int2015; 35:2155–66. doi:10.1111/liv.12818•Google Scholar
Stravitz RT, Heuman DM, Chand N, et al. Surveillance for hepatocellular carcinoma in patients with cirrhosis improves outcome. Am J Med2008; 121:119–26. doi:10.1016/j.amjmed.2007.09.020•Google Scholar
Bodzin AS, Busuttil RW. Hepatocellular carcinoma: advances in diagnosis, management, and long term outcome. World J Hepatol2015; 7:1157–67. doi:10.4254/wjh.v7.i9.1157•Google Scholar
Lim J, Singal AG. Surveillance and diagnosis of hepatocellular carcinoma. Clin Liver Dis (Hoboken)2019; 13:2–5. doi:10.1002/cld.761•Google Scholar
European Association For The Study Of The Liver, European Organisation For Research And Treatment Of Cancer. Treatment of C. EASL–EORTC clinical practice guidelines: management of hepatocellular carcinoma. J Hepatol2012; 56:908–43. doi:10.1016/j.jhep.2011.12.001•Google Scholar
Xie DY, Ren ZG, Zhou J, et al. Chinese clinical guidelines for the management of hepatocellular carcinoma: updates and insights. Hepatobiliary Surg Nutr2020; 9:452–63. doi:10.21037/hbsn-20-480•Google Scholar
Gao R, Zhao S, Aishanjiang K, et al. Deep learning for differential diagnosis of malignant hepatic tumors based on multi-phase contrast-enhanced CT and clinical data. J Hematol Oncol2021; 14. doi:10.1186/s13045-021-01167-2•Google Scholar
Liu X, Khalvati F, Namdar K, et al. Can machine learning radiomics provide pre-operative differentiation of combined hepatocellular cholangiocarcinoma from hepatocellular carcinoma and cholangiocarcinoma to inform optimal treatment planning. Eur Radiol2021; 31:244–55. doi:10.1007/s00330-020-07119-7•Google Scholar
Li W, Lv X-Z, Zheng X, et al. Machine learning-based ultrasomics improves the diagnostic performance in differentiating focal nodular hyperplasia and atypical hepatocellular carcinoma. Front Oncol2021; 11. doi:10.3389/fonc.2021.544979•Google Scholar
McCarthy J. What is artificial intelligence. 2007; Google Scholar
Alzubaidi L, Zhang J, Humaidi AJ, et al. Review of deep learning: concepts, CNN architectures, challenges, applications, future directions. J Big Data2021; 8. doi:10.1186/s40537-021-00444-8•Google Scholar
Guresen E, Kayakutlu G. Definition of artificial neural networks with comparison to other networks. Procedia Computer Science2011; 3:426–33. doi:10.1016/j.procs.2010.12.071•Google Scholar
Roberts LR, Sirlin CB, Zaiem F, et al. Imaging for the diagnosis of hepatocellular carcinoma: a systematic review and meta-analysis. Hepatology2018; 67:401–21. doi:10.1002/hep.29487•Google Scholar
Min JH, Lee MW, Park HS, et al. Interobserver variability and diagnostic performance of gadoxetic acid–enhanced MRI for predicting microvascular invasion in hepatocellular carcinoma. Radiology2020; 297:573–81. doi:10.1148/radiol.2020201940•Google Scholar
Covert EC, Fitzpatrick K, Mikell J, et al. Intra- and inter-operator variability in MRI-based manual segmentation of HCC lesions and its impact on dosimetry. EJNMMI Phys2022; 9. doi:10.1186/s40658-022-00515-6•Google Scholar
Shen J, Zhang CJP, Jiang B, et al. Artificial intelligence versus clinicians in disease diagnosis: systematic review. JMIR Med Inform2019; 7. doi:10.2196/10010•Google Scholar
Tiyarattanachai T, Apiparakoon T, Marukatat S, et al. The feasibility to use artificial intelligence to aid detecting focal liver lesions in real-time ultrasound: a preliminary study based on videos. Sci Rep2022; 12. doi:10.1038/s41598-022-11506-z•Google Scholar
Kim HY, Lampertico P, Nam JY, et al. An artificial intelligence model to predict hepatocellular carcinoma risk in Korean and caucasian patients with chronic hepatitis B. J Hepatol2022; 76:311–8. doi:10.1016/j.jhep.2021.09.025•Google Scholar
Papanikolaou N, Matos C, Koh DM, et al. How to develop a meaningful radiomic signature for clinical use in oncologic patients. Cancer Imaging2020; 20. doi:10.1186/s40644-020-00311-4•Google Scholar
van Timmeren JE, Cester D, Tanadini-Lang S, et al. “Radiomics in medical Imaging—“How-to” guide and critical reflection”. Insights Imaging2020; 11. doi:10.1186/s13244-020-00887-2•Google Scholar
Miranda J, Horvat N, Fonseca GM, et al. Current status and future perspectives of radiomics in hepatocellular carcinoma. World J Gastroenterol2023; 29:43–60. doi:10.3748/wjg.v29.i1.43•Google Scholar
Hatt M, Parmar C, Qi J, et al. Machine (deep) learning methods for image processing and radiomics. IEEE Trans Radiat Plasma Med Sci2019; 3:104–8. doi:10.1109/TRPMS.2019.2899538•Google Scholar
Chang Y, Lee J-H. Optimal modalities for HCC surveillance in a high-incidence region. Clin Liver Dis (Hoboken)2020; 16:236–9. doi:10.1002/cld.923•Google Scholar
Tzartzeva K, Obi J, Rich NE, et al. Surveillance imaging and alpha fetoprotein for early detection of hepatocellular carcinoma in patients with cirrhosis: a meta-analysis. Gastroenterology2018; 154:1706–18. doi:10.1053/j.gastro.2018.01.064•Google Scholar
Lencioni R, Piscaglia F, Bolondi L, et al. Contrast-enhanced ultrasound in the diagnosis of hepatocellular carcinoma. J Hepatol2008; 48:848–57. doi:10.1016/j.jhep.2008.02.005•Google Scholar
Samoylova ML, Mehta N, Roberts JP, et al. Predictors of ultrasound failure to detect hepatocellular carcinoma. Liver Transpl2018; 24:1171–7. doi:10.1002/lt.25202•Google Scholar
Galle PR, Forner A, Llovet JM, et al. EASL clinical practice guidelines: management of hepatocellular carcinoma. Journal of Hepatology2018; 69:182–236. doi:10.1016/j.jhep.2018.03.019•Google Scholar
Tsuchiya N, Sawada Y, Endo I, et al. Biomarkers for the early diagnosis of hepatocellular carcinoma. World J Gastroenterol2015; 21:10573–83. doi:10.3748/wjg.v21.i37.10573•Google Scholar
Zhang W, Hou S, Chen Y, et al. Deep learning for approaching hepatocellular carcinoma ultrasound screening dilemma: identification of Α-Fetoprotein-negative hepatocellular carcinoma from focal liver lesion found in high-risk patients. Front Oncol2022; 12. doi:10.3389/fonc.2022.862297•Google Scholar
Andrew G. Howard MZ, Chen B, Kalenichenko D, et al. Mobilenets: efficient convolutional neural networks for mobile vision applications. arXiv2017; Google Scholar
Lim JH, Kim MJ, Park CK, et al. Dysplastic nodules in liver cirrhosis: detection with triple phase helical dynamic CT. Br J Radiol2004; 77:911–6. doi:10.1259/bjr/56713551•Google Scholar
Ippolito D, Sironi S, Pozzi M, et al. Perfusion CT in cirrhotic patients with early stage hepatocellular carcinoma: assessment of tumor-related vascularization. Eur J Radiol2010; 73:148–52. doi:10.1016/j.ejrad.2008.10.014•Google Scholar
Marrero JA, Kulik LM, Sirlin CB, et al. Diagnosis, staging, and management of hepatocellular carcinoma: 2018 practice guidance by the American Association for the study of liver diseases. Hepatology2018; 68:723–50. doi:10.1002/hep.29913•Google Scholar
Lee YJ, Lee JM, Lee JS, et al. Hepatocellular carcinoma: diagnostic performance of multidetector CT and MR imaging—a systematic review and meta-analysis. Radiology2015; 275:97–109. doi:10.1148/radiol.14140690•Google Scholar
Wang M, Fu F, Zheng B, et al. Development of an AI system for accurately diagnose hepatocellular carcinoma from computed tomography imaging data. Br J Cancer2021; 125:1111–21. doi:10.1038/s41416-021-01511-w•Google Scholar
Arif-Tiwari H, Kalb B, Chundru S, et al. MRI of hepatocellular carcinoma: an update of current practices. Diagn Interv Radiol2014; 20:209–21. doi:10.5152/dir.2014.13370•Google Scholar
Osho A, Rich NE, Singal AG, et al. Role of imaging in management of hepatocellular carcinoma: surveillance, diagnosis, and treatment response. Hepatoma Res2020; 6. doi:10.20517/2394-5079.2020.42•Google Scholar
Hussain SM, Reinhold C, Mitchell DG, et al. Cirrhosis and lesion characterization at MR imaging. Radiographics2009; 29:1637–52. doi:10.1148/rg.296095508•Google Scholar
Oyama A, Hiraoka Y, Obayashi I, et al. Hepatic tumor classification using texture and Topology analysis of non-contrast-enhanced three-dimensional T1-weighted MR images with a radiomics approach. Sci Rep2019; 9. doi:10.1038/s41598-019-45283-z•Google Scholar
Oestmann PM, Wang CJ, Savic LJ, et al. Deep learning–assisted differentiation of pathologically proven atypical and typical hepatocellular carcinoma (HCC) versus non-HCC on contrast-enhanced MRI of the liver. Eur Radiol2021; 31:4981–90. doi:10.1007/s00330-020-07559-1•Google Scholar
Lee YT, Wang JJ, Zhu Y, et al. Diagnostic criteria and LI-RADS for hepatocellular carcinoma. Clin Liver Dis (Hoboken)2021; 17:409–13. doi:10.1002/cld.1075•Google Scholar
Harrell FE, Califf RM, Pryor DB, et al. Evaluating the yield of medical tests. JAMA1982; 247:2543–6. Google Scholar
Jayakumar S, Sounderajah V, Normahani P, et al. Quality assessment standards in artificial intelligence diagnostic accuracy systematic reviews: a meta-research study. Npj Digit Med2022; 5. doi:10.1038/s41746-021-00544-y•Google Scholar
Cruz Rivera S, Liu X, Chan A-W, et al. Guidelines for clinical trial protocols for interventions involving artificial intelligence: the SPIRIT-AI extension. Lancet Digit Health2020; 2:e549–60. doi:10.1016/S2589-7500(20)30219-3•Google Scholar
Jain A, Patel H, Nagalapatti L, et al. Overview and importance of data quality for machine learning tasks. New York, NY, USA, Association for Computing Machinery2020; doi:10.1145/3394486.3406477•Google Scholar
Diaz O, Kushibar K, Osuala R, et al. Data preparation for artificial intelligence in medical imaging: a comprehensive guide to open-access platforms and tools. Phys Med2021; 83:25–37. doi:10.1016/j.ejmp.2021.02.007•Google Scholar
Papadimitroulas P, Brocki L, Christopher Chung N, et al. Artificial intelligence: deep learning in oncological raOncological Radiomics and challenges of interpretability and data harmonization. Phys Med2021; 83:108–21. doi:10.1016/j.ejmp.2021.03.009•Google Scholar
Fatania K, Mohamud F, Clark A, et al. Intensity standardization of MRI prior to radiomic feature extraction for artificial intelligence research in glioma-a systematic review. Eur Radiol2022; 32:7014–25. doi:10.1007/s00330-022-08807-2•Google Scholar
Althnian A, AlSaeed D, Al-Baity H, et al. Impact of dataset size on classification performance: an empirical evaluation in the medical domain. Applied Sciences2021; 11:796. doi:10.3390/app11020796•Google Scholar
Quinton F, Popoff R, Presles B, et al. A tumour and liver automatic segmentation (ATLAS) dataset on contrast-enhanced magnetic resonance imaging for hepatocellular carcinoma. Data2023; 8:79. doi:10.3390/data8050079•Google Scholar
Calderaro J, Seraphin TP, Luedde T, et al. Artificial intelligence for the prevention and clinical management of hepatocellular carcinoma. Journal of Hepatology2022; 76:1348–61. doi:10.1016/j.jhep.2022.01.014•Google Scholar
Xu Y, Zheng B, Liu X, et al. Improving artificial intelligence pipeline for liver malignancy diagnosis using ultrasound images and video frames. Brief Bioinformatics2023; 24. doi:10.1093/bib/bbac569•Google Scholar
Moawad AW, Morshid A, Khalaf AM, et al. Multimodality annotated hepatocellular carcinoma data set including pre- and post-TACE with imaging Segmentation. Sci Data2023; 10. doi:10.1038/s41597-023-01928-3•Google Scholar
Shiraishi J, Sugimoto K, Moriyasu F, et al. Computer-aided diagnosis for the classification of focal liver lesions by use of contrast-enhanced ultrasonography. Medical Physics2008; 35:1734–46. doi:10.1118/1.2900109•Google Scholar
Streba CT, Ionescu M, Gheonea DI, et al. Contrast-enhanced ultrasonography parameters in neural network diagnosis of liver tumors. World J Gastroenterol2012; 18:4427–34. doi:10.3748/wjg.v18.i32.4427•Google Scholar
Huang Q, Pan F, Li W, et al. Differential diagnosis of atypical hepatocellular carcinoma in contrast-enhanced ultrasound using Spatio-temporal diagnostic semantics. IEEE J Biomed Health Inform2020; 24:2860–9. doi:10.1109/JBHI.2020.2977937•Google Scholar
Tiyarattanachai T, Apiparakoon T, Marukatat S, et al. Development and validation of artificial intelligence to detect and diagnose liver lesions from ultrasound images. PLOS ONE2021; 16. doi:10.1371/journal.pone.0252882•Google Scholar
Virmani J, Kumar V, Kalra N, et al. Neural network ensemble based CAD system for focal liver lesions from B-mode ultrasound. J Digit Imaging2014; 27:520–37. doi:10.1007/s10278-014-9685-0•Google Scholar
Mitrea D, Badea R, Mitrea P, et al. Hepatocellular carcinoma automatic diagnosis within CEUS and B-mode ultrasound images using advanced machine learning methods. Sensors (Basel)2021; 21. doi:10.3390/s21062202•Google Scholar
Weston AD, Korfiatis P, Kline TL, et al. Automated abdominal segmentation of CT scans for body composition analysis using deep learning. Radiology2019; 290:669–79. doi:10.1148/radiol.2018181432•Google Scholar
Nayak A, Baidya Kayal E, Arya M, et al. Computer-aided diagnosis of cirrhosis and hepatocellular carcinoma using multi-phase abdomen CT. Int J Comput Assist Radiol Surg2019; 14:1341–52. doi:10.1007/s11548-019-01991-5•Google Scholar
Yamada A, Oyama K, Fujita S, et al. Dynamic contrast-enhanced computed tomography diagnosis of primary liver cancers using transfer learning of pretrained convolutional neural networks: is registration of Multiphasic images necessary. Int J Comput Assist Radiol Surg2019; 14:1295–301. doi:10.1007/s11548-019-01987-1•Google Scholar
Shi W, Kuang S, Cao S, et al. Deep learning assisted differentiation of hepatocellular carcinoma from focal liver lesions: choice of four-phase and three-phase CT imaging protocol. Abdom Radiol (NY)2020; 45:2688–97. doi:10.1007/s00261-020-02485-8•Google Scholar
Mokrane F-Z, Lu L, Vavasseur A, et al. Radiomics machine-learning signature for diagnosis of hepatocellular carcinoma in cirrhotic patients with indeterminate liver nodules. Eur Radiol2020; 30:558–70. doi:10.1007/s00330-019-06347-w•Google Scholar
Kim DW, Lee G, Kim SY, et al. Deep learning–based algorithm to detect primary hepatic malignancy in multiphase CT of patients at high risk for HCC. Eur Radiol2021; 31:7047–57. doi:10.1007/s00330-021-07803-2•Google Scholar
Khan AA, Narejo GB. Analysis of abdominal computed tomography images for automatic liver cancer diagnosis using image processing algorithm. CMIR2019; 15:972–82. doi:10.2174/1573405615666190716122040•Google Scholar
Xu X, Mao Y, Tang Y, et al. Classification of hepatocellular carcinoma and Intrahepatic cholangiocarcinoma based on radiomic analysis. Comput Math Methods Med2022; 2022. doi:10.1155/2022/5334095•Google Scholar
Rocha BA, Ferreira LC, Vianna LGR, et al. Contrast phase recognition in liver computer tomography using deep learning. Sci Rep2022; 12. doi:10.1038/s41598-022-24485-y•Google Scholar
Duc VT, Chien PC, Huyen LDM, et al. Deep learning model with convolutional neural network for detecting and segmenting hepatocellular carcinoma in CT: A preliminary study. Cureus2022; 14. doi:10.7759/cureus.21347•Google Scholar
Kim J, Min JH, Kim SK, et al. Detection of hepatocellular carcinoma in contrast-enhanced magnetic resonance imaging using deep learning Classifier: a multi-center retrospective study. Sci Rep2020; 10. doi:10.1038/s41598-020-65875-4•Google Scholar
Zheng R, Wang L, Wang C, et al. Feasibility of automatic detection of small hepatocellular carcinoma (≤2 cm) in cirrhotic liver based on pattern matching and deep learning. Phys Med Biol2021; 66:085014. doi:10.1088/1361-6560/abf2f8•Google Scholar
Bousabarah K, Letzen B, Tefera J, et al. Automated detection and delineation of hepatocellular carcinoma on Multiphasic contrast-enhanced MRI using deep learning. Abdom Radiol (NY)2021; 46:216–25. doi:10.1007/s00261-020-02604-5•Google Scholar
Stollmayer R, Budai BK, Tóth A, et al. Diagnosis of focal liver lesions with deep learning-based multi-channel analysis of hepatocyte-specific contrast-enhanced magnetic resonance imaging. World J Gastroenterol2021; 27:5978–88. doi:10.3748/wjg.v27.i35.5978•Google Scholar
Cho Y, Han YE, Kim MJ, et al. Computer-aided hepatocellular carcinoma detection on the hepatobiliary phase of gadoxetic acid-enhanced magnetic resonance imaging using a convolutional neural network: feasibility evaluation with multi-sequence data. Comput Methods Programs Biomed2022; 225:107032. doi:10.1016/j.cmpb.2022.107032•Google Scholar