Document Type : Type B: Systematic review and meta-analysis (with high level of evidence).
Authors
1 Mansoura University
2 University of Szeged, Hungary
Abstract
Keywords
Introduction
It is important to digitize a physical slide and subsequently analyze it to ensure the accurate conversion of tissue or specimen slides into a digital format. This will make the extraction of valuable information possible. The key steps involved in this process are sample preparation, slide scanning, digital image storage, image preprocessing, region of interest selection, image segmentation, feature extraction, image analysis, and data interpretation and sharing[1].
The significance of proper sample preparation cannot be overstated, as the quality of the sample directly influences the accuracy and reliability of the subsequent analysis. This process's essential tasks include staining or labeling the sample to enhance specific features of interest. Once the sample is adequately prepared, slide scanners or digital microscopes are used to capture high-resolution images of the entire slide, so that an advanced image analysis can be conducted. Parameters such as the resolution, magnification, and focal plane must be adjusted carefully before the slide is scanned. The digitized images are then systematically stored in digital image repositories or databases. Effective organization and structured storage of the slides are essential to ensure the images can be easily accessed and retrieved as needed. This step is also necessary to maintain proper data management and archival practices [2].
Image preprocessing enables the application of noise reduction, color correction, and background subtraction measures to enhance the quality and clarity of the images. This step contributes to the accuracy of the analysis by minimizing unwanted variations in the images. Because not all parts of a digitized slide may be relevant to the analysis, so specific regions of interest (ROIs) within each digital image must be defined. These ROIs must then be further refined through image segmentation. Image processing techniques can be used to separate identified ROIs from the background of an image or adjacent tissues. This segmentation is vital for isolating the features of interest that require a detailed analysis. Feature extraction entails that relevant characteristics and information are derived from the segmented ROIs. These features may encompass texture, shape, intensity, and other distinctive attributes. These extracted features serve as the basis for the subsequent analysis.
Once the features have been extracted, the core of the analysis can begin. Depending on the research or diagnostic objectives, researchers or software algorithms can use analysis techniques, such as object detection, classification, quantification, or pattern recognition. This step provides quantitative insights into the samples, aids in identifying abnormalities, and supports classification tasks. The analysis results can then be interpreted within the context of the research or diagnostic goals. Quantitative measurements can be made, anomalies can be identified, and samples can be classified based on predetermined criteria. The analysis results, including measurements, interpretations, and visualizations, should be stored in a structured and well-documented format. Proper documentation is essential for traceability, reproducibility, and future reference. Well-documented records are vital for maintaining scientific rigor and high-quality standards. Sharing of the results with relevant stakeholders, researchers, or healthcare professionals is facilitated to support clinical decision-making, further research, and interdisciplinary collaboration[3,4].
Method
Specific software viewers included in this review are 3DHistech’s PANNORAMIC, Roche’s navify®, Clinical Viewer, and InstantViewer, chosen for their prominence and widespread use in digital pathology. Peer-reviewed articles, technical reports, and comparative studies published within the past five years (from 2018 to 2023) were considered to ensure relevance to contemporary advancements and features. PubMed, IEEE Xplore, Google Scholar, and other academic databases were systematically searched using combinations of keywords such as “3DHistech’s PANORAMIC,” “Roche’s navify®,” “Clinical Viewer,” “InstantViewer, ” “digital pathology software, ” “slide digitization,” “image analysis,” and related terms.
Emphasis was placed on software capabilities related to slide digitization, image processing, storage, image analysis algorithms, user interface, integration with other tools, and any unique features contributing to the digitization process in pathology analysis. The review delved into features encompassing noise reduction, color correction, and background subtraction—measures critical for optimizing image clarity and reducing unwanted variations.
The review assessed the user interface of the software viewers, acknowledging the importance of an intuitive and navigable interface. Evaluating ease of use, accessibility of features, and user interaction dynamics provided insights into the software’s friendliness to pathologists and researchers. Official web pages and technical documentation from the manufacturers of the specified software viewers were accessed to gather detailed information, updates, features, and any relevant case studies or comparisons provided by the companies.
Results and Discussion
Frequently used digital slide formats
The digital slide scanning starts with the glass microscope slide containing a tissue sample or specimen. This slide is placed on the scanner for digitization. The scanner is the hardware device responsible for capturing high-resolution digital images of the microscope slide. It typically consists of a precision stage for moving the slide, a high-quality objective lens, a light source, and a digital camera for image capture. The scanner moves the objective lens across the microscope slide in a grid pattern, capturing multiple overlapping images that are later stitched together to create a seamless whole slide image. Whole slide images are stored in specific file formats like SVS, NDPI, or MRXS, which support high-resolution imaging and annotations. Resolution refers to the level of detail in the digital image. In digital slide scanning, a resolution is measured in terms of dots per inch (DPI) or micrometers per pixel (μm/px). Higher resolution results in more detailed images but also larger file sizes.
Exporting digital slides comes in different formats. Famous examples follow:
Digital slides software viewers
Instead of storing these large image files locally on physical servers or computers, the images are securely stored in the cloud. Cloud storage provides several advantages, including scalability, accessibility from anywhere with an internet connection, and robust data backup and recovery. This remote access allows for telepathology, collaboration, and second opinions, which can be invaluable for medical consultations and research[5]. Cloud-based pathology workflows can integrate with artificial intelligence (AI) algorithms and image analysis tools. These technologies can aid in the automated detection of anomalies, faster diagnoses, and the identification of potential biomarkers. The speed at which digital pathology images can be uploaded, retrieved, and analyzed in the cloud can be affected by latency issues. For real-time consultations and diagnoses, image retrieval or analysis delays can be a drawback[6,7].
Examples for digital slides software viewers are 3dhistech’s PANNORAMIC DESK II DW, PANNORAMIC Midi II, PANNORAMIC SCAN II, PANNORAMIC Confocal, PANNORAMIC 250 Flash III, and PANNORAMIC 1000[8]. To view these slides, SlideViewer is a digital microscopy application specifically designed to support microscope examination processes in bioscience. This application has been available since 2013, with the current release being SlideViewer 2.6. It is offered free of charge and caters to research purposes as a Windows application, compatible with file formats such as mrxs, sys, dicom, ndpi, iSyntax, and tiff. SlideViewer offers instant access and storage options through SlideCenter, be it local or cloud/server-based. It allows integration with AI tools through a Python interface, with optional image analysis tools like QuantCenter. The application also includes features like Marker Counter, 3DView for z-Stack slides, TMAtool (optional), comprehensive annotations, snapshot functionality, slide overview, tracking of inspected FOVs, parallel viewing of up to nine slides, color adjustment, and autofluorescence correction at an extra cost. It supports various input devices and is a valuable resource for research purposes. Other digital slide viewers include navify®, Clinical Viewer, and InstantViewer[9].
Roche’s navify® Digital Pathology revolutionizes pathology laboratory workflow by enhancing efficiency through connectivity and automation. This cloud-based pathology workflow software seamlessly connects technicians and pathologists. Navify® is an iteration of Roche’s uPath enterprise software, initially established in 2019, with continuous improvements and updates through 2022. Navify® Digital Pathology supports external algorithms for pathology decision support and allows for comments and integration with external applications. Its unique characteristics include (a) one-click image analysis with compatibility for uPath whole slide image analysis digital pathology algorithms, (b) scalability through the deployment of additional cloud resources, (c) multi-slide canvas viewing for faster annotation and sign-out and (d) user-centric design with an innovative interface to enhance the overall user experience.
Clinical Viewer is a digital microscopy application crafted to support the histopathological diagnostic workflow and microscope examination processes. This application has been integral to the Pannoramic Pathology Management System since 2020. The application operates on a case-based structure and provides comprehensive annotation features, a snapshot function, slide overview, and preview images. It allows tracking of inspected fields of view (FOVs), parallel viewing of up to nine slides, color adjustment, and autofluorescence correction. Various input devices are supported, making it a valuable tool for clinical and research purposes[10].
InstantViewer is the default application for opening slides within SlideCenter. It is a versatile, multi-platform slide viewer application compatible with Windows, Mac OSX, iOS, LINUX, and Android. InstantViewer is accessible free of charge and primarily intended for research purposes. It operates as a web browser-based tool and is also available as an iPad Viewer app. It is compatible with the mrxs file format and allows instant access to slides hosted on the server or in the cloud. InstantViewer provides annotation features, a snapshot function, slide overview, supporting brightfield (BF) and fluorescence (FL) slides. It ensures that slides are available anytime and on various devices, enhancing accessibility and convenience[11].
Table 1 presents a comparative overview of key digital slide viewers used in the field of pathology and research. This comprehensive breakdown encompasses several critical features that impact their utility in digitizing slides and subsequent analysis. Each viewer’s distinct attributes are highlighted, heightening their functionalities, certifications, platform compatibility, and integration capabilities.
Feature |
Navify® |
Clinical Viewer |
SlideViewer |
InstantViewer |
Primary Purpose |
Pathology, Research |
Clinical, Research |
Research |
Research |
Year of Introduction |
2022 |
2020 |
2013 |
2000 |
CE-IVD (IVDR) Certification |
Yes |
Yes |
No |
No |
Operating System |
Windows |
Windows |
Windows |
Windows, Mac OSX, iOS, LINUX, Android |
Supported File Formats |
Various |
mrxs, sys, dicom, |
mrxs, sys, dicom, ndpi, |
mrxs, ndpi, iSyntax, tiff |
Integration with AI Tools |
Yes |
Yes |
Yes |
No |
Case-Based Structure |
Yes |
Yes |
Yes |
No |
Instant Access and Storage |
Yes, via PPMS |
Yes |
Yes, |
Yes |
Annotation Features |
Yes |
Yes |
Yes |
Yes |
Snapshot Function |
Yes |
Yes |
Yes |
Yes |
Parallel Viewing of Slides |
Yes, with limits |
Yes, with limits |
Yes |
Yes |
Color Adjustment and Autofluorescence |
Yes |
Yes |
Yes |
Yes |
Cloud-Based Storage Option |
Yes |
No |
Yes |
Yes |
Integration with LIS |
Yes |
No |
No |
No |
Multi-Platform Compatibility |
No |
No |
No |
Yes (Windows, Mac OSX, iOS, LINUX, Android) |
Image analysis algorithms and open-source libraries
Image analysis algorithms are used to process and extract information from digital images. These algorithms find applications in various fields, including healthcare, computer vision, remote sensing, and more. Many more specialized algorithms are designed for specific tasks and industries such as object detection algorithms, image segmentation algorithms, feature extraction algorithms, texture analysis algorithms, image registration algorithms, and deep learning algorithms14.
Object detection algorithms examples include:
Feature Extraction algorithms examples include:
Texture Analysis algorithms examples include:
Image Registration algorithms examples include:
Deep Learning algorithms examples include:
Edge Detection algorithms examples include:
Thresholding algorithms examples include:
Not all these algorithms can be operated with a user-friendly interface. Some of them require basic or advanced computational knowledge [12]. However, effort is made to render the required codes available on open-source libraries[13–15]. Examples for these libraries are:
It is worth noting that GUIs are not as suitable for fully automated or batch-processing tasks. Writing scripts or using command-line tools is more efficient in such cases. GUI-based applications can be resource-intensive, consuming more memory and processing power than command-line tools. GUIs for image processing and analysis tools are valuable for their ease of use and accessibility, particularly for inexperienced coding users. However, they may have automation, scalability, and flexibility limitations, making them more suitable for specific use cases. The choice between GUI-based and script-based tools depends on the specific requirements of the task and the user’s expertise [14].
The best example of the intersection between research and practice in digital pathology is the algorithm quantification of Programmed Death-Ligand 1 (PD-L1) expression[16,17]. This involves processing immunohistochemistry stained slides using automated image analysis techniques[18,19]. OpenCV is usually used to perform PD-L1 analysis on a tested image. The following Python code suffices to load an image, convert it to grayscale, apply thresholding to segment the PD-L1-positive regions and find contour[20].
import cv2
import numpy as np
# Load the image
image = cv2.imread('pd-l1_image.png')
# Convert the image to grayscale
gray = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)
# Perform thresholding to segment the PD-L1-positive regions
# You may need to adjust the threshold value based on your specific images
_, thresholded = cv2.threshold(gray, 128, 255, cv2.THRESH_BINARY)
# Find contours in the thresholded image
contours, _ = cv2.findContours(thresholded, cv2.RETR_EXTERNAL, cv2.CHAIN_APPROX_SIMPLE)
# Iterate through the contours and calculate area (a measure of PD-L1 expression)
pd_l1_areas = []
for contour in contours:
area = cv2.contourArea(contour)
pd_l1_areas.append(area)
# Calculate total PD-L1 expression area
total_pd_l1_area = sum(pd_l1_areas)
# Display the total PD-L1 expression area
print(f"Total PD-L1 expression area: {total_pd_l1_area} square pixels")
The specific steps and libraries used depend on the nature and goals of analysis. The code given below works for other libraries.
import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt
# Load your gene expression data as a DataFrame
# This is a simplified example; you should replace this with your actual data.
data = pd.read_csv('gene_expression_data.csv')
# Assuming you have a column for PDL1 expression
pdl1_expression = data['PDL1']
# Perform basic statistical analysis
mean_expression = pdl1_expression.mean()
median_expression = pdl1_expression.median()
max_expression = pdl1_expression.max()
min_expression = pdl1_expression.min()
print(f"Mean PDL1 expression: {mean_expression}")
print(f"Median PDL1 expression: {median_expression}")
print(f"Max PDL1 expression: {max_expression}")
print(f"Min PDL1 expression: {min_expression}")
# Visualize the PDL1 expression distribution
sns.histplot(pdl1_expression, kde=True)
plt.xlabel('PDL1 Expression')
plt.ylabel('Frequency')
plt.title('PDL1 Expression Distribution')
plt.show()
Conclusion
Digital slides are large files, which can be challenging to store and transmit. While digital slides can offer high-resolution images, the quality may not always match that of traditional slides. Factors such as scanner quality, slide preparation, and compression can affect the image quality. Managing and backing up large WSIs can be costly and require substantial storage space. Transmitting these files over networks can be slow and may require significant bandwidth. Integrating digital slides into the pathology workflow can be complex. Existing laboratory information systems (LIS) and picture archiving and communication systems (PACS) may need to be updated or replaced to support digital pathology.
Notes: None
Acknowledgements: None
Funding resources: None
Conflict of interest: The authors declare no conflict of interest.
The similarity index (using iThenticate) reads 10 %.
AI plagiarism was checked by https://www.zerogpt.com/
Results: Original (written by humans)
Copyright: HUE.