|J Pathol Inform 2018,
A comprehensive study of telecytology using robotic digital microscope and single Z-stack digital scan for fine-needle aspiration-rapid on-site evaluation
Keluo Yao1, Rulong Shen2, Anil Parwani2, Zaibo Li2
1 Department of Pathology, Michigan Medicine, University of Michigan, Ann Arbor, Michigan, USA
2 Department of Pathology, The Ohio State University, Columbus, Ohio, USA
|Date of Submission||10-Oct-2018|
|Date of Acceptance||14-Nov-2018|
|Date of Web Publication||24-Dec-2018|
Dr. Keluo Yao
Department of Pathology, Michigan Medicine, University of Michigan, North Campus Research Complex, 2800 Plymouth Road, Bldg. 60, Suite 1609, Ann Arbor, MI 48109-5062
Source of Support: None, Conflict of Interest: None
| Abstract|| |
Background: The current technology for remote assessment of fine-needle aspiration-rapid on-site evaluation (FNA-ROSE) is limited. Recent advances may provide solutions. This study compared the performance of VisionTek digital microscope (VDM) (Sakura, Japan) and Hamamatsu NanoZoomer C9600-12 single Z-stack digital scan (SZDS) to conventional light microscopy (CLM) for FNA-ROSE. Methods: We assembled sixty FNA cases from the thyroid (n = 16), lymph node (n = 16), pancreas (n = 9), head and neck (n = 9), salivary gland (n = 5), lung (n = 4), and rectum (n = 1) based on a single institution's routine workflow. One Diff-Quik-stained slide was selected for each case. Two board-certified cytopathologists independently evaluated the cases using VDM, SZDS, and CLM. A “washout” period of at least 14 days was placed between the reviews. The results were categorized into satisfactory versus unsatisfactory for adequacy assessment (AA) and unsatisfactory, benign, atypical, suspicious, and malignant for preliminary diagnosis (PD). Results: For AA, the Cohen's kappa statistics (CKS) scores of intermodality agreement (IMA) were 0.74–0.94 (CLM vs. VDM) and 0.86–1 (CLM vs. SZDS). The discordant rates of IMA were 3.3% (4/120) for VDM versus CLM, and 1.7% (2/120) for SZDS versus CLM. For PD, the CKS scores of IMA ranged 0.7–0.93. The overall discordant rates of IMA were 15% (18/120) for CLM versus VDM and 10.8% (13/120) for CLM versus SZDS. The discordant rates of IMA with 2 or higher degrees were 5.8% (7/120) for CLM versus VDM and 1.7% (2/120) for CLM versus SZDS. The average time spent per slide was 270 s for VDM, significantly longer than that for CLM (113 s) or for SZDS (122 s). Conclusions: Our data demonstrate that both VDM and SZDS are suitable to provide AA and reasonable PD evaluation. VDM, however, has a significantly longer turnaround time and worse diagnostic performance. The study demonstrates both the potentials and challenges of using VDM and SZDS for FNA-ROSE.
Keywords: Adequacy assessment, Cohen's kappa statistics, conventional light microscopy, fine-needle aspiration-rapid on-site evaluation, intermodality agreement, NanoZoomer, preliminary diagnosis, single Z-stack digital scan, Telecytology, VisionTek digital microscope
|How to cite this article:|
Yao K, Shen R, Parwani A, Li Z. A comprehensive study of telecytology using robotic digital microscope and single Z-stack digital scan for fine-needle aspiration-rapid on-site evaluation. J Pathol Inform 2018;9:49
|How to cite this URL:|
Yao K, Shen R, Parwani A, Li Z. A comprehensive study of telecytology using robotic digital microscope and single Z-stack digital scan for fine-needle aspiration-rapid on-site evaluation. J Pathol Inform [serial online] 2018 [cited 2019 Jul 22];9:49. Available from: http://www.jpathinformatics.org/text.asp?2018/9/1/49/248454
| Introduction|| |
Given the increasing utilization of cytopathology as a way of obtaining a biopsy, fine-needle aspiration-rapid onsite evaluation (FNA-ROSE) has become a rate-limiting step in the whole process. A well-executed FNA-ROSE is an important quality control step and can significantly impact the diagnostic quality of the obtained biopsy material. Given the dispersed nature of FNA service, many institutions employ some form of telecytology to facilitate FNA-ROSE. One of the most frequently used platforms is a webcam-based solution such as the NetCam (Olympus, Japan). However, most solutions have poor image quality and do not allow the cytopathologist to control the examination process through “driving.” Robotic digital microscopes and whole-slide images overcome these limitations by offering direct region of interest manipulation through mechanical or electronic means. The image qualities are also higher through digital image process and more robust sensors. Here, we explore the performance of using a robotic digital microscope (VisionTek digital microscope [VDM] M6) and single Z-stack digital scan (SZDS) as a possible substitute for FNA-ROSE.
| Methods|| |
Based on the daily workflow of a single institution and College of American Pathologist guideline, we created a panel of sixty cases from the thyroid (n = 16), lymph node (n = 16), pancreas (n = 9), head and neck (n = 9), salivary gland (n = 5), lung (n = 4), and rectum (n = 1). For the purpose of the study, each case was consisted of a single representative slide prepared with Diff-Quik and the entire panel was composed of diverse set of sites, organs, and original diagnoses. Each case contained a brief clinical history, specimen source, and preparation method. The cases were randomized and distributed in batches of four. All cases had been blindly and independently assessed for preliminary diagnosis (PD) and turnaround time by two board-certified cytopathologists (arbitrarily designated as A and B). The adequacy assessments (AAs) were obtained by categorizing the results into satisfactory versus unsatisfactory. The preliminary diagnoses were categorized into unsatisfactory, benign, atypical, suspicious, and malignant. For example, a result of pleomorphic adenoma would be categorized as satisfactory for AA and benign for PD. Cases with lymphocytes and cannot exclude lymphoma without ancillary studies were categorized as satisfactory for adequacy assessment and atypical for diagnostic evaluation. All statistical data were processed by Microsoft Excel 2016 and analyzed using Python scikit-learn 0.19.2.
Instruments and image acquisition
We used three different types of assessment methods: conventional light microscopy (CLM) with glass slides, VDM M6 with glass slides, and SZDS of glass slides produced by Hamamatsu NanoZoomer C9600-12. For CLM, the cytopathologists used their accustomed microscopes (Olympus BX series). For VDM, we used the manufacturer's software and adjusted the gamma setting of the VDM software to “2” to optimize image quality for Diff-Quik stain per the instrument manufacturer [Figure 1]. Still poorly understood, gamma adjustment of digital images brings out more information by enhancing “contrast.” For the SZDS, we used the NDP.view2 (Hamamatsu Photonics, Japan) software and viewed the digital slides on a standard “office-grade” LCD monitor with 1080p resolution and 24-bit color. A washout period of 2 weeks or more was placed between each method for each cytopathologist.
|Figure 1: Pancreatic adenocarcinoma displayed on VisionTek software under default image quality settings (a). The same region after adjusting gamma to “2” (b)|
Click here to view
| Results|| |
[Table 1] contains all the detailed results from AA and PD between CLM, VDM, and SZDS from the two cytopathologists.
|Table 1: All results from cytopathologists, adequacy assessment, preliminary diagnosis, conventional light microscopy VisionTek digital microscope, and single Z-stack digital scan|
Click here to view
For each case, the adequacy was evaluated into either satisfactory or unsatisfactory category, and the Cohen's kappa statistics (CKS) scores were calculated [Table 2] and [Table 3].
|Table 2: Concordance rates and Cohen's kappa statistics scores for interobserver agreement|
Click here to view
|Table 3: Concordance rate and Cohen's kappa statistics scores for intermodality agreement|
Click here to view
For interobserver agreement (IOA), CKS score for CLM was 0.74 with 4 instances of disagreements (lymph node ×3 and pancreas ×1). CKS score for VDM was 0.58 with 6 instances of disagreements (thyroid ×2, lymph node ×3, and pancreas ×1). CKS score for SZDS was 0.74 with 4 instances of disagreements (thyroid ×1, lymph node ×2, and pancreas ×1).
For intermodality agreement (IMA), cytopathologist A achieved higher CKS scores (0.94 and 1) than cytopathologist B (0.74 and 0.86) for both CLM versus VDM and CLM versus SZDS, respectively. For cytopathologist A, only one instance of disagreement occurred on a lymph node specimen for CLM versus VDM and no disagreement occurred for CLM versus SZDS. Cytopathologist B had three instances of disagreement (thyroid ×2 and lymph node ×1) on CLM versus VDM and two instances of disagreement (thyroid ×1 and lymph node ×1) on CLM versus SZDS.
Preliminary diagnostic evaluation
Preliminary diagnoses were categorized into five categories including unsatisfactory, benign, atypical, suspicious, and malignant. The CKS scores were calculated [Table 2] and [Table 3].
For IOA, CKS score for CLM was 0.67 with 13 disagreements including 2 head and neck specimens (2 malignant vs. atypical), 4 lymph node specimens (1 suspicious vs. malignant and 3 unsatisfactory vs. benign), 2 pancreas specimens (1 malignant vs. suspicious and 1 unsatisfactory vs. benign), 2 salivary gland specimens (2 atypical vs. benign), and 3 thyroid specimens (2 suspicious vs. benign and 1 atypical vs. benign). Two disagreements (2/13) had two or more degrees of discordance. The CKS score for VDM was 0.47 with 22 disagreements including 4 head and neck specimens (1 malignant vs. benign, 2 malignant vs. atypical, and 1 atypical vs. benign), 1 lung specimen (malignant vs. suspicious), 7 lymph node specimens (2 malignant vs. benign, 2 suspicious vs. malignant, 1 atypical vs. unsatisfactory, and 2 benign vs. unsatisfactory), 4 pancreas specimens (1 suspicious vs. benign, 1 malignant vs. atypical, 1 malignant vs. suspicious, and 1 benign vs. unsatisfactory), 2 salivary gland specimens (1 suspicious vs. benign and 1 malignant vs. suspicious), and 4 thyroid specimens (1 suspicious vs. benign, 2 benign vs. unsatisfactory, and 1 atypical vs. benign). Nine disagreements (9/22) had two or more degrees of discordance. The CKS score for SZDS was 0.70 with 12 disagreements including 3 head and neck specimens (1 malignant vs. atypical, 1 malignant vs. suspicious, and 1 suspicious vs. atypical), 2 lymph node specimens (both benign vs. unsatisfactory), 2 pancreas specimens (1 suspicious vs. unsatisfactory and 1 malignant vs. suspicious), 1 salivary gland specimen (suspicious vs. benign), and 4 thyroid specimens (2 suspicious vs. benign, 1 suspicious vs. atypical, and 1 benign vs. unsatisfactory). Five disagreements (5/12) had two or more degrees of discordance.
For intermodal agreement, CKS scores ranged from 0.7 to 0.93. Cytopathologist A had 6 instances (CKS score 0.85) of disagreements for CLM versus VDM including 1 lung specimen (malignant vs. suspicious), 4 lymph node specimens (1 malignant vs. benign, 2 malignant vs. suspicious, and 1 benign vs. unsatisfactory), and 1 pancreas specimen (malignant vs. suspicious). One disagreement had two degrees of discordance. There were 3 instances (CKS score 0.93) of disagreements for CLM versus SZDS including 1 head and neck specimen (malignant vs. suspicious), 1 pancreas case (malignant vs. suspicious), and 1 thyroid specimen (suspicious vs. benign). One disagreement (1/3) had two degrees of discordance. For cytopathologist B, there were 12 instances (CKS score 0.70) of disagreements for CLM versus VDM including 2 head and neck specimens (1 malignant vs. benign and 1 atypical vs. benign), 2 lymph node specimens (1 suspicious vs. benign and 1 atypical vs. unsatisfactory), 2 pancreas specimens (1 malignant vs. benign and 1 malignant vs. atypical), 3 salivary gland specimens (1 malignant vs. suspicious, 1 suspicious vs. atypical, and 1 atypical vs. benign), and 3 thyroid specimens (1 suspicious vs. benign and 2 benign vs. unsatisfactory). Six disagreements (6/12) had two or greater degrees of discordance. There were 10 disagreements (CKS score 0.75) for CLM versus SZDS including 1 head and neck specimen (malignant vs. suspicious), 2 lymph node specimens (1 malignant vs. suspicious and 1 benign vs. unsatisfactory), 2 pancreas specimens (1 suspicious vs. benign and 1 malignant vs. suspicious), 2 salivary gland specimens (1 suspicious vs. atypical and 1 atypical vs. benign), and 3 thyroid specimens (2 suspicious vs. atypical and 1 benign vs. unsatisfactory). Only 1 disagreement (1/10) had two degrees of discordance.
Turnaround time analysis
The average time spent per slide was 270 s for VDM (range: 60–1200 s), 113 s for CLM (range: 60–600 s), and 122 s for SZDS (range: 60–300 s). Statistical analysis demonstrated significant statistical difference (P < 0.05) between the turnaround time from VDM and the time from CLM or SZDS.
| Conclusions|| |
The gold standard for FNA-ROSE involves direct assessment of the cytologic preparation on glass slides by cytopathologists using a microscope. However, due to the increased utilization and dispersion of clinical services, many cytology departments face the pressure of meeting the demand for multiple concurrent FNA-ROSEs at different locations. While solutions such as NetCam or its variant of “webcam-” based system are easy to implement and maintain, in practice, they are unsatisfactory due to the inability to “drive” the slide and poor image quality. Based on our experience, most cytopathologists will only depend on the NetCam solution for procedures such as thyroid AA where a PD is not absolutely essential. In most instances, on-site assessment using a traditional microscope is preferred to render a reliable PD.
Although typically expensive, systems such as the VDM or rapidly scanned digital slides can alleviate the shortcomings of “webcam-” type solutions. Our data show that both solutions under ideal conditions have the potential to be just as accurate as the direct examination of the glass slides under CLM. The solutions achieve this feat by increasing image quality through digital image process, better sensors, and the ability to “drive” the slide. VDM has the advantage of being accepted as a solution for the remotely controlled frozen section at many institutions. Rapid digital slide is also being evaluated as a possible alternative.
Based on our data, it appears that VDM and SZDS each offers unique advantages and disadvantages. Because VDM is fundamentally a remotely controlled microscope looking at glass slides, theoretically it should offer superior image quality and flexibility comparable to CLM. Indeed, even the image quality issue presented by the Diff-Quik stain can be alleviated with the correct gamma setting per manufacturer's recommendation [Figure 1]. In addition, the “Z-stack” option is also present since the ability to adjust focal plane is a part of the control offered by the software interface. However, using the system remotely can be a slightly frustrating experience due to the subjective “lag” feeling caused by the delay between the time the cytopathologist executes a command and the time when it is carried out by the instrument. This “lag” can be appreciated by the vastly different turnaround time between VDM and SZDS.
SZDS suffers less from the “lag” because the images have already been captured/stored on the computer/network. However, a significant downside is that unlike VDM or CLM, the images are not immediately available. Moreover, while the Z-stack option is available through some scanners, it may be impractical to implement in the FNA-ROSE setting as the slides take longer to scan and the storage requirements are higher. SZDS, however, can be rapidly scanned and takes up significantly less storage space than Z-stack scanning. However, without Z-stack, there are concerns for the proper visualization of three-dimensional features prevalent in many cytology specimens. It appears that our data suggest that the lack of Z-stack does not significantly impact diagnostic performance, and this finding has been collaborated in the literature previously. Moreover, the advances in slide scanning technology have improved known issues such as uniformity of plane of focus, data storage requirement, and image quality. The sufficiency of improvement is supported by the fact that a small number of institutions have started to pioneer on-site slide scanning for the frozen section as an alternative for the remotely controlled robotic microscope.
It appears that both VDM and SZDS can potentially produce less reliable preliminary diagnosis. Cytopathology inherently suffers from it due to the use of subjective morphologic features and sampling errors., The problems appear to exacerbate on the VDM when used for diagnostic evaluation compared to CLM and SZDS, even with help from “Z-stack.” The potential culprit includes limitation of the field of view and the persistence of image quality problems despite enhancement by software [Figure 1]. In addition, cytopathologist B is known to have less exposure to digital pathology technologies, which can partially explain the noticeable decrease in his/her IMA when compared to cytopathologist A. Furthermore, the increase of interobserver disagreements with two or higher degrees of discordance with VDM and SZDS compared to CLM suggests that experience in microscopic workflow does not always translate into the same interpretative performance with newer digital modality, a finding that has been previously reported.
Our study, to the best of our knowledge, is the first attempt to have a side-by-side performance comparison between glass slide, robotic microscope, and single Z-stack digital slide format for FNA-ROSE. Even though we carefully controlled the variables by blinding the cytopathologists to the diagnoses and applied adequate “washout” period between the different assessment methods, some “carry-over” memory of the cases was inevitable and could have a confounding impact on the data. In addition, while the one slide per case format fits the need of the study, it does not simulate many FNA-ROSE scenarios where evaluating multiple smear slides is necessary, and therefore, performing telecytology using either technology may be time-consuming and difficult. Nonetheless, our data can serve as a guide for possible improvement in the technology for FNA-ROSE.
Financial support and sponsorship
Conflicts of interest
There are no conflicts of interest.
| References|| |
Silverman JF, Finley JL, O'Brien KF, Dabbs DJ, Park HK, Larkin EW, et al.
Diagnostic accuracy and role of immediate interpretation of fine needle aspiration biopsy specimens from various sites. Acta Cytol 1989;33:791-6.
Pantanowitz L, Sinard JH, Henricks WH, Fatheree LA, Carter AB, Contis L, et al.
Validating whole slide imaging for diagnostic purposes in pathology: Guideline from the college of american pathologists pathology and laboratory quality center. Arch Pathol Lab Med 2013;137:1710-22.
Singnoo J, Finlayson GD. Understanding the gamma adjustment of images. In: Color Imaging Conference. Vol. 2010. San Antonio, Texas: Society for Imaging Science and Technology; 2010. p. 134-9.
Bauer TW, Slaw RJ, McKenney JK, Patil DT. Validation of whole slide imaging for frozen section diagnosis in surgical pathology. J Pathol Inform 2015;6:49.
] [Full text]
Evans AJ, Chetty R, Clarke BA, Croul S, Ghazarian DM, Kiehl TR, et al.
Primary frozen section diagnosis by robotic microscopy and virtual slide telepathology: The university health network experience. Semin Diagn Pathol 2009;26:165-76.
Lin O. Telecytology for rapid on-site evaluation: Current status. J Am Soc Cytopathol 2018;7:1-6.
Wright AM, Smith D, Dhurandhar B, Fairley T, Scheiber-Pacht M, Chakraborty S, et al.
Digital slide imaging in cervicovaginal cytology: A pilot study. Arch Pathol Lab Med 2013;137:618-24.
McKay RR, Baxi VA, Montalto MC. The accuracy of dynamic predictive autofocusing for whole slide imaging. J Pathol Inform 2011;2:38.
] [Full text]
Hall TL, Layfield LJ, Philippe A, Rosenthal DL. Sources of diagnostic error in fine needle aspiration of the thyroid. Cancer 1989;63:718-25.
Ballo MS, Shin HJ, Sneige N. Sources of diagnostic error in the fine-needle aspiration diagnosis of Warthin's tumor and clues to a correct diagnosis. Diagn Cytopathol 1997;17:230-4.
Groen R, Abe K, Yoon HS, Li Z, Shen R, Yoshikawa A, et al.
Application of microscope-based scanning software (Panoptiq) for the interpretation of cervicovaginal cytology specimens. Cancer Cytopathol 2017;125:918-25.
[Table 1], [Table 2], [Table 3]