|J Pathol Inform 2018,
Implementing the DICOM standard for digital pathology
Markus D Herrmann1, David A Clunie2, Andriy Fedorov3, Sean W Doyle1, Steven Pieper4, Veronica Klepeis5, Long P Le5, George L Mutter6, David S Milstone6, Thomas J Schultz7, Ron Kikinis3, Gopal K Kotecha1, David H Hwang6, Katherine P Andriole8, A John Iafrate5, James A Brink9, Giles W Boland10, Keith J Dreyer11, Mark Michalski11, Jeffrey A Golden6, David N Louis5, Jochen K Lennerz5
1 MGH and BWH Center for Clinical Data Science, Boston, MA, USA
2 PixelMed Publishing, LLC, Bangor, PA, USA
3 Department of Radiology, Surgical Planning Laboratory, Brigham and Women's Hospital; Harvard Medical School, Boston, MA, USA
4 Isomics, Inc., Cambridge, MA, USA
5 Harvard Medical School; Department of Pathology, Massachusetts General Hospital, Boston, MA, USA
6 Harvard Medical School; Department of Pathology, Brigham and Women's Hospital, Boston, MA, USA
7 Enterprise Medical Imaging, Massachusetts General Hospital, Boston, MA, USA
8 MGH and BWH Center for Clinical Data Science; Harvard Medical School; Department of Radiology, Brigham and Women's Hospital, Boston, MA, USA
9 Harvard Medical School; Department of Radiology, Massachusetts General Hospital, Boston, MA, USA
10 Harvard Medical School; Department of Radiology, Brigham and Women's Hospital, Boston, MA, USA
11 MGH and BWH Center for Clinical Data Science; Harvard Medical School; Department of Radiology, Massachusetts General Hospital, Boston, MA, USA
|Date of Submission||30-Jul-2018|
|Date of Acceptance||06-Aug-2018|
|Date of Web Publication||02-Nov-2018|
Jochen K Lennerz
Department of Pathology, Center for Integrated Diagnostics, Massachusetts General Hospital, Harvard Medical School, 55 Fruit Street, GRJ 1015, Boston, MA 02114
Source of Support: None, Conflict of Interest: None
| Abstract|| |
Background: Digital Imaging and Communications in Medicine (DICOM®) is the standard for the representation, storage, and communication of medical images and related information. A DICOM file format and communication protocol for pathology have been defined; however, adoption by vendors and in the field is pending. Here, we implemented the essential aspects of the standard and assessed its capabilities and limitations in a multisite, multivendor healthcare network. Methods: We selected relevant DICOM attributes, developed a program that extracts pixel data and pixel-related metadata, integrated patient and specimen-related metadata, populated and encoded DICOM attributes, and stored DICOM files. We generated the files using image data from four vendor-specific image file formats and clinical metadata from two departments with different laboratory information systems. We validated the generated DICOM files using recognized DICOM validation tools and measured encoding, storage, and access efficiency for three image compression methods. Finally, we evaluated storing, querying, and retrieving data over the web using existing DICOM archive software. Results: Whole slide image data can be encoded together with relevant patient and specimen-related metadata as DICOM objects. These objects can be accessed efficiently from files or through RESTful web services using existing software implementations. Performance measurements show that the choice of image compression method has a major impact on data access efficiency. For lossy compression, JPEG achieves the fastest compression/decompression rates. For lossless compression, JPEG-LS significantly outperforms JPEG 2000 with respect to data encoding and decoding speed. Conclusion: Implementation of DICOM allows efficient access to image data as well as associated metadata. By leveraging a wealth of existing infrastructure solutions, the use of DICOM facilitates enterprise integration and data exchange for digital pathology.
Keywords: Computational pathology, DICOMweb, image compression, slide scanning, whole slide imaging
|How to cite this article:|
Herrmann MD, Clunie DA, Fedorov A, Doyle SW, Pieper S, Klepeis V, Le LP, Mutter GL, Milstone DS, Schultz TJ, Kikinis R, Kotecha GK, Hwang DH, Andriole KP, Iafrate A J, Brink JA, Boland GW, Dreyer KJ, Michalski M, Golden JA, Louis DN, Lennerz JK. Implementing the DICOM standard for digital pathology. J Pathol Inform 2018;9:37
|How to cite this URL:|
Herrmann MD, Clunie DA, Fedorov A, Doyle SW, Pieper S, Klepeis V, Le LP, Mutter GL, Milstone DS, Schultz TJ, Kikinis R, Kotecha GK, Hwang DH, Andriole KP, Iafrate A J, Brink JA, Boland GW, Dreyer KJ, Michalski M, Golden JA, Louis DN, Lennerz JK. Implementing the DICOM standard for digital pathology. J Pathol Inform [serial online] 2018 [cited 2022 May 29];9:37. Available from: https://www.jpathinformatics.org/text.asp?2018/9/1/37/244882
| Introduction|| |
Tissue-based diagnostics relies heavily on optical (bright field) microscopy. Digital pathology promises advances over this 150-year-old technology. First, scanned slides may be navigated seamlessly on a display at multiple magnifications for diagnosis and research (“virtual microscopy”). Second, different pathologists may remotely review the slides simultaneously in real time (“telepathology”). Third, images may be analyzed by computer algorithms and the resulting quantitative biomarkers can be integrated with clinical data (“computational pathology”)., Despite these potential benefits, widespread adoption of digital pathology in a clinical setting has not yet been achieved. Regulatory hurdles in the United States can, in part, be held historically accountable; however, the US Federal Drug Administration recently cleared the first whole slide imaging system for marketing based on a recent multisite clinical trial (NCT02699970) that demonstrated noninferiority to optical microscopy across a range of use cases. It is now clear that digitized slides provide an acceptable level of clinical performance when compared with conventional light microscopy.
Whole slide imaging applications extend well beyond interactive viewing on screen. In particular, computer vision and machine learning techniques hold great promise to unlock the potential of digital pathology by extending human capabilities with decision-support tools and automating laborious mechanical tasks.,,,,,,, These tools require machine-readable data in a standardized format–not only for the image pixel data but also for their annotations and associated descriptive metadata. In the current digital pathology landscape, whole slide imaging systems store image data in proprietary file formats. While these systems allow interactive viewing, the proprietary nature of data formats and interfaces create vendor lock-in and impede data access. Open-source and commercial software solutions to read proprietary formats have been developed.,, However, these solutions primarily provide access to image pixel data whereas crucial metadata related to the clinical context (and the acquisition process) remain largely inaccessible. Furthermore, proliferation of competing vendor solutions increases the number of proprietary formats, which represents a barrier for interoperability and maintainability. In other words, there is a compelling need for data standardization in digital pathology to facilitate the clinical integration and to support the computational development streams.,
Digital Imaging and Communications in Medicine (DICOM) is the globally accepted standard for communication and management of a wide range of medical images and related information.,, DICOM further supports encoding, storage, and exchange of image annotations as well as quantitative measurements derived from images. While the standard has been extended to support digital pathology, it has seen little adoption in pathology practice. Specifically, the DICOM standard comprises an extensive set of documents that specify various technical aspects of digital pathology. DICOM addresses primarily information technology experts who have the necessary technical expertise to implement it. In contrast, practicing pathologists without a solid computer science background may not instantly appreciate the value of DICOM data models and communication protocols for their everyday work. This disconnect has resulted in an apparent lack of prioritization of interoperability, and vendors lack a compelling return on investment for building DICOM turn-key solutions.
There are several misperceptions among pathologists about the scope, applicability, and suitability of the DICOM standard. For example, DICOM is often perceived as being only an open file format for storage of image pixel data whereas the metadata integration, communication, and data exchange aspects are often disregarded. Recently, the need to achieve interoperability of whole slide imaging between different systems has been emphasized by the Digital Pathology Association as well as DICOM Working Group 26. These groups (in which many vendors participate) recently met for the first time to evaluate the image data exchange using DICOM representations and protocols. Intellectual property obstacles, which previously hindered implementation by vendors, have been resolved., Vendors now generally embrace the standard and agree on implementation details for improved interoperability. These large-scale efforts need to be supplemented by pilot implementations in pathology departments for various practical reasons: (1) the evaluation of capabilities and limitations requires first-hand experience at the user level (especially by content experts in pathology); (2) the demonstration of compatibility with existing clinical systems (e.g., pathology laboratory information systems [LIS], enterprise-wide Picture Archiving and Communication Systems) requires local, laboratory-based proofs of principle; and (3) reliance on external advice and guidelines cannot replace local stakeholder involvement and active definition of resource requirements. Importantly, adoption of the DICOM standard represents an opportunity for pathology to leverage established enterprise medical imaging infrastructure and software solutions. Ultimately, a common data standard will enable convergence between radiology and pathology for multidisciplinary integrated diagnostics.,,
Based on the compelling need for data standardization and interoperability in digital pathology, we initiated a cross-departmental prospective quality improvement project to implement the DICOM standard for digital pathology and outline resource requirements for implementation. The solutions presented here empower pathologists to gain an appreciation of and enable the assessment of the appropriateness of the DICOM standard for pathology practice. In addition, we demonstrate that existing software solutions developed for radiology can be reused for pathology through conformance with the DICOM standard.
| Methods|| |
Study sites, ethics approval
Two pathology laboratories and a clinical data science center within the authors' tertiary healthcare network served as the study sites. The project is part of a prospective and ongoing interdepartmental clinical quality-improvement initiative (institutional checklist, Human Research Committee, version May 25, 2012). Use of deidentified patient samples was performed under institutional review board approval “Feasibility Assessment of a Standard File Format for Digital Pathology” (IRB: 2018P000082); research was performed in accordance with the Declaration of Helsinki.
The primary goal was to assess whether DICOM is a practical format for digital pathology. We generated DICOM files from available pixel and metadata. For pixel data, we used proprietary image files from four different slide-scanning systems (Aperio CS2, Leica Biosystems, Buffalo Grove, IL, USA; Hamamatsu NanoZoomer S60, Hamamatsu Photonics, Boston, MA, USA; Motic EasyScan Pro6, Motic Microscopy, Richmond, British Columbia, Canada; Philips IntelliSite Ultra Fast Scanner, Eindhoven, The Netherlands). For metadata, we extracted the relevant information from two LISs (CoPathPlus™ 6.1MR1, PowerPath 10.0.1.10, Sunquest Information Systems, Tucson, AZ, USA). Secondary goals were: assessment of performance, evaluation of compatibility with existing DICOM software, assessment of pixel compression, pixel load times, evaluation of querying and retrieving data, and tracking of development efforts.
Selection and encoding of DICOM attributes
DICOM defines application-specific representations of images and their metadata in information object definitions (IODs), which are composed of a set of attributes grouped in modules. DICOM defines some attributes as mandatory, those always required to be present or present under specified conditions, and others as optional, which may be included or omitted at the implementer's discretion. For our main routine surgical pathology use case, we chose to implement 114 attributes (93 required or conditionally required, 21 optional) of the Whole Slide Microscopy Image IOD, defined in DICOM PS3.3 IODs. The attributes we selected are listed in [Supplementary Table 1]. The rules common to all applications for encoding, transmitting, and storing these attributes are defined in other parts of DICOM.,,, DICOM also defines an extensive set of controlled terminology for various applications by reference, when possible, to external lexicons, such as the Systematized Nomenclature of Medicine Clinical Terms (SNOMED CT). The coded values to use for attributes of the Specimen module, which includes both identifying and preparation descriptive attributes, are defined by DICOM in templates and context groups (value sets) in PS3.16. Templates and context groups represent generic mechanisms to describe the information in coded form and are cataloged by their corresponding template identifiers (TIDs) and context group identifiers (CIDs), respectively. We used the standard template TID 8001 (Specimen Preparation) to code items of the Specimen Preparation Step Content Item Sequence (FHIR) attribute and a variety of standard context groups to code different concepts contained by items. To code the Preparation Step Type concept, we used the coded values from context group CID 8111 (Specimen Preparation Procedure) and to code specific preparation steps, we used the following standard context groups: CID 8109 (Specimen Collection Procedure) for the collection step, CID 8110 (Specimen Sampling Procedure) for the sampling step, CID 8112 (Specimen Stains) for the staining step, and CID 8113 (Specimen Preparation Steps) for the processing step. Fixatives and embedding media can be applicable to one or more of the above steps, and we accounted for this variability by including values from CID 8114 (Specimen Fixatives) and CID 8115 (Specimen Embedding Media) as appropriate.
Generation of DICOM files
To create DICOM files, we defined a program that consists of the following steps: (1) extract pixel data and pixel-related metadata from proprietary file formats of different whole slide imaging vendors (see above); (2) obtain patient-and specimen-related identifying and descriptive metadata from the LIS (see above); (3) populate DICOM attributes with the obtained data; (4) encode attributes as DICOM data elements; and (5) store DICOM data sets in files on disk. We implemented this program in Python, using the pydicom Python package (version 1.0.2).
Pixel data and pixel-related metadata were extracted from proprietary image file formats using the open slide-python Python package (version 1.1.1),,, which depends on the OpenSlide C library (version 3.4.1)., The OpenSlide library enables accessing regions of pixel data through a uniform active programming interface (API) that abstracts the specific details and encoding issues of the proprietary pixel data organization and representation. Pixel data were extracted through this interface from the original lossy JPEG compressed images and re-encoded in compressed form to enable comparison of different compression methods. Specifically, we compared JPEG (lossy), JPEG-LS (lossless), and JPEG 2000 (lossless) compression methods. JPEG and JPEG 2000 compressions were performed using the Pillow Python package (version 5.1.0) together with the libjpeg-turbo C library for JPEG (version 1.5.3) and the openjpeg C library for JPEG 2000 (version 2.1.2). The C libraries were installed from source with default compiler flags. A lossy quality factor of 95 was used for JPEG. JPEG-LS compression was performed using the CharPyLS Python package interfacing with the CharLS C++ library.
Validation of DICOM files
It is important to note that the DICOM standard does not specify any testing or validation procedures to assess conformance to the standard. However, software tools have been developed for the validation of DICOM files. We used the dciodvfy tool from the dicom3tools package to validate the generated files in an automated manner. The validation tool checks whether metadata are encoded in conformity to the standard; however, it does not attempt to decode and interpret the pixel data. To ensure that the generated DICOM files fully conform with the latest 2018b edition of the DICOM standard and that compressed pixel data can be decoded and displayed correctly, we further performed an in-depth manual review using a variety of DICOM software libraries including pydicom (version 1.0.2),, GDCM (version 2.8.6),, and DCMTK (version v3.6.3).
To assess the performance of our conversion program, we evaluated the conversion times (in minutes per gigabyte) and relative file sizes by submitting separate jobs for each combination of the compression method and image/metadata file pair to the cluster. We tracked total elapsed execution times using the GNU “time” command and measured file sizes using the Unix “du” command. To assess the performance of DICOM as a file format for pixel data access, we considered the reading of individual frame items (within a DICOM series) as a typical use case; for example, the steps necessary to automate pixel data retrieval for building machine-learning applications. Specifically, we emulated pixel data access using a custom-built Python command line program that: (1) reads and decodes metadata of each DICOM image instance of a given series stored in PS3.10 files on disc, (2) identifies the image instance that contains the pixel data frame item for a given tile position within the image pyramid based on the corresponding DICOM metadata attributes, (3) determines the position of the frame item within the Pixel Data element (using the basic offset table item), (4) reads the frame pixel content into memory, and (5) decompresses the pixel data. To assess pixel data access efficiency, we measured the access times across three DICOM series from different vendor-specific file formats and compared performance between compression algorithms using the pydicom Python package (version 1.0.2), as well as the Python interface of the GDCM C++ library (version 2.8.6).
Network storage and retrieval of DICOM data
Effort tracking and data analysis
To estimate resources for DICOM implementation we prospectively tracked project efforts of personnel using Jira. For visualization of results, we used the Pandas Python package (version 0.22.0) and the Plotly Python package (version 2.5.1). For repeated measures, we provide averages ± standard deviation; statistical significance was defined as P ≤ 0.01.
| Results|| |
In the following, we first present the DICOM data models and describe how the standard represents multi-scale, tiled whole slide image pixel data together with related clinical metadata, including detailed descriptions of pathology laboratory workflows. Then, we report the generation of DICOM files from existing vendor files and assess encoding, storage and access performance using different lossy and lossless compression methods. We demonstrate the query as well as retrieval of image data over the web using DICOM RESTful services (DICOMweb) and report our implementation efforts.
DICOM enables modeling of image pixel data together with clinical metadata
Clinical diagnosis by light microscopy requires integration of image and clinical metadata. At the very least, a pathologist needs a slide label and a patient identifier to uniquely assign a diagnosis to a given patient (currently captured in the LIS) and part-type (typically referring to an anatomical region). We consider the integration of additional imaging-derived (e.g., annotations of the slide findings) and imaging-related information (e.g., tumor stage, biopsy sampling approach) invaluable. This level of integration will also allow for reuse of the clinical data for training and testing of machine learning applications, as well as automated data queries. To account for various use cases, the DICOM standard provides the syntax and semantics to describe both pixel and metadata by defining two data models: (1) the Model of the Real World and (2) the Information Model.
The Model of the Real World defines real-world entities and their relationships [Figure 1]a. The relevant entities for digital pathology are patient, study, series, image, container, and specimen. Importantly, the DICOM model follows the nomenclature of laboratory specimens in HL7, which differs slightly from the terminology commonly used in anatomical pathology practice. For example, a DICOM specimen represents any distinct tissue unit that may be subject to processing in the pathology laboratory (and not necessarily just the surgical specimen). In other words, according to the DICOM nomenclature, parts, blocks, and tissue sections may all represent DICOM specimens (of different types). Likewise, a DICOM container refers to any physical object that holds a specimen such as a jar, a syringe, a cassette, or a glass slide. Specifically, the DICOM container that is relevant in the context of digital pathology is most often the glass slide, which holds the imaged tissue section specimen. A DICOM series typically comprises all the digital images of a tissue section specimen mounted on a single physical glass slide container. A slide tray (paper folder, or multiple folders) with numerous physical slides in the real world would typically map to a DICOM study when digitized. In general, a DICOM study is equivalent to one case with one accession number. It is particularly important to mention the slide as a container because it generally also holds a label (with or without a barcode) for identification of the imaged tissue section specimen [Figure 1]a. There is a great variation between sites (laboratories) as to what information this label includes, ranging from identifiers of the patient, study, specimen, and/or container through human or machine-readable descriptive data such as the patient's name or the stain. In current pathology workflows, different sites and systems variously track studies, containers and/or specimens, depending on local standard operating procedures and naming-conventions. While the DICOM standard contains an informative, nonexhaustive description of various use cases that extend beyond current anatomic pathology practice, the priority for the purpose of managing digital images in the clinic is to track the slide container and link it to other information entities in the Model of the Real World (the patient, study, series, image, or specimen) as necessary.
|Figure 1: Representation of digital pathology information in DICOM. (a) Extension of the DICOM model of the real world for specimens. Diagram showing different DICOM information entities and their hierarchical relationships. Entities are color-coded based on information source. Information about Patient entities (shown in blue) is obtained from the electronic medical record, information about Study, Container, and Specimen entities (shown in orange) is obtained from the laboratory information system, and information about Series and Image entities is provided by the microscope/slide scanning system (shown in green). (b) DICOM information model. The VL Whole Slide Microscopy Image composite information object definition class provides attributes to describe real-world information entities. Related attributes are grouped into modules (visualized as boxes). The Pixel Data attribute, which represents the actual image pixel values, is highlighted in bold. (c) DICOM data set. Attributes of an instance of the VL Whole Slide Microscopy Image class are encoded as data elements. The pixel data element is generally the last element in the dataset (shown in grey). The elements preceding the pixel data element encode metadata that are required for interpretation of the image. These meta data elements are often collectively (but inaccurately) referred to as the “header” (blue, orange, green)|
Click here to view
The DICOM Information Model provides a set of IODs to describe the properties of entities of the Model of the Real World. The model uses object-oriented semantics to define IODs (classes) and attributes for a description of real-world entities in the context of distinct imaging modalities. Importantly, IODs do not map to individual entities one to one, but an IOD includes a variety of attributes that hold information about different entities. Specifically, the Visible Light Whole Slide Microscopy Image IOD provides attributes to describe the actual image as well as the related patient, study, series, image, specimen, and container entities [Figure 1]b. For storage and exchange, IODs are instantiated by populating attributes with actual values and are serialized into data sets by encoding each attribute as a separate data element [Figure 1]c. Each data element is identified in the data set by a tag and data elements are ordered in the data set by their tag's numerical value. The encoded dataset represents data in “unnormalized form” in database parlance, which in practice means that all information (including information about the patient, study, series, specimen, and container) is encoded at the level of the image. While this introduces redundancy (the complete information is entirely repeated in every image instance), it assures that each image can still be identified and interpreted safely in an unambiguous and self-describing manner, even when completely separated from any management system.
DICOM allows encoding of multiscale, tiled whole slide images
A digital slide is typically represented as a multiresolution image pyramid. The scanned image, for example, acquired at 0.25 or 0.5 μm resolution (corresponding to ×40 or ×20 objectives), forms the base (highest resolution) level of the pyramid. Other magnifications (higher levels of the pyramid) are derived computationally by successive down-sampling [Figure 2]a. In DICOM, unlike most proprietary formats, each resolution level is represented as a separate instance of the VL Whole Slide Microscopy Image IOD and therefore encoded as a separate DICOM data set and stored in a separate DICOM file. The two-dimensional array of pixel values that represents the image is referred to as the total pixel matrix [Figure 2]b. For improved access performance and to circumvent limitations on total frame size, each matrix is tiled into smaller, continuous, rectangular, equally sized pixel regions along the row and column dimension [Figure 2]c. Each tile is compressed and encoded as a separate frame [Figure 2]d, each of which is encapsulated as one or more fragments (frame items) within the pixel data element. For example, a digital slide with five magnification levels may be represented in DICOM as a series consisting of five image instances. DICOM provides two options for organizing and encoding the tiled pixel matrix: Full and sparse. Which method is used is indicated by the value of the Dimension Organization Type attribute.
|Figure 2: Encoding of whole slide imaging information in DICOM data set. (a) Schematic of multiresolution whole slide image pyramid. The pyramid base level represents the original image that was acquired by the microscope/slide scanning system. Higher pyramid level images (=lower power) are derived through successive downsampling. The frame of reference for images is the slide coordinate system in millimeter units, where the origin is defined as the lower left corner of the upright standing slide. (b) Total pixel matrix. An image is defined as a continuous, rectangular area of pixels. Shown is the image at ×10 resolution. Note that it is rotated 90° (counterclockwise) relative to the slide coordinate system. (c) Tiled image grid. The pixel matrix may be tiled into smaller rectangular, equally sized regions. Tiles are organized such that the first dimension spans the matrix rows and the second dimension the matrix columns, respectively. (d) Image pyramid encoded as series of VL Whole Slide Microscopy Image instances. Each image instance (=pyramid layer =downsampled magnification) is encoded as a separate DICOM data set. (e) DICOM data set of a multiframe image instance with encapsulated Pixel Data element. Each tile is compressed and encoded as a separate Frame item. Frames are implicitly numbered, based on the order of encoding. The dimensional organization of frames in the real world is described by the dimension index sequence attribute (green), which contains a dimension index pointer and a functional group pointer attribute for each dimension. The values of these attributes point to other attributes in the per-frame functional groups sequence attribute (yellow), which encode the actual values for each frame. The frame content sequence attribute hereby describes the relative position of each frame in the tiled image grid whereas the plane position (slide) sequence attribute describes the absolute position of each frame in the slide coordinate system as well as the total pixel matrix. The byte offset to individual frame items in the pixel data element (grey box labeled 1-6) is specified by the basic offset table item (purple), which is itself part of (the first item of) the encapsulated pixel data element|
Click here to view
In the case of full organization, a frame must exist for every tile of the rectangular total pixel matrix; the order in which the tiles are encoded in the pixel data element is predictable. Specifically, for two-dimensional images, the order is such that frames are encoded first along the row direction and then along the column direction of the pixel matrix. Since the position of frames is defined implicitly, the explicit tile coordinates may be omitted. A recipient may need to re-compute them based on column and row dimensions of the tiles and the total pixel matrix. In the case of sparse organization, tile coordinates and position are required to be explicitly recorded for each tile, not all tiles need to be present, and the frame items can be encoded in the pixel data element in any order. The position of each frame item is encoded in the Per-frame Functional Groups Sequence attribute [Figure 2]e. The full organization was recently added to DICOM as a result of preliminary implementation experience that suggested improvement was needed for the most common use case due to the size of the Per-frame Functional Groups Sequence. We chose to implement the full encoding scheme because it improves access performance by reducing overhead when transmitting metadata over the network as well as loading and parsing it into memory. The DICOM standard thereby enables encoding of multiscale, tiled whole-slide images in a standardized fashion.
DICOM enables encoding of pathology laboratory workflow metadata
Documentation of clinical metadata in pathology is complicated by the fact that the imaging subject is not the patient itself, but tissues obtained from the patient. Before digitization, tissues undergo several preparation steps including one or more sampling steps [Figure 3]a. At first glance, this may appear trivial to pathologists because they receive, sample, subsample, process, and stain hundreds of specimens per day. However, capturing and describing this information digitally in a standardized format is by no means trivial, regardless of whether optical or digital microscopy is used, because workflows differ greatly between laboratories and use cases. Here, we applied the terms part, block, and section to refer to different steps in a typical surgical pathology workflow [Figure 3]a. It is important to note, however, that DICOM does not provide designated attributes for these concepts. Instead, it provides a generic mechanism that enables encoding a variety of different laboratory workflows in a general and flexible manner. For example, based on the DICOM Specimen Preparation template, we defined a set of five preparation steps that apply to the vast majority of our routine surgical pathology use cases: Collection (e.g., surgical procedure), receiving, sampling (e.g., grossing or sectioning), processing (e.g., decalcification), and staining (e.g., trichrome stain) [Figure 3]a. The fixative (e.g., formalin) and embedding medium (e.g., paraffin wax) can be specified as part of the appropriate preparation step. This allows, for example, distinguishing specimens received fresh for frozen sectioning (original frozen section) from subsequent sections after fixation and from those received in formalin. Attributes for a description of patient and study (in a typical workflow, assigned during accessioning of the case) are part of the Patient General Study and Patient Study modules. Attributes for description of containers (e.g., glass slides) as well as specimens (e.g., tissue sections) are part of the Specimen module [Figure 3]b. DICOM requires that preparation steps be coded using a defined coding scheme, such as SNOMED CT, to permit the standardized interpretation of specimen-related information beyond the confines of one department and one LIS. Thereby, DICOM enables the encoding of relevant metadata together with the pixel data.
|Figure 3: Encoding of specimen information in DICOM data set. (a) DICOM model of the real world for pathology laboratory workflow. Top: Hierarchical relationships between the patient (blue), study =case (purple), specimen (yellow), and container (green) information entities. Bottom: Sequence of specimen preparation steps (1-6) for a typical surgical pathology encounter. (b) DICOM data set of image instance with encoded laboratory information. Patient and Study entities are described by attributes of the Patient module (blue) and general study and patient study modules (purple), respectively. Specimen and container entities are described by attributes of the Specimen module (yellow). The container used for imaging (glass slide) is described at the root dataset level whereas imaged specimens (tissue sections) are described by the nested specimen description sequence attribute. Laboratory procedures that were performed to prepare a specimen for image acquisition are described by the specimen preparation sequence attribute, which includes a specimen preparation step content item sequence attribute for each performed procedure. The description of a content item is based on the Specimen Preparation template, which provides concepts to encode a procedure as name-value pairs using a specified coding scheme, such as SNOMED CT. Mandatory concepts are the specimen identifier, which describes the identifier of the processed specimen, and processing type, which describes the kind of procedure that was performed. Additional concepts may be included depending on the value of the Processing Type concept. In case of a sampling preparation step (highlighted in red , compare sequence step 3 in a), Sampling method, Parent Specimen Identifier and Parent Specimen Type concepts are included. (c) Schematic of a current pathology report. Top: Identifiable information about patient (blue) and accession (purple), which map to attributes of the patient and general study module. Bottom: Final diagnosis and gross description for each specimen of type “part.” Identifiable information about specimen of type “section” is often not included in the report (i.e., final diagnoses generally lack complete slide-level annotations). SNOMED CT: Systematized Nomenclature of Medicine Clinical Terms|
Click here to view
The level of detail described by DICOM attributes of the Specimen module is generally not reflected in the pathology report: Specifically, a surgical pathology report contains sufficient information to uniquely identify the patient (name, sex, date of birth, medical record number, etc.) and the study (accession number, accession date, etc.) and it provides a gross (and sometimes a histological) description of the parts [Figure 3]c. When viewing the images to render a diagnosis and to create the report, DICOM metadata are crucial to the display software for visualizing the images correctly and providing related information to the pathologist who needs to interpret the image in the appropriate clinical context. The identifiers for the glass slide (container) and the tissue section (specimen) are generally not explicitly provided in current pathology reports, which makes it difficult to map the free text description in the report to the corresponding digital image and the spatial position within the image. In the future, there is considerable opportunity for improvement of such reports, particularly when they are encoded as synoptic (structured) or multimedia reports, at which time inclusion of links and hyperlinks to individual slides, images, and annotations become feasible in an integrated digital imaging environment. The level of detail in the DICOM attributes is also critically important for machine learning workflows, whether it be for the simple task of selecting the appropriate images to route for digital processing, or for more complex parameterization of algorithm behavior. Regardless of the downstream use case, implementing DICOM effectively requires integrating relevant data across different information sources into one data model [Figure 3]c.
DICOM data sets can be generated from existing vendor files
Only very recently have commercially available whole slide imaging systems started to produce images encoded according to the DICOM standard., In the interim, to support the installed based of scanners – only capable of producing proprietary format images, commercial and open-source implementers have begun to develop conversion tools., To evaluate the capabilities and limitations of DICOM for the representation of digital pathology whole slide images, we implemented a process to generate DICOM PS3.10 files based on existing vendor-specific file formats and validated their integrity and standard conformance using established automated validation tools as well as manual expert review as described in the Methods section.
DICOM provides efficient lossy and lossless compression methods
Whole slide imaging data sets are generally relatively large (GB range for individual slides) compared to other medical imaging applications and are therefore usually stored in compressed form. DICOM supports three image compression schemes relevant for visible light microscopy images: JPEG, JPEG-LS, and JPEG 2000. Most systems routinely apply a baseline lossy JPEG compression algorithm, before storing the images in their proprietary format. This allows for reasonable file sizes suitable for rapid access balanced against sufficient image quality for most current diagnostic tasks. Some systems can be configured to allow the user to select alternative compression schemes, including lossy forms of JPEG 2000 and other proprietary formats.
We measured the influence of all three compression schemes supported by DICOM on overall relative data size [Figure 4], left x-axis] and encoding times for VL Whole Slide Microscopy Image instances [Figure 4], right x-axis] generated from original lossy compressed vendor images. Recompressing previously lossy compressed images potentially biases performance measurements; repeated lossy compression also degrades image quality and should not be used in production. Recompression can be avoided by copying the binary pixel data “blobs” directly from original files into DICOM data sets without decoding and re-encoding. However, the recompression approach was necessary for our comparison of different methods since original uncompressed (or lossless compressed images) were not available. The resulting data size after DICOM conversion depends on many parameters including tile size, number of resolution levels, the compression method and in the case of lossy compression, the quality factor or target bitrate. When the same compression method, tile size and number of resolution levels are chosen for DICOM encoding or when the JPEG blocks are simply moved from the TIFF container to the DICOM datasets, the resulting data size will be almost identical [Figure 4], left axis, top]. The conversion time is primarily a factor of the data size and the compression method. As expected, the lossy baseline JPEG method has the overall smallest data size and lossless JPEG-LS and JPEG 2000 compression results in larger files [Figure 4], left x-axis]. Notably, the JPEG-LS method compresses images more efficiently when compared to JPEG 2000 (6.85 ± 2.17 versus 9.90 ± 3.14 min/GB; P < 0.000001, t-test). The baseline JPEG method also has the shortest encoding times [[Figure 4], right x-axis]. Interestingly, JPEG-LS encoding is almost as fast as baseline JPEG (37.03 ± 19.86 min/GB versus 26.58 ± 19.20 min/GB; P > 0.01, t-test) and orders of magnitude faster than JPEG 2000 (37.03 ± 19.86 min/GB versus 405.56 ± 239.52 min/GB; P < 10 − 17, t-test) [Figure 4]. The observed performance gains with JPEG-LS over JPEG 2000 are not surprising given the low complexity of the underlying algorithm and are consistent with the performance of the same schemes on digital photographs.
|Figure 4: Measurements for encoding and storing DICOM data sets in PS3.10 files. Relative file sizes and encoding times for different vendor-specific formats and image compression methods. A set of 16 vendor-specific whole slide image files (4 files per vendor) were converted to DICOM files using lossy (JPEG with quality set to 95) and lossless (JPEG-LS and JPEG 2000) image compression methods. Left: Size of generated DICOM files with respect to the size of the corresponding original image files. The lossy JPEG size is shown to exemplify the difference between lossy and lossless schemes. Right: Program execution times for reading original files, encoding the contained information as DICOM data sets and writing generated data sets to DICOM files on disk. Execution times were normalized with respect to original file sizes for comparison between digital slides. Shown are averages (bars) and standard deviation (error bars) of measurements|
Click here to view
DICOM enables efficient frame-level data access from local files
Most digital pathology applications require efficient access to subsets of pixel data without having to load the entire data set into memory. We evaluated existing, actively maintained software libraries for retrieving individual frames from the DICOM PS3.10 files that we previously generated. As a use case, we considered a machine-learning program that requires the loading of specific frames from files on disk into memory [Figure 5]a. Specifically, in contrast to reading all pixel data into memory, our program reads and interprets the image metadata to locate and to load only relevant subsets of the pixel data. Since many machine-learning algorithms are often orchestrated using the Python programming language, we explored software libraries that provide Python interfaces for reading of metadata as well as pixel data of VL Whole Slide Microscopy Image SOP instances stored in DICOM files. We identified the four libraries, which support loading of JPEG, JPEG-LS, and JPEG 2000 compressed multiframe DICOM images in encapsulated format: GDCM, ITK, SimpleITK, and pydicom, [Figure 5]b. However, only two of the four libraries [Figure 5]b expose interfaces for fetching individual frames without loading the entire pixel data element into memory. We measured the time to retrieve an individual frame for a given position in the slide coordinate system and resolution level using the pydicom library and compared retrieval times between compression methods [Figure 5]c. Our results show that JPEG (average: 0.29 ± 0.02s; P < 0.0001, t-test) and JPEG-LS (average: 0.37 ± 0.05s; P < 0.0002, t-test) allow significantly faster pixel data access when compared to JPEG 2000 (average: 0.65 ± 0.09s). We consider these observations good evidence that DICOM enables efficient retrieval of pixel data from files (e.g., for machine learning applications).
|Figure 5: Retrieving pixel data from DICOM data sets stored in PS3.10 files on disk. (a) Sequence of steps required for loading an individual frame of a multiframe DICOM image from a file on disk into a pixel matrix in memory. First, the file that contains the frame of interest is identified by reading and interpreting metadata (“DICOM header”) of each file. Second, the relative position of the frame within the image is identified using the available metadata. Third, the absolute position of the frame is determined within the encapsulated pixel data element. Fourth, the binary content of the frame item is read into memory. Lastly, the pixel data are decompressed. (b) Software library support for reading images from DICOM files. Selected software libraries available for the Python programming language and their level of support for reading multiframe DICOM images. Only a subset of libraries provides the functionality (check icon) for selectively loading an individual frame into memory. (c) Frame access efficiency for different image compression methods. Shown is the average time (±standard deviation) it takes to identify, read, and decompress an individual frame of a multi-frame DICOM image using an implementation of the algorithm (shown in a) in the pydicom software library|
Click here to view
DICOMweb facilitates remote frame-level data access
Click here to view
Click here to view
DICOM standard can be implemented with reasonable efforts
To obtain a real-world estimate of the time, skills, and budget necessary to implement DICOM for digital pathology, we prospectively tracked our implementation efforts. At time of submission of the manuscript, our efforts encompassed 120 h of documentation review, 273 h of software/algorithm development (including code review, writing documentation and developing tests), and 86 h of consultation with content experts. These numbers will equate to 0.8 full-time equivalents (FTE) for a 120-day period or 0.4 FTE over 1 year if one assumes that person can focus 100% on the DICOM implementation. With respect to the required computational expertise, in our case, the lead computational physician-scientist (MDH) had 4 years of in-depth coding experience in image applications. A project budget estimate accounting for the personnel salary USD $95,687 (expressed as fraction of average salary from publicly available resources for the FTE fraction described), plus computational resources (USD $28,434) amounts to roughly $125,000. Since we had direct access to DICOM experts and received support from the open-source DICOM community, the above estimates may differ in other settings. We provide these data as a reasonable point of reference for similar efforts.
| Discussion|| |
We implemented a pilot of the DICOM standard for whole slide imaging for digital pathology in a multisite, multivendor healthcare network setting. Since a reference implementation is currently lacking, we generated valid DICOM files by combining pixel data from vendor-specific file formats with clinical metadata from LISs. We used the generated data sets to test storage as well as access performance and evaluated overall practicability, placing an emphasis on compatibility with existing software libraries, and available archives.
The complexity of the DICOM standard is initially daunting and its adoption poses several challenges. A common misperception among pathologists is that DICOM is merely yet another file format, and there is uncertainty about what benefits and drawbacks adoption of the standard would entail. We hope we have convinced the reader that, although the standard does include a file format, its scope is much broader than the typical proprietary file format and includes standardized communication of image data and a rich and extensible model describing related identifying and descriptive information between devices and over network. Another misperception is that the standard is static and unchangeable. DICOM has proven to be a living standard that has been continuously adapting to technological change. Most notably, DICOMweb services and the DICOM JSON model have significantly improved data accessibility over the web. One can now understand DICOM as a set of URI templates for RESTful API endpoints and standardized JSON schemas for the corresponding resources. Crucially, these are defined in a manner consistent with the use of DICOM for all other imaging specialties. Our results demonstrate that once implemented, DICOM enables data access and exchange in a practical and vendor-neutral manner. We further show that, by relying on DICOM, one can leverage available medical imaging infrastructure and software systems to store, search, and retrieve whole slide imaging data efficiently.
Despite the obvious advantages of DICOM with respect to interoperability and enterprise integration,,, challenges remain. In the following, we discuss some key open questions that need to be addressed.
First, large whole slide imaging data sets complicate storage and network transmission. DICOM was originally designed for radiology image data sets more than a quarter of a century ago. At the time, data sets were orders of magnitude smaller than whole slide images in pathology. The total length of the compressed Pixel Data element itself is theoretically unlimited. The number of frames (tiles) is somewhat arbitrarily limited to 4 gigabytes; however, this is generally more than sufficient. An optional Basic Offset Table allows the sender to encode an index of the physical byte offsets to the positions of individual frames within the Pixel Data element. In other words, the receiver does not have to build its own index of the frames. However, currently, only 32-bit integers are used in the Basic Offset Table, which means that images larger than ~4 gigabytes have to omit it. There is no need to include the Basic Offset Table for a server since it can easily build its own index to satisfy frame-level retrieval requests; however, it is a significant convenience when attempting to retrieve selected frames from a PS3.10 file stored on disk. Given the typical size of pixel data in pathology, this size limit can be easily exceeded, in particular when lossless rather than lossy compression methods are applied [Figure 4], left axis]. A simple extension of the DICOM standard is in progress that addresses this Basic Offset Table weakness and will allow for an Extended Offset Table with 64-bit pointers as well as length information. That said, the information will remain optional and recipients will need to be prepared to rebuild their own index if necessary.
Concatenations are a DICOM feature that provides a workaround for size-related issues. Specifically, concatenations allow splitting a multiframe DICOM data set into several smaller parts. For example, with a pixel size threshold at 100MB, the number of concatenation parts in our series ranged from 11 to 46. Concatenations offer several other advantages, such as storing large DICOM objects over the web. Unfortunately, existing DICOM archives don't interpret concatenations they receive as a single image instance for the purpose of query and frame-level retrieval. Since each concatenation part still represents a valid DICOM object, archives can handle storage and retrieval of instances that are parts of concatenations; however, this places the burden on client implementations, which need to understand concatenations to retrieve the appropriate frames. A new generation of archive implementations that understand and reassemble concatenations would be highly desirable.
Second, vendors currently do not store files in DICOM format. Converting images from existing vendor proprietary file formats introduces overhead, requires constant maintenance to track variations as well as generate updates for each new model/vendor, and important information about the image acquisition process may be lost. For example, the orientation of images and the absolute position of tiles relative to the slide-based coordinate system are generally not specified in proprietary file formats and need to be assumed. Obviously whole slide scanning systems should store data directly in DICOM format and do so efficiently such that DICOM generation does not become a rate-limiting and error-prone step. We embrace ongoing efforts by whole slide scanner vendors to implement DICOM, and we anticipate that adoption of DICOM will advance digital pathology by providing standardized access to pixel data together with clinical metadata.
Third, choosing the optimal image compression method represents a challenge. Our comparison of the three image compression methods currently supported by DICOM highlights the importance of the choice of compression algorithm for data encoding and decoding performance. Currently, vendors store images in lossy compressed format, sometimes using proprietary compression schemes. Lossless compression would maximize image quality and circumvent potential issues related to recompression; however, its benefits may not justify the considerably larger data size. Our performance measurements indicate that JPEG-LS is considerably faster than JPEG 2000 for lossless compression and decompression, at speeds comparable to lossy, baseline JPEG. Since potential consequences of lossy image compression for machine learning applications are yet unexplored, JPEG-LS may present a potential solution for storing and accessing images efficiently without losing information. However, there is limited support for JPEG-LS in existing enterprise-level or radiology DICOM archives and the appropriate color space transformation to use for lossless schemes like JPEG-LS remains an open question.
Fourth, we emphasize the importance of integrating imaging and information systems. Digital pathology involves more than digitizing slides. It requires the integration of image pixel data with clinical metadata to be useful in pathology practice. Current anatomic pathology LISs are generally designed around graphical user interfaces and optimized toward reporting and billing rather than digital pathology workflows. As a result, these systems often do not expose an application programming interface (API) for programmatic communication of image-related information between systems. Creation of custom middleware represents a solution;, however, integration with imaging systems remains challenging because relevant details may not even be tracked by the information system in structured form. In particular, information about specimen preparation steps is generally not readily available. Interpretation of acquired images in clinical context demands substantially different approaches to tracking specimen-related information, which are less case-centric and place more emphasis on the imaged tissue sections. There are various alternative standards routinely used for communicating information from departmental information systems; to DICOM acquisition devices and clinical laboratory devices, specifically DICOM Modality Worklist (MWL) and various HL7 version 2 messages. Existing profiles for anatomical pathology and laboratory workflows are under investigation by the Integrating the Healthcare Enterprise (IHE) Pathology and Laboratory Medicine (PaLM) group in collaboration with DICOM Working Group 26, for whom standardization of the LIS to scanner interface is a high priority. However, mapping proprietary LIS internal data structures to standard HL7 specimen representations or the DICOM Specimen Preparation template is nontrivial and currently may require customization of the HL7 message triggers, segments, and fields (including private segments) with the vendor, or use of third-party middleware solutions for extraction and transformation of the data. The IHE PaLM group has recognized that the underlying HL7 V2 messages and Domain Analysis Model (DAM) for histopathology and slide specimen description have gaps and need mapping to the model that was defined in DICOM Supplements 122 and 145, and are in the process of submitting a Project Scope Statement from the Orders and Observations Work Group to fill these gaps. The earlier effort by the predecessor of IHE PaLM, IHE Anatomical Pathology, which defined a DICOM MWL-based Anatomical Pathology Workflow profile for communication between anatomic pathology AP-LIS and scanners, has not been adopted. We hope that the next generation of LIS will implement any new HL7 V2 messages and future IHE PaLM technical framework(s) to provide standard-compliant interfaces for data query and retrieval. In the interim, it would be desirable if LIS would use customized HL7 messages and fields to allow automated query and retrieval of specimen information, rather than relying on completely proprietary mechanisms.
Fifth, the operational challenges of incorporating DICOM for digital pathology should not be underestimated. One major difference between implementing whole slide imaging and other technologies into routine practice is the need for expert IT involvement (to establish the necessary IT infrastructure, storage, and personnel). Regardless of the selected storage format, any implementation of digital pathology requires storage of vast amounts of data. If the same compression scheme and parameters are used, a DICOM file containing a multiframe image with encapsulated pixel data element is similar in total size to a TIFF or proprietary format file. The DICOM file serves as a container for compressed pixel data blobs (frame items) and the additional metadata is minimal in size by comparison. Some optimizations may be lost (such as factoring out JPEG tables and only sending them once, which TIFF allows for but DICOM does not); however, these have minimal effect. There are exceptions though, for example, significant size expansion may occur when images are lossy compressed with a proprietary scheme by the scanner vendor and can only be recompressed without further loss using a standard lossless scheme. Accordingly, scanner vendors should be strongly discouraged from using proprietary schemes and only use standard schemes supported by DICOM such as JPEG, JPEG-LS and JPEG 2000. Thus, there is normally no storage size penalty for using DICOM.
The storage requirements remain enormous. For example, the lead authors' clinical facilities are characterized by ~185,000 cases/year with a slide output of 3800–5800/day, which equates to approximately 1,420,000 slides per year. Assuming lossy compressed data storage (e.g., 1–3 GB per slide), we will require ~4.3 petabytes storage space per year. We have established IT divisions and a center for clinical data science with extensive experience in enterprise medical imaging (20+ years); however, digital pathology images are on average 10 times the size of radiology images and will require significantly more resources for image management. One should not overestimate the impact of this factor, however, since radiology too has been faced with periodic challenges involving nontrivial leaps in data volume, such as thin slice CT and breast tomosynthesis; one just needs to be suitably prepared. By relying on DICOM as a data format for storage and exchange of whole slide images, healthcare enterprises can leverage a wealth of already existing medical imaging infrastructure representing billions of dollars of investments and build on the extensive DICOM expertise of IT specialists in hospitals and companies around the world.
Finally, clearly defining the “added clinical value” remains as the key challenge of implementing digital pathology into routine clinical practice. The core vision of digital pathology is that availability of pixel and metadata can unlock the full potential of integrated histopathological data analytics. We foresee that decision-support tools and machine-learning applications will facilitate adoption; however, these solutions require interoperability of systems and effective data access. Once completely realized, the advantages of digital pathology will be clear and fundamentally change how pathologists work. Researchers, practitioners, and vendors are now tasked to collectively navigate the transition from theoretical availability towards a meaningful return on investment. An enterprise-level communication standard (i.e., DICOM) is an essential component of streamlining data workflows to ultimately improve patient care.
The authors would like to thank A. Smith, N. Jones, B. R. Friedman, L. Chen, and P. H. Norwood for expert technical assistance. We thank C. Ayala, J. Floyd, S. Shafieha, and H. I. Paik for help with LIS access and E. Ziegler for technical discussions. We also thank N. Tenenholtz and R. W. Larson for project management as well as K. Burke and J. Reyes for administrative support. We further thank Enterprise Research Infrastructure and Services at Partners Healthcare for their support and for the provision of the ERISOne cluster. A special thanks is extended to J. Gilbertson and J. Kalpathy-Cramer for advice and for being awesome.
Financial support and sponsorship
This work was funded in part by the NIH (RO1 CA225655) to J.K.L, NIH (U24 CA180918, P41 EB015898) to A.Y.F., NIH (P41 EB015902, U24 CA180918, U24 CA199460) to S.P, and the content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institute of Health or any other organization.
Conflicts of interest
There are no conflicts of interest.
| References|| |
Schultz M. Rudolf Virchow. Emerg Infect Dis2008;14:1480-1.
Weinstein RS, Graham AR, Richter LC, Barker GP, Krupinski EA, Lopez AM, et al.
Overview of telepathology, virtual microscopy, and whole slide imaging: Prospects for the future. Hum Pathol 2009;40:1057-69.
Pantanowitz L, Dickinson K, Evans AJ, Hassell LA, Henricks WH, Lennerz JK, et al.
American telemedicine association clinical guidelines for telepathology. J Pathol Inform 2014;5:39.
Louis DN, Feldman M, Carter AB, Dighe AS, Pfeifer JD, Bry L, et al.
Computational pathology: A path ahead. Arch Pathol Lab Med 2016;140:41-50.
Louis DN, Gerber GK, Baron JM, Bry L, Dighe AS, Getz G, et al.
Computational pathology: An emerging definition. Arch Pathol Lab Med 2014;138:1133-8.
Abels E, Pantanowitz L. Current state of the regulatory trajectory for whole slide imaging devices in the USA. J Pathol Inform 2017;8:23.
] [Full text]
Mukhopadhyay S, Feldman MD, Abels E, Ashfaq R, Beltaifa S, Cacciabeve NG, et al.
Whole slide imaging versus microscopy for primary diagnosis in surgical pathology: A multicenter blinded randomized noninferiority study of 1992 cases (Pivotal study). Am J Surg Pathol 2018;42:39-52.
Ehteshami Bejnordi B, Veta M, Johannes van Diest P, van Ginneken B, Karssemeijer N, Litjens G, et al.
Diagnostic assessment of deep learning algorithms for detection of lymph node metastases in women with breast cancer. JAMA 2017;318:2199-210.
Gheisari S, Catchpoole DR, Charlton A, Kennedy PJ. Convolutional deep belief network with feature encoding for classification of neuroblastoma histological images. J Pathol Inform 2018;9:17.
] [Full text]
Golden JA. Deep learning algorithms for detection of lymph node metastases from breast cancer: Helping artificial intelligence be seen. JAMA 2017;318:2184-6.
Granter SR, Beck AH, Papke DJ Jr. Straw men, deep learning, and the future of the human microscopist: Response to “Artificial intelligence and the pathologist: Future frenemies?” Arch Pathol Lab Med 2017;141:624.
Litjens G, Bandi P, Ehteshami Bejnordi B, Geessink O, Balkenhol M, Bult P, et al
. 1399 H&E-stained sentinel lymph node sections of breast cancer patients: The CAMELYON dataset. Gigascience 2018;7. doi: 10.1093/gigascience/giy065.
Moskowitz CS. Using free-response receiver operating characteristic curves to assess the accuracy of machine diagnosis of cancer. JAMA 2017;318:2250-1.
Sharma G, Carter A. Artificial intelligence and the pathologist: Future frenemies? Arch Pathol Lab Med 2017;141:622-3.
van Smeden M, Van Calster B, Groenwold RH. Machine learning compared with pathologist assessment. JAMA 2018;319:1725-6.
Clunie DA, Dennison DK, Cram D, Persons KR, Bronkalla MD, Primo HR, et al.
Technical challenges of enterprise imaging: HIMSS-SIIM collaborative white paper. J Digit Imaging 2016;29:583-614.
Garcia-Rojo M, Sanchez A, Bueno G, De Mena D. Standardization of pathology whole slide images according to DICOM 145 supplement and storage in PACs. Diagn Pathol 2016;8:175.
Goode A, Gilbert B, Harkes J, Jukic D, Satyanarayanan M. OpenSlide: A vendor-neutral software foundation for digital pathology. J Pathol Inform 2013;4:27.
] [Full text]
Marques Godinho T, Lebre R, Silva LB, Costa C. An efficient architecture to support digital pathology in standard medical imaging repositories. J Biomed Inform 2017;71:190-7.
Balis UJ. Digital imaging standards and system interoperability. Clin Lab Med 1997;17:315-22.
Roth CJ, Lannum LM, Joseph CL. Enterprise imaging governance: HIMSS-SIIM collaborative white paper. J Digit Imaging 2016;29:539-46.
Roth CJ, Lannum LM, Persons KR. A foundation for enterprise imaging: HIMSS-SIIM collaborative white paper. J Digit Imaging 2016;29:530-8.
Larobina M, Murino L. Medical image file formats. J Digit Imaging 2014;27:200-6.
Fedorov A, Clunie D, Ulrich E, Bauer C, Wahle A, Brown B, et al.
DICOM for quantitative imaging biomarker development: A standards based approach to sharing clinical data and structured PET/CT analysis results in head and neck cancer research. PeerJ 2016;4:e2057.
Singh R, Chubb L, Pantanowitz L, Parwani A. Standardization in digital pathology: Supplement 145 of the DICOM standards. J Pathol Inform 2011;2:23.
] [Full text]
Clunie D, Hosseinzadeh D, Wintell M, De Mena D, Lajara N, Garcia-Rojo M, et al.
Digital imaging and communications in medicine whole slide imaging connectathon at digital pathology association pathology visions 2017. J Pathol Inform 2018;9:6.
Cucoranu IC, Parwani AV, Vepa S, Weinstein RS, Pantanowitz L. Digital pathology: A systematic evaluation of the patent landscape. J Pathol Inform 2014;5:16.
] [Full text]
Lundström CF, Gilmore HL, Ros PR. Integrated diagnostics: The computational revolution catalyzing cross-disciplinary practices in radiology, pathology, and genomics. Radiology 2017;285:12-5.
Sorace J, Aberle DR, Elimam D, Lawvere S, Tawfik O, Wallace WD, et al.
Integrating pathology and radiology disciplines: An emerging opportunity? BMC Med 2012;10:100.
National Electrical Manufacturers Association (NEMA). Digital Imaging and Communications in Medicine (DICOM) Standard PS3.4-Service Class Specifications. Report No.: PS3.4. Rosslyn, VA: National Electrical Manufacturers Association (NEMA); 2018. Available from: http://www.dicom.nema.org/medical/dicom/current/output/pdf/part 04.pdf
. [Last accessed on 2018 Jun 23].
National Electrical Manufacturers Association (NEMA). Digital Imaging and Communications in Medicine (DICOM) Standard PS3.5 - Data Structures and Encoding. Report No.: PS3.5. Rosslyn, VA: National Electrical Manufacturers Association (NEMA); 2018. Available from: http://www.dicom.nema.org/medical/dicom/current/output/pdf/part 05.pdf
. [Last accessed on 2018 Jun 23].
National Electrical Manufacturers Association (NEMA). Digital Imaging and Communications in Medicine (DICOM) Standard PS3.6 - Data Dictionary. Report No.: PS3.6. Rosslyn, VA: National Electrical Manufacturers Association (NEMA); 2018. Available from: http://www.dicom.nema.org/medical/dicom/current/output/pdf/part 06.pdf
. [Last accessed on 2018 Jun 23].
National Electrical Manufacturers Association (NEMA). Digital Imaging and Communications in Medicine (DICOM) Standard PS3.10 - Media Storage and File Format for Media Interchange. Report No.: PS3.10. Rosslyn, VA: National Electrical Manufacturers Association (NEMA); 2018. Available from: http://www.dicom.nema.org/medical/dicom/current/output/pdf/part 10.pdf
. [Last accessed on 2018 Jun 23].
National Electrical Manufacturers Association (NEMA). Digital Imaging and Communications in Medicine (DICOM) Standard PS3.16 - Content Mapping Resource. Report No.: PS3.16. Rosslyn, VA: National Electrical Manufacturers Association (NEMA); 2018. Available from: http://www.dicom.nema.org/medical/dicom/current/output/pdf/part 16.pdf
. [Last accessed on 2018 Jun 23].
Mason D. Pydicom: An open source DICOM library. Med Phys 2011;38:3493.
PyPi Website for the Openslide-Python Python Package; 2018. Available from: https://www.openslide.org/
. [Last accessed on 2018 Jun 11].
National Electrical Manufacturers Association (NEMA). Digital Imaging and Communications in Medicine (DICOM) Standard PS3.8 - Network Communication Support for Message Exchange. Report No.: PS3.8. Rosslyn, VA: National Electrical Manufacturers Association (NEMA); 2018. Available from: http://www.dicom.nema.org/medical/dicom/current/output/pdf/part 08.pdf
. [Last accessed on 2018 Jun 23].
National Electrical Manufacturers Association (NEMA). Digital Imaging and Communications in Medicine (DICOM) Standard PS3.7 - Message Exchange. Report No.: PS3.7. Rosslyn, VA: National Electrical Manufacturers Association (NEMA); 2018. Available from: http://www.dicom.nema.org/medical/dicom/current/output/pdf/part 07.pdf
. [Last accessed on 2018 Jun 23].
Fielding RT. Architectural Styles and the Design of Network-based Software Architectures. PhD Thesis, UC Irvine; 2000.
National Electrical Manufacturers Association (NEMA). Digital Imaging and Communications in Medicine (DICOM) Standard PS3.18 - Web Services. Report No.: PS3.18. Rosslyn, VA: National Electrical Manufacturers Association (NEMA); 2018. Available from: https://www.dicomstandard.org/dicomweb/
. [Last accessed on 2018 Jun 11].
Github Website for the Git Repository of the DCM4CHEE Archive. Miscellaneous; 2018. Available from: https://www.dcm4che.org/
. [Last accessed on 2018 Jun 11].
National Electrical Manufacturers Association (NEMA). Digital Imaging and Communications in Medicine (DICOM) Standard PS3.17 - Explanatory Information. Annex NN Specimen Identification and Management. Report No.: PS3.17. Rosslyn, VA: National Electrical Manufacturers Association (NEMA); 2018. Available from: http://www.dicom.nema.org/medical/dicom/current/output/pdf/part 17.pdf
. [Last accessed on 2018 Jun 23]
Jodogne S. The orthanc ecosystem for medical imaging. J Digit Imaging 2018;31:341-52.
McCormick M, Liu X, Jomier J, Marion C, Ibanez L. ITK: Enabling reproducible research and open science. Front Neuroinform 2014;8:13.
Lowekamp BC, Chen DT, Ibáñez L, Blezek D. The design of simpleITK. Front Neuroinform 2013;7:45.
Genereaux BW, Dennison DK, Ho K, Horn R, Silver EL, O'Donnell K, et al.
DICOMweb™: Background and application of the web standard for medical imaging. J Digit Imaging 2018;31:321-326.
Bidgood WD Jr., Horii SC. Introduction to the ACR-NEMA DICOM standard. Radiographics 1992;12:345-55.
Guo H, Birsa J, Farahani N, Hartman DJ, Piccoli A, O'Leary M, et al.
Digital pathology and anatomic pathology laboratory information system integration to support digital pathology sign-out. J Pathol Inform 2016;7:23.
Isaacs M, Lennerz JK, Yates S, Clermont W, Rossi J, Pfeifer JD, et al.
Implementation of whole slide imaging in surgical pathology: A value added approach. J Pathol Inform 2011;2:39.
] [Full text]
Daniel C, Rojo MG, Klossa J, Della Mea V, Booker D, Beckwith BA, et al.
Standardizing the use of whole slide images in digital pathology. Comput Med Imaging Graph 2011;35:496-505.
IHE Pathology and Laboratory Medicine (PaLM). HL7 Project Scope Statement. IHE Digital Pathology Workflow Metadata Requirements (e.g. DICOM) to Specimen DAM Mapping and HL7 Product Family Use. Available from: http://wiki.hl7.org/images/1/17/HL7_CG_20180904.pdf
. [Last accessed on 2018 Aug 05].
[Figure 1], [Figure 2], [Figure 3], [Figure 4], [Figure 5], [Figure 6], [Figure 7]
|This article has been cited by|
||FedFlow: a federated platform to build secure sharing and synchronization services for health dataflows
| ||Diana Carrizales-Espinoza, Dante D. Sanchez-Gallegos, J. L. Gonzalez-Compean, Jesus Carretero |
| ||Computing. 2022; |
|[Pubmed] | [DOI]|
||Current applications and challenges of artificial intelligence in pathology
| ||Matthew G. Hanna, Maria H. Hanna |
| ||Human Pathology Reports. 2022; 27: 300596 |
|[Pubmed] | [DOI]|
||Interpretable tumor differentiation grade and microsatellite instability recognition in gastric cancer using deep learning
| ||Feng Su, Jianmin Li, Xinya Zhao, Beining Wang, Yajie Hu, Yu Sun, Jiafu Ji |
| ||Laboratory Investigation. 2022; |
|[Pubmed] | [DOI]|
||Tissue Multiplex Analyte Detection in Anatomic Pathology – Pathways to Clinical Implementation
| ||Keith A. Wharton, Douglas Wood, Mael Manesse, Kirsteen H. Maclean, Florian Leiss, Aleksandra Zuraw |
| ||Frontiers in Molecular Biosciences. 2021; 8 |
|[Pubmed] | [DOI]|
||Artificial Intelligence and the Medical Physicist: Welcome to the Machine
| ||Michele Avanzo, Annalisa Trianni, Francesca Botta, Cinzia Talamonti, Michele Stasi, Mauro Iori |
| ||Applied Sciences. 2021; 11(4): 1691 |
|[Pubmed] | [DOI]|
||Shifting Gears in Precision Oncology—Challenges and Opportunities of Integrative Data Analysis
| ||Ka-Won Noh, Reinhard Buettner, Sebastian Klein |
| ||Biomolecules. 2021; 11(9): 1310 |
|[Pubmed] | [DOI]|
||Developing image analysis pipelines of whole-slide images: Pre- and post-processing
| ||Byron Smith, Meyke Hermsen, Elizabeth Lesser, Deepak Ravichandar, Walter Kremers |
| ||Journal of Clinical and Translational Science. 2021; 5(1) |
|[Pubmed] | [DOI]|
||Multispecialty Enterprise Imaging Workgroup Consensus on Interactive Multimedia Reporting Current State and Road to the Future: HIMSS-SIIM Collaborative White Paper
| ||Christopher J. Roth, David A. Clunie, David J. Vining, Seth J. Berkowitz, Alejandro Berlin, Jean-Pierre Bissonnette, Shawn D. Clark, Toby C. Cornish, Monief Eid, Cree M. Gaskin, Alexander K. Goel, Genevieve C. Jacobs, David Kwan, Damien M. Luviano, Morgan P. McBee, Kelly Miller, Abdul Moiz Hafiz, Ceferino Obcemea, Anil V. Parwani, Veronica Rotemberg, Elliot L. Silver, Erik S. Storm, James E. Tcheng, Karen S. Thullner, Les R. Folio |
| ||Journal of Digital Imaging. 2021; 34(3): 495 |
|[Pubmed] | [DOI]|
||The role of machine learning in cardiovascular pathology
| ||Carolyn Glass, Kyle J. Lafata, William Jeck, Roarke Horstmeyer, Colin Cooke, Jeffrey Everitt, Matthew Glass, David Dov, Michael A. Seidman |
| ||Canadian Journal of Cardiology. 2021; |
|[Pubmed] | [DOI]|
||OpenPhi: an interface to access Philips iSyntax whole slide images for computational pathology
| ||Nita Mulliqi, Kimmo Kartasalo, Henrik Olsson, Xiaoyi Ji, Lars Egevad, Martin Eklund, Pekka Ruusuvuori, Olga Vitek |
| ||Bioinformatics. 2021; 37(21): 3995 |
|[Pubmed] | [DOI]|
||Understanding Scanner Utilization With Real-Time DICOM Metadata Extraction
| ||Pradeeban Kathiravelu, Ashish Sharma, Puneet Sharma |
| ||IEEE Access. 2021; 9: 10621 |
|[Pubmed] | [DOI]|
||A Performant Protocol for Distributed Health Records Databases
| ||Micael Pedrosa, Rui Lebre, Carlos Costa |
| ||IEEE Access. 2021; 9: 125930 |
|[Pubmed] | [DOI]|
||NCI Imaging Data Commons
| ||Andrey Fedorov, William J.R. Longabaugh, David Pot, David A. Clunie, Steve Pieper, Hugo J.W.L. Aerts, André Homeyer, Rob Lewis, Afshin Akbarzadeh, Dennis Bontempi, William Clifford, Markus D. Herrmann, Henning Höfener, Igor Octaviano, Chad Osborne, Suzanne Paquette, James Petts, Davide Punzo, Madelyn Reyes, Daniela P. Schacherer, Mi Tian, George White, Erik Ziegler, Ilya Shmulevich, Todd Pihl, Ulrike Wagner, Keyvan Farahani, Ron Kikinis |
| ||Cancer Research. 2021; 81(16): 4188 |
|[Pubmed] | [DOI]|
||A review and guide to creating patient specific 3D printed anatomical models from MRI for benign gynecologic surgery
| ||Teresa E. Flaxman, Carly M. Cooke, Olivier X. Miguel, Adnan M. Sheikh, Sukhbir S. Singh |
| ||3D Printing in Medicine. 2021; 7(1) |
|[Pubmed] | [DOI]|
||Open Health Imaging Foundation Viewer: An Extensible Open-Source Framework for Building Web-Based Imaging Applications to Support Cancer Research
| ||Erik Ziegler, Trinity Urban, Danny Brown, James Petts, Steve D. Pieper, Rob Lewis, Chris Hafey, Gordon J. Harris |
| ||JCO Clinical Cancer Informatics. 2020; (4): 336 |
|[Pubmed] | [DOI]|
||An Efficient Hybrid Fuzzy-Clustering Driven 3D-Modeling of Magnetic Resonance Imagery for Enhanced Brain Tumor Diagnosis
| ||Suresh Kanniappan, Duraimurugan Samiayya, Durai Raj Vincent P M, Kathiravan Srinivasan, Dushantha Nalin K. Jayakody, Daniel Gutiérrez Reina, Atsushi Inoue |
| ||Electronics. 2020; 9(3): 475 |
|[Pubmed] | [DOI]|
||Protecting the Pathology Commons in the Digital Era
| ||Benjamin L. Mazer, Nathan Paulson, John H. Sinard |
| ||Archives of Pathology & Laboratory Medicine. 2020; 144(9): 1037 |
|[Pubmed] | [DOI]|
||Report on computational assessment of Tumor Infiltrating Lymphocytes from the International Immuno-Oncology Biomarker Working Group
| ||Mohamed Amgad, Elisabeth Specht Stovgaard, Eva Balslev, Jeppe Thagaard, Weijie Chen, Sarah Dudgeon, Ashish Sharma, Jennifer K. Kerner, Carsten Denkert, Yinyin Yuan, Khalid AbdulJabbar, Stephan Wienert, Peter Savas, Leonie Voorwerk, Andrew H. Beck, Anant Madabhushi, Johan Hartman, Manu M. Sebastian, Hugo M. Horlings, Jan Hudecek, Francesco Ciompi, David A. Moore, Rajendra Singh, Elvire Roblin, Marcelo Luiz Balancin, Marie-Christine Mathieu, Jochen K. Lennerz, Pawan Kirtani, I-Chun Chen, Jeremy P. Braybrooke, Giancarlo Pruneri, Sandra Demaria, Sylvia Adams, Stuart J. Schnitt, Sunil R. Lakhani, Federico Rojo, Laura Comerma, Sunil S. Badve, Mehrnoush Khojasteh, W. Fraser Symmans, Christos Sotiriou, Paula Gonzalez-Ericsson, Katherine L. Pogue-Geile, Rim S. Kim, David L. Rimm, Giuseppe Viale, Stephen M. Hewitt, John M. S. Bartlett, Frédérique Penault-Llorca, Shom Goel, Huang-Chun Lien, Sibylle Loibl, Zuzana Kos, Sherene Loi, Matthew G. Hanna, Stefan Michiels, Marleen Kok, Torsten O. Nielsen, |
| ||npj Breast Cancer. 2020; 6(1) |
|[Pubmed] | [DOI]|
||Technische, operative und regulatorische Aspekte für die Nutzung der digitalen und rechnergestützten Pathologie
| ||Markus D. Herrmann, Jochen K. Lennerz |
| ||Der Pathologe. 2020; 41(S2): 103 |
|[Pubmed] | [DOI]|