Journal of Pathology Informatics

TECHNICAL NOTE
Year
: 2018  |  Volume : 9  |  Issue : 1  |  Page : 47-

Integration of cancer registry data into the text information extraction system: Leveraging the structured data import tool


Faina Linkov1, Jonathan C Silverstein3, Michael Davis3, Brenda Crocker2, Degan Hao4, Althea Schneider3, Melissa Schwenk5, Sharon Winters3, Joyce Zelnis5, Adrian V Lee4, Michael J Becich6 
1 Department of Obstetrics, Gynecology and Reproductive Sciences; Department of Biomedical Informatics, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania, USA
2 UPMC Hillman Cancer Center Information Services, Pittsburgh, Pennsylvania, USA
3 UPMC Network Cancer Registry, Pittsburgh, Pennsylvania, USA
4 Department of Pharmacology and Chemical Biology, UPMC Hillman Cancer Center, The Institute for Precision Medicine, Pittsburgh, Pennsylvania, USA

Correspondence Address:
Dr. Faina Linkov
Department of Obstetrics, Gynecology, University of Pittsburgh School of Medicine, 3380 Blvd of Allies, Suite 343, Pittsburgh, PA 15213
USA

Introduction/Background: Cancer registries in the US collect timely and systematic data on new cancer cases, extent of disease, staging, biomarker status, treatment, survival, and mortality of cancer cases. Existing methodologies for accessing local cancer registry data for research are time-consuming and often rely on the manual merging of data by staff registrars. In addition, existing registries do not provide direct access to these data nor do they routinely provide linkage to discrete electronic health record (EHR) data, reports, or imaging data. Automation of such linkage can provide an impressive data resource and make valuable data available for translational cancer research. Methods: The UPMC Network Cancer Registry collects highly structured, longitudinal data on all reportable cancer patients, from the point of the diagnosis throughout treatment and follow-up/outcomes. Using commercial registry software, we collect data in compliance with standards governed by the North American Association of Central Cancer Registries. This standardization ensures that the data are highly structured with standard coding and collection methods, which support data exchange among central cancer registries and the Centers for Disease Control and Prevention. Results: At the UPMC Hillman Cancer Center and University of Pittsburgh, we explored the feasibility of linking this well-curated, structured cancer registry data with unstructured text (i.e., pathology and radiology reports), using the Text Information Extraction System (TIES). We used the TIES platform to integrate breast cancer cases from the UPMC Network Cancer Registry system and then combine these data with other EHR data as a pilot use case that can be replicated for other cancers. Conclusions: As a result of this integration, we now have a single searchable repository of information for breast cancer patients from the UPMC registry, combined with their pathology and radiology reports. The system that we developed is easily scalable to other health systems and cancer centers.


How to cite this article:
Linkov F, Silverstein JC, Davis M, Crocker B, Hao D, Schneider A, Schwenk M, Winters S, Zelnis J, Lee AV, Becich MJ. Integration of cancer registry data into the text information extraction system: Leveraging the structured data import tool.J Pathol Inform 2018;9:47-47


How to cite this URL:
Linkov F, Silverstein JC, Davis M, Crocker B, Hao D, Schneider A, Schwenk M, Winters S, Zelnis J, Lee AV, Becich MJ. Integration of cancer registry data into the text information extraction system: Leveraging the structured data import tool. J Pathol Inform [serial online] 2018 [cited 2019 Aug 18 ];9:47-47
Available from: http://www.jpathinformatics.org/article.asp?issn=2153-3539;year=2018;volume=9;issue=1;spage=47;epage=47;aulast=Linkov;type=0