27–29 May 2024
Geneva
Europe/Zurich timezone

Leveraging semantics for clinical phenotype in a consortium of public hospitals

Not scheduled
15m
Geneva

Geneva

Oral presentation Health and the environment, time for solutions

Description

Introduction:
Electronic Health Records (EHR) contain a wealth of valuable clinical data that can be harnessed for research, decision support, and healthcare improvement. However, reusing EHR data remains challenging due to issues related to their heterogeneous, multi modal, complex nature, and the lack of semantic representation. This abstract presents a pragmatic approach to address these challenges by leveraging formal grammars and semantically rich classifications like SNOMED CT. We demonstrate how manual post-coordinated representations, natural language processing, and graph databases can enhance the utility of EHR data within a consortium of public hospitals.
Methods:
To overcome the limitations of EHR data, we adopt a multi-faceted approach. Categorical data are tackled through manual representation using SNOMED CT formal grammar to create post-coordinated expressions. This allows us to capture the meaning of clinical concepts and reduce informational loss. We establish stratified rules and validation tools to ensure consistency and accuracy in this process. Textual data are addressed through a natural language processing (NLP) pipeline, which extract valuable information from unstructured clinical narratives.
The semantically encoded representations are then integrated into a graph database. This database not only facilitate semantic exploration but also act as a bridge to other data models, enabling interoperability with various systems. Furthermore, it enables similarity comparisons between patients, offering a valuable tool for cohort identification and research.
Results:
Promising results were achieved through these efforts. On categorical data, a coverage of 76% of the instances was achieved on data selected for encoding. Secondly, a scalable industrial pipeline was established, which adhered to industry standards and made use of widely accepted tools, thereby ensuring compatibility and future sustainability. The graph database contains more than 60,000 semantic representations, multiple international classifications and widely used data models representations.
Conclusion:
The semantic enhancement of clinical data is paramount in unlocking their full potential for research, clinical decision support, and healthcare improvement. By enriching EHR data with formal grammar and semantically rich classifications like SNOMED CT, we transition from a data-centric to a semantic-centric world, where the meaning of the data is as crucial as its structure. This paradigm shift empowers healthcare institutions to tap into the wealth of knowledge encapsulated within EHRs.
Semantic enhancement could not only improve data reuse but also facilitate the extraction of meaningful insights. It enables healthcare professionals and researchers to better understand patient profiles, identify hidden patterns, and make more informed decisions. In the long run, this approach has the potential to revolutionize healthcare by fostering a deeper understanding of clinical phenotypes, advancing personalized medicine, and ultimately improving patient outcomes. Therefore, embracing semantics is not merely an option but a necessity in the evolving landscape of healthcare data utilization.

Contact Geneva Health Forum I just want to receive information about the GHF 2024 conference / Je souhaite simplement recevoir des informations sur la conférence GHF 2024

Author

Christophe Gaudet-Blavignac (Hôpitaux Universitaires de Genève)

Co-authors

Dr Julien Ehrsam (Hôpitaux Universitaires de Genève) Mr Adel Bensahla Talet (Hôpitaux Universitaires de Genève) Prof. Christian Lovis (Hôpitaux Universitaires de Genève)

Presentation materials

There are no materials yet.