Privacy-Preserving Text Summarization Using Semantic Similarity With Biobert And Clinicalbert For Multiple Medical Documents Leveraging Parallelized High-Performance Computing

Authors

  • Majji Venkata Kishore
  • Prajna Bodapati

DOI:

https://doi.org/10.70135/seejph.vi.4393

Abstract

The enormous volume of textual data produced by medical documents in the healthcare industry provides insightful information, but it also presents serious privacy, data security, and computational complexity issues. Through the use of parallelized high-performance computing (HPC), this research presents a unique framework for the privacy-preserving text summarization of various medical records utilizing semantic similarity algorithms driven by modified BioBERT and ClinicalBERT. In order to maximize productivity, the framework uses distributed computing environments and secure computation approaches to satisfy the demand for summarizing sensitive medical data while maintaining anonymity. This study shows that the method offers quick and privacy-compliant summarization, protecting patient privacy without sacrificing the information's relevance and semantic accuracy.

Downloads

Published

2025-02-09

How to Cite

Kishore, M. V., & Bodapati, P. (2025). Privacy-Preserving Text Summarization Using Semantic Similarity With Biobert And Clinicalbert For Multiple Medical Documents Leveraging Parallelized High-Performance Computing. South Eastern European Journal of Public Health, 1795–1801. https://doi.org/10.70135/seejph.vi.4393

Issue

Section

Articles