Enhancing Healthcare Data Interoperability with AI-Driven Synthetic Datasets Using FHIR Standards

Authors

  • Vamshi Paili

Keywords:

synthetic data, FHIR R5, interoperability, privacy, generative models, load testing, profile validation, asynchronous export

Abstract

This article addresses how one can combine AI-generated synthetic medical datasets as well as FHIR standard semantic standards and APIs to better enable cross-system interoperability between health care providers. In order to create data sets with the desired level of analytical utility, the author have as a study objective to demonstrate how the generation of synthetics in the context of FHIR resource and transport via the R5 mechanisms of profiles, subscriptions and asynchronous export to NDJSON removes both legal and technical barriers to accessing clinical data. Because data-sharing is complicated by the risk of regulatory noncompliance; a significant amount of clinical data is contained in unstructured documents; and clinical data grows at an exponential rate, the need for this type of solution has increased. This work is innovative in that it brings together three families of generative models: sequential LLM-like models, GANs and diffusion networks. Additionally, the author use a direct mapping pipeline from generative models to FHIR resources and validate profile IG and automated REST tests using built-in systems. To train the generative models on rare nosologies, and to perform load-testing, the author describe an architecture that enables private, secure access to multiple, representative cohorts, which are used to generate the synthetic data. Key Findings: (1) The use of synthetics inside the FHIR shell ensures compatibility with the current infrastructure while reducing legal barriers. (2) They also allow risk-free testing of extreme scenarios and (3) they also reduce sample bias. A rational and practical strategy for accelerating the implementation of analytics in clinics would include a strategy of generating synthetics in a continuous integration/continuous delivery (CI/CD) environment and (2) implementing mandatory validation through profile management. The author believe that the article will provide valuable assistance to medical AI developers, integration solution architects, IT service managers, and regulatory analysts.

Author Biography

  • Vamshi Paili

    Sr Software Engineer, FEI Systems,Punta Gorda, Florida

References

[1] OneData, “Unlocking Zettabytes of Value: Business Success in the Age of Information Explosion,” OneData, 2025. https://onedata.ai/insights/200-zettabytes-of-data-by-2025/ (accessed Aug. 01, 2025).

[2] M. H. Gabriel, C. Richwine, C. Strawley, W. Barker, and J. Everson, “Interoperable Exchange of Patient Health Information Among U.S. Hospitals: 2023,” May 2024. https://www.ncbi.nlm.nih.gov/books/NBK606033/ (accessed Aug. 01, 2025).

[3] IBM, “Text Mining,” IBM. https://www.ibm.com/think/topics/text-mining (accessed Aug. 02, 2025).

[4] H.-J. Kong, “Managing Unstructured Big Data in Healthcare System,” Healthcare Informatics Research, vol. 25, no. 1, 2019, doi: https://doi.org/10.4258/hir.2019.25.1.1.

[5] S. Alder, “Healthcare Data Breach Statistics,” The HIPAA Journal, May 26, 2025. https://www.hipaajournal.com/healthcare-data-breach-statistics/ (accessed Aug. 04, 2025).

[6] M. Kilgus, “GDPR Enforcement in Life Science & Healthcare,” CMS, May 13, 2025. https://cms.law/en/deu/publication/gdpr-enforcement-tracker-report/life-science-healthcare (accessed Aug. 05, 2025).

[7] G. Iversen, “Epic says its imaging order alerts prevented millions of duplicate scans over one year,” DotMed, 2025. https://www.dotmed.com/news/story/64303 (accessed Aug. 06, 2025).

[8] G. Church, “The deadly cost of duplicate patient records,” OncLive, Nov. 05, 2023. https://www.chiefhealthcareexecutive.com/view/the-deadly-cost-of-duplicate-patient-records-viewpoint (accessed Aug. 07, 2025).

[9] “Resourcelist - FHIR v5.0.0,” HL7. https://www.hl7.org/fhir/resourcelist.html (accessed Aug. 08, 2025).

[10] S. Rodriguez, “Latest FHIR Standard R5 Elevates Data Exchange, Interoperability,” Health IT and EHR, 2023. https://www.techtarget.com/searchhealthit/news/366577881/Latest-FHIR-Standard-R5-Elevates-Data-Exchange-Interoperability (accessed Aug. 09, 2025).

[11] WHO, “SMART Verifiable IPS for Pilgrimage v2.0.3,” WHO. https://smart.who.int/ips-pilgrimage/ (accessed Aug. 10, 2025).

[12] “Validating Resources against the specification and Profiles,” Hl7. https://confluence.hl7.org/plugins/viewsource/viewpagesrc.action?pageId=144967104 (accessed Aug. 11, 2025). [13] A. E. W. Johnson, T. J. Pollard, L. Shen, et al., "MIMIC-III, a freely accessible critical care database," Scientific Data, vol. 3, 160035, 2016. doi: 10.1038/sdata.2016.35.

Downloads

Published

2025-12-26

Issue

Section

Articles

How to Cite

Vamshi Paili. (2025). Enhancing Healthcare Data Interoperability with AI-Driven Synthetic Datasets Using FHIR Standards. International Journal of Computer (IJC), 56(1), 297-308. https://www.ijcjournal.org/InternationalJournalOfComputer/article/view/2474