Reshaping patient and treatment insights with RWE and AI
Current applications of AI in RWE
Enhancing coding and algorithm development
Ryan Fiano, PhD., a researcher at Cencora, explained that by leveraging AI in a recent project, the time involved in building complex algorithms for treatment patterns dropped from several weeks to a week or less. "The efficiency comes from knowing how to prompt the AI and prior experience with turning outcomes into actionable steps,” he said.
This specific expertise is vital because the pre-work required to define common retrospective outcomes—such as Lines of Therapy (LOT), persistence, or treatment switching—is rarely a standardized process; definitions often vary significantly across studies. By combining deep domain knowledge with focused prompting techniques, analysts can equip AI with validated internal code patterns, effectively "teaching" it to apply standard “coding” approaches across studies. For instance, by "showing" the model exactly how the team handles typical rules, such as medication overlaps and gap contingencies to define LOT, analysts can help to ensure that more complex edge cases require only minimal refactoring.
This process, we have found, dramatically reduces the "debug loop"—transforming what was once a multi-day cycle of syntax troubleshooting into a streamlined validation process where analysts focus on verifying clinical intent rather than correcting code structure.
Improving data validation and quality
For example, Dr. Fiano noted that his team was able to leverage prompt templates derived from established quality control practices to systematically generate quality control checkpoints that validate outputs while ensuring consistency.
“This approach transforms manual quality assurance into a scalable, repeatable process integrated seamlessly within AI workflows,” Dr. Fiano noted.
By automating these validation processes, AI allows researchers to focus on higher-value tasks, such as study design and interpretation, while maintaining the rigor required for regulatory and payer submissions.
Challenges and limitations of AI in RWE
Ensuring validity and transparency
Derek Swiger, PharmD, MS, an expert in digital innovation at Cencora, highlighted the importance of building safeguards into workflows:
“At the end of the day, Generative AI tools are black boxes, so we need to be creative with building checks into the process that ensure the outputs are valid and aligned with client needs.”
For example, researchers can validate AI-generated data by comparing it against known benchmarks or using self-checking mechanisms to ensure consistency. These safeguards are essential for maintaining trust in AI-driven RWE.
From human expertise to AI effectiveness
“The value of AI right now lies in combining content expertise with the ability to build effective prompts,” Dr. Fiano explained. “You need to know how to guide the AI to get meaningful results and ensure that those results align with the study objectives.”
This human-in-the-loop approach underscores the importance of retaining critical thinking and domain expertise, even as AI automates many aspects of RWE generation.4,2
While human expertise is integral, so too is accessing high-quality data – a persistent problem given how often data is siloed across institutions or locked behind proprietary systems. Federated learning models, which allow data to remain decentralized while enabling collaborative analysis, offer a potential solution. However, implementing these models requires overcoming technical and regulatory barriers.
Future trends and opportunities
Dr. Fiano shared his perspective on the potential of federated learning. “Federated frameworks enable the analysis of dispersed datasets as if they were coalesced in one place, creating a unified virtual cohort that overcomes single-institution limitations without ever physically centralizing sensitive patient records.”
Rather than pooling identifiable records, federated architectures execute standardized analytic queries locally at each data partner, returning only aggregate results or summary statistics. Europe's DARWIN EU network exemplifies this at scale. As of December 2025, it has initiated over 100 studies across 32 data partners covering approximately 188 million patients,6 all coordinated through a federated approach built on the OMOP Common Data Model (CDM).
An earlier initiative was the European Health Data & Evidence Network (EHDEN), which was focused on gathering broad insights and evidence from real-world clinical data. The project has since transitioned to the Netherlands-based EHDEN Foundation, which is focused on building the infrastructure for a federated network across Europe.7
In the United States, the Food and Drug Administration’s (FDA) recent policy shift to accept de-identified RWE for certain regulatory submissions further validates federated evidence as a credible alternative to centralized data warehouses.8
For rare diseases or niche populations where single institutions lack statistical power, federated analytics enables researchers to generate robust comparative effectiveness evidence while maintaining strict data governance—each institution retains custody of its data, and re-identification risk is mitigated through architectural controls that separate tokens from quasi-identifiers.9 For example, federated queries can now link genomic data, claims, and mortality registries across multiple institutions to study treatment patterns and long-term outcomes in ultra-rare diseases where individual sites may have fewer than 50 patients, but the federated network collectively reaches sample sizes sufficient for causal inference and regulatory-grade evidence.10
Conclusion
AI will play an increasingly important role in the development of RWE solutions as companies seek to navigate the complexities of evidence generation and achieve their market access goals.
Ansvarserklæring:
Informasjonen i denne artikkelen utgjør ikke juridisk rådgivning. Cencora, Inc. oppfordrer leserne på det sterkeste til å gjennomgå tilgjengelig informasjon relatert til emnene som diskuteres og stole på sin egen erfaring og ekspertise når de tar beslutninger relatert til dette.
Ta kontakt med teamet vårt
Sources
1. Towards responsible artificial intelligence in healthcare-getting real about real-world data and evidence. J Am Med Inform Assoc., Nov 2025. https://pmc.ncbi.nlm.nih.gov/articles/PMC12626219/
2. Opportunities and Challenges for AI-Based Analysis of RWD in Pharmaceutical R&D: A Practical Perspective, Künstliche Intelligenz, Oct 2023. https://link.springer.com/article/10.1007/s13218-023-00809-6#:~:text=analysis%20approach,illustrate%20challenges%20and%20methodological%20considerations
3. What is black box AI? IBM, Oct 2024. https://www.ibm.com/think/topics/black-box-ai
4. Towards responsible artificial intelligence in healthcare—getting real about real-world data and evidence, J Am Med Inform Assoc., Nov 2025. https://pmc.ncbi.nlm.nih.gov/articles/PMC12626219/
5. Darwin EU. https://www.darwin-eu.org/
6. Big data highlights, EMA, Dec 2025. https://ec.europa.eu/newsroom/ema/newsletter-archives/70356
7. European Health Data and Evidence Network (EHDEN): Shaping the future of health data in Europe, European Union. https://data.europa.eu/en/news-events/news/european-health-data-and-evidence-network-ehden-shaping-future-health-data-europe
8. FDA Eliminates Major Barrier to Using Real-World Evidence in Drug and Device Application Reviews, FDA, Dec 2025. https://www.fda.gov/news-events/press-announcements/fda-eliminates-major-barrier-using-real-world-evidence-drug-and-device-application-reviews
9. Tokenization techniques for privacy-preserving healthcare data: tokenization nuts and bolts., Front Drug Saf Regul., Dec 2025. https://pubmed.ncbi.nlm.nih.gov/41488932/
10. Privacy-by-Design with Federated Learning will drive future Rare Disease Research, Journal of Neuromuscular Diseases, Dec 2024. https://journals.sagepub.com/doi/10.1177/22143602241296276
