Clinical Intelligence Research Press Clinical Intelligence Research Press

Search

Search results:
A Large Language Model Integration Architecture for Clinical Decision Infrastructure
The integration of large language models (LLMs) into clinical decision infrastructures represents a transformative shift in healthcare delivery, enabling enhanced reasoning, data synthesis, and adaptive support for clinicians. This conceptual manuscript proposes a novel architecture, termed the adaptive LLM-orchestrated clinical ecosystem (ALOCE), designed to seamlessly embed LLMs within existing electronic health record (EHR) systems, interoperability frameworks, and governance protocols. By delineating a multi-layered structure encompassing data ingestion, semantic processing, decision augmentation, and continuous monitoring, ALOCE addresses key challenges such as data silos, ethical AI deployment, and real-time adaptability in clinical environments. Drawing on theoretical foundations from AI governance and healthcare informatics, the architecture incorporates feedback topologies for drift detection and ethical alignment, ensuring robustness in diverse clinical workflows. Conceptual formulas are introduced to model risk propagation across layers, decision confidence thresholds, and governance load balancing, providing interpretive tools for system designers. The manuscript synthesizes recent literature on clinical AI architectures, highlighting interoperability standards like FHIR and the role of LLMs in augmenting human decision-making without empirical validation. Ultimately, this work outlines a blueprint for scalable, ethical LLM integration, fostering improved patient outcomes through intelligent infrastructure orchestration. While theoretical, the implications extend to policy, deployment strategies, and future research in AI-driven healthcare systems.
Journal of Artificial Intelligence for Healthcare Systems
Original Research | Open access | 20 January 2025 | Article: 35

Large Language Models in Clinical Contexts: Infrastructure, Oversight, and Risk Dynamics
The integration of large language models (LLMs) into clinical healthcare systems represents a transformative shift in how data analytics, decision support, and operational infrastructure are conceptualized and deployed. This narrative review synthesizes recent advancements in LLMs within healthcare, focusing on their roles in enhancing clinical analytics, infrastructural frameworks, and oversight mechanisms while addressing inherent risk dynamics. Drawing from peer-reviewed literature, we examine how LLMs facilitate the processing of vast unstructured clinical data, such as electronic health records and patient narratives, to generate actionable insights that inform diagnostics, treatment planning, and resource allocation. Key infrastructural elements include scalable deployment pipelines that integrate LLMs with existing hospital information systems, enabling real-time analytics and predictive modeling without disrupting legacy workflows. Oversight is emphasized through regulatory frameworks that ensure ethical deployment, data privacy compliance, and bias mitigation, as LLMs amplify risks related to misinformation, algorithmic opacity, and equitable access in diverse clinical settings. Risk dynamics are explored in terms of model hallucinations, dependency on training data quality, and potential for exacerbating healthcare disparities if not properly governed. The review highlights systems-level analytics where LLMs contribute to closed-loop healthcare ecosystems, from data ingestion and inference to feedback-driven recalibration, fostering adaptive intelligence in clinical decision-making. For instance, LLMs have been adapted for tasks like text summarization, diagnostic reasoning, and patient communication, outperforming traditional methods in efficiency while requiring robust validation to maintain clinical fidelity. We underscore the need for interdisciplinary collaboration between clinicians, data scientists, and policymakers to harness LLMs' potential in optimizing healthcare delivery. By synthesizing cross-study evidence, this review proposes an original interpretive framework for LLM-enabled healthcare systems, structured around data-model-deployment-governance cycles, to guide future implementations. Ultimately, while LLMs promise enhanced analytics and infrastructural resilience, their clinical adoption demands vigilant oversight to balance innovation with patient safety and ethical integrity. This synthesis not only maps the current landscape but also identifies infrastructural gaps in scaling LLMs for equitable, high-stakes clinical environments, paving the way for more resilient healthcare analytics paradigms.
Journal of Artificial Intelligence for Healthcare Systems
Review | Open access | 20 July 2025 | Article: 41

Retrieval-Augmented Generation for Real-Time Clinical Question Answering: A Framework Integrating Electronic Health Records and Clinical Guidelines
Clinicians often need rapid, evidence-based answers that integrate patient-specific electronic health records (EHRs) with clinical guidelines, but existing decision support tools are limited in real-time personalization. While large language models (LLMs) offer strong medical reasoning, they are prone to hallucinations and lack direct access to local EHR data, making them unsafe for standalone clinical use; meanwhile, traditional retrieval systems cannot synthesize coherent, context-aware responses. This paper proposes a retrieval-augmented generation (RAG) framework that combines dual-source retrieval from both institutional EHRs and clinical guideline databases. The system includes an EHR indexer, a guideline repository, a semantic retriever, an LLM-based generator, and a safety filter for hallucination mitigation. By grounding outputs in retrieved patient data and evidence-based recommendations, the model improves factual reliability, explainability, and clinical trustworthiness. Overall, the framework enables safe, real-time clinical question answering by integrating LLM reasoning with verified medical sources, with future validation planned on public EHR and guideline datasets.
Journal of Artificial Intelligence for Healthcare Systems
Original Research | Open access | 20 January 2025 | Article: 100

Parameter-Efficient Fine-Tuning of Large Language Models for Automated Discharge Summary Generation from Daily Progress Notes and Laboratory Results
Hospital discharge summaries are critical for care transitions, directly impacting readmission prevention and medication reconciliation, yet physicians spend 15-30 minutes per patient drafting these documents, contributing substantially to documentation burden and professional burnout. Manual summarization of daily progress notes and laboratory results is repetitive, time-consuming, and error-prone, as clinicians must sift through lengthy unstructured notes across multiple hospital days while identifying salient events and trends. We propose a large language model with parameter-efficient fine-tuning for automated discharge summary generation that processes chronologically ordered daily progress notes alongside time-series laboratory results to produce structured discharge documentation. The framework consists of a base LLM augmented with LoRA adapters, a progress note encoder for section segmentation, a laboratory result integrator that computes trend indicators, and a summary generator that produces sectioned discharge output. Parameter-efficient fine-tuning enables domain adaptation to clinical text with minimal computational resources, preserving patient-specific information while reducing hallucination through retrieval of key factual details from the input notes. This framework offers a practical pathway to reduced documentation burden and improved discharge quality, with potential for widespread deployment across health systems given the modest computational requirements of PEFT approaches.
Journal of Artificial Intelligence for Healthcare Systems
Original Research | Open access | 20 January 2025 | Article: 101

Large Language Models in Clinical Medicine from 2017 to 2025: A Systematic Review of Performance on Medical Licensing Examinations, Clinical Documentation, Decision Support, and Safety Concerns
Large language models (LLMs) have rapidly advanced since the transformer architecture was introduced in 2017, with systems such as GPT-3, GPT-4, Med-PaLM, and Claude increasingly explored for applications in medical education, clinical documentation, decision support, and patient communication, raising both optimism and concerns regarding safety and reliability. This systematic review synthesizes evidence across studies retrieved from PubMed, arXiv, ACL Anthology, IEEE Xplore, and Google Scholar that empirically evaluated LLMs in clinical settings using quantitative performance metrics, with risk of bias assessed using an adapted PROBAST framework for machine learning research. Findings show that LLMs achieve 60–90% accuracy on USMLE-style examinations, with leading models such as GPT-4 and Med-PaLM 2 reaching or surpassing passing thresholds, while in clinical documentation tasks they can reduce physician workload by approximately 30–50% in generating outputs such as discharge summaries, though human review remains consistently required. Performance in clinical decision support is more variable and specialty-dependent, and hallucination rates ranging from 5–30% have been reported, alongside persistent issues of bias and overconfidence in incorrect outputs. Overall, while LLMs demonstrate strong capabilities in structured medical knowledge tasks and documentation support, current limitations including hallucinations, bias, and lack of prospective clinical validation prevent safe autonomous deployment, making clinician oversight and robust safety safeguards essential for any clinical use.
Journal of Artificial Intelligence for Healthcare Systems
Review | Open access | 20 January 2026 | Article: 121

Large Language Model with Retrieval-Augmented Generation and Chain-of-Thought Reasoning for Differential Diagnosis Generation from Emergency Department Triage Notes and Vital Signs
This article proposes a conceptual framework for a diagnostic support system in emergency departments that leverages large language models, retrieval-augmented generation, and chain-of-thought reasoning. By combining triage notes and vital signs, the system generates a ranked differential diagnosis list to assist clinicians without replacing their judgment. The framework includes components like a triage note encoder, a vital sign encoder, a retrieval module, and a diagnosis ranker, using evidence from clinical guidelines, curated references, and de-identified prior cases. The approach grounds the model in authoritative knowledge while ensuring transparency and explainability in the diagnostic process. However, prospective validation, integration into workflows, and clinician oversight are crucial before implementation to ensure safety and effectiveness.
Journal of Artificial Intelligence for Healthcare Systems
Original Research | Open access | 20 July 2026 | Article: 129

Large Language Models for Clinical Trial Patient Screening and Recruitment: A Systematic Review of Zero-Shot, Few-Shot, and Fine-Tuned Approaches for Matching Eligibility Criteria to Electronic Health Records
Clinical trial recruitment is hindered by slow, costly, and labor-intensive processes, particularly due to the complexity of eligibility criteria often written in free text. This systematic review examines the use of large language models (LLMs) for matching clinical trial eligibility criteria to electronic health records (EHR). It evaluates zero-shot, few-shot, and fine-tuned LLM approaches, comparing their strengths, limitations, and deployment readiness in supporting patient-trial matching. Thirty-three studies published from 2017 to 2026 were included, with findings showing that zero-shot prompting is most adaptable for simple criteria, few-shot prompting offers consistent reasoning for ambiguous criteria, and fine-tuned models excel in task-specific performance but require labeled data and are less portable. The review concludes that no single approach is optimal for all trial screening tasks, and hybrid workflows combining various methods with human verification are most suitable for clinical use.
Journal of Artificial Intelligence for Healthcare Systems
Review | Open access | 20 July 2026 | Article: 141
Filters
Clear All

Subject
AI-driven Diagnostics Artificial Intelligence in Health Informatics Artificial Intelligence in Healthcare Big Data in Healthcare Clinical Data Mining Clinical Decision Support Systems Clinical Informatics Computer Vision Connected Health Systems Deep Learning Digital Health Digital Healthcare Innovation Digital Transformation in Healthcare Electronic Health Records Ethical AI in Healthcare Explainable AI Health Data Analytics Health Data Privacy Health Informatics Health Information Management Health Information Systems Health System Optimization Health Technology Assessment Healthcare Data Science Healthcare Informatics Healthcare Information Security Healthcare Management Healthcare Management Information Systems Intelligent Medical Systems Internet of Medical Things (IoMT) Interoperability in Healthcare Systems Machine Learning Medical Data Analytics Medical Data Management Medical Imaging Mobile Health (mHealth) Natural Language Processing Precision Medicine Predictive Analytics Remote Patient Monitoring Smart Healthcare Systems Telemedicine Wearable Health Technologies e-Health




Access type