Centre for the Fourth Industrial Revolution Network 2023-2024

Page 18 of 45 · 2024 · WEF_Centre_for_the_Fourth_Industrial_Revolution_Network_2023-2024.pdf

18 Action: The participants running the pilot developed a tool that leverages GPT-4 as a knowledge base and uses an Mbaza English-Kinyarwanda translation model to empower CHWs in Rwanda. This pilot focused specifically on malnutrition, which remains a critical challenge in Rwanda.Case study 5: LLM tool for community health workers Rwanda Context: In Rwanda, community health workers (CHWs) are the backbone of primary healthcare. Yet, CHWs’ lack of clinical training leads to delayed diagnoses, preventable deaths and extreme strain on the public health system. Large language models (LLMs) are showing significant potential in closing knowledge gaps in healthcare, but for this to be deployed in Rwanda, LLMs must converse in Kinyarwanda. Result: Preliminary assessments have yielded promising results, underscoring the potential of LLMs to drive positive change in Rwanda’s healthcare landscape. Figure 1: Network outcomes Baseline 8% accuracy, as evaluated by professional linguists (that is, it did not comprehend Kinyarwanda at all; did not evaluate with healthcare workers).55% accuracy, as evaluated by professional linguists To note: GPT-4’s comprehension of Kinyarwanda has improved dramatically over the last three months. Open-source models still lack comprehension.71% functional accuracy, as evaluated by professional healthcare workers (that is, responses are comprehensible but have grammatical errors). After pilot Next steps: The next phase of rolling out this LLM tool includes two steps: 1 Evaluation of LLM datasets by collecting questions from CHWs and responses from both clinicians and LLMs. This dataset will be used to evaluate the performance of LLMs in a clinical setting.2 Conducting a silent trial of an AI-enhanced CHW app in Rwanda, with a primary focus on evaluating the performance of LLMs within the CHW setting. The trial will assess the LLMs’ ability to provide accurate and culturally relevant decision support for diagnosis and referral without altering CHW behaviour.
Ask AI what this page says about a topic: