search
HomeBackend DevelopmentPython Tutoriald[IA]gnosis: developing RAG applications with IRIS for Healt

With the introduction of vector data types and the Vector Search functionality in IRIS, a whole world of possibilities opens up for the development of applications and an example of these applications is the one that I recently saw published in a public contest by the Ministry of Health from Valencia in which they requested a tool to assist in ICD-10 coding using AI models.

How could we implement an application similar to the one requested? Let's see what we would need:

  1. List of ICD-10 codes, which we will use as context for our RAG application to search for diagnoses within the plain texts.
  2. A trained model that vectorizes the texts in which we are going to look for equivalences in the ICD-10 codes.
  3. The Python libraries necessary for the ingestion and vectorization of ICD-10 codes and texts.
  4. A friendly front-end that supports texts on which we look for possible diagnoses.
  5. Orchestration of requests received from the front-end.

What does IRIS provide us to cover the above needs?

  1. CSV import, either using the RecordMapper functionality or directly using Embedded Python.
  2. Embedded Python allows us to implement the Python code necessary to generate the vectors using the selected model.
  3. Publish REST APIs that will be invoked from the front-end application.
  4. Interoperability productions that allow tracking of information within IRIS.

Well, we only have to see the developed example:

d[IA]gnosis

Associated with this article you have access to the developed application, in the next articles we will see in detail how we implement each of the functionalities, from the use of the model, the storage of the vectors and the use of vector searches.

Let's review the application:

Importing ICD-10 codes

d[IA]gnosis: developing RAG applications with IRIS for Healt

From the configuration screen we are told the format that the CSV file must comply with the ICD-10 codes that we are going to import. The loading and vectorization process consumes a lot of time and resources, which is why the deployment of the Docker container configures not only the RAM memory usable by Docker but also the disk memory in case the requirements exceed the allocated RAM:

  # iris
  iris:
    init: true
    container_name: iris
    build:
      context: .
      dockerfile: iris/Dockerfile
    ports:
      - 52774:52773
      - 51774:1972
    volumes:
    - ./shared:/shared
    environment:
    - ISC_DATA_DIRECTORY=/shared/durable
    command: --check-caps false --ISCAgent false
    mem_limit: 30G
    memswap_limit: 32G

The file with the ICD-10 codes is available in the project path /shared/cie10/icd10.csv, once 100% is reached the application will be ready to be used.

In our application we have defined two different functionalities for diagnostic coding, one based on HL7 messages received in the system and another based on plain texts.

Diagnostic capture from HL7

The project contains some HL7 messages prepared for testing, it is only necessary to copy the /shared/hl7/messagesa01_en.hl7 file to the /shared/HL7In folder and the associated production will be responsible for extracting the diagnosis from it to display it in the web application:

d[IA]gnosis: developing RAG applications with IRIS for Healt

From the diagnosis requests screen we can see all the diagnoses received via HL7 messaging. To code them to ICD-10 we only need to click on the magnifying glass to show a list of those ICD-10 codes closest to the diagnosis received:

d[IA]gnosis: developing RAG applications with IRIS for Healt

Once selected, we will see the diagnosis and its associated ICD-10 code in the list. By clicking on the button with the envelope icon, a message will be generated using the original and including the new one selected within the diagnosis segment:

MSH|^~\&|HIS|HULP|EMPI||||ADT^A08|592956|P|2.5.1
EVN|A01|
PID|||1556655212^^^SERMAS^SN~922210^^^HULP^PI||GARCÍA PÉREZ^JUAN^^^||20150403|M|||PASEO PEDRO ÁLVAREZ 195 1 CENTRO^^LEGANÉS^MADRID^28379^SPAIN||555283055^PRN^^JUAN.GARCIA@YAHOO.COM|||||||||||||||||N|
PV1||N
DG1|1||O10.91^Unspecified pre-existing hypertension complicating pregnancy^CIE10-ES|Gestational hypertension||A||

This message can be found in the path /shared/HL7Out

Screenshots of diagnoses in plaintext

From the Text Analyzer option, the user can include plain text on which an analysis process will be carried out. The application will search in tuples of 3 lemmatized words (eliminating articles, pronouns and other less relevant words). Once analyzed, the system will show us the relevant underlined text and the possible diagnoses located:

d[IA]gnosis: developing RAG applications with IRIS for Healt

Once the analysis has been carried out, it can be consulted at any time from the analysis history.

Analysis history

All analyzes carried out are recorded and can be consulted at any time, being able to view all possible ICD-10 codes available:

In the next article...

We will see how, using Embedded Python, we use a specific LLM model for the vectorization of both the ICD-10 codes that we will use as context and the free texts.

If you have any questions or suggestions, do not hesitate to write a comment on the article.

The above is the detailed content of d[IA]gnosis: developing RAG applications with IRIS for Healt. For more information, please follow other related articles on the PHP Chinese website!

Statement
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
Are Python lists dynamic arrays or linked lists under the hood?Are Python lists dynamic arrays or linked lists under the hood?May 07, 2025 am 12:16 AM

Pythonlistsareimplementedasdynamicarrays,notlinkedlists.1)Theyarestoredincontiguousmemoryblocks,whichmayrequirereallocationwhenappendingitems,impactingperformance.2)Linkedlistswouldofferefficientinsertions/deletionsbutslowerindexedaccess,leadingPytho

How do you remove elements from a Python list?How do you remove elements from a Python list?May 07, 2025 am 12:15 AM

Pythonoffersfourmainmethodstoremoveelementsfromalist:1)remove(value)removesthefirstoccurrenceofavalue,2)pop(index)removesandreturnsanelementataspecifiedindex,3)delstatementremoveselementsbyindexorslice,and4)clear()removesallitemsfromthelist.Eachmetho

What should you check if you get a 'Permission denied' error when trying to run a script?What should you check if you get a 'Permission denied' error when trying to run a script?May 07, 2025 am 12:12 AM

Toresolvea"Permissiondenied"errorwhenrunningascript,followthesesteps:1)Checkandadjustthescript'spermissionsusingchmod xmyscript.shtomakeitexecutable.2)Ensurethescriptislocatedinadirectorywhereyouhavewritepermissions,suchasyourhomedirectory.

How are arrays used in image processing with Python?How are arrays used in image processing with Python?May 07, 2025 am 12:04 AM

ArraysarecrucialinPythonimageprocessingastheyenableefficientmanipulationandanalysisofimagedata.1)ImagesareconvertedtoNumPyarrays,withgrayscaleimagesas2Darraysandcolorimagesas3Darrays.2)Arraysallowforvectorizedoperations,enablingfastadjustmentslikebri

For what types of operations are arrays significantly faster than lists?For what types of operations are arrays significantly faster than lists?May 07, 2025 am 12:01 AM

Arraysaresignificantlyfasterthanlistsforoperationsbenefitingfromdirectmemoryaccessandfixed-sizestructures.1)Accessingelements:Arraysprovideconstant-timeaccessduetocontiguousmemorystorage.2)Iteration:Arraysleveragecachelocalityforfasteriteration.3)Mem

Explain the performance differences in element-wise operations between lists and arrays.Explain the performance differences in element-wise operations between lists and arrays.May 06, 2025 am 12:15 AM

Arraysarebetterforelement-wiseoperationsduetofasteraccessandoptimizedimplementations.1)Arrayshavecontiguousmemoryfordirectaccess,enhancingperformance.2)Listsareflexiblebutslowerduetopotentialdynamicresizing.3)Forlargedatasets,arrays,especiallywithlib

How can you perform mathematical operations on entire NumPy arrays efficiently?How can you perform mathematical operations on entire NumPy arrays efficiently?May 06, 2025 am 12:15 AM

Mathematical operations of the entire array in NumPy can be efficiently implemented through vectorized operations. 1) Use simple operators such as addition (arr 2) to perform operations on arrays. 2) NumPy uses the underlying C language library, which improves the computing speed. 3) You can perform complex operations such as multiplication, division, and exponents. 4) Pay attention to broadcast operations to ensure that the array shape is compatible. 5) Using NumPy functions such as np.sum() can significantly improve performance.

How do you insert elements into a Python array?How do you insert elements into a Python array?May 06, 2025 am 12:14 AM

In Python, there are two main methods for inserting elements into a list: 1) Using the insert(index, value) method, you can insert elements at the specified index, but inserting at the beginning of a large list is inefficient; 2) Using the append(value) method, add elements at the end of the list, which is highly efficient. For large lists, it is recommended to use append() or consider using deque or NumPy arrays to optimize performance.

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

SublimeText3 English version

SublimeText3 English version

Recommended: Win version, supports code prompts!

EditPlus Chinese cracked version

EditPlus Chinese cracked version

Small size, syntax highlighting, does not support code prompt function

Safe Exam Browser

Safe Exam Browser

Safe Exam Browser is a secure browser environment for taking online exams securely. This software turns any computer into a secure workstation. It controls access to any utility and prevents students from using unauthorized resources.

MantisBT

MantisBT

Mantis is an easy-to-deploy web-based defect tracking tool designed to aid in product defect tracking. It requires PHP, MySQL and a web server. Check out our demo and hosting services.

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment