GitHub - ErasmusMC-Bioinformatics/AI4HIV-LangZhou: Code repo for the HIV screening project with Erasmus MC

This repository contains code for my Master thesis: Automated HIV Screening on Dutch EHR with Large Language Models, at Erasmus University Medical Center and Vrije Universiteit Amsterdam.

The ErasmusHIV dataset and other data used in the thesis are not openly available due to their sensitive nature.

Abstract

Efficient screening and early diagnosis of HIV are critical for reducing onward transmission. Although large-scale laboratory testing is not feasible, the widespread adoption of Electronic Health Records (EHRs) offers new opportunities to address this challenge. Existing research primarily focuses on applying machine learning methods to structured data, such as patient demographics, for improving HIV diagnosis. However, these approaches often overlook unstructured text data such as clinical notes, which potentially contain valuable information relevant to HIV risk. In this study, we propose a novel pipeline that leverages Large Language Model (LLM) to analyze unstructured EHR text and determine a patient’s eligibility for further HIV testing. Experimental results on clinical data from Erasmus MC demonstrate that our pipeline achieves high accuracy while maintaining a low false negative rate.

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
visualization		visualization
.gitignore		.gitignore
Gemma.py		Gemma.py
README.md		README.md
benchmark_creation.py		benchmark_creation.py
experimental.py		experimental.py
filter.py		filter.py
fine_tune.py		fine_tune.py
fine_tune_simple.py		fine_tune_simple.py
merge.py		merge.py
model_list.py		model_list.py
prompt.py		prompt.py
utilities.py		utilities.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Abstract

About

Uh oh!

Languages

ErasmusMC-Bioinformatics/AI4HIV-LangZhou

Folders and files

Latest commit

History

Repository files navigation

Abstract

About

Resources

Uh oh!

Stars

Watchers

Forks

Languages