Scientific Challenge 2025
Predicting Length of Stay and Patient Outcome

Challenge Description

Welcome to the IFMBE Scientific Challenge Competition at the IUPESM World Congress on Medical Physics and Biomedical Engineering 2025 (IUPESM 2025). In this edition, participants are required to predict patients' length of stay and respective clinical outcome based on a set of parameters collected at the time of hospital admission.

The development and validation of practical and reliable risk assessment tools is central for clinical decision-making. At an early stage, they can be used to assist clinicians in stratification strategies, guide the type of intervention, optimize hospital bed monitoring, and matching the intensity of therapy with the individual patient’s risk. A precise risk assessment model or score is decisive in the identification of patients at high risk, in whom invasive strategies may improve the outcome and to identify patients at low risk, to whom little to no benefits are expected from potentially hazardous and costly treatments.

In the present challenge participants should develop a risk tool to estimate the length of stay (in days) and the type of hospital discharge (death or survival).

How to Participate

Participants must follow these steps to join the challenge:

1| Challenge Registration

2| Conference Submission

Important Notes:

Dataset Description

LAIS

The dataset was collected under the Project Regula SESAP-RN/FUNCERN, grant number 69/2021, from October 2021 to January 2024, by the of Technological Innovation in Health (LAIS) at the Federal University of Rio Grande do Norte (UFRN) in cooperation with the Secretary of Public Health of Rio Grande do Norte, Brazil.

For this challenge, a subset of N = 13.415 records was considered, each containing 20 input features and two target variables (outputs). The features are categorized into four main groups of information: i) Request, ii) Admission, iii) Patient data, and iv) Clinical information (collected at hospital admission) (see dataset description).

Group Feature Description / Units
RequestRequest dateDate on which a bed request was registered
RequestRequest typeAge group of bed requested for the patient: {Adult, Pediatric}
RequestRequested bed typeType of occupancy: {Ward, Intensive Care Unit}
AdmissionAdmission dateDate when patient was admitted
AdmissionAdmission bed typeType of bed: {Ward, Intensive Care Unit}
AdmissionAdmission Health unitHospital admitting the patient
PatientGender{Female, Male}
PatientAgePatient age
PatientPatient’s federal unitAcronym for the federal unit
ClinicalICD codeICD-10 diagnosis code
ClinicalBlood pressureSystolic/Diastolic (mmHg)
ClinicalGlasgow Coma ScaleScale from 3 to 15
ClinicalHematocritUnits (%)
ClinicalHemoglobinUnits g/dL
ClinicalLeukocytesUnitscells/mm³
ClinicalLymphocytesUnits (%)
ClinicalUreaUnits mg/dL
ClinicalCreatinineUnits mg/dL
ClinicalPlateletsUnits 10³/μL
ClinicalDiuresisUnits mL/day

Note: Some of the variable values in the dataset may be inaccurate. The reliability and accuracy of the data is not guaranteed.

The target variables are:

Group Feature Description / Units
TargetOutcome typePossible values are: {Death, Survival}
TargetLength of stayNumber of days in hospital

Disclaimer: The dataset used in this challenge is for research purposes only and should not be used for commercial applications.

Evaluation Criteria

Model performance is assessed based on three scores:

As a result, the team with the lowest GLscore will win the competition.

Final submissions must include a report and an executable or prediction file.

Dataset Files and Examples (Training / Validation / Testing)

1| Description

For reading the training and testing datasets please consider the following folder structure.

…\IUPESM2025\trainData.csv
…\IUPESM2025\valData.csv
…\IUPESM2025\testData.csv

Note: The trainData.csv and valData.csv file will be provided in CSV format.

The testData.csv dataset has the same structure as valData.csv but contains a different number of examples (records). The testData.csv file will not be available to participants and will be used to compute the global score.

2| Example in Python

We assume that your main program is in the same folder as the data files.

2.1| Training and Validation

Using the trainData.csv and valData.csv you should develop a model to estimate the length of stay and the type of discharge. The result of your program must consists of the model used and two vectors of dimension N, in csv format, to be save in the same folder as:


            import pandas as pd
            import numpy as np
            from sklearn.linear_model import LogisticRegression, LinearRegression
            
            # Data Loading
            df = pd.concat([pd.read_csv("train_dataset.csv"),
                            pd.read_csv("val_dataset.csv")])

            # Input features and target variables
            X = df[features]
            y_DT = df['Outcome']
            y_LS = df['LengthOfStay']
            
            # Model Training and Validation
            model_DT = LogisticRegression().fit(X, y_DT)
            model_LS = LinearRegression().fit(X, y_LS)
            
            # Models Results
            np.savetxt("DTestimation.csv", model_DT.predict(X), fmt='%d', delimiter=',')
            np.savetxt("LSestimation.csv", model_LS.predict(X).round(0), fmt='%d', delimiter=',')
            

2.2| Testing

Your program should be placed in the ..\IPUEMS25\ folder, the same folder where the testData.csv data file is located.


            import pandas as pd
            import numpy as np
            import joblib
            
            # Data Loading
            df = pd.read_csv("test_dataset.csv")
            X = df[features]
            
            # Model Loading
            model_DT = joblib.load("discharge_model.pkl")
            model_LS = joblib.load("lengthstay_model.pkl")
            
            # Model Predictions
            np.savetxt("DTestimation.csv", model_DT.predict(X), fmt='%d', delimiter=',')
            np.savetxt("LSestimation.csv", model_LS.predict(X), fmt='%d', delimiter=',')
            

Contact Information

If you have any questions, contact us at:

Email: sciupesm2025@dei.uc.pt

Official Website: IUPESM 2025

Frequently Asked Questions (FAQ)

Anyone affiliated with an academic or research institution can participate. Teams can have up to 5 members.

Send an email to sciupesm2025@dei.uc.pt with your team name, full names of all members, institutional emails & affiliations, and confirmation of agreement to participate.

The registration deadline is July 18, 2025.

The data becomes available as soon as the team's application is accepted. Participants will receive the data by email.

Submissions will be evaluated using three scores: DTscore (based on F1-score for outcome prediction), LSscore (mean absolute error for length of stay), and GLscore (the sum of both).

Good Luck!