Rankings
Reloaded

An open-source toolkit for visualizing benchmarking results

Rankings Reloaded Mission

The mission of Rankings Reloaded is to offer an open-source toolkit for robust and accurate uncertainty analysis and visualization of algorithm performance. Rankings Reloaded enables researchers to conduct fair benchmarking by revealing each algorithm’s true strengths and weaknesses.

Benchmarking Pitfalls in Machine Learning

The rapidly evolving field of machine learning (ML) is marked by ever faster development of new algorithms. In light of this competition, robust and reliable validation of algorithm performance is becoming increasingly important. International benchmarking competitions (“challenges”) have become the gold standard for benchmarking in ML, but are subject to frequent flaws in analysis and reporting [1]. Rankings Reloaded was developed to address these issues and empower researchers to conduct meaningful performance comparisons, avoiding common pitfalls

[1] Maier-Hein, L., Eisenmann, M., Reinke, A. et al. Why rankings of biomedical image analysis competitions should be interpreted with care. Nat Commun 9, 5217 (2018). https://doi.org/10.1038/s41467-018-07619-7

The Rankings Reloaded Framework

Rankings Reloaded is a user-friendly, ready-to-use open source framework for comprehensive uncertainty analysis in algorithm benchmarking. Building upon challengeR [2], Rankings Reloaded helps researchers identify strengths and weaknesses of algorithms for both individual benchmarking experiments and large-scale challenges, supporting both single-task and multi-task scenarios. By eliminating the need for complex installations, Rankings Reloaded makes powerful analyses accessible to developers unfamiliar with the R language.

About Rankings Reloaded

[2] Wiesenfarth, M., Reinke, A., Landman, B.A., Eisenmann, M., Aguilera Saiz, L., Cardoso, M.J., Maier-Hein, L. and Kopp-Schneider, A. Methods and open-source toolkit for analyzing and visualizing challenge results. Sci Rep 11, 2369 (2021). https://doi.org/10.1038/s41598-021-82017-6

Report generation in just 4 steps:

1. Upload Your Data

Upload your score data in CSV format (sample data). The data must contain results for every case (image).

2. Configure Ranking

Choose your ranking method from metric-based, case-based or significance ranking options.

3. Configure Uncertainty Analysis

Choose if you want to apply bootstrapping methods. It will be used to investigate the ranking uncertainty (recommended).

4. Generate the Report

Provide a few final details necessary for your report. Then you are ready to download it.

Download sample report

Get started with Report Configuration

Core Funding

Rankings Reloaded Publication

"Rankings Reloaded" is based on the publication "Methods and Open Source Toolkit for Analyzing and Visualizing Challenge Results." The goal of the paper is to suggest methods and provide an open-source framework for systematically analyzing and visualizing benchmarking results, including ranking uncertainty analysis. This approach aims to offer valuable insights for challenge organizers, participants, and individual researchers, helping them understand algorithm performance and validate datasets more intuitively. The paper covers various analysis and visualization techniques.

Please cite our paper if you use our online tool. For more information:

Publication and Citation

Credits: https://www.flaticon.com/free-icon/use-case_10401397

More information

If not everything is clear by now, don't worry: You will get more detailed information about each of these steps along the way. You can find out the possible use cases for further information.

Credit: https://www.freepik.com/free-vector/illustration-paper_2606514.htm

Get started with Report Configuration

If everything is clear, you are ready to create your report.