Linkedin

Soraia Alarcão and Manuel J. Fonseca work published in Multimedia Tools and Applications journal

Date: 26/08/2022

The paper “ExpertosLF: dynamic late fusion of CBIR systems using online learning with relevance feedback”, authored by LASIGE’s PhD student Soraia M. Alarcão has been published in the Multimedia Tools and Applications journal, a top-ranked journal (h5-index 87; Scimago Q1). This paper is co-authored by Vânia Mendonça (INESC-ID, Instituto Superior Técnico/ULisboa), Carolina Maruta (LEL, Faculdade de Medicina/ULisboa), and LASIGE’s integrated researcher Manuel J. Fonseca.

One of the main challenges in CBIR systems is to choose discriminative and compact features among dozens, to represent the images under comparison. Over the years, a great effort has been made to combine multiple features, mainly using early, late, and hierarchical fusion techniques. Unveiling the perfect combination of features is highly domain-specific and dependent on the type of image. Thus, the process of designing a CBIR system for new datasets or domains involves a huge experimentation overhead, leading to multiple fine-tuned CBIR systems. It would be desirable to dynamically find the best combination of CBIR systems without needing to go through such extensive experimentation and without requiring previous domain knowledge.

In this paper, the authors propose ExpertosLF, a model-agnostic interpretable late fusion technique based on online learning with expert advice, which dynamically combines CBIR systems without knowing a priori which ones are the best for a given domain. At each query, ExpertosLF takes advantage of user’s feedback to determine each CBIR contribution in the ensemble for the following queries. ExpertosLF produces an interpretable ensemble that is independent of the dataset and domain. Moreover, ExpertosLF is designed to be modular, and scalable.

Experiments on 13 benchmark datasets from the Biomedical, Real, and Sketch domains revealed that: (i) ExpertosLF surpasses the performance of state-of-the-art late-fusion techniques; (ii) it successfully and quickly converges to the performance of the best CBIR sets across domains without any previous domain knowledge (in most cases, fewer than 25 queries need to receive human feedback).

The paper is available here.