Visual Graph Analysis for Quality Assessment of Manually Labelled Documents Image DatabaseReport as inadecuate




Visual Graph Analysis for Quality Assessment of Manually Labelled Documents Image Database - Download this document for free, or read online. Document in PDF available to download.

1 LaBRI - Laboratoire Bordelais de Recherche en Informatique

Abstract : The context of this paper is the labelling of a document image database in an industrial process. Our work focuses on the quality assessment of a given labelled database. In most practical cases, a database is manually labelled by an operator who has to browse sequentially the images presented as thumbnails until the whole database is labelled. This task is very repetitive; moreover the filing plan defining the names and number of classes is often incomplete, which leads to many labelling errors. The question is then to certify if the quality of a labelled batch is good enough to globally accept it. Our objective is to ease and speed up that evaluation that needs up to 1.5 more times than the labelling work itself. We propose an interactive tool for visualizing the data as a graph. That graph enhances similarities between documents as well as the labelling quality. We define criteria on the graph that characterize the three types of errors an operator can do: an image is mislabelled, one class should be split in more pertinent subclasses, several classes should be merged in another. This allows us to focus the operator attention on potential errors. He can then count the errors encountered while auditing the database and assess or not the global labelling quality.





Author: Romain Giot - Romain Bourqui - Nicholas Journet - Anne Vialard -

Source: https://hal.archives-ouvertes.fr/



DOWNLOAD PDF




Related documents