Abstract : This paper addresses the integration of tags in terms weighting function for focused XML retrieval. Our model allows to consider a certain kind of structural information: tags that represent logical structure title, section, etc. as well as tags related to formatting bold font, centered text, etc

We first take into account the tags influence by estimating the probability that tags distinguishes terms which are the most relevant. Then, these weights are impacted on terms weighting function using several combining schemes. Experiments on a large collection during INEX 2008 XML IR evaluation campaign INitiative for Evaluation of XML Retrieval showed that using tags leads to improvements on focused retrieval.

