Clustering and Relational Ambiguity: from Text Data to Natural DataReport as inadecuate




Clustering and Relational Ambiguity: from Text Data to Natural Data - Download this document for free, or read online. Document in PDF available to download.

1 INRA

Abstract : Text data is often seen as -take-away- materials with little noise and easy to process information. Main questions are how to get data and transform them into a good document format. But data can be sensitive to noise oftenly called ambiguities. Ambiguities are aware from a long time, mainly because polysemy is obvious in language and context is required to remove uncertainty. I claim in this paper that syntactic context is not suffisant to improve interpretation. In this paper I try to explain that firstly noise can come from natural data themselves, even involving high technology, secondly texts, seen as verified but meaningless, can spoil content of a corpus; it may lead to contradictions and background noise.

keyword : computational linguistics





Author: Nicolas Turenne -

Source: https://hal.archives-ouvertes.fr/



DOWNLOAD PDF




Related documents