Object structure


On Asymmetric Matching between Sets


Raport Badawczy = Research Report ; RB/35/2014


Krawczak, Maciej. Autor ; Szkatuła, Grażyna Maria. Autor


Instytut Badań Systemowych. Polska Akademia Nauk ; Systems Research Institute. Polish Academy of Sciences

Place of publishing:


Date issued/created:



20 pages ; 21 cm ; Bibliography p. 20

Subject and Keywords:

Jaccard coefficient ; Sets’ perturbation ; Perturbacja zbiorów ; Tversky index ; Symbolic data analysis ; Indeks tverskego ; Analiza danych symbolicznych ; Sets' matching ; Dopasowanie zbiorów


A comparison of two objects may be viewed as an attempt to determine the degree to which they are similar or different in a given sense. Defining a good measure of proximity, or else similarity or dissimilarity between objects is very important in practical tasks as well as theoretical achievements. Each object is usually represented as a point in Cartesian coordinates, and therefore the distance between points reflects similarities between respective objects. In general, the space is assumed to be Euclidean, and a distance assigns a nonnegative number. From another point of view the concept of symmetry underlies essentially all theoretical treatment of similarity. Tversky (1977) provides empirical evidence of asymmetric similarities and argues that similarity should not be treated as a symmetric relation. According to Tversky’s consideration, an object is described by sets of features instead of geometric points in a metric space. In this paper we propose the new measure of remoteness between sets of nominal values. Instead of considering distance between two sets, we introduce the measures of perturbation of one set by another. The consideration is based on set-theoretic operations and the proposed measure describes changes of the second set after adding the first set to it, or vice versa. The measure of sets’ perturbation returns a value from [0, 1], and it must be emphasized that this measure is not symmetric in general. The difference between 1 and the sum of these two measures of perturbation of a pair of sets can be understood as Jaccard’s extended similarity measure. In this paper several mathematical properties of the measure of sets’ perturbation are studied, and interpretation of proximity is explained by the comparison of selected measures.


Raport Badawczy = Research Report

Resource type:


Detailed Resource Type:






Language of abstract:



Creative Commons Attribution BY 4.0 license

Terms of use:

Copyright-protected material. [CC BY 4.0] May be used within the scope specified in Creative Commons Attribution BY 4.0 license, full text available at: ; -

Digitizing institution:

Systems Research Institute of the Polish Academy of Sciences

Original in:

Library of Systems Research Institute PAS

Projects co-financed by:

Operational Program Digital Poland, 2014-2020, Measure 2.3: Digital accessibility and usefulness of public sector information; funds from the European Regional Development Fund and national co-financing from the state budget.



External content

This content is hosted outside the digital library.

Click the link below to view the content.

https://www.ibspan.waw.pl/~alex/OZwRCIN/WA777_137247_RB-2014-35_On Asymmetric Matching between Sets_content.pdf


Citation style: