Interactive MMR

Satvik Chekuri, Yuan Li, Daniel Manesh, and Syed Muhammad Farhan

Extractive

accuracy

diversity

Worflow

Assume that we are given a database of 5 documents d_i and a query q, and we calculated, given a symmetrical similarity measure, the similarity values as below. Further assume that λ is given by the user to be 0.5:

Initially our result set S is empty. Therefore the second half of the equation, which is the max pairwise similarity within S, will be zero. For the first iteration, MMR equation reduces to:
MMR = arg max (sim (d_i, q))
d₁ has the maximum similarity with q, therefore we pick it and add it to S. Now, S = {d₁}.
Since S = {d₁}, finding the maximum distance to an element in S to a given d_i is simply sim(d₁,d_i).
For d₂:
     sim(d₁, d₂) = 0.11
     sim (d₂, q) = 0.90
     Then MMR = 0.90 – (1-λ)0.11 = 0.4225
Similarly MMR values for d₃, d₄, d₅ are 0.135, -0.35 and 0.19 respectively. Since d₂ has the maximum MMR, we add it to S. Now S = {d₁, d₂}.
This time S = {d₁, d₂}. We should find max of sim (d_i, d₁) and sim (d_i, d₂) for the second part of the equation.
For d₃:
     max{sim (d₁, d₃), sim (d₂, d₃)} = max {0.23, 0.29} = 0.29
     sim (d₃, q) = 0.50
     Then MMR = 0.5*0.5 - 0.5*0.29 = -0.0725
Similarly MMR values for d₄ and d₅ are -0.35 and 0.06 respectively. Since d₂ has the maximum MMR, we add it to S. Now S = {d₁, d₂}.
d₃ has the maximum MMR, therefor S = {d₁, d₂, d₃}.
If we didn't have diversity at all (λ = 1), then our S would have been {d₁, d₂, d₅}.

Contact: {satvikchekuri, yli92, danielmanesh, syedfarhan}@vt.edu