[Tickets #10420] Re: Face similarity indexing is incorrect/broken

bugs at horde.org bugs at horde.org
Sat Sep 3 05:43:30 UTC 2011


DO NOT REPLY TO THIS MESSAGE. THIS EMAIL ADDRESS IS NOT MONITORED.

Ticket URL: http://bugs.horde.org/ticket/10420
------------------------------------------------------------------------------
  Ticket             | 10420
  Updated By         | Michael Rubinsky <mrubinsk at horde.org>
-Summary            | wrong column names and non-standard compliant queries
+Summary            | Face similarity indexing is incorrect/broken
  Queue              | Ansel
  Version            | Git master
  Type               | Bug
-State              | Feedback
+State              | Assigned
  Priority           | 1. Low
-Milestone          |
+Milestone          | 2.0
  Patch              |
-Owners             |
+Owners             | Michael Rubinsky
------------------------------------------------------------------------------


Michael Rubinsky <mrubinsk at horde.org> (2011-09-03 05:43) wrote:

I've fixed most of the issues you have brought up in this ticket.  
However, the query dealing with the vector indexes I have left alone  
for now. The person that originally wrote this part of the code has  
been gone for a while now, but it looks like the way the index is  
created is completely incorrect e.g.,

Given a vector such as [abcdefgh] with a word length of 2, the  
*correct* words should be:
[ab] [bc] [cd] [de] [ef] [fg] [gh]

not

[ab] [cd] [ef] [gh]

Also, these words need to be stored composed with the index position,  
in the same field (See  
http://download.pureftpd.org/pub/pure-ftpd/misc/libpuzzle/doc/README  
for more technical detail if interested).

Combine this with the fact that the images that are being compared are  
very small (just the face portion), and the current query really has  
very little chance of finding a similar face, unless it almost the  
exact same image - after that, it's just pure luck.

This should be fixed before Ansel 2.0





More information about the bugs mailing list