[Tickets #10420] Re: Face similarity indexing is incorrect/broken
bugs at horde.org
bugs at horde.org
Sat Sep 3 05:43:30 UTC 2011
DO NOT REPLY TO THIS MESSAGE. THIS EMAIL ADDRESS IS NOT MONITORED.
Ticket URL: http://bugs.horde.org/ticket/10420
------------------------------------------------------------------------------
Ticket | 10420
Updated By | Michael Rubinsky <mrubinsk at horde.org>
-Summary | wrong column names and non-standard compliant queries
+Summary | Face similarity indexing is incorrect/broken
Queue | Ansel
Version | Git master
Type | Bug
-State | Feedback
+State | Assigned
Priority | 1. Low
-Milestone |
+Milestone | 2.0
Patch |
-Owners |
+Owners | Michael Rubinsky
------------------------------------------------------------------------------
Michael Rubinsky <mrubinsk at horde.org> (2011-09-03 05:43) wrote:
I've fixed most of the issues you have brought up in this ticket.
However, the query dealing with the vector indexes I have left alone
for now. The person that originally wrote this part of the code has
been gone for a while now, but it looks like the way the index is
created is completely incorrect e.g.,
Given a vector such as [abcdefgh] with a word length of 2, the
*correct* words should be:
[ab] [bc] [cd] [de] [ef] [fg] [gh]
not
[ab] [cd] [ef] [gh]
Also, these words need to be stored composed with the index position,
in the same field (See
http://download.pureftpd.org/pub/pure-ftpd/misc/libpuzzle/doc/README
for more technical detail if interested).
Combine this with the fact that the images that are being compared are
very small (just the face portion), and the current query really has
very little chance of finding a similar face, unless it almost the
exact same image - after that, it's just pure luck.
This should be fixed before Ansel 2.0
More information about the bugs
mailing list