[rules-dev] soundslike: report on phonetic matching

Wolfgang Laun wolfgang.laun at gmail.com
Fri Oct 15 11:10:30 EDT 2010


On 15 October 2010 15:22, jschmied <nabble at juergenschmied.de> wrote:

>
> Hi!
>
> I have way better results with DoubleMetaphone. It's available in apache
> commons codec.
>

Indeed, although testing for 100 different contexts of the letter 'C' alone
sounds quite heavy.


>
> You need a two step approach:
>
> First select the canidates with phonetic keys (Soundex/DoubleMetaphone)
> Then filter all canidates with the original search value with something
> like
> JaroWinkler and a defined threshold.
>

But the "soundslike" operator in Drools is a binary relational operator. I
cannot relate your two-step approach to that. Drools currently has no
intent of exposing the word-to-key algorithm. But who knows - if there's
good cause...

-W


>
> juergen
> --
> View this message in context:
> http://drools-java-rules-engine.46999.n3.nabble.com/soundslike-report-on-phonetic-matching-tp1707485p1707757.html
> Sent from the Drools - Dev mailing list archive at Nabble.com.
> _______________________________________________
> rules-dev mailing list
> rules-dev at lists.jboss.org
> https://lists.jboss.org/mailman/listinfo/rules-dev
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.jboss.org/pipermail/rules-dev/attachments/20101015/d4428296/attachment.html 


More information about the rules-dev mailing list