[hibernate-dev] RE: Hibernate Search Spell Checker

John Griffin jgriffin at thebluezone.net
Sat Sep 13 16:22:49 EDT 2008


Leave the sf.getDictionary("main-dictionary").spellcheck(word, suggestion,
entity, properties...); until a later version if at all. I'll look at your
suggestions and compare them to how I was thinking of going about it. I
believe it should be controlled exclusively by annotations and the multiple
fields and dictionaries is in line with my thoughts.

Couldn't possibly be done for the book so I'll put a NOTE in saying we're
working on a better :>) solution for Hibernate Search. I really want to work
on this since I've been away from the project for so long. It WILL take more
than 3 or 4 days since I have limited time for working on the source.

Do you want me to put a Jira issue in on this?
Did you get my full statement on section 7.3?

John Griffin
http://thediningphilosopher.blogspot.com

-----Original Message-----
From: hibernate-dev-bounces at lists.jboss.org
[mailto:hibernate-dev-bounces at lists.jboss.org] On Behalf Of
hibernate-dev-request at lists.jboss.org
Sent: Saturday, September 13, 2008 10:00 AM
To: hibernate-dev at lists.jboss.org
Subject: hibernate-dev Digest, Vol 27, Issue 8

Send hibernate-dev mailing list submissions to
	hibernate-dev at lists.jboss.org

To subscribe or unsubscribe via the World Wide Web, visit
	https://lists.jboss.org/mailman/listinfo/hibernate-dev
or, via email, send a message with subject or body 'help' to
	hibernate-dev-request at lists.jboss.org

You can reach the person managing the list at
	hibernate-dev-owner at lists.jboss.org

When replying, please edit your Subject line so it is more specific
than "Re: Contents of hibernate-dev digest..."


Today's Topics:

   1. Spellchecker in HSearch (Emmanuel Bernard)


----------------------------------------------------------------------

Message: 1
Date: Sat, 13 Sep 2008 10:19:41 -0400
From: Emmanuel Bernard <emmanuel at hibernate.org>
Subject: [hibernate-dev] Spellchecker in HSearch
To: John Griffin <griffij at sosstaffing.com>,	John Griffin
	<jgriffin at thebluezone.net>
Cc: hibernate-dev at lists.jboss.org
Message-ID: <878228FD-9EC3-4948-A363-D9C4F41D92F9 at hibernate.org>
Content-Type: text/plain; charset="us-ascii"

OK I looked at the SpellChecker code from the Lucene contrib) and  
there is no trivial way to integrate it more nicely into Hibernate  
Search. They did not really plan for flexibility.

But it's doable to fork this code and make a spellchecker feature in  
Hibernate Search. Here are a few limitations they have that we need to  
remove. Here is a proposal.
  - more than one field and even more than one index should be the  
source of the dictionary
  - declaring which field is part of a dictionary should be  
declarative @AddToDictionary(name="main-dictionary") (with probably an  
optional field name(s))
  - for a given dictionary name, we need to associate an index and  
probably a field prefix (to share the same index for all  
dictionaries). This couold be by annotations or in the configuration  
(not sure)
  - this index should be kept like any other DirectoryProvider in the  
HSearch intrastructure (we could then reuse ReaderProvider internally  
and don't have leaks).
  - we will probably add a manual dictionary reindex method and the  
SpellChecker API should be in the SearchFactory.
     sf.getDictionary("main-dictionary").spellcheck(word, suggestion);
     sf.getDictionary("main-dictionary").spellcheck(word, suggestion,  
entity, properties...); //not sure how useful is this one really
     sf.getDictionary("main-dictionary").reindex();

Of course this needs to be refined.

This is probably 3 or 4 days of work but unfortunately we cannot do  
that for the book it seems. I would be wort mentioning in the section  
that a more integrated solution in on it's way.

--
Emmanuel Bernard
http://in.relation.to/Bloggers/Emmanuel | http://blog.emmanuelbernard.com 
  | http://twitter.com/emmanuelbernard
Hibernate Search in Action (http://is.gd/Dl1)

On  Sep 12, 2008, at 09:15, John Griffin wrote:

>
> Here's the file as a zip. Maybe the pdf wasn't passing.
>
> Also, Nermina pointed out to me that I skipped section 7.3. So ANY  
> reference
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
http://lists.jboss.org/pipermail/hibernate-dev/attachments/20080913/f073f115
/attachment-0001.html

------------------------------

_______________________________________________
hibernate-dev mailing list
hibernate-dev at lists.jboss.org
https://lists.jboss.org/mailman/listinfo/hibernate-dev


End of hibernate-dev Digest, Vol 27, Issue 8
********************************************




More information about the hibernate-dev mailing list