[hibernate-issues] [Hibernate-JIRA] Created: (HSEARCH-867) input stream support

adam (JIRA) noreply at atlassian.com
Tue Aug 23 20:56:02 EDT 2011


input stream support
--------------------

                 Key: HSEARCH-867
                 URL: http://opensource.atlassian.com/projects/hibernate/browse/HSEARCH-867
             Project: Hibernate Search
          Issue Type: Improvement
          Components: analyzer, integration
    Affects Versions: 3.4.0.Final
            Reporter: adam
            Priority: Minor


The current hibernate search functionality is not optimized for dealing with large text contents.  Two use cases:

1. indexing an external PDF that's 100MB where an @Field is set on a getter
2. indexing a @Lob field

in both cases, the method must return a string, or a base class, which might mean that you have an InputStream that's 50MB, which gets concatenated into a string, and then passed to an analyzer bundled into a Reader object.  I'm unclear what HibernateSearch is doing when the getter for the @Field annotation is called, but it would be ideal if it could use a reader instead of a string 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        


More information about the hibernate-issues mailing list