tag:blogger.com,1999:blog-8623074010562846957.post1676984314378184332..comments2023-09-01T03:38:08.236-04:00Comments on Changing Bits: Lucene's new analyzing suggester Michael McCandlesshttp://www.blogger.com/profile/04277432937861334672noreply@blogger.comBlogger22125tag:blogger.com,1999:blog-8623074010562846957.post-6594468081180854582015-03-06T17:29:25.211-05:002015-03-06T17:29:25.211-05:00Hi mschipperheyn,
Could you ask this on Lucene...Hi mschipperheyn,<br /><br />Could you ask this on Lucene's user list (java-user@lucene.apache.org)?Michael McCandlesshttps://www.blogger.com/profile/04277432937861334672noreply@blogger.comtag:blogger.com,1999:blog-8623074010562846957.post-48921345562243704072015-03-06T13:02:19.078-05:002015-03-06T13:02:19.078-05:00Can you comment on ThreadSafety? Can we just insta...Can you comment on ThreadSafety? Can we just instantiate a Suggester and then build, lookup, etc against a stored instance?mschipperheynhttps://www.blogger.com/profile/18084555874351799273noreply@blogger.comtag:blogger.com,1999:blog-8623074010562846957.post-35325252880169467172015-03-01T13:06:40.315-05:002015-03-01T13:06:40.315-05:00The surface form is separately stored (as an FST o...The surface form is separately stored (as an FST output), even though matching is done based on the analyzed form.Michael McCandlesshttps://www.blogger.com/profile/04277432937861334672noreply@blogger.comtag:blogger.com,1999:blog-8623074010562846957.post-61264516221098188562015-02-27T11:41:44.298-05:002015-02-27T11:41:44.298-05:00How can the AnalyzerSuggester return the original ...How can the AnalyzerSuggester return the original fieldvalue if the lookup is done by the analyzed value? Dependiing on the analyzer this is not a unique operation. Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-8623074010562846957.post-26702310188560584502014-08-25T04:09:09.713-04:002014-08-25T04:09:09.713-04:00Thanks for very good sample :)Thanks for very good sample :)Phu Hoang https://www.blogger.com/profile/13715421801057838107noreply@blogger.comtag:blogger.com,1999:blog-8623074010562846957.post-24805654370692812022014-02-19T19:34:40.097-05:002014-02-19T19:34:40.097-05:00If you have suggestions in your index, as e.g. tex...If you have suggestions in your index, as e.g. text and weight etc. as stored fields in your documents, you can use the DocumentDictionary class to enumerate the suggestions from your documents. You pass that to AnalyzingSuggester.build to build the suggester.Michael McCandlesshttps://www.blogger.com/profile/04277432937861334672noreply@blogger.comtag:blogger.com,1999:blog-8623074010562846957.post-69767321221115359642014-02-19T15:55:39.139-05:002014-02-19T15:55:39.139-05:00I'm also curious about source code. I'm h...I'm also curious about source code. I'm having trouble figuring out how to load an existing index to have the Suggester use (if that's even possible).Troy Dotyhttps://www.blogger.com/profile/06393759141266328336noreply@blogger.comtag:blogger.com,1999:blog-8623074010562846957.post-37519848862873162962013-07-16T22:38:08.035-04:002013-07-16T22:38:08.035-04:00Hello, Puneet
Thanks for the great demo.
I am con...Hello, Puneet<br /><br />Thanks for the great demo.<br />I am confuse for the Japanese autocomplete and I see your demo can support the Japanese autocomplete.<br />Could you please share the configurations?<br />Thanks a lot.<br /><br />And, Mike, thanks for this article.Anonymoushttps://www.blogger.com/profile/05203590041348795900noreply@blogger.comtag:blogger.com,1999:blog-8623074010562846957.post-17751999786494338642013-06-24T03:43:40.801-04:002013-06-24T03:43:40.801-04:00Any idea if the source is available for this demo?...Any idea if the source is available for this demo?Rickhttps://www.blogger.com/profile/04501722603473770946noreply@blogger.comtag:blogger.com,1999:blog-8623074010562846957.post-85917454131368934862013-05-02T13:44:54.380-04:002013-05-02T13:44:54.380-04:00Puneet,
Nice demo! Thanks for sharing.Puneet,<br /><br />Nice demo! Thanks for sharing.Michael McCandlesshttps://www.blogger.com/profile/04277432937861334672noreply@blogger.comtag:blogger.com,1999:blog-8623074010562846957.post-33930739642518373122013-05-02T10:18:15.279-04:002013-05-02T10:18:15.279-04:00http://luceneautosuggester-lucene.rhcloud.com/.. S...http://luceneautosuggester-lucene.rhcloud.com/.. Sample lucene auto suggester demo using analyzing suggester and fuzzy suggester.. Lucene autosuggester is awesome...Puneethttps://www.blogger.com/profile/16737734185513178048noreply@blogger.comtag:blogger.com,1999:blog-8623074010562846957.post-90582812219200857842013-02-07T18:52:16.028-05:002013-02-07T18:52:16.028-05:00Now that I think about it, it makes sense why that...Now that I think about it, it makes sense why that didn't work ... I was quering on a non-stored, indexed (copy)field. So that's obviously the reason I got back the indexed value.Anonymoushttps://www.blogger.com/profile/13304656870072811291noreply@blogger.comtag:blogger.com,1999:blog-8623074010562846957.post-46682335049644231042013-02-07T18:31:57.310-05:002013-02-07T18:31:57.310-05:00Phew, thanks for bringing closure Sebastian!Phew, thanks for bringing closure Sebastian!Michael McCandlesshttps://www.blogger.com/profile/04277432937861334672noreply@blogger.comtag:blogger.com,1999:blog-8623074010562846957.post-86176746589414004252013-02-07T18:05:11.268-05:002013-02-07T18:05:11.268-05:00So to clarify it, AnalyzerSuggester needs to be us...So to clarify it, AnalyzerSuggester needs to be used on the field directly (no copyField or something), then it's working :)<br />So to query on the field "name" with filters specified in "text_asug" (lowercase, ascii,...) one would use:<br /><br />http://pastebin.com/tN9yXHB0Anonymoushttps://www.blogger.com/profile/13304656870072811291noreply@blogger.comtag:blogger.com,1999:blog-8623074010562846957.post-24912833570605934782013-02-07T09:36:10.665-05:002013-02-07T09:36:10.665-05:00Hi Sebastian,
Can you send an email to solr-user@...Hi Sebastian,<br /><br />Can you send an email to solr-user@lucene.apache.org with these details? I'm not sure what's going on. That test case sure looks like it should work (ie return Têst Námè not test name).Michael McCandlesshttps://www.blogger.com/profile/04277432937861334672noreply@blogger.comtag:blogger.com,1999:blog-8623074010562846957.post-44593590714133888312013-02-07T07:58:35.666-05:002013-02-07T07:58:35.666-05:00Hey Mike,
here's a pastebin: http://pastebin....Hey Mike,<br /><br />here's a pastebin: http://pastebin.com/20vSGJ1a<br /><br />After that I feed the document and do:<br />http://localhost:8080/solr/wiki/autosuggest?q=asug:test&spellcheck.build=true<br /><br />I get the "right outcome" for every possible query I tried, e.g. Têst, tést, TÈST,... Only problem is, that this seems to return the stored index value ("test name"), rather than the stored field value ("Têst Námè").<br /><br />Thanks!Anonymoushttps://www.blogger.com/profile/13304656870072811291noreply@blogger.comtag:blogger.com,1999:blog-8623074010562846957.post-35715389583853202032013-02-07T06:53:09.932-05:002013-02-07T06:53:09.932-05:00Hi Sebastian,
That's very odd ... it should a...Hi Sebastian,<br /><br />That's very odd ... it should always return the original field value, not the analyzed form. Can you make a small set of names showing the issue?Michael McCandlesshttps://www.blogger.com/profile/04277432937861334672noreply@blogger.comtag:blogger.com,1999:blog-8623074010562846957.post-55233185221219382272013-02-07T06:24:43.981-05:002013-02-07T06:24:43.981-05:00Hello Mike,
I've been playing around with this...Hello Mike,<br />I've been playing around with this the last days and I think I almost got it working - maybe you know what needs to be changed :)<br /><br />I have a copyField "asug", which copies from "name" (this has accents in it), it is a custom fieldType with a KeywordTokenizer (since I want the whole term to be returned), lowercase and ascii-folding, for both index and query.<br /><br />It seems that it always returns the indexed value, rather than the actual field value - so "ame.." gives me back "amelie".<br /><br />I randomly got the right value back by feeding without index-analyzers and restarting with index-analyzers. Any ideas?<br />Anonymoushttps://www.blogger.com/profile/13304656870072811291noreply@blogger.comtag:blogger.com,1999:blog-8623074010562846957.post-978903034637446732013-01-30T13:28:23.148-05:002013-01-30T13:28:23.148-05:00Hi Aditya,
I don't know of any examples ... b...Hi Aditya,<br /><br />I don't know of any examples ... but maybe look at its unit test? https://svn.apache.org/repos/asf/lucene/dev/branches/lucene_solr_4_1/lucene/suggest/src/test/org/apache/lucene/search/suggest/analyzing/AnalyzingSuggesterTest.javaMichael McCandlesshttps://www.blogger.com/profile/04277432937861334672noreply@blogger.comtag:blogger.com,1999:blog-8623074010562846957.post-27400790315092381652013-01-30T01:02:09.765-05:002013-01-30T01:02:09.765-05:00Can you please point me to an example?Can you please point me to an example?Adityanoreply@blogger.comtag:blogger.com,1999:blog-8623074010562846957.post-43926005671453318012012-12-30T06:54:04.017-05:002012-12-30T06:54:04.017-05:00Hi orzech,
lookup() is thread safe!Hi orzech,<br /><br />lookup() is thread safe!Michael McCandlesshttps://www.blogger.com/profile/04277432937861334672noreply@blogger.comtag:blogger.com,1999:blog-8623074010562846957.post-22316305656834553392012-12-27T07:38:38.218-05:002012-12-27T07:38:38.218-05:00Thank you for this article! I've just started ...Thank you for this article! I've just started using Lucene's suggesters to implement autocomplete feature in my project. So far everything works great. <br /><br />Though, I'd be thankful if you could tell me if AnalyzingSuggester and FuzzySuggester are thread-safe when using lookup() method? I couldn't find this information anywhere.<br /><br />Kind regardsorzechhttps://www.blogger.com/profile/01093100890824433420noreply@blogger.com