Important Notice: Our web hosting provider recently started charging us for additional visits, which was unexpected. In response, we're seeking donations. Depending on the situation, we may explore different monetization options for our Community and Expert Contributors. It's crucial to provide more returns for their expertise and offer more Expert Validated Answers or AI Validated Answers. Learn more about our hosting issue here.

Why is index-time synonym expansion considered better than query-time synonym expansion?

expansion synonym
0
Posted

Why is index-time synonym expansion considered better than query-time synonym expansion?

0

For a variety of reasons, it is usually better to do this at index-time: • A synonym containing multiple words (example: i pod) isn’t recognized correctly at query-time because the query parser tokenizes on whitespace. • The IDF component of Lucene’s scoring algorithm will be much higher for documents matching a synonym appearing rarely, as compared to its equivalents that are common. This reduces the scoring effectiveness. • Prefix, wildcard, and fuzzy queries aren’t analyzed, and thus won’t match synonyms. Why is index-time synonym expansion considered less flexible than query-time synonym expansion? Index-time synonym expansion is less flexible than query-time synonym expansion because with index-time synonym expansion, changes to the synonyms will require a complete re-index to take effect. Moreover, the index will get larger if you do index-time expansion. It’s plausible to imagine the issues above being rectified at some point. However, until then, index-time is usually best. Is

Related Questions

What is your question?

*Sadly, we had to bring back ads too. Hopefully more targeted.