A new InnoDB variable to control whether InnoDB FTS should ignore stopword list
https:/
Description:
Ngram indexes also check the stopwords list, to see if any indexed element *contain* one of the words on that list. This looks good and it is the normal behaviour, but I don't think that the default table is suitable to use it with ngram.
For example, any item that contains 'a' or 'i' will be ignored. So for example, if you have word "east", you cannot search for "ea" because it has been ignored.
Ngram should have a different default list of stopwords, or an empty list.
Suggestion:
Introduce a new 'innodb_
Notes:
Please also notice that although this variable is introduced to resolve ngram issues, it affects non-ngram FTS as well. It has absolutely the same meaning - if it's enabled, FTS won't be checking if current token is a stopword when building/updating an FTS index. However, being a stopword doesn't just mean to be a one of the predefined words from the list. Tokens shorter than 'innodb_
See also:
https:/
https:/
Blueprint information
- Status:
- Complete
- Approver:
- Laurynas Biveinis
- Priority:
- High
- Drafter:
- Yura Sorokin
- Direction:
- Approved
- Assignee:
- Yura Sorokin
- Definition:
- Approved
- Series goal:
- Accepted for 5.7
- Implementation:
-
Implemented
- Milestone target:
-
5.7.20-18
- Started by
- Yura Sorokin
- Completed by
- Yura Sorokin