Issue search problems #2
Labels
No Label
Service
Buildbot
Service
Chat
Service
Gitea
Service
Translate
Type
Bug
Type
Config
Type
Deployment
Type
Feature
Type
Setup
No Milestone
No project
No Assignees
4 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: infrastructure/blender-projects-platform#2
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Tasks
Analysis
Also see #3 regarding this issue.
When searching for anything, it would seem that the results-count is constrained to 50 max.
This is true for the 'ISSUE_INDEXER_TYPE = db' as well as the 'ISSUE_INDEXER_TYPE = bleve' settings; however there are significant differences in the details.
db
With 'db', the total amount of results is 50.. So 48 open/2 closed, or 16 open/34 closed... etc..
This is true for extremely common search-terms too, like 'linux', 'radeon','crash', etc...
No paging is provided for results beyond that.
Lunny reports on blender.chat that this is by design . The 'code search' result-page does have pagination. We might need to revisit this.
bleve
The 'bleve' engine does something similar, but it seems to 'forget' results it did return earlier.
The size of the 'issues.bleve' directory also seems to slowly grow during indexing, and then suddenly shrink sharply. This cycle repeats a few times while indexing is going on.
First reports seem to indicate this MIGHT be due to concurrent indexer-processes trying to manage/fill the issues.bleve cache... corrupting/overwriting it. This needs checking.
Originally created by Arnd.
While we are waiting for this to be fixed, are issues going to be indexed by search engines? Seems that's disabled now:
https://projects.blender.org/robots.txt
Is there a plan to look into allowing Google to index? Should there a separate issue for this? In general some kind of working search is important for triagers to do their work.
Search results pagination is limited to 50 reultsto Issue search problemsThe db engine search works poorly, it's not proper text search but just a simple LIKE comparison.
Maybe we just need to try setting up elastic search for issues rather than getting db or bleve working better, both of which we know are not great.
Setting up elastic search for both issue and repo-indexing seems to be the logical thing to try out next. It will require some thought around how to properly support this so that we dont run into trouble, performance and disk-space wise.
Also, given that all these changes of indexers/parameters are requiring a restart of GITEA, it's not a good idea to test this on production.
I will invest time into getting a 'test.projects.blender.org' instance up that will aid in us being able to test intended changes of styling, webhooks and gitea itself (app-settings, etc).
@lunny's WIP pagination PR: https://github.com/go-gitea/gitea/pull/22704
@fsiddi's alternative idea: https://github.com/go-gitea/gitea/issues/20665
Adding a new search engine would be via interface, an example of one can be found here: https://sourcegraph.com/github.com/go-gitea/gitea/-/blob/modules/indexer/issues/
WIP meilisearch PR: https://github.com/go-gitea/gitea/pull/23136
WIP due to missing docs, and pending maintainer review
Support for meilisearch landed in Gitea
main
last week, and is also inblender-merged-develop
now.Status update
Fixed:
Waiting:
I think we can consider this resolved now.