Commit graph

27 commits

Author SHA1 Message Date
KMY(雪あすか)
a8fbcb3fb6
Change: #532 ElasticSearch設定の外出し (#650)
* Change: #532 ElasticSearch設定の外出し

* バージョンチェック

* 起動時にエラー
2024-03-12 12:11:13 +09:00
KMY(雪あすか)
d864399a09
Fix: #370 ダブルクオートを使った日本語検索が効かない (#448)
* Fix: #370 ダブルクオートを使った日本語検索が効かない

* Add test for multiple words

* Improve test

* 入れ替え

* 単語検索のテスト
2024-01-12 10:27:35 +09:00
KMY
597485f47a Merge remote-tracking branch 'parent/main' into upstream-20231128 2023-11-28 09:44:42 +09:00
Claire
f1657e6d62
Clamp dates when serializing to Elasticsearch API (#28081) 2023-11-27 13:25:54 +00:00
S.H
866f93b90e
Fix/elasticsearch plugin ci failure (#297)
* Revert "Fix search test"

This reverts commit f638163eca.

* Add Elasticsearch CI setting

* Revert development settings for public_statuses_index and statuses_index
2023-11-18 17:13:51 +09:00
KMY
5fb6bce744 Add my reacted statuses search 2023-09-15 18:50:45 +09:00
KMY
f638163eca Fix search test 2023-09-10 18:43:02 +09:00
KMY
621a41b670 Merge remote-tracking branch 'parent/main' into kb_migration 2023-09-01 18:05:03 +09:00
Claire
6b58cfd8dd
Fix searching by username by reverting account verbatim tokenizer to standard (#26739) 2023-08-31 15:35:58 +02:00
KMY
05a022448b Merge remote-tracking branch 'parent/main' into kb_migration 2023-08-29 13:25:48 +09:00
Eugen Rochko
7bd5ebb0c5
Fix multiple issues with status index mappings (#26686) 2023-08-28 11:36:17 +02:00
KMY
22427c4669 Improve sudachi settings 2023-08-28 10:23:11 +09:00
KMY
7c2abfbd10 Add domain attribute to other chewy indices 2023-08-26 19:00:57 +09:00
KMY
fbb82b740b Merge remote-tracking branch 'parent/main' into kb_migration 2023-08-25 12:19:55 +09:00
jsgoldstein
30c191aaa0
Add new public status index (#26344)
Co-authored-by: Eugen Rochko <eugen@zeonfederated.com>
Co-authored-by: Claire <claire.github-309c@sitedethib.com>
2023-08-24 16:40:04 +02:00
KMY
61e4be24e1 Merge commit '3a8370e1f4' into kb_migration 2023-08-15 17:58:03 +09:00
Claire
f5778caa3a
Add ES_PRESET option to customize numbers of shards and replicas (#26483)
Co-authored-by: Eugen Rochko <eugen@zeonfederated.com>
2023-08-14 17:46:16 +02:00
KMY
1fb5269501 Merge remote-tracking branch 'parent/main' into kb_migration 2023-08-09 08:53:40 +09:00
Eugen Rochko
72423bc8f6
Change account search tokenizer and queries (#26378) 2023-08-08 09:09:14 +02:00
KMY
850c4dfb3c Merge commit '6268188543' into kb_migration 2023-07-04 17:44:40 +09:00
jsgoldstein
4581a528f7
Change account search to match by text when opted-in (#25599)
Co-authored-by: Eugen Rochko <eugen@zeonfederated.com>
2023-06-29 13:05:21 +02:00
KMY
2a9fcced3b Fix count to zero as hiding account's counts 2023-05-20 21:29:44 +09:00
Eugen Rochko
a9b64b24d6
Change algorithm of tootctl search deploy to improve performance (#18463) 2022-05-22 22:16:43 +02:00
Eugen Rochko
679b7158e3
Change search indexing to use batches to minimize resource usage (#18451) 2022-05-18 23:29:14 +02:00
Takeshi Umeda
3419d3ec84
Bump chewy from 5.2.0 to 7.2.3 (supports Elasticsearch 7.x) (#16915)
* Bump chewy from 5.2.0 to 7.2.2

* fix style (codeclimate)

* fix style

* fix style

* Bump chewy from 7.2.2 to 7.2.3
2021-11-18 22:02:08 +01:00
Eugen Rochko
70da6d6630
Fix accounts search by full/partial display name and others (#11580)
- Restrict followers counts to local users to minimize local advantage
- Fix emoji shortcodes causing error in search
- Fix search syntax parse errors not being caught
2019-08-16 13:00:30 +02:00
Eugen Rochko
8fdff2748f
Add more accurate account search (#11537)
* Add more accurate account search

When ElasticSearch is available, a more accurate search is implemented:

- Using edge n-gram index for acct and display name
- Using asciifolding and cjk width normalization on display names
- Using Gaussian decay on account activity for additional scoring (recency)
- Using followers/friends ratio for additional scoring (spamminess)
- Using followers number for additional scoring (size)

The exact match precedence only takes effect when the input conforms
to the username format and the username part of it is complete, i.e.
when the user started typing the domain part.

* Support single-letter usernames

* Fix tests

* Fix not picking up account updates

* Add weights and normalization for scores, skip zero terms queries

* Use local counts for accounts index, adjust search parameters

* Fix mistakes

* Using updated_at of accounts is inadequate for remote accounts
2019-08-16 01:24:03 +02:00