14 Commits

Author SHA1 Message Date
David Robertson
e1bc972ff7
Update the to-device table 2023-05-02 18:16:14 +01:00
Kegan Dougal
a6c3f8f3fc When a device is deleted, remove all device data with it (to-device events, device lists) 2023-03-01 16:56:04 +00:00
Kegan Dougal
a7eed93722 Add comprehensive regression test for GlobalSnapshot(); ensure we clear db conns when tests end 2023-01-18 14:54:26 +00:00
Kegan Dougal
f80dc00eaf Log org.matrix.msgid fields if they are present 2022-12-21 11:28:43 +00:00
Kegan Dougal
93aaf4dc3d Include symbols 2022-12-21 10:58:57 +00:00
Kegan Dougal
a0e849c188 And bulk because why not 2022-12-21 10:56:25 +00:00
Kegan Dougal
f428ea6ea4 Add table tests for to-device message persistence 2022-12-21 10:53:05 +00:00
Kegan Dougal
be8543a21a add extensions for typing and receipts; bugfixes and additional perf improvements
Features:
 - Add `typing` extension.
 - Add `receipts` extension.
 - Add comprehensive prometheus `/metrics` activated via `SYNCV3_PROM`.
 - Add `SYNCV3_PPROF` support.
 - Add `by_notification_level` sort order.
 - Add `include_old_rooms` support.
 - Add support for `$ME` and `$LAZY`.
 - Add correct filtering when `*,*` is used as `required_state`.
 - Add `num_live` to each room response to indicate how many timeline entries are live.

Bug fixes:
 - Use a stricter comparison function on ranges: fixes an issue whereby UTs fail on go1.19 due to change in sorting algorithm.
 - Send back an `errcode` on HTTP errors (e.g expired sessions).
 - Remove `unsigned.txn_id` on insertion into the DB. Otherwise other users would see other users txn IDs :(
 - Improve range delta algorithm: previously it didn't handle cases like `[0,20] -> [20,30]` and would panic.
 - Send HTTP 400 for invalid range requests.
 - Don't publish no-op unread counts which just adds extra noise.
 - Fix leaking DB connections which could eventually consume all available connections.
 - Ensure we always unblock WaitUntilInitialSync even on invalid access tokens. Other code relies on WaitUntilInitialSync() actually returning at _some_ point e.g on startup we have N workers which bound the number of concurrent pollers made at any one time, we need to not just hog a worker forever.

Improvements:
 - Greatly improve startup times of sync3 handlers by improving `JoinedRoomsTracker`: a modest amount of data would take ~28s to create the handler, now it takes 4s.
 - Massively improve initial initial v3 sync times, by refactoring `JoinedRoomsTracker`, from ~47s to <1s.
 - Add `SlidingSyncUntil...` in tests to reduce races.
 - Tweak the API shape of JoinedUsersForRoom to reduce state block processing time for large rooms from 63s to 39s.
 - Add trace task for initial syncs.
 - Include the proxy version in UA strings.
 - HTTP errors now wait 1s before returning to stop clients tight-looping on error.
 - Pending event buffer is now 2000.
 - Index the room ID first to cull the most events when returning timeline entries. Speeds up `SelectLatestEventsBetween` by a factor of 8.
 - Remove cancelled `m.room_key_requests` from the to-device inbox. Cuts down the amount of events in the inbox by ~94% for very large (20k+) inboxes, ~50% for moderate sized (200 events) inboxes. Adds book-keeping to remember the unacked to-device position for each client.
2022-12-14 18:53:55 +00:00
Kegan Dougal
502d3b5852 Flesh out test cases for to-device events 2021-12-14 14:38:39 +00:00
Kegan Dougal
0e021eb560 Pass to-device messages through to the client
- Treat to-device messages as opaque JSON blobs
- Add basic integration test to ensure the messages make it from v2 to v3.
2021-12-14 11:51:47 +00:00
Kegan Dougal
8c27fbb877 Implement to_device message cleanup after all sessions have ACKed the message 2021-08-05 12:50:03 +01:00
Kegan Dougal
9c967ffe2a Implement to_device limits
- Fix a bug where copies of `Token` didn't copy the position (swap to actual arrays not slices)
- Modify the interface for `DataInRange` to allow it to return an `upTo` value. Required because
  requesting positions 10-50 when the limit is 20 may return events between 10-30, meaning the
  to position needs updating else the events between 30-50 will be lost.
2021-08-05 10:54:04 +01:00
Kegan Dougal
5b4b1a10ed Glue to_device stuff into the v3 handler, refactor to prevent import cycles 2021-08-03 12:06:09 +01:00
Kegan Dougal
0075b46bc9 Flesh out to_device_table with tests, update gjson dep 2021-08-03 09:33:38 +01:00