draft

Signed-off-by: Timo K <toger5@hotmail.de>
1 month ago · 9cbe4481b7
parent fe587a7d33
commit 9cbe4481b7
1 changed files with 150 additions and 0 deletions
--- a/proposals/4143-MatrixRTC.md
+++ b/proposals/4143-MatrixRTC.md
@ -0,0 +1,150 @@
+# MSC4143: MatrixRTC
+
+This MSC defines the modules with which the matrix real time system is build with.
+
+The MatrixRTC specification is separated into different modules.
+
+- The MatrixRTC room state that defines the state of the real time application.\
+ It is the source of truth for:
+  - Who is part of a session
+  - Who is connected via what technology/backend
+  - Metadata per device used by other participants to decide whether the streams
+  from this source are of interest / need to be subscribed.
+- The RTC backend.
+  - It defines how to connect the participating peers.
+  - Livekit is the standard for this as of writing.
+  - Defines how to connect to a server/other peers, how to update the connection,
+  how to subscribe to different streams...
+  - Another planned backend is a full mesh implementation based on MSC3401.
+- The RTCSession types (application) have their own per application spec.
+  - Calls can be done with an application of type `m.call` see (TODO: link call msc)
+  - The application defines all the details of the RTC experience:
+    - How to interpret the metadata of the member events.
+    - What streams to connect to.
+    - What data in which format to sent over the RTC channels.
+
+This MSC will focus on the matrix room state which can be seen as the most high
+level signalling of a call:
+
+## Proposal
+
+Each RTC session is made out of a collection of `m.rtc.member` events.
+Each `m.rtc.member` event defines the application type: `application`
+and a `call_id`. And is stored in a state event of type `m.rtc.member`.
+The first element of the state key is the `userId` and the second the `deviceId`.
+(see [this proposal for state keys](https://github.com/matrix-org/matrix-spec-proposals/pull/3757#issuecomment-2099010555)
+for context about second/first state key.)
+
+### The MatrixRTC room state
+
+Everything required for working MatrixRTC
+(current session, sessions history, join/leave events, ...) only
+require one event type.
+
+A complete `m.rtc.member` state event looks like this:
+
+```json
+// event type: m.rtc.member
+// event key: ["@user:matrix.domain", "DEVICEID"]
+{
+  "m.application": "m.my_session_type",
+  "m.call_id": "",
+  "focus_active": {...FOCUS_A},
+  "foci_preferred": [
+    {...FOCUS_1},
+    {...FOCUS_2}
+  ]
+}
+```
+
+giving us the information, that user: `@user:matrix.domain` with device `DEVICEID`
+is part of an RTCSession of type `m.call` in the scope/sub-session `""` (empty
+string as call id) connected over `FOCUS_A`. This is all information that is needed
+for another room member to detect the running session and join it.
+
+There is **no event** to represent a session. This event would include shared
+information where it is not trivial to decide who has authority over it.
+Instead the session is a computed value based on `m.rtc.member` events.
+The list of events with the same `m.application` and `m.call_id` represent one session.
+This array allows to compute fields like participant count, start time ...
+
+Sending an empty `m.rtc.member` event represents a leave action.
+Sending a well formatted `m.rtc.member` represents a join action.
+
+Based on the value of `m.application`, the event might include additional parameters
+required to provide additional session parameters.
+
+> A thirdRoom like experience could include the information of an approximate position
+on the map, so that clients can omit connecting to participants that are not in their
+area of interest.
+
+#### Historic sessions
+
+Since there is no singe entry for a historic session (because of the owner ship discussion),
+historic sessions need to be computed and most likely cached on the client.
+
+Each state event can either mark a join or leave:
+
+- join: `prev_state.m.application != current_state.m.application` &&
+ `prev_state.m.call_id != current_state.m.call_id` &&
+ `current_state.m.application != undefined`
+  (where an empty `m.rtc.member` event would imply `state.m.application == undefined`)
+- leave: `prev_state.m.application != current_state.m.application` &&
+ `prev_state.m.call_id != current_state.m.call_id` &&
+ `current_state.m.application == undefined`
+
+Based on this one can find user sessions. (The range between a join and a leave
+event) of specific times.
+The collection of all overlapping user sessions with the same `call_id` and
+`application` define one MatrixRTC history event.
+
+### The RTC backend
+
+`foci_active` and `foci_preferred` are used to communicate
+
+- how a user is connected to the session (`foci_active`)
+- what connection method this user knows about would like to connect with.
+
+The only enforced parameter of a `foci_preferred` or `foci_active` is `type`.
+Based on the focus type a different amount of parameters might be needed to,
+communicate how to connect to other users.
+`foci_preferred` and `foci_active` can have different parameters so that it is,
+possible to use a combination of the two to figure our that everyone is connected
+with each other.
+
+Only users with the same type can connect in one session. If a frontend does
+not support the used type they cannot connect.
+
+Each focus type will get its own MSC in which the detailed procedure to get from
+the foci information to working webRTC connections to the streams of all the
+participants is explained.
+
+- [`livekit`](www.example.com) TODO: create `livekit` focus MSC and add link here.
+- [`full_mesh`](https://github.com/matrix-org/matrix-spec-proposals/pull/3401)
+TODO: create `full-mesh` focus MSC based on[MSC3401](https://github.com/matrix-org/matrix-spec-proposals/pull/3401)
+and add link here.
+
+### The RTCSession types (application)
+
+Each session type might have its own specification in how the different streams
+are interpreted and even what focus type to use. This makes this proposal extremely
+flexible. A Jitsi conference could be added by introducing a new `m.application`
+and a new focus type and would be MatrixRTC compatible. It would not be compatible
+with applications that do not use the Jitsi focus but clients would know that there
+is an ongoing session of unknown type and unknown focus and could display/represent
+this in the user interface.
+
+To make it easy for clients to support different RTC session types, the recommended
+approach is to provide a matrix widget for each session type, so that client developers
+can use the widget as the first implementation if they want to support this RTC
+session type.
+
+Each application should get its own MSC in which the all the additional
+fields are explained and how the communication with the possible foci is
+defined:
+
+- [`m.call`](www.example.com) TODO: create `m.call` MSC and add link here.
+
+## Potential issues
+## Alternatives
+## Security considerations