initial version of event relationship MSC

6 years ago · 9e8460f58c
parent b770cfee8b
commit 9e8460f58c
1 changed files with 255 additions and 0 deletions
--- a/proposals/xxxx-event-relationships.md
+++ b/proposals/xxxx-event-relationships.md
@ -0,0 +1,255 @@
 # MSCxxxx: Event relationships
 It's common to want to send events in Matrix which relate to existing events -
 for instance, reactions, edits and even replies/threads.
 This proposal is one in a series of proposals that defines a mechanism for
 events to relate to each other.  Together, these proposals replace
 [MSC1849](https://github.com/matrix-org/matrix-doc/pull/1849).
 * This proposal defines a standard shape for indicating events which relate to
  other events.
 * [MSCxxxx](https://github.com/matrix-org/matrix-doc/pull/xxxx) defines APIs to
  let the server calculate the aggregations on behalf of the client, and so
  bundle the related events with the original event where appropriate.
 * [MSCxxxx](https://github.com/matrix-org/matrix-doc/pull/xxxx) defines how
  users can edit messages using this mechanism.
 * [MSCxxxx](https://github.com/matrix-org/matrix-doc/pull/xxxx) defines how
  users can annotate events, such as reacting to events with emoji, using this
  mechanism.
 ## Proposal
 This proposal introduces the concept of relations, which can be used to
 associate new information with an existing event.
 Relations are any event which have an `m.relates_to` field in their
 contents. The `m.relates_to` field must include a `rel_type` field that
 gives the type of relationship being defined, and the `event_id` field that
 gives the event which is the target of the relation.  All the information about
 the relationship lives under the `m.relates_to` key.
 If it helps, you can think of relations as a "subject verb object" triple,
 where the subject is the relation event itself; the verb is the `rel_type`
 field of the `m.relates_to` and the object is the `event_id` field.
 We consciously do not support multiple different relations within a single event,
 in order to keep the API simple, and in the absence of identifiable use cases.
 Instead, one would send multiple events, each with its own `m.relates_to`
 defined.
 A `rel_type` of `m.reference` is defined for future handling replies and
 threading. This let you define an event which references an existing
 event. When aggregated, this currently doesn't do anything special, but in
 future could bundle chains of references (i.e. threads). These do not yet
 replace the [reply mechanism currently defined in the spec](https://matrix.org/docs/spec/client_server/latest#rich-replies).
 For instance, an `m.room.message` which references an existing event
 would look like:
 ```json
 {
    "type": "m.room.message",
    "content": {
        "body": "i <3 shelties",
        "m.relates_to": {
            "rel_type": "m.reference",
            "event_id": "$another_event_id"
        }
    }
 }
 ```
 Different subtypes of references could be defined through additional fields on
 the `m.relates_to` object, to distinguish between replies, threads, etc.
 This MSC doesn't attempt to define these subtypes.
  XXX: do we want to support multiple parents for a m.reference event, if a
  given event references different parents in different ways?
 ### Sending relations
 Related events are normal Matrix events, and can be sent by the normal /send
 API.
 The server should postprocess relations if needed before sending them into a
 room, as defined by the relationship type. For example, a relationship type
 might only allow a user to send one related message to a given event.
 Similar to membership events, a convenience API is also provided to highlight
 that the server may post-process the event, and whose URL structures the
 semantics of the relation being sent more clearly:
 ```
 PUT /_matrix/client/r0/rooms/{roomId}/send_relation/{parent_id}/{relation_type}/{event_type}/{txn_id}[?key={relation_key}]
 {
    // event contents
 }
 ```
 The `parent_id` is the ID of the event being referenced. In other words, it is
 the `event_id` field that will be in the `m.relates_to` object.
 The `relation_key` is for relationships that have a `key` property.
 The endpoint does not have any trailing slashes.
 ### Receiving relations
 Relations are received during non-gappy incremental syncs (that is, syncs
 called with a `since` token, and that have `limited: false` in the portion of
 response for the given room) as normal discrete Matrix events.
 [MSCxxxx](https://github.com/matrix-org/matrix-doc/pull/xxxx) defines ways in
 which the server may aid clients in processing relations by aggregating the
 events.
 ### End to end encryption
 Since the server has to be able to bundle related events, structural
 information about relations cannot be encrypted end-to-end, and so the
 `m.relates_to` field should not be included in the ciphertext.
 A future MSC may define a method for encrypting certain parts of the
 `m.relates_to` field that may contain sensitive information.
 ### Redactions
 Relations may be redacted like any other event. In the case of `m.reference` it
 removes the referencing event.
 The `m.relates_to`.`rel_type` and `m.relates_to`.`event_id` fields should
 be preserved over redactions, so that clients can distinguish redacted edits
 from normal redacted messages, and maintain reply ordering.
  FIXME: synapse doesn't do this yet
  XXX: Does this require a new room version?
 ## Edge Cases
 Can you reply (via m.references) to a [reaction](https://github.com/matrix-org/matrix-doc/pull/xxxx)/[edit](https://github.com/matrix-org/matrix-doc/pull/xxxx)?
 * Yes, at the protocol level.  But you shouldn't expect clients to do anything
   useful with it.
 * Replying to a reaction should be treated like a normal message and have the
   reply behaviour ignored.
 * Replying to an edit should be treated in the UI as if you had replied to
   the original message.
 What does it mean to call /context on a relation?
 * We should probably just return the root event for now, and then refine it in
   future for threading?
 * XXX: what does synapse do here?
 Do we need to support retrospective references?
 * For something like "m.duplicate" to retrospectively declare that one event
   dupes another, we might need to bundle two-levels deep (subject+ref and then
   ref+target).  We can cross this bridge when we get there though, as a 4th
   aggregation type
 ## Potential issues
 ### Federation considerations
 We have a problem with resynchronising relations after a gap in federation:
 We have no way of knowing that an edit happened in the gap to one of the events
 we already have.  So, we'll show inconsistent data until we backfill the gap.
 * We could write this off as a limitation.
 * Or we could make *ALL* relations a DAG, so we can spot holes at the next
   relation, and go walk the DAG to pull in the missing relations?  Then, the
   next relation for an event could pull in any of the missing relations.
   Socially this probably doesn't work as reactions will likely drop off over
   time, so by the time your server comes back there won't be any more reactions
   pulling the missing ones in.
 * Could we also ask the server, after a gap, to provide all the relations which
   happened during the gap to events whose root preceeded the gap.
   * "Give me all relations which happened between this set of
     forward-extremities when I lost sync, and the point i've rejoined the DAG,
     for events which preceeded the gap"?
   * Would be hard to auth all the relations which this api coughed up.
     * We could auth them based only the auth events of the relation, except we
       lose the context of the nearby DAG which we'd have if it was a normal
       backfilled event.
     * As a result it would be easier for a server to retrospectively lie about
       events on behalf of its users.
     * This probably isn't the end of the world, plus it's more likely to be
       consistent than if we leave a gap.
       * i.e. it's better to consistent with a small chance of being maliciously
         wrong, than inconsistent with a guaranteed chance of being innocently
         wrong.
   * We'd need to worry about pagination.
   * This is probably the best solution, but can also be added as a v2.
 ## Tradeoffs
 ### Event shape
 Shape of
 ```json
 "content": {
    "m.relates_to": {
        "m.reference": {
            "event_id": "$another:event.com"
        }
    }
 }
 ```
 versus
 ```json
 "content": {
    "m.relates_to": {
        "rel_type": "m.reference",
        "event_id": "$another:event.com"
    }
 }
 ```
 The reasons to go with `rel_type` is:
 * we don't need the extra indirection to let multiple relations apply to a given pair of
   events, as that should be expressed as separate relation events.
 * if we want 'adverbs' to apply to 'verbs' in the subject-verb-object triples which
   relations form, then we apply it as mixins to the relation data itself rather than trying
   to construct subject-verb-verb-object sentences.
 * so, we should pick a simpler shape rather than inheriting the mistakes of m.in_reply_to
   and we have to keep ugly backwards compatibility around for m.in_reply_to
   but we can entirely separately worry about migrating replies to new-style-aggregations in future
   perhaps at the same time as doing threads.
 ## Historical context
 pik's MSC441 has:
 Define the JSON schema for the aggregation event, so the server can work out
 which fields should be aggregated.
 ```json
 "type": "m.room._aggregation.emoticon",
 "content": {
    "emoticon": "::smile::",
    "msgtype": "?",
    "target_id": "$another:event.com"
 }
 ```
 These would then aggregated, based on target_id, and returned as annotations on
 the source event in an `aggregation_data` field:
 ```json
 "content": {
    ...
    "aggregation_data": {
        "m.room._aggregation.emoticon": {
            "aggregation_data": [
                {
                    "emoticon": "::smile::",
                    "event_id": "$14796538949JTYis:pik-test",
                    "sender": "@pik:pik-test"
                }
            ],
            "latest_event_id": "$14796538949JTYis:pik-test"
        }
    }
 }
 ```