s/mxid/user id/, and other PR feedback.

pull/359/head
Richard van der Hoff 8 years ago
parent e628edfdc6
commit 001db45046

@ -150,13 +150,13 @@ Users
~~~~~ ~~~~~
Each client is associated with a user account, which is identified in Matrix Each client is associated with a user account, which is identified in Matrix
using a unique identifier, or "MXID". This ID is namespaced to the homeserver using a unique identifier, or "user ID". This ID is namespaced to the homeserver
which allocated the account and has the form:: which allocated the account and has the form::
@localpart:domain @localpart:domain
See the `Identifier Grammar`_ section for full details of the structure of See the `Identifier Grammar`_ section for full details of the structure of
an MXID. user IDs.
Events Events
~~~~~~ ~~~~~~
@ -317,14 +317,14 @@ that are in the room that can be used to join via.
Identity Identity
~~~~~~~~ ~~~~~~~~
Users in Matrix are identified via their matrix user ID (MXID). However, Users in Matrix are identified via their Matrix user ID. However,
existing 3rd party ID namespaces can also be used in order to identify Matrix existing 3rd party ID namespaces can also be used in order to identify Matrix
users. A Matrix "Identity" describes both the user ID and any other existing IDs users. A Matrix "Identity" describes both the user ID and any other existing IDs
from third party namespaces *linked* to their account. from third party namespaces *linked* to their account.
Matrix users can *link* third-party IDs (3PIDs) such as email addresses, social Matrix users can *link* third-party IDs (3PIDs) such as email addresses, social
network accounts and phone numbers to their user ID. Linking 3PIDs creates a network accounts and phone numbers to their user ID. Linking 3PIDs creates a
mapping from a 3PID to a user ID. This mapping can then be used by Matrix mapping from a 3PID to a user ID. This mapping can then be used by Matrix
users in order to discover the MXIDs of their contacts. users in order to discover the user IDs of their contacts.
In order to ensure that the mapping from 3PID to user ID is genuine, a globally In order to ensure that the mapping from 3PID to user ID is genuine, a globally
federated cluster of trusted "Identity Servers" (IS) are used to verify the 3PID federated cluster of trusted "Identity Servers" (IS) are used to verify the 3PID
and persist and replicate the mappings. and persist and replicate the mappings.
@ -376,12 +376,14 @@ be reached by other homeservers. The complete grammar is::
where ``host`` is as defined by `RFC3986, section 3.2.2 where ``host`` is as defined by `RFC3986, section 3.2.2
<https://tools.ietf.org/html/rfc3986#section-3.2.2>`_. <https://tools.ietf.org/html/rfc3986#section-3.2.2>`_.
.. NOTE:: Examples of valid server names are:
The RFC3986 specification of a "host", allows IPv4 literals (``1.2.3.4``), and * ``matrix.org``
IPv6 literals (``[1234:5678::abcd]``), as well as registered domain * ``matrix.org:8888``
names. Similarly, all of these formats are valid in Matrix server names and * ``1.2.3.4`` (IPv4 literal)
identifiers. * ``1.2.3.4:1234`` (IPv4 literal with explicit port)
* ``[1234:5678::abcd]`` (IPv6 literal)
* ``[1234:5678::abcd]:5678`` (IPv6 literal with explicit port)
Common Identifier Format Common Identifier Format
@ -393,112 +395,110 @@ the form::
&localpart:domain &localpart:domain
where ``&`` represents a 'sigil' character; ``domain`` is the server name of where ``&`` represents a 'sigil' character; ``domain`` is the `server name`_ of
the homeserver which allocated the identifier, and ``localpart`` is an the homeserver which allocated the identifier, and ``localpart`` is an
identifier allocated by that homeserver. identifier allocated by that homeserver.
The sigil characters are as follows: The sigil characters are as follows:
* ``@``: User ID (MXID) * ``@``: User ID
* ``!``: Room ID * ``!``: Room ID
* ``$``: Event ID * ``$``: Event ID
* ``#``: Room alias * ``#``: Room alias
In some cases (such as Room IDs and Event IDs), the ``domain`` is present only
for namespacing, to avoid clashes of identifiers between different
homeservers. In other cases (User IDs and Room aliases), it defines the
authoritative homeserver for contacting the user or room in question.
The precise grammar defining the allowable format of an identifier depends on The precise grammar defining the allowable format of an identifier depends on
the type of identifier. the type of identifier.
User Identifiers User Identifiers
++++++++++++++++ ++++++++++++++++
Users within Matrix are uniquely identified by their MXID. The MXID is Users within Matrix are uniquely identified by their Matrix user ID. The user
namespaced to the homeserver which allocated the account and has the form:: ID is namespaced to the homeserver which allocated the account and has the
form::
@localpart:domain @localpart:domain
The ``localpart`` of an MXID is an opaque identifier for that user. It MUST NOT The ``localpart`` of a user ID is an opaque identifier for that user. It MUST
be empty, and MUST contain only the characters ``a-z``, ``0-9``, ``.``, ``_``, NOT be empty, and MUST contain only the characters ``a-z``, ``0-9``, ``.``,
``=``, and ``-``. ``_``, ``=``, and ``-``.
The ``domain`` of an MXID is the server name of the homeserver which allocated The ``domain`` of a user ID is the `server name`_ of the homeserver which
the account. allocated the account.
The length of an MXID, including the ``@`` sigil and the domain, MUST NOT The length of a user ID, including the ``@`` sigil and the domain, MUST NOT
exceed 255 characters. exceed 255 characters.
The complete grammar for a legal MXID is:: The complete grammar for a legal user ID is::
mxid = "@" mxid_localpart ":" server_name user_id = "@" user_id_localpart ":" server_name
mxid_localpart = 1*mxid_char user_id_localpart = 1*user_id_char
mxid_char = DIGIT user_id_char = DIGIT
/ %x61-7A ; a-z / %x61-7A ; a-z
/ "-" / "." / "=" / "_" / "-" / "." / "=" / "_"
.. admonition:: Rationale .. admonition:: Rationale
A number of factors were considered when defining the allowable characters A number of factors were considered when defining the allowable characters
for an MXID. for a user ID.
Firstly, we chose to exclude characters outside the basic US-ASCII character Firstly, we chose to exclude characters outside the basic US-ASCII character
set. MXIDs are primarily intended for use as an identifier at the protocol set. User IDs are primarily intended for use as an identifier at the protocol
level, and their use as a human-readable handle is of secondary level, and their use as a human-readable handle is of secondary
benefit. Furthermore, they are useful as a last-resort differentiator between benefit. Furthermore, they are useful as a last-resort differentiator between
users with similar display names. Allowing the full unicode character set users with similar display names. Allowing the full unicode character set
would make very difficult for a human to distinguish two similar MXIDs. The would make very difficult for a human to distinguish two similar user IDs. The
limited character set used has the advantage that even a user unfamiliar with limited character set used has the advantage that even a user unfamiliar with
the Latin alphabet should be able to distinguish similar MXIDs manually, if the Latin alphabet should be able to distinguish similar user IDs manually, if
somewhat laboriously. somewhat laboriously.
We chose to disallow upper-case characters because we do not consider it We chose to disallow upper-case characters because we do not consider it
valid to have two MXIDs which differ only in case: indeed it should be valid to have two user IDs which differ only in case: indeed it should be
possible to reach ``@user:matrix.org`` as ``@USER:matrix.org``. However, possible to reach ``@user:matrix.org`` as ``@USER:matrix.org``. However,
MXIDs are necessarily used in a number of situations which are inherently user IDs are necessarily used in a number of situations which are inherently
case-sensitive (notably in the ``state_key`` of ``m.room.member`` case-sensitive (notably in the ``state_key`` of ``m.room.member``
events). Forbidding upper-case characters (and requiring homeservers to events). Forbidding upper-case characters (and requiring homeservers to
downcase usernames when creating MXIDs for new users) is a relatively simple downcase usernames when creating user IDs for new users) is a relatively simple
way to ensure that ``@USER:matrix.org`` cannot refer to a different user to way to ensure that ``@USER:matrix.org`` cannot refer to a different user to
``@user:matrix.org``. ``@user:matrix.org``.
Finally, we decided to restrict the allowable punctuation to a very basic set Finally, we decided to restrict the allowable punctuation to a very basic set
to ensure that the identifier can be used as-is in as wide a number of to ensure that the identifier can be used as-is in as wide a number of
situations as possible, without requiring escaping. For instance, allowing situations as possible, without requiring escaping. For instance, allowing
"%" or "/" would make it harder to use an MXID in a URI. "*" is used as a "%" or "/" would make it harder to use a user ID in a URI. "*" is used as a
wildcard in some APIs (notably the filter API), so it also cannot be a legal wildcard in some APIs (notably the filter API), so it also cannot be a legal
MXID character. user ID character.
The length restriction is derived from the limit on the length of the The length restriction is derived from the limit on the length of the
``sender`` key on events; since the MXID appears in every event sent by the ``sender`` key on events; since the user ID appears in every event sent by the
user, it is limited to ensure that the MXID does not dominate over the actual user, it is limited to ensure that the user ID does not dominate over the actual
content of the events. content of the events.
Historical MXIDs Matrix user IDs are sometimes informally referred to as MXIDs.
<<<<<<<<<<<<<<<<
Historical User IDs
<<<<<<<<<<<<<<<<<<<
Older versions of this specification were more tolerant of the characters Older versions of this specification were more tolerant of the characters
permitted in MXID localparts. There are currently active users whose MXIDs do permitted in user ID localparts. There are currently active users whose user
not conform to the permitted character set, and a number of rooms whose history IDs do not conform to the permitted character set, and a number of rooms whose
includes events with a ``sender`` which does not conform. In order to handle history includes events with a ``sender`` which does not conform. In order to
these rooms successfully, clients and servers MUST accept MXIDs with localparts handle these rooms successfully, clients and servers MUST accept user IDs with
from the expanded character set:: localparts from the expanded character set::
extended_mxid_char = %x21-7E extended_user_id_char = %x21-7E
Mapping from other character sets Mapping from other character sets
<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<< <<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<
In certain circumstances it will be desirable to map from a wider character set In certain circumstances it will be desirable to map from a wider character set
onto the limited character set allowed in an MXID localpart. Examples include a onto the limited character set allowed in a user ID localpart. Examples include
homeserver creating an MXID for a new user based on their chosen login ID, or a a homeserver creating a user ID for a new user based on their chosen login ID,
bridge mapping user ids from another protocol. or a bridge mapping user ids from another protocol.
Implmentations are free to do this mapping however they choose. Since the MXID Implmentations are free to do this mapping however they choose. Since the user
is opaque except to the implementation which created it, the only requirement ID is opaque except to the implementation which created it, the only
is that the implemention can perform the mapping consistently. However, we requirement is that the implemention can perform the mapping
suggest the following algorithm: consistently. However, we suggest the following algorithm:
1. Encode character strings as UTF-8. 1. Encode character strings as UTF-8.
@ -528,16 +528,18 @@ A room has exactly one room ID. A room ID has the format::
!opaque_id:domain !opaque_id:domain
An event thas exactly one event ID. An event ID has the format:: An event has exactly one event ID. An event ID has the format::
$opaque_id:domain $opaque_id:domain
The ``domain`` of a room/event ID is the server name of the homeserver which created The ``domain`` of a room/event ID is the `server name`_ of the homeserver which
the room/event. Note that the domain is used only for namespacing - there is no created the room/event. The domain is used only for namespacing to avoid the
risk of clashes of identifiers between different homeservers. There is no
implication that the room or event in question is still available at the implication that the room or event in question is still available at the
corresponding homeserver. corresponding homeserver.
Event IDs and Room IDs are case-sensitive. They are not mant to be human readable. Event IDs and Room IDs are case-sensitive. They are not meant to be human
readable.
.. TODO-spec .. TODO-spec
What is the grammar for the opaque part? https://matrix.org/jira/browse/SPEC-389 What is the grammar for the opaque part? https://matrix.org/jira/browse/SPEC-389
@ -549,10 +551,12 @@ A room may have zero or more aliases. A room alias has the format::
#room_alias:domain #room_alias:domain
The ``domain`` of a room alias is the server of the homeserver which created The ``domain`` of a room alias is the `server name`_ of the homeserver which
the alias. Other servers may contact this homeserver to look up the alias. created the alias. Other servers may contact this homeserver to look up the
alias.
Room aliases MUST NOT exceed 255 bytes (including the ``#`` sigil and the domain). Room aliases MUST NOT exceed 255 bytes (including the ``#`` sigil and the
domain).
.. TODO-spec .. TODO-spec
- Need to specify precise grammar for Room Aliases. https://matrix.org/jira/browse/SPEC-391 - Need to specify precise grammar for Room Aliases. https://matrix.org/jira/browse/SPEC-391

Loading…
Cancel
Save