mitogen/docs/ansible.rst


.. image:: images/ansible/cell_division.png
    :align: right

Mitogen for Ansible
===================

An extension to `Ansible`_ is included that implements connections over
Mitogen, replacing embedded shell invocations with pure-Python equivalents
invoked via highly efficient remote procedure calls to persistent interpreters
tunnelled over SSH. No changes are required to target hosts.

The extension is approaching stability and real-world usage is encouraged. `Bug
reports`_ are welcome: Ansible is huge, and only wide testing will ensure
soundness.

.. _Ansible: https://www.ansible.com/

.. _Bug reports: https://goo.gl/yLKZiJ


Overview
--------

**Expect a 1.25x - 7x speedup** and a **CPU usage reduction of at least 2x**,
depending on network conditions, modules executed, and time already spent by
targets on useful work. Mitogen cannot improve a module once it is executing,
it can only ensure the module executes as quickly as possible.

* **One connection is used per target**, in addition to one sudo invocation per
  user account. This is much better than SSH multiplexing combined with
  pipelining, as significant state can be maintained in RAM between steps, and
  system logs aren't spammed with repeat authentication events.

* **A single network roundtrip is used** to execute a step whose code already
  exists in RAM on the target. Eliminating multiplexed SSH channel creation
  saves 4 ms runtime per 1 ms of network latency for every playbook step.

* **Processes are aggressively reused**, avoiding the cost of invoking Python
  and recompiling imports, saving 300-800 ms for every playbook step.

* Code is ephemerally cached in RAM, **reducing bandwidth usage by an order
  of magnitude** compared to SSH pipelining, with around 5x fewer frames
  traversing the network in a typical run.

* **Fewer writes to the target filesystem occur**. In typical configurations,
  Ansible repeatedly rewrites and extracts ZIP files to multiple temporary
  directories on the target. Security issues relating to temporary files in
  cross-account scenarios are entirely avoided.

The effect is most potent on playbooks that execute many **short-lived
actions**, where Ansible's overhead dominates the cost of the operation, for
example when executing large ``with_items`` loops to run simple commands or
write files.


Installation
------------

1. Thoroughly review :ref:`noteworthy_differences` and :ref:`changelog`.
2. Download and extract |mitogen_url|.
3. Modify ``ansible.cfg``:

   .. parsed-literal::

        [defaults]
        strategy_plugins = /path/to/mitogen-|mitogen_version|/ansible_mitogen/plugins/strategy
        strategy = mitogen_linear

   The ``strategy`` key is optional. If omitted, the
   ``ANSIBLE_STRATEGY=mitogen_linear`` environment variable can be set on a
   per-run basis. Like ``mitogen_linear``, the ``mitogen_free`` strategy exists
   to mimic the ``free`` strategy.

4. If targets have a restrictive ``sudoers`` file, add a rule like:

   ::

       deploy = (ALL) NOPASSWD:/usr/bin/python -c*

5. Subscribe to the `mitogen-announce mailing list
   <https://www.freelists.org/list/mitogen-announce>`_ to stay updated with new
   releases and important bug fixes.


Demo
~~~~

This demonstrates Ansible running a subset of the Mitogen integration tests
concurrent to an equivalent run using the extension.

.. raw:: html

    <iframe src="https://player.vimeo.com/video/283272293?title=0&byline=0&portrait=0" width="720" height="439" frameborder="0" webkitallowfullscreen mozallowfullscreen allowfullscreen></iframe>


Testimonials
~~~~~~~~~~~~

* "With mitogen **my playbook runtime went from 45 minutes to just under 3
  minutes**. Awesome work!"

* "The runtime was reduced from **1.5 hours on 4 servers to just under 3
  minutes**. Thanks!"

* "Oh, performance improvement using Mitogen is *huge*. As mentioned before,
  running with Mitogen enables takes 7m36 (give or take a few seconds). Without
  Mitogen, the same run takes 19m49! **I'm not even deploying without Mitogen
  anymore** :)"

* "**Works like a charm**, thank you for your quick response"

* "I tried it out. **He is not kidding about the speed increase**."

* "I don't know what kind of dark magic @dmw_83 has done, but his Mitogen
  strategy took Clojars' Ansible runs from **14 minutes to 2 minutes**. I still
  can't quite believe it."

* "Enabling the mitogen plugin in ansible feels like switching from floppy to SSD"


.. _noteworthy_differences:

Noteworthy Differences
----------------------

* Ansible 2.3-2.6 are supported along with Python 2.6, 2.7 or 3.6. Verify your
  installation is running one of these versions by checking ``ansible
  --version`` output.

* The Ansible ``raw`` action executes as a regular Mitogen connection,
  precluding its use for installing Python on a target. This will be addressed
  soon.

* The ``doas``, ``su`` and ``sudo`` become methods are available. File bugs to
  register interest in more.

* The `docker <https://docs.ansible.com/ansible/2.6/plugins/connection/docker.html>`_,
  `jail <https://docs.ansible.com/ansible/2.6/plugins/connection/jail.html>`_,
  `local <https://docs.ansible.com/ansible/2.6/plugins/connection/local.html>`_,
  `lxc <https://docs.ansible.com/ansible/2.6/plugins/connection/lxc.html>`_,
  `lxd <https://docs.ansible.com/ansible/2.6/plugins/connection/lxd.html>`_,
  and `ssh <https://docs.ansible.com/ansible/2.6/plugins/connection/ssh.html>`_
  built-in connection types are supported, along with Mitogen-specific
  :ref:`machinectl <machinectl>`, :ref:`mitogen_doas <doas>`,
  :ref:`mitogen_su <su>`, :ref:`mitogen_sudo <sudo>`, and :ref:`setns <setns>`
  types. File bugs to register interest in others.

* Local commands execute in a reuseable interpreter created identically to
  interpreters on targets. Presently one interpreter per ``become_user``
  exists, and so only one local action may execute simultaneously.

  Ansible usually permits up to ``forks`` simultaneous local actions. Any
  long-running local actions that execute for every target will experience
  artificial serialization, causing slowdown equivalent to `task_duration *
  num_targets`. This will be fixed soon.

* "Module Replacer" style modules are not supported. These rarely appear in
  practice, and light web searches failed to reveal many examples of them.

* Ansible permits up to ``forks`` connections to be setup in parallel, whereas
  in Mitogen this is handled by a fixed-size thread pool. Up to 16 connections
  may be established in parallel by default, this can be modified by setting
  the ``MITOGEN_POOL_SIZE`` environment variable.

* The ``ansible_python_interpreter`` variable is parsed using a restrictive
  :mod:`shell-like <shlex>` syntax, permitting values such as ``/usr/bin/env
  FOO=bar python``, which occur in practice. Ansible `documents this
  <https://docs.ansible.com/ansible/latest/user_guide/intro_inventory.html#ansible-python-interpreter>`_
  as an absolute path, however the implementation passes it unquoted through
  the shell, permitting arbitrary code to be injected.

* Performance does not scale linearly with target count. This will improve over
  time.

* SSH and ``become`` are treated distinctly when applying timeouts, and
  timeouts apply up to the point when the new interpreter is ready to accept
  messages. Ansible has two timeouts: ``ConnectTimeout`` for SSH, applying up
  to when authentication completes, and a separate parallel timeout up to when
  ``become`` authentication completes.

  For busy targets, Ansible may successfully execute a module where Mitogen
  would fail without increasing the timeout. For sick targets, Ansible may hang
  indefinitely after authentication without executing a command, for example
  due to a stuck filesystem IO appearing in ``$HOME/.profile``.


New Features & Notes
--------------------


Connection Delegation
~~~~~~~~~~~~~~~~~~~~~

.. image:: images/jumpbox.png
    :align: right

Included is a preview of **Connection Delegation**, a Mitogen-specific
implementation of `stackable connection plug-ins`_. This enables connections
via a bastion, or container connections delegated via their host machine, where
reaching the host may entail further delegation.

.. _Stackable connection plug-ins: https://github.com/ansible/proposals/issues/25

Unlike with SSH forwarding Ansible has complete visibility of the final
topology, declarative configuration via static/dynamic inventory is possible,
and data can be cached and re-served, and code executed on every intermediary.

For example when targeting Docker containers on a remote machine, each module
need only be uploaded once for the first task and container that requires it,
then cached and served from the SSH account for every future task in any
container.

.. raw:: html

    <div style="clear: both;"></div>


.. caution::

    Connection delegation is a work in progress, bug reports are welcome.

    * Delegated connection setup is single-threaded; only one connection can be
      constructed in parallel per intermediary.

    * Inferring the configuration of intermediaries may be buggy, manifesting
      as duplicate connections between hops, due to not perfectly replicating
      the configuration Ansible would normally use for the intermediary.

    * Automatic tunnelling of SSH-dependent actions, such as the
      ``synchronize`` module, is not yet supported. This will be added in the
      0.3 series.

To enable connection delegation, set ``mitogen_via=<inventory name>`` on the
command line, or as host and group variables.

.. code-block:: ini

    # Docker container on web1.dc1 is reachable via web1.dc1.
    [app-containers.web1.dc1]
    app1.web1.dc1 ansible_host=app1 ansible_connection=docker mitogen_via=web1.dc1

    # Web servers in DC1 are reachable via bastion.dc1
    [dc1]
    web1.dc1
    web2.dc1
    web3.dc1

    [dc1:vars]
    mitogen_via = bastion.dc1

    # Web servers in DC2 are reachable via bastion.dc2
    [dc2]
    web1.dc2
    web2.dc2
    web3.dc2

    [dc2:vars]
    mitogen_via = bastion.dc2

    # Prod bastions are reachable via a magic account on a
    # corporate network gateway.
    [bastions]
    bastion.dc1 mitogen_via=prod-ssh-access@corp-gateway.internal
    bastion.dc2 mitogen_via=prod-ssh-access@corp-gateway.internal

    [corp-gateway]
    corp-gateway.internal


File Transfer
~~~~~~~~~~~~~

Normally `sftp(1) <https://linux.die.net/man/1/sftp>`_ or
`scp(1) <https://linux.die.net/man/1/scp>`_ are used to copy files by the
`assemble <http://docs.ansible.com/ansible/latest/modules/assemble_module.html>`_,
`copy <http://docs.ansible.com/ansible/latest/modules/copy_module.html>`_,
`patch <http://docs.ansible.com/ansible/latest/modules/patch_module.html>`_,
`script <http://docs.ansible.com/ansible/latest/modules/script_module.html>`_,
`template <http://docs.ansible.com/ansible/latest/modules/template_module.html>`_, and
`unarchive <http://docs.ansible.com/ansible/latest/modules/unarchive_module.html>`_
actions, or when uploading modules with pipelining disabled. With Mitogen
copies are implemented natively using the same interpreters, connection tree,
and routed message bus that carries RPCs.

This permits direct streaming between endpoints regardless of execution
environment, without necessitating temporary copies in intermediary accounts or
machines, for example when ``become`` is active, or in the presence of
connection delegation. It also avoids the need to securely share temporary
files between accounts and machines.

As the implementation is self-contained, it is simple to make improvements like
prioritizing transfers, supporting resume, or displaying progress bars.


Safety
^^^^^^

Transfers proceed to a hidden file in the destination directory, with content
and metadata synced using `fsync(2) <https://linux.die.net/man/2/fsync>`_ prior
to rename over any existing file. This ensures the file remains consistent at
all times, in the event of a crash, or when overlapping `ansible-playbook` runs
deploy differing file contents.

The `sftp(1) <https://linux.die.net/man/1/sftp>`_ and `scp(1)
<https://linux.die.net/man/1/sftp>`_ tools may cause undetected data corruption
in the form of truncated files, or files containing intermingled data segments
from overlapping runs. As part of normal operation, both tools expose a window
where readers may observe inconsistent file contents.


Performance
^^^^^^^^^^^

One roundtrip initiates a transfer larger than 124 KiB, while smaller transfers
are embedded in a 0-roundtrip remote call. For tools operating via SSH
multiplexing, 4 roundtrips are required to configure the IO channel, in
addition to the time to start the local and remote processes.

An invocation of ``scp`` with an empty ``.profile`` over a 30 ms link takes
~140 ms, wasting 110 ms per invocation, rising to ~2,000 ms over a 400 ms
UK-India link, wasting 1,600 ms per invocation.


Interpreter Reuse
~~~~~~~~~~~~~~~~~

Python interpreters are aggressively reused to execute modules. While this
works well, it violates an unwritten assumption, and so it is possible an
earlier module execution could cause a subsequent module to fail, or for
unrelated modules to interact poorly due to bad hygiene, such as
monkey-patching that becomes stacked over repeat invocations.

Before reporting a bug relating to a misbehaving module, please re-run with
``-e mitogen_task_isolation=fork`` to see if the problem abates. This may be
set per-task, paying attention to the possibility an earlier task may be the
true cause of a failure.

.. code-block:: yaml

    - name: My task.
      broken_module:
        some_option: true
      vars:
        mitogen_task_isolation: fork

If forking solves your problem, **please report a bug regardless**, as an
internal list can be updated to prevent others bumping into the same problem.


Interpreter Recycling
~~~~~~~~~~~~~~~~~~~~~

There is a per-target limit on the number of interpreters. Once 20 exist, the
youngest is terminated before starting any new interpreter, preventing
situations like below from triggering memory exhaustion.

.. code-block:: yaml

    - hosts: corp_boxes
      vars:
        user_directory: [
          # 10,000 corporate user accounts
        ]
      tasks:
        - name: Create user bashrc
          become: true
          vars:
            ansible_become_user: "{{item}}"
          copy:
            src: bashrc
            dest: "~{{item}}/.bashrc"
          with_items: "{{user_directory}}"

The youngest is chosen to preserve useful accounts like ``root`` and
``postgresql`` that often appear early in a run, however it is simple to
construct a playbook that defeats this strategy. A future version will key
interpreters on the identity of their creating task, avoiding useful account
recycling in every scenario.

To modify the limit, set the ``MITOGEN_MAX_INTERPRETERS`` environment variable.


Standard IO
~~~~~~~~~~~

Ansible uses pseudo TTYs for most invocations to allow it to type interactive
passwords, however pseudo TTYs are disabled where standard input is required or
``sudo`` is not in use. Additionally when SSH multiplexing is enabled, a string
like ``Shared connection to localhost closed\r\n`` appears in ``stderr`` of
every invocation.

Mitogen does not naturally require either of these, as command output is always
embedded within framed messages, and it can simply call :py:func:`pty.openpty`
in any location an interactive password must be typed.

A major downside to Ansible's behaviour is that ``stdout`` and ``stderr`` are
merged together into a single ``stdout`` variable, with carriage returns
inserted in the output by the TTY layer. However ugly, the extension emulates
this precisely, to avoid breaking playbooks that expect text to appear in
specific variables with a particular linefeed style.


.. _ansible_tempfiles:

Temporary Files
~~~~~~~~~~~~~~~

Ansible creates a variety of temporary files and directories depending on its
operating mode.

In the best case when pipelining is enabled and no temporary uploads are
required, for each task Ansible will create one directory below a
system-supplied temporary directory returned by :func:`tempfile.mkdtemp`, owned
by the target account a new-style module will execute in.

In other cases depending on the task type, whether become is active, whether
the target become user is privileged, whether the associated action plugin
needs to upload files, and whether the associated module needs to store files,
Ansible may:

* Create a directory owned by the SSH user either under ``remote_tmp``, or a
  system-default directory,
* Upload action dependencies such as non-new style modules or rendered
  templates to that directory via `sftp(1) <https://linux.die.net/man/1/sftp>`_
  or `scp(1) <https://linux.die.net/man/1/scp>`_.
* Attempt to modify the directory's access control list to grant access to the
  target user using `setfacl(1) <https://linux.die.net/man/1/setfacl>`_,
  requiring that tool to be installed and a supported filesystem to be in use,
  or for the ``allow_world_readable_tmpfiles`` setting to be  :data:`True`.
* Create a directory owned by the target user either under ``remote_tmp``, or
  a system-default directory, if a new-style module needs a temporary directory
  and one was not previously created for a supporting file earlier in the
  invocation.

In summary, for each task Ansible may create one or more of:

* ``~ssh_user/<remote_tmp>/...`` owned by the login user,
* ``$TMPDIR/ansible-tmp-...`` owned by the login user,
* ``$TMPDIR/ansible-tmp-...`` owned by the login user with ACLs permitting
  write access by the become user,
* ``~become_user/<remote_tmp>/...`` owned by the become user,
* ``$TMPDIR/ansible_<modname>_payload_.../`` owned by the become user,
* ``$TMPDIR/ansible-module-tmp-.../`` owned by the become user.

A directory must exist to maintain compatibility with Ansible, as many modules
introspect :data:`sys.argv` to find a directory where they may write files,
however only one directory exists for the lifetime of each interpreter, its
location is consistent for each target account, and it is always privately
owned by that account.

The paths below are tried until one is found that is writeable and lives on a
filesystem with ``noexec`` disabled:

1. ``$variable`` and tilde-expanded ``remote_tmp`` setting from
   ``ansible.cfg``
2. ``$variable`` and tilde-expanded ``system_tmpdirs`` setting from
   ``ansible.cfg``
3. ``TMPDIR`` environment variable
4. ``TEMP`` environment variable
5. ``TMP`` environment variable
6. ``/tmp``
7. ``/var/tmp``
8. ``/usr/tmp``
9. Current working directory

As the directory is created once at startup, and its content is managed by code
running remotely, no additional network roundtrips are required to manage it
for each task requiring temporary storage.


.. _ansible_process_env:

Process Environment Emulation
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Since Ansible discards processes after each module invocation, follow-up tasks
often (but not always) receive a new environment that will usually include
changes made by previous tasks. As such modifications are common, for
compatibility the extension emulates the existing behaviour as closely as
possible.

Some scenarios exist where emulation is impossible, for example, applying
``nsswitch.conf`` changes when ``nscd`` is not in use. If future scenarios
appear that cannot be solved through emulation, the extension will be updated
to automatically restart affected interpreters instead.


DNS Resolution
^^^^^^^^^^^^^^

Modifications to ``/etc/resolv.conf`` cause the glibc resolver configuration to
be reloaded via `res_init(3) <https://linux.die.net/man/3/res_init>`_. This
isn't necessary on some Linux distributions carrying glibc patches to
automatically check ``/etc/resolv.conf`` periodically, however it is necessary
on at least Debian and BSD derivatives.


``/etc/environment``
^^^^^^^^^^^^^^^^^^^^

When ``become: true`` is active or SSH multiplexing is disabled, modifications
by previous tasks to ``/etc/environment`` and ``$HOME/.pam_environment`` are
normally reflected, since the content of those files is reapplied by `PAM
<https://en.wikipedia.org/wiki/Pluggable_authentication_module>`_ via `pam_env`
on each authentication of ``sudo`` or ``sshd``.

Both files are monitored for changes, and changes are applied where it appears
safe to do so:

* New keys are added if they did not otherwise exist in the inherited
  environment, or previously had the same value as found in the file before it
  changed.

* Given a key (such as ``http_proxy``) added to the file where no such key
  exists in the environment, the key will be added.

* Given a key (such as ``PATH``) where an existing environment key exists with
  a different value, the update or deletion will be ignored, as it is likely
  the key was overridden elsewhere after `pam_env` ran, such as by
  ``/etc/profile``.

* Given a key removed from the file that had the same value as the existing
  environment key, the key will be removed.


How Modules Execute
~~~~~~~~~~~~~~~~~~~

Ansible usually modifies, recompresses and reuploads modules every time they
run on a target, work that must be repeated by the controller for every
playbook step.

With the extension any modifications are done on the target, allowing pristine
copies of modules to be cached, reducing the necessity to re-transfer modules
for each invocation. Unmodified modules are uploaded once on first use and
cached in RAM for the remainder of the run.

**Binary**
    Native executables detected using a complex heuristic. Arguments are
    supplied as a JSON file whose path is the sole script parameter.

**Module Replacer**
    Python scripts detected by the presence of
    ``#<<INCLUDE_ANSIBLE_MODULE_COMMON>>`` appearing in their source. This type
    is not yet supported.

**New-Style**
    Python scripts detected by the presence of ``from ansible.module_utils.``
    appearing in their source. Arguments are supplied as JSON written to
    ``sys.stdin`` of the target interpreter.

**JSON_ARGS**
    Detected by the presence of ``INCLUDE_ANSIBLE_MODULE_JSON_ARGS`` appearing
    in the script source. The interpreter directive (``#!interpreter``) is
    adjusted to match the corresponding value of ``{{ansible_*_interpreter}}``
    if one is set. Arguments are supplied as JSON mixed into the script as a
    replacement for ``INCLUDE_ANSIBLE_MODULE_JSON_ARGS``.

**WANT_JSON**
    Detected by the presence of ``WANT_JSON`` appearing in the script source.
    The interpreter directive is adjusted as above. Arguments are supplied as a
    JSON file whose path is the sole script parameter.

**Old Style**
    Files not matching any of the above tests. The interpreter directive is
    adjusted as above. Arguments are supplied as a file whose path is the sole
    script parameter. The format of the file is ``"key=repr(value)[
    key2=repr(value2)[ ..]] "``.


Runtime Patches
~~~~~~~~~~~~~~~

Three small runtime patches are employed in ``strategy.py`` to hook into
desirable locations, in order to override uses of shell, the module executor,
and the mechanism for selecting a connection plug-in. While it is hoped the
patches can be avoided in future, for interesting versions of Ansible deployed
today this simply is not possible, and so they continue to be required.

The patches are concise and behave conservatively, including by disabling
themselves when non-Mitogen connections are in use. Additional third party
plug-ins are unlikely to attempt similar patches, so the risk to an established
configuration should be minimal.


Flag Emulation
~~~~~~~~~~~~~~

Mitogen re-parses ``sudo_flags``, ``become_flags``, and ``ssh_flags`` using
option parsers extracted from `sudo(1)` and `ssh(1)` in order to emulate their
equivalent semantics. This allows:

* robust support for common ``ansible.cfg`` tricks without reconfiguration,
  such as forwarding SSH agents across ``sudo`` invocations,
* reporting on conflicting flag combinations,
* reporting on unsupported flag combinations,
* internally special-casing certain behaviour (like recursive agent forwarding)
  without boring the user with the details,
* avoiding opening the extension up to untestable scenarios where users can
  insert arbitrary garbage between Mitogen and the components it integrates
  with,
* precise emulation by an alternative implementation, for example if Mitogen
  grew support for Paramiko.


Connection Types
----------------

Matching Ansible, connection variables are treated on a per-task basis, causing
establishment of additional reuseable interpreters as necessary to match the
configuration of each task.


.. _doas:

Doas
~~~~

``doas`` can be used as a connection method that supports connection delegation, or
as a become method.

When used as a become method:

* ``ansible_python_interpreter``
* ``ansible_become_exe``: path to ``doas`` binary.
* ``ansible_become_user`` (default: ``root``)
* ``ansible_become_pass`` (default: assume passwordless)
* ansible.cfg: ``timeout``

When used as the ``mitogen_doas`` connection method:

* The inventory hostname has no special meaning.
* ``ansible_user``: username to use.
* ``ansible_password``: password to use.
* ``ansible_python_interpreter``


.. _method-docker:

Docker
~~~~~~

Like `docker
<https://docs.ansible.com/ansible/2.6/plugins/connection/docker.html>`_ except
connection delegation is supported.

* ``ansible_host``: Name of Docker container (default: inventory hostname).
* ``ansible_user``: Name of user within the container to execute as.


FreeBSD Jail
~~~~~~~~~~~~

Like `jail
<https://docs.ansible.com/ansible/2.6/plugins/connection/jail.html>`_ except
connection delegation is supported.

* ``ansible_host``: Name of jail (default: inventory hostname).
* ``ansible_user``: Name of user within the jail to execute as.


Local
~~~~~

Like `local
<https://docs.ansible.com/ansible/2.6/plugins/connection/local.html>`_ except
connection delegation is supported.

* ``ansible_python_interpreter``


Process Model
^^^^^^^^^^^^^

Ansible usually executes local connection commands as a transient subprocess of
the forked worker executing a task. With the extension, the local connection
exists as a persistent subprocess of the connection multiplexer.

This means that global state mutations made to the top-level Ansible process
that are normally visible to newly forked subprocesses, such as vars plug-ins
that modify the environment, will not be reflected when executing local
commands without additional effort.

During execution the extension presently mimics the working directory and
process environment inheritence of regular Ansible, however it is possible some
additional differences exist that may break existing playbooks.


.. _method-lxc:

LXC
~~~

Connect to classic LXC containers, like `lxc
<https://docs.ansible.com/ansible/2.6/plugins/connection/lxc.html>`_ except
connection delegation is supported, and ``lxc-attach`` is always used rather
than the LXC Python bindings, as is usual with ``lxc``.

The ``lxc-attach`` command must be available on the host machine.

* ``ansible_python_interpreter``
* ``ansible_host``: Name of LXC container (default: inventory hostname).


.. _method-lxd:

LXD
~~~

Connect to modern LXD containers, like `lxd
<https://docs.ansible.com/ansible/2.6/plugins/connection/lxd.html>`_ except
connection delegation is supported. The ``lxc`` command must be available on
the host machine.

* ``ansible_python_interpreter``
* ``ansible_host``: Name of LXC container (default: inventory hostname).


.. _machinectl:

Machinectl
~~~~~~~~~~

Like the `machinectl third party plugin
<https://github.com/BaxterStockman/ansible-connection-machinectl>`_ except
connection delegation is supported. This is a light wrapper around the
:ref:`setns <setns>` method.

* ``ansible_host``: Name of Docker container (default: inventory hostname).
* ``ansible_user``: Name of user within the container to execute as.
* ``mitogen_machinectl_path``: path to ``machinectl`` command if not available
  as ``/bin/machinectl``.


.. _setns:

Setns
~~~~~

The ``setns`` method connects to Linux containers via `setns(2)
<https://linux.die.net/man/2/setns>`_. Unlike :ref:`method-docker`,
:ref:`method-lxc`, and :ref:`method-lxd` the namespace transition is handled
internally, ensuring optimal throughput to the child. This is necessary for
:ref:`machinectl` where only PTY channels are supported.

A utility program must be installed to discover the PID of the container's root
process.

* ``mitogen_kind``: one of ``docker``, ``lxc``, ``lxd`` or ``machinectl``.
* ``ansible_host``: Name of container as it is known to the corresponding tool
  (default: inventory hostname).
* ``ansible_user``: Name of user within the container to execute as.
* ``mitogen_docker_path``: path to Docker if not available on the system path.
* ``mitogen_lxc_path``: path to LXD's ``lxc`` command if not available as
  ``lxc-info``.
* ``mitogen_lxc_info_path``: path to LXC classic's ``lxc-info`` command if not
  available as ``lxc-info``.
* ``mitogen_machinectl_path``: path to ``machinectl`` command if not available
  as ``/bin/machinectl``.


.. _su:

Su
~~

Su can be used as a connection method that supports connection delegation, or
as a become method.

When used as a become method:

* ``ansible_python_interpreter``
* ``ansible_su_exe``, ``ansible_become_exe``
* ``ansible_su_user``, ``ansible_become_user`` (default: ``root``)
* ``ansible_su_pass``, ``ansible_become_pass`` (default: assume passwordless)
* ``su_flags``, ``become_flags``
* ansible.cfg: ``timeout``

When used as the ``mitogen_su`` connection method:

* The inventory hostname has no special meaning.
* ``ansible_user``: username to su as.
* ``ansible_password``: password to su as.
* ``ansible_python_interpreter``


.. _sudo:

Sudo
~~~~

Sudo can be used as a connection method that supports connection delegation, or
as a become method.

When used as a become method:

* ``ansible_python_interpreter``
* ``ansible_sudo_exe``, ``ansible_become_exe``
* ``ansible_sudo_user``, ``ansible_become_user`` (default: ``root``)
* ``ansible_sudo_pass``, ``ansible_become_pass`` (default: assume passwordless)
* ``sudo_flags``, ``become_flags``
* ansible.cfg: ``timeout``

When used as the ``mitogen_sudo`` connection method:

* The inventory hostname has no special meaning.
* ``ansible_user``: username to sudo as.
* ``ansible_password``: password to sudo as.
* ``sudo_flags``, ``become_flags``
* ``ansible_python_interpreter``


SSH
~~~

Like `ssh <https://docs.ansible.com/ansible/2.6/plugins/connection/ssh.html>`_
except connection delegation is supported.

* ``ansible_ssh_timeout``
* ``ansible_host``, ``ansible_ssh_host``
* ``ansible_user``, ``ansible_ssh_user``
* ``ansible_port``, ``ssh_port``
* ``ansible_ssh_executable``, ``ssh_executable``
* ``ansible_ssh_private_key_file``
* ``ansible_ssh_pass``, ``ansible_password`` (default: assume passwordless)
* ``ssh_args``, ``ssh_common_args``, ``ssh_extra_args``
* ``mitogen_ssh_debug_level``: integer between `0..3` indicating the SSH client
  debug level. Ansible must also be run with '-vvv' to view the output.


Debugging
---------

Diagnostics and use of the :py:mod:`logging` package output on the target
machine are usually discarded. With Mitogen, all of this is captured and
returned to the controller, where it can be viewed as desired with ``-vvv``.
Basic high level logs are produced with ``-vvv``, with logging of all IO on the
controller with ``-vvvv`` or higher.

Although use of standard IO and the logging package on the target is forwarded
to the controller, it is not possible to receive IO activity logs, as the
process of receiving those logs would would itself generate IO activity. To
receive a complete trace of every process on every machine, file-based logging
is necessary. File-based logging can be enabled by setting
``MITOGEN_ROUTER_DEBUG=1`` in your environment.

When file-based logging is enabled, one file per context will be created on the
local machine and every target machine, as ``/tmp/mitogen.<pid>.log``.

If you are experiencing a hang, ``MITOGEN_DUMP_THREAD_STACKS=1`` causes every
process on every machine to dump every thread stack into the logging framework
every 5 seconds.


Getting Help
~~~~~~~~~~~~
Some users and developers hang out on the
`#mitogen <https://webchat.freenode.net/?channels=mitogen>`_ channel on the
FreeNode IRC network.


Sample Profiles
---------------

Local VM connection
~~~~~~~~~~~~~~~~~~~

This demonstrates Mitogen vs. connection pipelining to a local VM, executing
the 100 simple repeated steps of ``run_hostname_100_times.yml`` from the
examples directory. Mitogen requires **43x less bandwidth and 4.25x less
time**.

.. image:: images/ansible/run_hostname_100_times.png


Kathmandu to Paris
~~~~~~~~~~~~~~~~~~

This is a full Django application playbook over a ~180ms link between Kathmandu
and Paris. Aside from large pauses where the host performs useful work, the
high latency of this link means Mitogen only manages a 1.7x speedup.

Many early roundtrips are due to inefficiencies in Mitogen's importer that will
be fixed over time, however the majority, comprising at least 10 seconds, are
due to idling while the host's previous result and next command are in-flight
on the network.

The initial extension lays groundwork for exciting structural changes to the
execution model: a future version will tackle latency head-on by delegating
some control flow to the target host, melding the performance and scalability
benefits of pull-based operation with the management simplicity of push-based
operation.

.. image:: images/ansible/costapp.png
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												examples: rename playbooks for clarity.

											
										
										
											7 years ago
+								.. image:: images/ansible/cell_division.png
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
+								    :align: right
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								Mitogen for Ansible
 								===================
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								An extension to `Ansible`_ is included that implements connections over
 								Mitogen, replacing embedded shell invocations with pure-Python equivalents
 								invoked via highly efficient remote procedure calls to persistent interpreters
 								tunnelled over SSH. No changes are required to target hosts.
-												docs: remove another warning label.

											
										
										
											6 years ago
+								The extension is approaching stability and real-world usage is encouraged. `Bug
 								reports`_ are welcome: Ansible is huge, and only wide testing will ensure
 								soundness.
-												docs: extra ansible paragraph.

											
										
										
											7 years ago
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
+								.. _Ansible: https://www.ansible.com/
 								.. _Bug reports: https://goo.gl/yLKZiJ
-												docs: small reference fixes.

											
										
										
											6 years ago
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
+								Overview
 								--------
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								**Expect a 1.25x - 7x speedup** and a **CPU usage reduction of at least 2x**,
 								depending on network conditions, modules executed, and time already spent by
 								targets on useful work. Mitogen cannot improve a module once it is executing,
 								it can only ensure the module executes as quickly as possible.
 								* **One connection is used per target**, in addition to one sudo invocation per
 								  user account. This is much better than SSH multiplexing combined with
 								  pipelining, as significant state can be maintained in RAM between steps, and
 								  system logs aren't spammed with repeat authentication events.
 								* **A single network roundtrip is used** to execute a step whose code already
 								  exists in RAM on the target. Eliminating multiplexed SSH channel creation
-												docs: more ansible updates

											
										
										
											7 years ago
+								  saves 4 ms runtime per 1 ms of network latency for every playbook step.
-												docs: major Ansible page update.

											
										
										
											7 years ago
 								* **Processes are aggressively reused**, avoiding the cost of invoking Python
 								  and recompiling imports, saving 300-800 ms for every playbook step.
 								* Code is ephemerally cached in RAM, **reducing bandwidth usage by an order
 								  of magnitude** compared to SSH pipelining, with around 5x fewer frames
 								  traversing the network in a typical run.
-												docs: more accurate bullet.

											
										
										
											7 years ago
+								* **Fewer writes to the target filesystem occur**. In typical configurations,
 								  Ansible repeatedly rewrites and extracts ZIP files to multiple temporary
-												docs: glaring ancient typo.

											
										
										
											6 years ago
+								  directories on the target. Security issues relating to temporary files in
-												docs: more accurate bullet.

											
										
										
											7 years ago
+								  cross-account scenarios are entirely avoided.
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												docs: add 'will it work' intuition.

											
										
										
											6 years ago
+								The effect is most potent on playbooks that execute many **short-lived
 								actions**, where Ansible's overhead dominates the cost of the operation, for
 								example when executing large ``with_items`` loops to run simple commands or
 								write files.
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												docs: move installation section above demo

											
										
										
											7 years ago
+								Installation
 								------------
-												docs: link mitogen-announce mailing list.

											
										
										
											6 years ago
+. Thoroughly review :ref:`noteworthy_differences` and :ref:`changelog`.
 . Download and extract |mitogen_url|.
 . Modify ``ansible.cfg``:
-												docs: move installation section above demo

											
										
										
											7 years ago
-												docs: link to PyPI release, not GitHub archive URL.

Now download counts are visible via PSF BigQuery.

											
										
										
											6 years ago
+								   .. parsed-literal::
-												docs: move installation section above demo

											
										
										
											7 years ago
 								        [defaults]
-												docs: link to PyPI release, not GitHub archive URL.

Now download counts are visible via PSF BigQuery.

											
										
										
											6 years ago
+								        strategy_plugins = /path/to/mitogen-|mitogen_version|/ansible_mitogen/plugins/strategy
-												docs: move installation section above demo

											
										
										
											7 years ago
+								        strategy = mitogen_linear
 								   The ``strategy`` key is optional. If omitted, the
 								   ``ANSIBLE_STRATEGY=mitogen_linear`` environment variable can be set on a
 								   per-run basis. Like ``mitogen_linear``, the ``mitogen_free`` strategy exists
 								   to mimic the ``free`` strategy.
-												docs: link mitogen-announce mailing list.

											
										
										
											6 years ago
+. If targets have a restrictive ``sudoers`` file, add a rule like:
-												docs: add example sudoers rule

hat tip @seuf :)

											
										
										
											7 years ago
-												docs: link mitogen-announce mailing list.

											
										
										
											6 years ago
+								   ::
-												docs: add example sudoers rule

hat tip @seuf :)

											
										
										
											7 years ago
 								       deploy = (ALL) NOPASSWD:/usr/bin/python -c*
-												docs: link mitogen-announce mailing list.

											
										
										
											6 years ago
+. Subscribe to the `mitogen-announce mailing list
-												docs: minor tweaks.

											
										
										
											6 years ago
+								   <https://www.freelists.org/list/mitogen-announce>`_ to stay updated with new
 								   releases and important bug fixes.
-												docs: link mitogen-announce mailing list.

											
										
										
											6 years ago
-												docs: move installation section above demo

											
										
										
											7 years ago
-												docs: link to Ansible video demo

											
										
										
											7 years ago
+								Demo
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								~~~~
-												docs: link to Ansible video demo

											
										
										
											7 years ago
 								This demonstrates Ansible running a subset of the Mitogen integration tests
 								concurrent to an equivalent run using the extension.
 								.. raw:: html
-												docs: host demo on Vimeo.

											
										
										
											6 years ago
+								    <iframe src="https://player.vimeo.com/video/283272293?title=0&byline=0&portrait=0" width="720" height="439" frameborder="0" webkitallowfullscreen mozallowfullscreen allowfullscreen></iframe>
-												docs: link to Ansible video demo

											
										
										
											7 years ago
-												docs: reorder sections

											
										
										
											7 years ago
+								Testimonials
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								~~~~~~~~~~~~
-												docs: reorder sections

											
										
										
											7 years ago
 								* "With mitogen **my playbook runtime went from 45 minutes to just under 3
 								  minutes**. Awesome work!"
 								* "The runtime was reduced from **1.5 hours on 4 servers to just under 3
 								  minutes**. Thanks!"
 								* "Oh, performance improvement using Mitogen is *huge*. As mentioned before,
 								  running with Mitogen enables takes 7m36 (give or take a few seconds). Without
 								  Mitogen, the same run takes 19m49! **I'm not even deploying without Mitogen
 								  anymore** :)"
 								* "**Works like a charm**, thank you for your quick response"
 								* "I tried it out. **He is not kidding about the speed increase**."
-												docs: slightly bikeshed last testimonial

											
										
										
											7 years ago
+								* "I don't know what kind of dark magic @dmw_83 has done, but his Mitogen
 								  strategy took Clojars' Ansible runs from **14 minutes to 2 minutes**. I still
 								  can't quite believe it."
-												Add testimonial from Clojars
											
										
										
											7 years ago
-												docs: add funny testimonial

											
										
										
											6 years ago
+								* "Enabling the mitogen plugin in ansible feels like switching from floppy to SSD"
-												docs: reorder sections

											
										
										
											7 years ago
-												docs: link changelog into Ansible install procedure

											
										
										
											6 years ago
 								.. _noteworthy_differences:
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								Noteworthy Differences
 								----------------------
-												docs: update supported versions.

											
										
										
											6 years ago
+								* Ansible 2.3-2.6 are supported along with Python 2.6, 2.7 or 3.6. Verify your
-												docs: link mitogen-announce mailing list.

											
										
										
											6 years ago
+								  installation is running one of these versions by checking ``ansible
 								  --version`` output.
-												docs: add 'raw' to 0.2 in-scope

											
										
										
											6 years ago
+								* The Ansible ``raw`` action executes as a regular Mitogen connection,
 								  precluding its use for installing Python on a target. This will be addressed
 								  soon.
-												issue #303: add doas to the docs

											
										
										
											6 years ago
+								* The ``doas``, ``su`` and ``sudo`` become methods are available. File bugs to
 								  register interest in more.
-												ansible: doc updates

											
										
										
											7 years ago
-												Updated readme with build status, updated docs

											
										
										
											6 years ago
+								* The `docker <https://docs.ansible.com/ansible/2.6/plugins/connection/docker.html>`_,
 								  `jail <https://docs.ansible.com/ansible/2.6/plugins/connection/jail.html>`_,
 								  `local <https://docs.ansible.com/ansible/2.6/plugins/connection/local.html>`_,
 								  `lxc <https://docs.ansible.com/ansible/2.6/plugins/connection/lxc.html>`_,
 								  `lxd <https://docs.ansible.com/ansible/2.6/plugins/connection/lxd.html>`_,
 								  and `ssh <https://docs.ansible.com/ansible/2.6/plugins/connection/ssh.html>`_
-												ansible: add mitogen_sudo method, split out connection subclasses.

Slowly moving towards real implementations in those files.

											
										
										
											7 years ago
+								  built-in connection types are supported, along with Mitogen-specific
-												docs: document local connection process model difference.

											
										
										
											6 years ago
+								  :ref:`machinectl <machinectl>`, :ref:`mitogen_doas <doas>`,
-												issue #303: add doas to the docs

											
										
										
											6 years ago
+								  :ref:`mitogen_su <su>`, :ref:`mitogen_sudo <sudo>`, and :ref:`setns <setns>`
 								  types. File bugs to register interest in others.
-												ansible: doc updates

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								* Local commands execute in a reuseable interpreter created identically to
 								  interpreters on targets. Presently one interpreter per ``become_user``
 								  exists, and so only one local action may execute simultaneously.
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								  Ansible usually permits up to ``forks`` simultaneous local actions. Any
 								  long-running local actions that execute for every target will experience
 								  artificial serialization, causing slowdown equivalent to `task_duration *
 								  num_targets`. This will be fixed soon.
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								* "Module Replacer" style modules are not supported. These rarely appear in
 								  practice, and light web searches failed to reveal many examples of them.
-												docs: new Ansible limitation, add new heading

Some differences are eventually likely to become permanent, because the
existing behaviour is unforgiveable.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								* Ansible permits up to ``forks`` connections to be setup in parallel, whereas
 								  in Mitogen this is handled by a fixed-size thread pool. Up to 16 connections
 								  may be established in parallel by default, this can be modified by setting
 								  the ``MITOGEN_POOL_SIZE`` environment variable.
-												docs: new Ansible limitation, add new heading

Some differences are eventually likely to become permanent, because the
existing behaviour is unforgiveable.

											
										
										
											7 years ago
-												docs: update Changelog.

											
										
										
											6 years ago
+								* The ``ansible_python_interpreter`` variable is parsed using a restrictive
 								  :mod:`shell-like <shlex>` syntax, permitting values such as ``/usr/bin/env
 								  FOO=bar python``, which occur in practice. Ansible `documents this
 								  <https://docs.ansible.com/ansible/latest/user_guide/intro_inventory.html#ansible-python-interpreter>`_
 								  as an absolute path, however the implementation passes it unquoted through
 								  the shell, permitting arbitrary code to be injected.
-												docs: remove more Ansible limitations

											
										
										
											7 years ago
+								* Performance does not scale linearly with target count. This will improve over
 								  time.
-												issue #144: ansible: increase default pool size to 16.

											
										
										
											7 years ago
-												docs: more updates.

- accurate description of Ansible timeouts
- rough detach() sketch

											
										
										
											7 years ago
+								* SSH and ``become`` are treated distinctly when applying timeouts, and
 								  timeouts apply up to the point when the new interpreter is ready to accept
 								  messages. Ansible has two timeouts: ``ConnectTimeout`` for SSH, applying up
 								  to when authentication completes, and a separate parallel timeout up to when
 								  ``become`` authentication completes.
 								  For busy targets, Ansible may successfully execute a module where Mitogen
 								  would fail without increasing the timeout. For sick targets, Ansible may hang
 								  indefinitely after authentication without executing a command, for example
 								  due to a stuck filesystem IO appearing in ``$HOME/.profile``.
-												docs: note the semantic difference in Mitogen vs. Ansible timeouts

Related to issue #141.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								New Features & Notes
 								--------------------
 								Connection Delegation
 								~~~~~~~~~~~~~~~~~~~~~
 								.. image:: images/jumpbox.png
 								    :align: right
 								Included is a preview of **Connection Delegation**, a Mitogen-specific
-												docs: more ansible updates

											
										
										
											7 years ago
+								implementation of `stackable connection plug-ins`_. This enables connections
 								via a bastion, or container connections delegated via their host machine, where
 								reaching the host may entail further delegation.
-												docs: major Ansible page update.

											
										
										
											7 years ago
 								.. _Stackable connection plug-ins: https://github.com/ansible/proposals/issues/25
 								Unlike with SSH forwarding Ansible has complete visibility of the final
 								topology, declarative configuration via static/dynamic inventory is possible,
 								and data can be cached and re-served, and code executed on every intermediary.
 								For example when targeting Docker containers on a remote machine, each module
 								need only be uploaded once for the first task and container that requires it,
 								then cached and served from the SSH account for every future task in any
 								container.
 								.. raw:: html
 								    <div style="clear: both;"></div>
 								.. caution::
 								    Connection delegation is a work in progress, bug reports are welcome.
-												ansible: enable forking when requested and for async jobs.

Closes #105.
References #155.

mitogen/service.py:
    Refactor services to support individually exposed methods with
    different security policies for each method.

    - @mitogen.service.expose() to expose a method and set its policy
    - @mitogen.service.arg_spec() to validate input.
    - Require basic service message format to be a tuple of
      `(method, kwargs)`, where kwargs is always a dict.
    - Update DeduplicatingService to match the new scheme.

ansible_mitogen/connection.py:
    - Rename 'method' to 'method_name' to disambiguate it from the
      service.call()'s method= argument.

ansible_mitogen/planner.py:
    - Generate an ID for every job, sync or not, and fetch job results
      from JobResultService rather than via the initiating function
      call's return value.
    - Planner subclasses now get to select whether their Runner should
      run in a forked process. The base implementation requests this if
      the 'mitogen_isolation_mode=fork' task variable is present.

ansible_mitogen/runner.py:
    Teach runners to deliver their result via JobResultService executing
    in their indirect parent mux process.

ansible_mitogen/plugins/actions/mitogen_async_status.py:
    Split the implementation up into methods, and more compatibly
    emulate Ansible's existing output.

ansible_mitogen/process.py:
    Mux processes now host JobResultService.

ansible_mitogen/services.py:
    Update existing services to the new mitogen.service scheme, and
    implement JobResultService:

    * listen() method for synchronous jobs. planner.invoke() registers a
      Sender with the service prior to invoking the job, then sleeps
      waiting for the service to write the job result to the
      corresponding Receiver.

    * Non-blocking get() method for implementing mitogen_async_status
      action.

    * Child-accessible push() method for delivering task results.

ansible_mitogen/target.py:
    New helpers for spawning a virginal subprocess on startup, from
    which asynchronous and mitogen_task_isolation=fork jobs are forked.
    Necessary to avoid a task inheriting potentially
    polluted/monkey-patched parent environment, since remaining jobs
    continue to run in the original child process.

docs/ansible.rst:
    Add/merge/remove some behaviours/risks.

tests/ansible/integration:
    New tests for forking/async.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								    * Delegated connection setup is single-threaded; only one connection can be
 								      constructed in parallel per intermediary.
 								    * Inferring the configuration of intermediaries may be buggy, manifesting
 								      as duplicate connections between hops, due to not perfectly replicating
 								      the configuration Ansible would normally use for the intermediary.
-												docs: mention synchronize/delegation issue.

											
										
										
											6 years ago
+								    * Automatic tunnelling of SSH-dependent actions, such as the
 								      ``synchronize`` module, is not yet supported. This will be added in the
 .3 series.
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								To enable connection delegation, set ``mitogen_via=<inventory name>`` on the
 								command line, or as host and group variables.
 								.. code-block:: ini
 								    # Docker container on web1.dc1 is reachable via web1.dc1.
 								    [app-containers.web1.dc1]
 								    app1.web1.dc1 ansible_host=app1 ansible_connection=docker mitogen_via=web1.dc1
 								    # Web servers in DC1 are reachable via bastion.dc1
 								    [dc1]
 								    web1.dc1
 								    web2.dc1
 								    web3.dc1
 								    [dc1:vars]
 								    mitogen_via = bastion.dc1
 								    # Web servers in DC2 are reachable via bastion.dc2
 								    [dc2]
 								    web1.dc2
 								    web2.dc2
 								    web3.dc2
 								    [dc2:vars]
 								    mitogen_via = bastion.dc2
-												docs: show become_user example for connection delegation.

											
										
										
											7 years ago
+								    # Prod bastions are reachable via a magic account on a
 								    # corporate network gateway.
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								    [bastions]
-												docs: show become_user example for connection delegation.

											
										
										
											7 years ago
+								    bastion.dc1 mitogen_via=prod-ssh-access@corp-gateway.internal
 								    bastion.dc2 mitogen_via=prod-ssh-access@corp-gateway.internal
-												docs: major Ansible page update.

											
										
										
											7 years ago
 								    [corp-gateway]
 								    corp-gateway.internal
 								File Transfer
 								~~~~~~~~~~~~~
-												issue #321: simplify temp directory handling.

											
										
										
											6 years ago
+								Normally `sftp(1) <https://linux.die.net/man/1/sftp>`_ or
 								`scp(1) <https://linux.die.net/man/1/scp>`_ are used to copy files by the
-												docs: add links.

											
										
										
											7 years ago
+								`assemble <http://docs.ansible.com/ansible/latest/modules/assemble_module.html>`_,
 								`copy <http://docs.ansible.com/ansible/latest/modules/copy_module.html>`_,
 								`patch <http://docs.ansible.com/ansible/latest/modules/patch_module.html>`_,
 								`script <http://docs.ansible.com/ansible/latest/modules/script_module.html>`_,
 								`template <http://docs.ansible.com/ansible/latest/modules/template_module.html>`_, and
 								`unarchive <http://docs.ansible.com/ansible/latest/modules/unarchive_module.html>`_
 								actions, or when uploading modules with pipelining disabled. With Mitogen
 								copies are implemented natively using the same interpreters, connection tree,
 								and routed message bus that carries RPCs.
-												docs: major Ansible page update.

											
										
										
											7 years ago
-												docs: more ansible updates

											
										
										
											7 years ago
+								This permits direct streaming between endpoints regardless of execution
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								environment, without necessitating temporary copies in intermediary accounts or
 								machines, for example when ``become`` is active, or in the presence of
-												docs: more ansible updates

											
										
										
											7 years ago
+								connection delegation. It also avoids the need to securely share temporary
 								files between accounts and machines.
-												docs: major Ansible page update.

											
										
										
											7 years ago
-												docs: more ansible updates

											
										
										
											7 years ago
+								As the implementation is self-contained, it is simple to make improvements like
 								prioritizing transfers, supporting resume, or displaying progress bars.
-												docs: major Ansible page update.

											
										
										
											7 years ago
-												docs: add file transfer safety section.

											
										
										
											7 years ago
+								Safety
 								^^^^^^
-												docs: more ansible updates

											
										
										
											7 years ago
+								Transfers proceed to a hidden file in the destination directory, with content
 								and metadata synced using `fsync(2) <https://linux.die.net/man/2/fsync>`_ prior
 								to rename over any existing file. This ensures the file remains consistent at
 								all times, in the event of a crash, or when overlapping `ansible-playbook` runs
 								deploy differing file contents.
-												docs: add file transfer safety section.

											
										
										
											7 years ago
-												issue #321: simplify temp directory handling.

											
										
										
											6 years ago
+								The `sftp(1) <https://linux.die.net/man/1/sftp>`_ and `scp(1)
-												docs: more ansible updates

											
										
										
											7 years ago
+								<https://linux.die.net/man/1/sftp>`_ tools may cause undetected data corruption
 								in the form of truncated files, or files containing intermingled data segments
 								from overlapping runs. As part of normal operation, both tools expose a window
 								where readers may observe inconsistent file contents.
-												docs: add file transfer safety section.

											
										
										
											7 years ago
 								Performance
 								^^^^^^^^^^^
-												ansible: avoid roundtrip for small file transfers.

Calls to connect.put_file() where the file is sufficiently small enough
to fit in a single RPC proceed without waiting for an RPC response. If
the write fails the target context will log an exception, and any
subsequent step depending on the written file will fail.

I verified every built-in action plugin for file transfer calls, and
they all depend on the transferred file in the following step, so this
should be safe.

Reduces template/copy actions to 2-RTT, loop-20-templates.yml runtime
reduced from 30 seconds to 10 seconds over a 250ms link compared to
v0.2.2, and from 123 seconds compared to vanilla with pipelining
enabled.

											
										
										
											6 years ago
+								One roundtrip initiates a transfer larger than 124 KiB, while smaller transfers
 								are embedded in a 0-roundtrip remote call. For tools operating via SSH
 								multiplexing, 4 roundtrips are required to configure the IO channel, in
 								addition to the time to start the local and remote processes.
-												docs: more ansible updates

											
										
										
											7 years ago
 								An invocation of ``scp`` with an empty ``.profile`` over a 30 ms link takes
 								~140 ms, wasting 110 ms per invocation, rising to ~2,000 ms over a 400 ms
 								UK-India link, wasting 1,600 ms per invocation.
-												docs: add file transfer safety section.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								Interpreter Reuse
 								~~~~~~~~~~~~~~~~~
 								Python interpreters are aggressively reused to execute modules. While this
 								works well, it violates an unwritten assumption, and so it is possible an
 								earlier module execution could cause a subsequent module to fail, or for
 								unrelated modules to interact poorly due to bad hygiene, such as
 								monkey-patching that becomes stacked over repeat invocations.
 								Before reporting a bug relating to a misbehaving module, please re-run with
 								``-e mitogen_task_isolation=fork`` to see if the problem abates. This may be
 								set per-task, paying attention to the possibility an earlier task may be the
 								true cause of a failure.
 								.. code-block:: yaml
 								    - name: My task.
 								      broken_module:
 								        some_option: true
 								      vars:
 								        mitogen_task_isolation: fork
-												ansible: enable forking when requested and for async jobs.

Closes #105.
References #155.

mitogen/service.py:
    Refactor services to support individually exposed methods with
    different security policies for each method.

    - @mitogen.service.expose() to expose a method and set its policy
    - @mitogen.service.arg_spec() to validate input.
    - Require basic service message format to be a tuple of
      `(method, kwargs)`, where kwargs is always a dict.
    - Update DeduplicatingService to match the new scheme.

ansible_mitogen/connection.py:
    - Rename 'method' to 'method_name' to disambiguate it from the
      service.call()'s method= argument.

ansible_mitogen/planner.py:
    - Generate an ID for every job, sync or not, and fetch job results
      from JobResultService rather than via the initiating function
      call's return value.
    - Planner subclasses now get to select whether their Runner should
      run in a forked process. The base implementation requests this if
      the 'mitogen_isolation_mode=fork' task variable is present.

ansible_mitogen/runner.py:
    Teach runners to deliver their result via JobResultService executing
    in their indirect parent mux process.

ansible_mitogen/plugins/actions/mitogen_async_status.py:
    Split the implementation up into methods, and more compatibly
    emulate Ansible's existing output.

ansible_mitogen/process.py:
    Mux processes now host JobResultService.

ansible_mitogen/services.py:
    Update existing services to the new mitogen.service scheme, and
    implement JobResultService:

    * listen() method for synchronous jobs. planner.invoke() registers a
      Sender with the service prior to invoking the job, then sleeps
      waiting for the service to write the job result to the
      corresponding Receiver.

    * Non-blocking get() method for implementing mitogen_async_status
      action.

    * Child-accessible push() method for delivering task results.

ansible_mitogen/target.py:
    New helpers for spawning a virginal subprocess on startup, from
    which asynchronous and mitogen_task_isolation=fork jobs are forked.
    Necessary to avoid a task inheriting potentially
    polluted/monkey-patched parent environment, since remaining jobs
    continue to run in the original child process.

docs/ansible.rst:
    Add/merge/remove some behaviours/risks.

tests/ansible/integration:
    New tests for forking/async.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								If forking solves your problem, **please report a bug regardless**, as an
 								internal list can be updated to prevent others bumping into the same problem.
 								Interpreter Recycling
 								~~~~~~~~~~~~~~~~~~~~~
 								There is a per-target limit on the number of interpreters. Once 20 exist, the
 								youngest is terminated before starting any new interpreter, preventing
 								situations like below from triggering memory exhaustion.
 								.. code-block:: yaml
 								    - hosts: corp_boxes
 								      vars:
 								        user_directory: [
 								          # 10,000 corporate user accounts
 								        ]
 								      tasks:
 								        - name: Create user bashrc
 								          become: true
 								          vars:
 								            ansible_become_user: "{{item}}"
 								          copy:
 								            src: bashrc
 								            dest: "~{{item}}/.bashrc"
 								          with_items: "{{user_directory}}"
 								The youngest is chosen to preserve useful accounts like ``root`` and
 								``postgresql`` that often appear early in a run, however it is simple to
 								construct a playbook that defeats this strategy. A future version will key
 								interpreters on the identity of their creating task, avoiding useful account
 								recycling in every scenario.
 								To modify the limit, set the ``MITOGEN_MAX_INTERPRETERS`` environment variable.
 								Standard IO
 								~~~~~~~~~~~
 								Ansible uses pseudo TTYs for most invocations to allow it to type interactive
 								passwords, however pseudo TTYs are disabled where standard input is required or
 								``sudo`` is not in use. Additionally when SSH multiplexing is enabled, a string
 								like ``Shared connection to localhost closed\r\n`` appears in ``stderr`` of
 								every invocation.
 								Mitogen does not naturally require either of these, as command output is always
 								embedded within framed messages, and it can simply call :py:func:`pty.openpty`
 								in any location an interactive password must be typed.
 								A major downside to Ansible's behaviour is that ``stdout`` and ``stderr`` are
 								merged together into a single ``stdout`` variable, with carriage returns
 								inserted in the output by the TTY layer. However ugly, the extension emulates
 								this precisely, to avoid breaking playbooks that expect text to appear in
 								specific variables with a particular linefeed style.
-												docs: update ansible risks/differences.

											
										
										
											7 years ago
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												issue #321: simplify temp directory handling.

											
										
										
											6 years ago
+								.. _ansible_tempfiles:
 								Temporary Files
 								~~~~~~~~~~~~~~~
 								Ansible creates a variety of temporary files and directories depending on its
 								operating mode.
 								In the best case when pipelining is enabled and no temporary uploads are
 								required, for each task Ansible will create one directory below a
 								system-supplied temporary directory returned by :func:`tempfile.mkdtemp`, owned
-												issue #321: docs fixes

											
										
										
											6 years ago
+								by the target account a new-style module will execute in.
-												issue #321: simplify temp directory handling.

											
										
										
											6 years ago
 								In other cases depending on the task type, whether become is active, whether
 								the target become user is privileged, whether the associated action plugin
 								needs to upload files, and whether the associated module needs to store files,
 								Ansible may:
 								* Create a directory owned by the SSH user either under ``remote_tmp``, or a
 								  system-default directory,
 								* Upload action dependencies such as non-new style modules or rendered
 								  templates to that directory via `sftp(1) <https://linux.die.net/man/1/sftp>`_
 								  or `scp(1) <https://linux.die.net/man/1/scp>`_.
 								* Attempt to modify the directory's access control list to grant access to the
 								  target user using `setfacl(1) <https://linux.die.net/man/1/setfacl>`_,
 								  requiring that tool to be installed and a supported filesystem to be in use,
 								  or for the ``allow_world_readable_tmpfiles`` setting to be  :data:`True`.
 								* Create a directory owned by the target user either under ``remote_tmp``, or
 								  a system-default directory, if a new-style module needs a temporary directory
 								  and one was not previously created for a supporting file earlier in the
 								  invocation.
 								In summary, for each task Ansible may create one or more of:
 								* ``~ssh_user/<remote_tmp>/...`` owned by the login user,
 								* ``$TMPDIR/ansible-tmp-...`` owned by the login user,
 								* ``$TMPDIR/ansible-tmp-...`` owned by the login user with ACLs permitting
 								  write access by the become user,
 								* ``~become_user/<remote_tmp>/...`` owned by the become user,
 								* ``$TMPDIR/ansible_<modname>_payload_.../`` owned by the become user,
 								* ``$TMPDIR/ansible-module-tmp-.../`` owned by the become user.
-												issue #321: 2.4+ compatibility fixes, disable test on Vanilla.

											
										
										
											6 years ago
+								A directory must exist to maintain compatibility with Ansible, as many modules
 								introspect :data:`sys.argv` to find a directory where they may write files,
 								however only one directory exists for the lifetime of each interpreter, its
 								location is consistent for each target account, and it is always privately
 								owned by that account.
-												issue #321: docs fixes

											
										
										
											6 years ago
+								The paths below are tried until one is found that is writeable and lives on a
 								filesystem with ``noexec`` disabled:
-												issue #321: 2.4+ compatibility fixes, disable test on Vanilla.

											
										
										
											6 years ago
 . ``$variable`` and tilde-expanded ``remote_tmp`` setting from
 								   ``ansible.cfg``
 . ``$variable`` and tilde-expanded ``system_tmpdirs`` setting from
 								   ``ansible.cfg``
 . ``TMPDIR`` environment variable
 . ``TEMP`` environment variable
 . ``TMP`` environment variable
-												issue #321: take remote_tmp and system_tmpdirs into account.

Can't simply ignore these settings as some users may have weird noexec
filesystems.

											
										
										
											6 years ago
+. ``/tmp``
 . ``/var/tmp``
 . ``/usr/tmp``
-												issue #321: 2.4+ compatibility fixes, disable test on Vanilla.

											
										
										
											6 years ago
+. Current working directory
-												issue #321: simplify temp directory handling.

											
										
										
											6 years ago
 								As the directory is created once at startup, and its content is managed by code
-												issue #321: 2.4+ compatibility fixes, disable test on Vanilla.

											
										
										
											6 years ago
+								running remotely, no additional network roundtrips are required to manage it
 								for each task requiring temporary storage.
-												issue #321: simplify temp directory handling.

											
										
										
											6 years ago
-												issue #338: refactor env handling into class and fix tests.

											
										
										
											6 years ago
+								.. _ansible_process_env:
 								Process Environment Emulation
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 								Since Ansible discards processes after each module invocation, follow-up tasks
 								often (but not always) receive a new environment that will usually include
 								changes made by previous tasks. As such modifications are common, for
 								compatibility the extension emulates the existing behaviour as closely as
 								possible.
 								Some scenarios exist where emulation is impossible, for example, applying
 								``nsswitch.conf`` changes when ``nscd`` is not in use. If future scenarios
 								appear that cannot be solved through emulation, the extension will be updated
 								to automatically restart affected interpreters instead.
 								DNS Resolution
 								^^^^^^^^^^^^^^
 								Modifications to ``/etc/resolv.conf`` cause the glibc resolver configuration to
 								be reloaded via `res_init(3) <https://linux.die.net/man/3/res_init>`_. This
 								isn't necessary on some Linux distributions carrying glibc patches to
 								automatically check ``/etc/resolv.conf`` periodically, however it is necessary
-												issue #321: simplify temp directory handling.

											
										
										
											6 years ago
+								on at least Debian and BSD derivatives.
-												issue #338: refactor env handling into class and fix tests.

											
										
										
											6 years ago
 								``/etc/environment``
 								^^^^^^^^^^^^^^^^^^^^
 								When ``become: true`` is active or SSH multiplexing is disabled, modifications
 								by previous tasks to ``/etc/environment`` and ``$HOME/.pam_environment`` are
-												issue #321: simplify temp directory handling.

											
										
										
											6 years ago
+								normally reflected, since the content of those files is reapplied by `PAM
-												issue #338: refactor env handling into class and fix tests.

											
										
										
											6 years ago
+								<https://en.wikipedia.org/wiki/Pluggable_authentication_module>`_ via `pam_env`
 								on each authentication of ``sudo`` or ``sshd``.
 								Both files are monitored for changes, and changes are applied where it appears
 								safe to do so:
 								* New keys are added if they did not otherwise exist in the inherited
 								  environment, or previously had the same value as found in the file before it
 								  changed.
 								* Given a key (such as ``http_proxy``) added to the file where no such key
 								  exists in the environment, the key will be added.
 								* Given a key (such as ``PATH``) where an existing environment key exists with
 								  a different value, the update or deletion will be ignored, as it is likely
 								  the key was overridden elsewhere after `pam_env` ran, such as by
 								  ``/etc/profile``.
 								* Given a key removed from the file that had the same value as the existing
 								  environment key, the key will be removed.
-												issue #106: docs: initial docs for how modules execute.

											
										
										
											7 years ago
+								How Modules Execute
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								~~~~~~~~~~~~~~~~~~~
-												issue #106: docs: initial docs for how modules execute.

											
										
										
											7 years ago
-												issue #106: docs: tidyup.

											
										
										
											7 years ago
+								Ansible usually modifies, recompresses and reuploads modules every time they
 								run on a target, work that must be repeated by the controller for every
 								playbook step.
 								With the extension any modifications are done on the target, allowing pristine
 								copies of modules to be cached, reducing the necessity to re-transfer modules
 								for each invocation. Unmodified modules are uploaded once on first use and
 								cached in RAM for the remainder of the run.
-												issue #106: docs: initial docs for how modules execute.

											
										
										
											7 years ago
 								**Binary**
-												docs: tidy up big list of bullets.

											
										
										
											7 years ago
+								    Native executables detected using a complex heuristic. Arguments are
 								    supplied as a JSON file whose path is the sole script parameter.
-												issue #106: docs: initial docs for how modules execute.

											
										
										
											7 years ago
 								**Module Replacer**
-												docs: tidy up big list of bullets.

											
										
										
											7 years ago
+								    Python scripts detected by the presence of
 								    ``#<<INCLUDE_ANSIBLE_MODULE_COMMON>>`` appearing in their source. This type
 								    is not yet supported.
-												issue #106: docs: initial docs for how modules execute.

											
										
										
											7 years ago
 								**New-Style**
-												docs: tidy up big list of bullets.

											
										
										
											7 years ago
+								    Python scripts detected by the presence of ``from ansible.module_utils.``
 								    appearing in their source. Arguments are supplied as JSON written to
 								    ``sys.stdin`` of the target interpreter.
-												issue #106: docs: initial docs for how modules execute.

											
										
										
											7 years ago
 								**JSON_ARGS**
-												docs: tidy up big list of bullets.

											
										
										
											7 years ago
+								    Detected by the presence of ``INCLUDE_ANSIBLE_MODULE_JSON_ARGS`` appearing
 								    in the script source. The interpreter directive (``#!interpreter``) is
 								    adjusted to match the corresponding value of ``{{ansible_*_interpreter}}``
 								    if one is set. Arguments are supplied as JSON mixed into the script as a
 								    replacement for ``INCLUDE_ANSIBLE_MODULE_JSON_ARGS``.
-												issue #106: docs: initial docs for how modules execute.

											
										
										
											7 years ago
 								**WANT_JSON**
-												docs: tidy up big list of bullets.

											
										
										
											7 years ago
+								    Detected by the presence of ``WANT_JSON`` appearing in the script source.
 								    The interpreter directive is adjusted as above. Arguments are supplied as a
 								    JSON file whose path is the sole script parameter.
-												issue #106: docs: initial docs for how modules execute.

											
										
										
											7 years ago
 								**Old Style**
-												docs: tidy up big list of bullets.

											
										
										
											7 years ago
+								    Files not matching any of the above tests. The interpreter directive is
 								    adjusted as above. Arguments are supplied as a file whose path is the sole
 								    script parameter. The format of the file is ``"key=repr(value)[
 								    key2=repr(value2)[ ..]] "``.
-												issue #106: docs: initial docs for how modules execute.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								Runtime Patches
 								~~~~~~~~~~~~~~~
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								Three small runtime patches are employed in ``strategy.py`` to hook into
 								desirable locations, in order to override uses of shell, the module executor,
 								and the mechanism for selecting a connection plug-in. While it is hoped the
 								patches can be avoided in future, for interesting versions of Ansible deployed
 								today this simply is not possible, and so they continue to be required.
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								The patches are concise and behave conservatively, including by disabling
 								themselves when non-Mitogen connections are in use. Additional third party
 								plug-ins are unlikely to attempt similar patches, so the risk to an established
 								configuration should be minimal.
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								Flag Emulation
 								~~~~~~~~~~~~~~
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								Mitogen re-parses ``sudo_flags``, ``become_flags``, and ``ssh_flags`` using
 								option parsers extracted from `sudo(1)` and `ssh(1)` in order to emulate their
 								equivalent semantics. This allows:
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								* robust support for common ``ansible.cfg`` tricks without reconfiguration,
 								  such as forwarding SSH agents across ``sudo`` invocations,
 								* reporting on conflicting flag combinations,
 								* reporting on unsupported flag combinations,
 								* internally special-casing certain behaviour (like recursive agent forwarding)
 								  without boring the user with the details,
 								* avoiding opening the extension up to untestable scenarios where users can
 								  insert arbitrary garbage between Mitogen and the components it integrates
 								  with,
 								* precise emulation by an alternative implementation, for example if Mitogen
 								  grew support for Paramiko.
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												docs: better connection type docs

											
										
										
											7 years ago
+								Connection Types
 								----------------
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												docs: better connection type docs

											
										
										
											7 years ago
+								Matching Ansible, connection variables are treated on a per-task basis, causing
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								establishment of additional reuseable interpreters as necessary to match the
 								configuration of each task.
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												issue #303: add doas to the docs

											
										
										
											6 years ago
+								.. _doas:
 								Doas
 								~~~~
 								``doas`` can be used as a connection method that supports connection delegation, or
 								as a become method.
 								When used as a become method:
 								* ``ansible_python_interpreter``
 								* ``ansible_become_exe``: path to ``doas`` binary.
 								* ``ansible_become_user`` (default: ``root``)
 								* ``ansible_become_pass`` (default: assume passwordless)
 								* ansible.cfg: ``timeout``
 								When used as the ``mitogen_doas`` connection method:
 								* The inventory hostname has no special meaning.
 								* ``ansible_user``: username to use.
 								* ``ansible_password``: password to use.
 								* ``ansible_python_interpreter``
-												docs: more ansible updates

											
										
										
											7 years ago
+								.. _method-docker:
-												docs: better connection type docs

											
										
										
											7 years ago
+								Docker
 								~~~~~~
-												ansible: allow establishment of duplicate SSH connections

											
										
										
											7 years ago
-												docs: more ansible updates

											
										
										
											7 years ago
+								Like `docker
-												Updated readme with build status, updated docs

											
										
										
											6 years ago
+								<https://docs.ansible.com/ansible/2.6/plugins/connection/docker.html>`_ except
-												docs: better connection type docs

											
										
										
											7 years ago
+								connection delegation is supported.
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												docs: better connection type docs

											
										
										
											7 years ago
+								* ``ansible_host``: Name of Docker container (default: inventory hostname).
 								* ``ansible_user``: Name of user within the container to execute as.
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												docs: more ansible updates

											
										
										
											7 years ago
+								FreeBSD Jail
 								~~~~~~~~~~~~
-												ansible: add mitogen_sudo method, split out connection subclasses.

Slowly moving towards real implementations in those files.

											
										
										
											7 years ago
-												docs: more ansible updates

											
										
										
											7 years ago
+								Like `jail
-												Updated readme with build status, updated docs

											
										
										
											6 years ago
+								<https://docs.ansible.com/ansible/2.6/plugins/connection/jail.html>`_ except
-												ansible: add mitogen_sudo method, split out connection subclasses.

Slowly moving towards real implementations in those files.

											
										
										
											7 years ago
+								connection delegation is supported.
 								* ``ansible_host``: Name of jail (default: inventory hostname).
 								* ``ansible_user``: Name of user within the jail to execute as.
 								Local
 								~~~~~
-												docs: more ansible updates

											
										
										
											7 years ago
+								Like `local
-												Updated readme with build status, updated docs

											
										
										
											6 years ago
+								<https://docs.ansible.com/ansible/2.6/plugins/connection/local.html>`_ except
-												ansible: add mitogen_sudo method, split out connection subclasses.

Slowly moving towards real implementations in those files.

											
										
										
											7 years ago
+								connection delegation is supported.
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												ansible: allow establishment of duplicate SSH connections

											
										
										
											7 years ago
+								* ``ansible_python_interpreter``
-												ansible: migrate logging variables into utils.

											
										
										
											7 years ago
-												docs: document local connection process model difference.

											
										
										
											6 years ago
+								Process Model
 								^^^^^^^^^^^^^
 								Ansible usually executes local connection commands as a transient subprocess of
 								the forked worker executing a task. With the extension, the local connection
 								exists as a persistent subprocess of the connection multiplexer.
 								This means that global state mutations made to the top-level Ansible process
 								that are normally visible to newly forked subprocesses, such as vars plug-ins
 								that modify the environment, will not be reflected when executing local
 								commands without additional effort.
 								During execution the extension presently mimics the working directory and
 								process environment inheritence of regular Ansible, however it is possible some
 								additional differences exist that may break existing playbooks.
-												docs: more ansible updates

											
										
										
											7 years ago
+								.. _method-lxc:
-												ansible: add mitogen_sudo method, split out connection subclasses.

Slowly moving towards real implementations in those files.

											
										
										
											7 years ago
+								LXC
 								~~~
-												Support LXD; closes #339.

											
										
										
											6 years ago
+								Connect to classic LXC containers, like `lxc
 								<https://docs.ansible.com/ansible/2.6/plugins/connection/lxc.html>`_ except
 								connection delegation is supported, and ``lxc-attach`` is always used rather
 								than the LXC Python bindings, as is usual with ``lxc``.
-												ansible: add mitogen_sudo method, split out connection subclasses.

Slowly moving towards real implementations in those files.

											
										
										
											7 years ago
 								The ``lxc-attach`` command must be available on the host machine.
 								* ``ansible_python_interpreter``
 								* ``ansible_host``: Name of LXC container (default: inventory hostname).
-												Support LXD; closes #339.

											
										
										
											6 years ago
+								.. _method-lxd:
 								LXD
 								~~~
 								Connect to modern LXD containers, like `lxd
 								<https://docs.ansible.com/ansible/2.6/plugins/connection/lxd.html>`_ except
 								connection delegation is supported. The ``lxc`` command must be available on
 								the host machine.
 								* ``ansible_python_interpreter``
 								* ``ansible_host``: Name of LXC container (default: inventory hostname).
-												docs: document local connection process model difference.

											
										
										
											6 years ago
+								.. _machinectl:
 								Machinectl
 								~~~~~~~~~~
 								Like the `machinectl third party plugin
 								<https://github.com/BaxterStockman/ansible-connection-machinectl>`_ except
 								connection delegation is supported. This is a light wrapper around the
 								:ref:`setns <setns>` method.
 								* ``ansible_host``: Name of Docker container (default: inventory hostname).
 								* ``ansible_user``: Name of user within the container to execute as.
 								* ``mitogen_machinectl_path``: path to ``machinectl`` command if not available
 								  as ``/bin/machinectl``.
-												ansible: add mitogen_sudo method, split out connection subclasses.

Slowly moving towards real implementations in those files.

											
										
										
											7 years ago
+								.. _setns:
-												docs: better connection type docs

											
										
										
											7 years ago
+								Setns
 								~~~~~
-												issue #150: ansible: add basic Docker support.

											
										
										
											7 years ago
-												docs: better connection type docs

											
										
										
											7 years ago
+								The ``setns`` method connects to Linux containers via `setns(2)
-												Support LXD; closes #339.

											
										
										
											6 years ago
+								<https://linux.die.net/man/2/setns>`_. Unlike :ref:`method-docker`,
 								:ref:`method-lxc`, and :ref:`method-lxd` the namespace transition is handled
 								internally, ensuring optimal throughput to the child. This is necessary for
 								:ref:`machinectl` where only PTY channels are supported.
-												docs: better connection type docs

											
										
										
											7 years ago
-												docs: more links

											
										
										
											7 years ago
+								A utility program must be installed to discover the PID of the container's root
 								process.
-												docs: better connection type docs

											
										
										
											7 years ago
-												Support LXD; closes #339.

											
										
										
											6 years ago
+								* ``mitogen_kind``: one of ``docker``, ``lxc``, ``lxd`` or ``machinectl``.
-												docs: better connection type docs

											
										
										
											7 years ago
+								* ``ansible_host``: Name of container as it is known to the corresponding tool
 								  (default: inventory hostname).
-												setns: support changing user.

To match existing third party plugin.

											
										
										
											7 years ago
+								* ``ansible_user``: Name of user within the container to execute as.
-												docs: better connection type docs

											
										
										
											7 years ago
+								* ``mitogen_docker_path``: path to Docker if not available on the system path.
-												Support LXD; closes #339.

											
										
										
											6 years ago
+								* ``mitogen_lxc_path``: path to LXD's ``lxc`` command if not available as
 								  ``lxc-info``.
 								* ``mitogen_lxc_info_path``: path to LXC classic's ``lxc-info`` command if not
 								  available as ``lxc-info``.
-												docs: better connection type docs

											
										
										
											7 years ago
+								* ``mitogen_machinectl_path``: path to ``machinectl`` command if not available
 								  as ``/bin/machinectl``.
-												ansible: support su become method.

											
										
										
											7 years ago
+								.. _su:
 								Su
 								~~
 								Su can be used as a connection method that supports connection delegation, or
 								as a become method.
 								When used as a become method:
 								* ``ansible_python_interpreter``
 								* ``ansible_su_exe``, ``ansible_become_exe``
 								* ``ansible_su_user``, ``ansible_become_user`` (default: ``root``)
 								* ``ansible_su_pass``, ``ansible_become_pass`` (default: assume passwordless)
 								* ``su_flags``, ``become_flags``
 								* ansible.cfg: ``timeout``
 								When used as the ``mitogen_su`` connection method:
 								* The inventory hostname has no special meaning.
 								* ``ansible_user``: username to su as.
 								* ``ansible_password``: password to su as.
 								* ``ansible_python_interpreter``
-												ansible: add mitogen_sudo method, split out connection subclasses.

Slowly moving towards real implementations in those files.

											
										
										
											7 years ago
+								.. _sudo:
 								Sudo
 								~~~~
 								Sudo can be used as a connection method that supports connection delegation, or
 								as a become method.
 								When used as a become method:
 								* ``ansible_python_interpreter``
 								* ``ansible_sudo_exe``, ``ansible_become_exe``
 								* ``ansible_sudo_user``, ``ansible_become_user`` (default: ``root``)
 								* ``ansible_sudo_pass``, ``ansible_become_pass`` (default: assume passwordless)
 								* ``sudo_flags``, ``become_flags``
 								* ansible.cfg: ``timeout``
 								When used as the ``mitogen_sudo`` connection method:
-												docs: more ansible updates

											
										
										
											7 years ago
+								* The inventory hostname has no special meaning.
-												ansible: add mitogen_sudo method, split out connection subclasses.

Slowly moving towards real implementations in those files.

											
										
										
											7 years ago
+								* ``ansible_user``: username to sudo as.
 								* ``ansible_password``: password to sudo as.
 								* ``sudo_flags``, ``become_flags``
 								* ``ansible_python_interpreter``
-												docs: better connection type docs

											
										
										
											7 years ago
+								SSH
 								~~~
-												Updated readme with build status, updated docs

											
										
										
											6 years ago
+								Like `ssh <https://docs.ansible.com/ansible/2.6/plugins/connection/ssh.html>`_
-												docs: more ansible updates

											
										
										
											7 years ago
+								except connection delegation is supported.
-												docs: better connection type docs

											
										
										
											7 years ago
 								* ``ansible_ssh_timeout``
 								* ``ansible_host``, ``ansible_ssh_host``
 								* ``ansible_user``, ``ansible_ssh_user``
 								* ``ansible_port``, ``ssh_port``
 								* ``ansible_ssh_executable``, ``ssh_executable``
 								* ``ansible_ssh_private_key_file``
 								* ``ansible_ssh_pass``, ``ansible_password`` (default: assume passwordless)
 								* ``ssh_args``, ``ssh_common_args``, ``ssh_extra_args``
-												issue #278: ansible: support mitogen_ssh_debug_level variable.

											
										
										
											7 years ago
+								* ``mitogen_ssh_debug_level``: integer between `0..3` indicating the SSH client
 								  debug level. Ansible must also be run with '-vvv' to view the output.
-												Add link to IRC; closes #116

											
										
										
											7 years ago
-												ansible: migrate logging variables into utils.

											
										
										
											7 years ago
+								Debugging
 								---------
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								Diagnostics and use of the :py:mod:`logging` package output on the target
 								machine are usually discarded. With Mitogen, all of this is captured and
 								returned to the controller, where it can be viewed as desired with ``-vvv``.
 								Basic high level logs are produced with ``-vvv``, with logging of all IO on the
 								controller with ``-vvvv`` or higher.
-												ansible: enable forking when requested and for async jobs.

Closes #105.
References #155.

mitogen/service.py:
    Refactor services to support individually exposed methods with
    different security policies for each method.

    - @mitogen.service.expose() to expose a method and set its policy
    - @mitogen.service.arg_spec() to validate input.
    - Require basic service message format to be a tuple of
      `(method, kwargs)`, where kwargs is always a dict.
    - Update DeduplicatingService to match the new scheme.

ansible_mitogen/connection.py:
    - Rename 'method' to 'method_name' to disambiguate it from the
      service.call()'s method= argument.

ansible_mitogen/planner.py:
    - Generate an ID for every job, sync or not, and fetch job results
      from JobResultService rather than via the initiating function
      call's return value.
    - Planner subclasses now get to select whether their Runner should
      run in a forked process. The base implementation requests this if
      the 'mitogen_isolation_mode=fork' task variable is present.

ansible_mitogen/runner.py:
    Teach runners to deliver their result via JobResultService executing
    in their indirect parent mux process.

ansible_mitogen/plugins/actions/mitogen_async_status.py:
    Split the implementation up into methods, and more compatibly
    emulate Ansible's existing output.

ansible_mitogen/process.py:
    Mux processes now host JobResultService.

ansible_mitogen/services.py:
    Update existing services to the new mitogen.service scheme, and
    implement JobResultService:

    * listen() method for synchronous jobs. planner.invoke() registers a
      Sender with the service prior to invoking the job, then sleeps
      waiting for the service to write the job result to the
      corresponding Receiver.

    * Non-blocking get() method for implementing mitogen_async_status
      action.

    * Child-accessible push() method for delivering task results.

ansible_mitogen/target.py:
    New helpers for spawning a virginal subprocess on startup, from
    which asynchronous and mitogen_task_isolation=fork jobs are forked.
    Necessary to avoid a task inheriting potentially
    polluted/monkey-patched parent environment, since remaining jobs
    continue to run in the original child process.

docs/ansible.rst:
    Add/merge/remove some behaviours/risks.

tests/ansible/integration:
    New tests for forking/async.

											
										
										
											7 years ago
 								Although use of standard IO and the logging package on the target is forwarded
 								to the controller, it is not possible to receive IO activity logs, as the
-												docs: minor tweaks.

											
										
										
											6 years ago
+								process of receiving those logs would would itself generate IO activity. To
-												ansible: enable forking when requested and for async jobs.

Closes #105.
References #155.

mitogen/service.py:
    Refactor services to support individually exposed methods with
    different security policies for each method.

    - @mitogen.service.expose() to expose a method and set its policy
    - @mitogen.service.arg_spec() to validate input.
    - Require basic service message format to be a tuple of
      `(method, kwargs)`, where kwargs is always a dict.
    - Update DeduplicatingService to match the new scheme.

ansible_mitogen/connection.py:
    - Rename 'method' to 'method_name' to disambiguate it from the
      service.call()'s method= argument.

ansible_mitogen/planner.py:
    - Generate an ID for every job, sync or not, and fetch job results
      from JobResultService rather than via the initiating function
      call's return value.
    - Planner subclasses now get to select whether their Runner should
      run in a forked process. The base implementation requests this if
      the 'mitogen_isolation_mode=fork' task variable is present.

ansible_mitogen/runner.py:
    Teach runners to deliver their result via JobResultService executing
    in their indirect parent mux process.

ansible_mitogen/plugins/actions/mitogen_async_status.py:
    Split the implementation up into methods, and more compatibly
    emulate Ansible's existing output.

ansible_mitogen/process.py:
    Mux processes now host JobResultService.

ansible_mitogen/services.py:
    Update existing services to the new mitogen.service scheme, and
    implement JobResultService:

    * listen() method for synchronous jobs. planner.invoke() registers a
      Sender with the service prior to invoking the job, then sleeps
      waiting for the service to write the job result to the
      corresponding Receiver.

    * Non-blocking get() method for implementing mitogen_async_status
      action.

    * Child-accessible push() method for delivering task results.

ansible_mitogen/target.py:
    New helpers for spawning a virginal subprocess on startup, from
    which asynchronous and mitogen_task_isolation=fork jobs are forked.
    Necessary to avoid a task inheriting potentially
    polluted/monkey-patched parent environment, since remaining jobs
    continue to run in the original child process.

docs/ansible.rst:
    Add/merge/remove some behaviours/risks.

tests/ansible/integration:
    New tests for forking/async.

											
										
										
											7 years ago
+								receive a complete trace of every process on every machine, file-based logging
 								is necessary. File-based logging can be enabled by setting
 								``MITOGEN_ROUTER_DEBUG=1`` in your environment.
-												docs: Ansible logging update (#111)

											
										
										
											7 years ago
 								When file-based logging is enabled, one file per context will be created on the
-												docs: So many typos

											
										
										
											7 years ago
+								local machine and every target machine, as ``/tmp/mitogen.<pid>.log``.
-												ansible: limited support for become_flags, more docs.

											
										
										
											7 years ago
-												ansible: MITOGEN_DUMP_THREAD_STACKS for mux process too

											
										
										
											7 years ago
+								If you are experiencing a hang, ``MITOGEN_DUMP_THREAD_STACKS=1`` causes every
-												docs: minor tweaks.

											
										
										
											6 years ago
+								process on every machine to dump every thread stack into the logging framework
 								every 5 seconds.
-												ansible: MITOGEN_DUMP_THREAD_STACKS for mux process too

											
										
										
											7 years ago
-												ansible: limited support for become_flags, more docs.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								Getting Help
 								~~~~~~~~~~~~
 								Some users and developers hang out on the
 								`#mitogen <https://webchat.freenode.net/?channels=mitogen>`_ channel on the
 								FreeNode IRC network.
-												docs: tidy ansible docs.

											
										
										
											7 years ago
-												ansible: limited support for become_flags, more docs.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								Sample Profiles
 								---------------
-												docs: tidy ansible docs.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								Local VM connection
 								~~~~~~~~~~~~~~~~~~~
-												issue #164: precisely emulate Ansible's stdio behaviour.

* Use identical logic to select when stdout/stderr are merged, so
  'stdout', 'stdout_lines', 'stderr', 'stderr_lines' contain the same
  output before/after the extension.

* When stdout/stderr are merged, synthesize carriage returns just like
  the TTY layer.

* Mimic the SSH connection multiplexing message on stderr. Not really
  for user code, but so compare_output_test.sh needs fewer fixups.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								This demonstrates Mitogen vs. connection pipelining to a local VM, executing
 								the 100 simple repeated steps of ``run_hostname_100_times.yml`` from the
 								examples directory. Mitogen requires **43x less bandwidth and 4.25x less
 								time**.
-												issue #164: precisely emulate Ansible's stdio behaviour.

* Use identical logic to select when stdout/stderr are merged, so
  'stdout', 'stdout_lines', 'stderr', 'stderr_lines' contain the same
  output before/after the extension.

* When stdout/stderr are merged, synthesize carriage returns just like
  the TTY layer.

* Mimic the SSH connection multiplexing message on stderr. Not really
  for user code, but so compare_output_test.sh needs fewer fixups.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								.. image:: images/ansible/run_hostname_100_times.png
-												issue #164: precisely emulate Ansible's stdio behaviour.

* Use identical logic to select when stdout/stderr are merged, so
  'stdout', 'stdout_lines', 'stderr', 'stderr_lines' contain the same
  output before/after the extension.

* When stdout/stderr are merged, synthesize carriage returns just like
  the TTY layer.

* Mimic the SSH connection multiplexing message on stderr. Not really
  for user code, but so compare_output_test.sh needs fewer fixups.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								Kathmandu to Paris
 								~~~~~~~~~~~~~~~~~~
-												issue #164: precisely emulate Ansible's stdio behaviour.

* Use identical logic to select when stdout/stderr are merged, so
  'stdout', 'stdout_lines', 'stderr', 'stderr_lines' contain the same
  output before/after the extension.

* When stdout/stderr are merged, synthesize carriage returns just like
  the TTY layer.

* Mimic the SSH connection multiplexing message on stderr. Not really
  for user code, but so compare_output_test.sh needs fewer fixups.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								This is a full Django application playbook over a ~180ms link between Kathmandu
 								and Paris. Aside from large pauses where the host performs useful work, the
 								high latency of this link means Mitogen only manages a 1.7x speedup.
-												issue #164: precisely emulate Ansible's stdio behaviour.

* Use identical logic to select when stdout/stderr are merged, so
  'stdout', 'stdout_lines', 'stderr', 'stderr_lines' contain the same
  output before/after the extension.

* When stdout/stderr are merged, synthesize carriage returns just like
  the TTY layer.

* Mimic the SSH connection multiplexing message on stderr. Not really
  for user code, but so compare_output_test.sh needs fewer fixups.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								Many early roundtrips are due to inefficiencies in Mitogen's importer that will
 								be fixed over time, however the majority, comprising at least 10 seconds, are
 								due to idling while the host's previous result and next command are in-flight
 								on the network.
-												issue #164: precisely emulate Ansible's stdio behaviour.

* Use identical logic to select when stdout/stderr are merged, so
  'stdout', 'stdout_lines', 'stderr', 'stderr_lines' contain the same
  output before/after the extension.

* When stdout/stderr are merged, synthesize carriage returns just like
  the TTY layer.

* Mimic the SSH connection multiplexing message on stderr. Not really
  for user code, but so compare_output_test.sh needs fewer fixups.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								The initial extension lays groundwork for exciting structural changes to the
 								execution model: a future version will tackle latency head-on by delegating
 								some control flow to the target host, melding the performance and scalability
 								benefits of pull-based operation with the management simplicity of push-based
 								operation.
-												ansible: limited support for become_flags, more docs.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								.. image:: images/ansible/costapp.png
-												ansible: limited support for become_flags, more docs.

											
										
										
											7 years ago