mitogen/docs/ansible.rst


Mitogen for Ansible
===================

.. image:: images/ansible/ansible_mitogen.svg
    :class: mitogen-right-200 mitogen-logo-wrap

An extension to `Ansible`_ is included that implements connections over
Mitogen, replacing embedded shell invocations with pure-Python equivalents
invoked via highly efficient remote procedure calls to persistent interpreters
tunnelled over SSH. No changes are required to target hosts.

The extension is approaching stability and real-world usage is encouraged. `Bug
reports`_ are welcome: Ansible is huge, and only wide testing will ensure
soundness.

.. _Ansible: https://www.ansible.com/

.. _Bug reports: https://goo.gl/yLKZiJ


Overview
--------

**Expect a 1.25x - 7x speedup** and a **CPU usage reduction of at least 2x**,
depending on network conditions, modules executed, and time already spent by
targets on useful work. Mitogen cannot improve a module once it is executing,
it can only ensure the module executes as quickly as possible.

* **One connection is used per target**, in addition to one sudo invocation per
  user account. This is much better than SSH multiplexing combined with
  pipelining, as significant state can be maintained in RAM between steps, and
  system logs aren't spammed with repeat authentication events.

* **A single network roundtrip is used** to execute a step whose code already
  exists in RAM on the target. Eliminating multiplexed SSH channel creation
  saves 4 ms runtime per 1 ms of network latency for every playbook step.

* **Processes are aggressively reused**, avoiding the cost of invoking Python
  and recompiling imports, saving 300-800 ms for every playbook step.

* Code is ephemerally cached in RAM, **reducing bandwidth usage by an order
  of magnitude** compared to SSH pipelining, with around 5x fewer frames
  traversing the network in a typical run.

* **Fewer writes to the target filesystem occur**. In typical configurations,
  Ansible repeatedly rewrites and extracts ZIP files to multiple temporary
  directories on the target. Security issues relating to temporary files in
  cross-account scenarios are entirely avoided.

The effect is most potent on playbooks that execute many **short-lived
actions**, where Ansible's overhead dominates the cost of the operation, for
example when executing large ``with_items`` loops to run simple commands or
write files.


Installation
------------

1. Thoroughly review :ref:`noteworthy_differences` and :ref:`known_issues`.
2. Download and extract |mitogen_url|.
3. Modify ``ansible.cfg``:

   .. parsed-literal::

        [defaults]
        strategy_plugins = /path/to/mitogen-|mitogen_version|/ansible_mitogen/plugins/strategy
        strategy = mitogen_linear

   The ``strategy`` key is optional. If omitted, the
   ``ANSIBLE_STRATEGY=mitogen_linear`` environment variable can be set on a
   per-run basis. Like ``mitogen_linear``, the ``mitogen_free`` and
   ``mitogen_host_pinned`` strategies exists to mimic the ``free`` and
   ``host_pinned`` strategies.

4. If targets have a restrictive ``sudoers`` file, add a rule like:

   ::

       deploy = (ALL) NOPASSWD:/usr/bin/python -c*

5.

   .. raw:: html

    <form action="https://www.freelists.org/cgi-bin/subscription.cgi" method="post">
        Releases occur frequently and often include important fixes. Subscribe
        to the <a
        href="https://www.freelists.org/list/mitogen-announce">mitogen-announce
        mailing list</a> be notified of new releases.

        <p>
        <input type="email" placeholder="E-mail Address" name="email" style="font-size: 105%;">
        <input type=hidden name="list" value="mitogen-announce">
        <!-- <input type=hidden name="url_or_message" value="https://mitogen.readthedocs.io/en/stable/ansible.html#installation">-->
        <input type="hidden" name="action" value="subscribe">
        <button type="submit" style="font-size: 105%;">
            Subscribe
        </button>
        </p>
    </form>


Demo
~~~~

This demonstrates Ansible running a subset of the Mitogen integration tests
concurrent to an equivalent run using the extension.

.. raw:: html

    <iframe src="https://player.vimeo.com/video/283272293?title=0&byline=0&portrait=0" width="720" height="439" frameborder="0" webkitallowfullscreen mozallowfullscreen allowfullscreen></iframe>


Testimonials
~~~~~~~~~~~~

* "With mitogen **my playbook runtime went from 45 minutes to just under 3
  minutes**. Awesome work!"

* "The runtime was reduced from **1.5 hours on 4 servers to just under 3
  minutes**. Thanks!"

* "Oh, performance improvement using Mitogen is *huge*. As mentioned before,
  running with Mitogen enables takes 7m36 (give or take a few seconds). Without
  Mitogen, the same run takes 19m49! **I'm not even deploying without Mitogen
  anymore** :)"

* "**Works like a charm**, thank you for your quick response"

* "I tried it out. **He is not kidding about the speed increase**."

* "I don't know what kind of dark magic @dmw_83 has done, but his Mitogen
  strategy took Clojars' Ansible runs from **14 minutes to 2 minutes**. I still
  can't quite believe it."

* "Enabling the mitogen plugin in ansible feels like switching from floppy to SSD"


.. _noteworthy_differences:

Noteworthy Differences
----------------------

* Ansible 2.3-2.7 are supported along with Python 2.6, 2.7 or 3.6. Verify your
  installation is running one of these versions by checking ``ansible
  --version`` output.

* The Ansible ``raw`` action executes as a regular Mitogen connection,
  precluding its use for installing Python on a target. This will be addressed
  soon.

* The ``doas``, ``su`` and ``sudo`` become methods are available. File bugs to
  register interest in more.

* The `docker <https://docs.ansible.com/ansible/2.6/plugins/connection/docker.html>`_,
  `jail <https://docs.ansible.com/ansible/2.6/plugins/connection/jail.html>`_,
  `kubectl <https://docs.ansible.com/ansible/2.6/plugins/connection/kubectl.html>`_,
  `local <https://docs.ansible.com/ansible/2.6/plugins/connection/local.html>`_,
  `lxc <https://docs.ansible.com/ansible/2.6/plugins/connection/lxc.html>`_,
  `lxd <https://docs.ansible.com/ansible/2.6/plugins/connection/lxd.html>`_,
  and `ssh <https://docs.ansible.com/ansible/2.6/plugins/connection/ssh.html>`_
  built-in connection types are supported, along with Mitogen-specific
  :ref:`machinectl <machinectl>`, :ref:`mitogen_doas <doas>`,
  :ref:`mitogen_su <su>`, :ref:`mitogen_sudo <sudo>`, and :ref:`setns <setns>`
  types. File bugs to register interest in others.

* Local commands execute in a reuseable interpreter created identically to
  interpreters on targets. Presently one interpreter per ``become_user``
  exists, and so only one local action may execute simultaneously.

  Ansible usually permits up to ``forks`` simultaneous local actions. Any
  long-running local actions that execute for every target will experience
  artificial serialization, causing slowdown equivalent to `task_duration *
  num_targets`. This will be fixed soon.

* "Module Replacer" style modules are not supported. These rarely appear in
  practice, and light web searches failed to reveal many examples of them.

* Ansible permits up to ``forks`` connections to be setup in parallel, whereas
  in Mitogen this is handled by a fixed-size thread pool. Up to 16 connections
  may be established in parallel by default, this can be modified by setting
  the ``MITOGEN_POOL_SIZE`` environment variable.

* The ``ansible_python_interpreter`` variable is parsed using a restrictive
  :mod:`shell-like <shlex>` syntax, permitting values such as ``/usr/bin/env
  FOO=bar python``, which occur in practice. Ansible `documents this
  <https://docs.ansible.com/ansible/latest/user_guide/intro_inventory.html#ansible-python-interpreter>`_
  as an absolute path, however the implementation passes it unquoted through
  the shell, permitting arbitrary code to be injected.

* Performance does not scale linearly with target count. This will improve over
  time.

* SSH and ``become`` are treated distinctly when applying timeouts, and
  timeouts apply up to the point when the new interpreter is ready to accept
  messages. Ansible has two timeouts: ``ConnectTimeout`` for SSH, applying up
  to when authentication completes, and a separate parallel timeout up to when
  ``become`` authentication completes.

  For busy targets, Ansible may successfully execute a module where Mitogen
  would fail without increasing the timeout. For sick targets, Ansible may hang
  indefinitely after authentication without executing a command, for example
  due to a stuck filesystem IO appearing in ``$HOME/.profile``.


New Features & Notes
--------------------


Connection Delegation
~~~~~~~~~~~~~~~~~~~~~

.. image:: images/jumpbox.svg
    :class: mitogen-right-275

Included is a preview of **Connection Delegation**, a Mitogen-specific
implementation of `stackable connection plug-ins`_. This enables connections
via a bastion, or container connections delegated via their host machine, where
reaching the host may entail further delegation.

.. _Stackable connection plug-ins: https://github.com/ansible/proposals/issues/25

Unlike with SSH forwarding Ansible has complete visibility of the final
topology, declarative configuration via static/dynamic inventory is possible,
and data can be cached and re-served, and code executed on every intermediary.

For example when targeting Docker containers on a remote machine, each module
need only be uploaded once for the first task and container that requires it,
then cached and served from the SSH account for every future task in any
container.

.. raw:: html

    <div style="clear: both;"></div>


.. caution::

    Connection delegation is a work in progress, bug reports are welcome.

    * Delegated connection setup is single-threaded; only one connection can be
      constructed in parallel per intermediary.

    * Inferring the configuration of intermediaries may be buggy, manifesting
      as duplicate connections between hops, due to not perfectly replicating
      the configuration Ansible would normally use for the intermediary.

    * Automatic tunnelling of SSH-dependent actions, such as the
      ``synchronize`` module, is not yet supported. This will be added in the
      0.3 series.

To enable connection delegation, set ``mitogen_via=<inventory name>`` on the
command line, or as host and group variables.

.. code-block:: ini

    # Docker container on web1.dc1 is reachable via web1.dc1.
    [app-containers.web1.dc1]
    app1.web1.dc1 ansible_host=app1 ansible_connection=docker mitogen_via=web1.dc1

    # Web servers in DC1 are reachable via bastion.dc1
    [dc1]
    web1.dc1
    web2.dc1
    web3.dc1

    [dc1:vars]
    mitogen_via = bastion.dc1

    # Web servers in DC2 are reachable via bastion.dc2
    [dc2]
    web1.dc2
    web2.dc2
    web3.dc2

    [dc2:vars]
    mitogen_via = bastion.dc2

    # Prod bastions are reachable via a magic account on a
    # corporate network gateway.
    [bastions]
    bastion.dc1 mitogen_via=prod-ssh-access@corp-gateway.internal
    bastion.dc2 mitogen_via=prod-ssh-access@corp-gateway.internal

    [corp-gateway]
    corp-gateway.internal


File Transfer
~~~~~~~~~~~~~

Normally `sftp(1)`_ or `scp(1)`_ are used to copy files by the
`assemble <http://docs.ansible.com/ansible/latest/modules/assemble_module.html>`_,
`copy <http://docs.ansible.com/ansible/latest/modules/copy_module.html>`_,
`patch <http://docs.ansible.com/ansible/latest/modules/patch_module.html>`_,
`script <http://docs.ansible.com/ansible/latest/modules/script_module.html>`_,
`template <http://docs.ansible.com/ansible/latest/modules/template_module.html>`_, and
`unarchive <http://docs.ansible.com/ansible/latest/modules/unarchive_module.html>`_
actions, or when uploading modules with pipelining disabled. With Mitogen
copies are implemented natively using the same interpreters, connection tree,
and routed message bus that carries RPCs.

.. _scp(1): https://linux.die.net/man/1/scp
.. _sftp(1): https://linux.die.net/man/1/sftp

This permits direct streaming between endpoints regardless of execution
environment, without necessitating temporary copies in intermediary accounts or
machines, for example when ``become`` is active, or in the presence of
connection delegation. It also avoids the need to securely share temporary
files between accounts and machines.

As the implementation is self-contained, it is simple to make improvements like
prioritizing transfers, supporting resume, or displaying progress bars.


Safety
^^^^^^

Transfers proceed to a hidden file in the destination directory, with content
and metadata synced using `fsync(2) <https://linux.die.net/man/2/fsync>`_ prior
to rename over any existing file. This ensures the file remains consistent at
all times, in the event of a crash, or when overlapping `ansible-playbook` runs
deploy differing file contents.

The `sftp(1)`_ and `scp(1)`_ tools may cause undetected data corruption
in the form of truncated files, or files containing intermingled data segments
from overlapping runs. As part of normal operation, both tools expose a window
where readers may observe inconsistent file contents.


Performance
^^^^^^^^^^^

One roundtrip initiates a transfer larger than 124 KiB, while smaller transfers
are embedded in a 0-roundtrip pipelined call. For tools operating via SSH
multiplexing, 4 roundtrips are required to configure the IO channel, followed
by 6 roundtrips to transfer the file in the case of ``sftp``, in addition to
the time to start the local and remote processes.

An invocation of ``scp`` with an empty ``.profile`` over a 30 ms link takes
~140 ms, wasting 110 ms per invocation, rising to ~2,000 ms over a 400 ms
UK-India link, wasting 1,600 ms per invocation.


Interpreter Reuse
~~~~~~~~~~~~~~~~~

Python interpreters are aggressively reused to execute modules. While this
works well, it violates an unwritten assumption, and so it is possible an
earlier module execution could cause a subsequent module to fail, or for
unrelated modules to interact poorly due to bad hygiene, such as
monkey-patching that becomes stacked over repeat invocations.

Before reporting a bug relating to a misbehaving module, please re-run with
``-e mitogen_task_isolation=fork`` to see if the problem abates. This may be
set per-task, paying attention to the possibility an earlier task may be the
true cause of a failure.

.. code-block:: yaml

    - name: My task.
      broken_module:
        some_option: true
      vars:
        mitogen_task_isolation: fork

If forking solves your problem, **please report a bug regardless**, as an
internal list can be updated to prevent others bumping into the same problem.


Interpreter Recycling
~~~~~~~~~~~~~~~~~~~~~

There is a per-target limit on the number of interpreters. Once 20 exist, the
youngest is terminated before starting any new interpreter, preventing
situations like below from triggering memory exhaustion.

.. code-block:: yaml

    - hosts: corp_boxes
      vars:
        user_directory: [
          # 10,000 corporate user accounts
        ]
      tasks:
        - name: Create user bashrc
          become: true
          vars:
            ansible_become_user: "{{item}}"
          copy:
            src: bashrc
            dest: "~{{item}}/.bashrc"
          with_items: "{{user_directory}}"

The youngest is chosen to preserve useful accounts like ``root`` and
``postgresql`` that often appear early in a run, however it is simple to
construct a playbook that defeats this strategy. A future version will key
interpreters on the identity of their creating task, avoiding useful account
recycling in every scenario.

To modify the limit, set the ``MITOGEN_MAX_INTERPRETERS`` environment variable.


Standard IO
~~~~~~~~~~~

Ansible uses pseudo TTYs for most invocations to allow it to type interactive
passwords, however pseudo TTYs are disabled where standard input is required or
``sudo`` is not in use. Additionally when SSH multiplexing is enabled, a string
like ``Shared connection to localhost closed\r\n`` appears in ``stderr`` of
every invocation.

Mitogen does not naturally require either of these, as command output is always
embedded within framed messages, and it can simply call :py:func:`pty.openpty`
in any location an interactive password must be typed.

A major downside to Ansible's behaviour is that ``stdout`` and ``stderr`` are
merged together into a single ``stdout`` variable, with carriage returns
inserted in the output by the TTY layer. However ugly, the extension emulates
this precisely, to avoid breaking playbooks that expect text to appear in
specific variables with a particular linefeed style.


.. _ansible_tempfiles:

Temporary Files
~~~~~~~~~~~~~~~

Temporary file handling in Ansible is tricky, and the precise behaviour varies
across major versions. A variety of temporary files and directories are
created, depending on the operating mode.

In the best case when pipelining is enabled and no temporary uploads are
required, for each task Ansible will create one directory below a
system-supplied temporary directory returned by :func:`tempfile.mkdtemp`, owned
by the target account a new-style module will execute in.

In other cases depending on the task type, whether become is active, whether
the target become user is privileged, whether the associated action plugin
needs to upload files, and whether the associated module needs to store files,
Ansible may:

* Create a directory owned by the SSH user either under ``remote_tmp``, or a
  system-default directory,
* Upload action dependencies such as non-new style modules or rendered
  templates to that directory via `sftp(1)`_ or `scp(1)`_.
* Attempt to modify the directory's access control list to grant access to the
  target user using `setfacl(1) <https://linux.die.net/man/1/setfacl>`_,
  requiring that tool to be installed and a supported filesystem to be in use,
  or for the ``allow_world_readable_tmpfiles`` setting to be  :data:`True`.
* Create a directory owned by the target user either under ``remote_tmp``, or
  a system-default directory, if a new-style module needs a temporary directory
  and one was not previously created for a supporting file earlier in the
  invocation.

In summary, for each task Ansible may create one or more of:

* ``~ssh_user/<remote_tmp>/...`` owned by the login user,
* ``$TMPDIR/ansible-tmp-...`` owned by the login user,
* ``$TMPDIR/ansible-tmp-...`` owned by the login user with ACLs permitting
  write access by the become user,
* ``~become_user/<remote_tmp>/...`` owned by the become user,
* ``$TMPDIR/ansible_<modname>_payload_.../`` owned by the become user,
* ``$TMPDIR/ansible-module-tmp-.../`` owned by the become user.


Mitogen for Ansible
^^^^^^^^^^^^^^^^^^^

As Mitogen can execute new-style modules from RAM, and transfer files to target
user accounts without first writing an intermediary file in any separate login
account, handling is relatively simplified.

Temporary directories must exist to maintain compatibility with Ansible, as
many modules introspect :data:`sys.argv` to find a directory where they may
write files, however only one directory exists for the lifetime of each
interpreter, its location is consistent for each account, and it is always
privately owned by that account.

During startup, the persistent remote interpreter tries the paths below until
one is found that is writeable and lives on a filesystem with ``noexec``
disabled:

1. ``$variable`` and tilde-expanded ``remote_tmp`` setting from
   ``ansible.cfg``
2. ``$variable`` and tilde-expanded ``system_tmpdirs`` setting from
   ``ansible.cfg``
3. ``TMPDIR`` environment variable
4. ``TEMP`` environment variable
5. ``TMP`` environment variable
6. ``/tmp``
7. ``/var/tmp``
8. ``/usr/tmp``
9. Current working directory

The directory is created at startup and recursively destroyed during interpeter
shutdown. Subdirectories are automatically created and destroyed by the
controller for each task that requires them.


Round-trip Avoidance
^^^^^^^^^^^^^^^^^^^^

Mitogen avoids many round-trips due to temporary file handling that are present
in regular Ansible:

* During task startup, it is not necessary to wait until the target has
  succeeded in creating a temporary directory. Instead, any failed attempt to
  create the directory will cause any subsequent RPC belonging to the same task
  to fail with the error that occurred.

* As temporary directories are privately owned by the target user account,
  operations relating to modifying the directory to support cross-account
  access are avoided.

* An explicit work-around is included to avoid the `copy` and `template`
  actions needlessly triggering a round-trip to set their temporary file as
  executable.

* During task shutdown, it is not necessary to wait to learn if the target has
  succeeded in deleting a temporary directory, since any error that may occur
  can is logged asynchronously via the logging framework, and the persistent
  remote interpreter arranges for all subdirectories to be destroyed during
  interpreter shutdown.


.. _ansible_process_env:

Process Environment Emulation
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Since Ansible discards processes after each module invocation, follow-up tasks
often (but not always) receive a new environment that will usually include
changes made by previous tasks. As such modifications are common, for
compatibility the extension emulates the existing behaviour as closely as
possible.

Some scenarios exist where emulation is impossible, for example, applying
``nsswitch.conf`` changes when ``nscd`` is not in use. If future scenarios
appear that cannot be solved through emulation, the extension will be updated
to automatically restart affected interpreters instead.


DNS Resolution
^^^^^^^^^^^^^^

Modifications to ``/etc/resolv.conf`` cause the glibc resolver configuration to
be reloaded via `res_init(3) <https://linux.die.net/man/3/res_init>`_. This
isn't necessary on some Linux distributions carrying glibc patches to
automatically check ``/etc/resolv.conf`` periodically, however it is necessary
on at least Debian and BSD derivatives.


``/etc/environment``
^^^^^^^^^^^^^^^^^^^^

When ``become: true`` is active or SSH multiplexing is disabled, modifications
by previous tasks to ``/etc/environment`` and ``$HOME/.pam_environment`` are
normally reflected, since the content of those files is reapplied by `PAM
<https://en.wikipedia.org/wiki/Pluggable_authentication_module>`_ via `pam_env`
on each authentication of ``sudo`` or ``sshd``.

Both files are monitored for changes, and changes are applied where it appears
safe to do so:

* New keys are added if they did not otherwise exist in the inherited
  environment, or previously had the same value as found in the file before it
  changed.

* Given a key (such as ``http_proxy``) added to the file where no such key
  exists in the environment, the key will be added.

* Given a key (such as ``PATH``) where an existing environment key exists with
  a different value, the update or deletion will be ignored, as it is likely
  the key was overridden elsewhere after `pam_env` ran, such as by
  ``/etc/profile``.

* Given a key removed from the file that had the same value as the existing
  environment key, the key will be removed.


How Modules Execute
~~~~~~~~~~~~~~~~~~~

Ansible usually modifies, recompresses and reuploads modules every time they
run on a target, work that must be repeated by the controller for every
playbook step.

With the extension any modifications are done on the target, allowing pristine
copies of modules to be cached, reducing the necessity to re-transfer modules
for each invocation. Unmodified modules are uploaded once on first use and
cached in RAM for the remainder of the run.

**Binary**
    Native executables detected using a complex heuristic. Arguments are
    supplied as a JSON file whose path is the sole script parameter.

**Module Replacer**
    Python scripts detected by the presence of
    ``#<<INCLUDE_ANSIBLE_MODULE_COMMON>>`` appearing in their source. This type
    is not yet supported.

**New-Style**
    Python scripts detected by the presence of ``from ansible.module_utils.``
    appearing in their source. Arguments are supplied as JSON written to
    ``sys.stdin`` of the target interpreter.

**JSON_ARGS**
    Detected by the presence of ``INCLUDE_ANSIBLE_MODULE_JSON_ARGS`` appearing
    in the script source. The interpreter directive (``#!interpreter``) is
    adjusted to match the corresponding value of ``{{ansible_*_interpreter}}``
    if one is set. Arguments are supplied as JSON mixed into the script as a
    replacement for ``INCLUDE_ANSIBLE_MODULE_JSON_ARGS``.

**WANT_JSON**
    Detected by the presence of ``WANT_JSON`` appearing in the script source.
    The interpreter directive is adjusted as above. Arguments are supplied as a
    JSON file whose path is the sole script parameter.

**Old Style**
    Files not matching any of the above tests. The interpreter directive is
    adjusted as above. Arguments are supplied as a file whose path is the sole
    script parameter. The format of the file is ``"key=repr(value)[
    key2=repr(value2)[ ..]] "``.


Runtime Patches
~~~~~~~~~~~~~~~

Three small runtime patches are employed in ``strategy.py`` to hook into
desirable locations, in order to override uses of shell, the module executor,
and the mechanism for selecting a connection plug-in. While it is hoped the
patches can be avoided in future, for interesting versions of Ansible deployed
today this simply is not possible, and so they continue to be required.

The patches are concise and behave conservatively, including by disabling
themselves when non-Mitogen connections are in use. Additional third party
plug-ins are unlikely to attempt similar patches, so the risk to an established
configuration should be minimal.


Flag Emulation
~~~~~~~~~~~~~~

Mitogen re-parses ``sudo_flags``, ``become_flags``, and ``ssh_flags`` using
option parsers extracted from `sudo(1)` and `ssh(1)` in order to emulate their
equivalent semantics. This allows:

* robust support for common ``ansible.cfg`` tricks without reconfiguration,
  such as forwarding SSH agents across ``sudo`` invocations,
* reporting on conflicting flag combinations,
* reporting on unsupported flag combinations,
* internally special-casing certain behaviour (like recursive agent forwarding)
  without boring the user with the details,
* avoiding opening the extension up to untestable scenarios where users can
  insert arbitrary garbage between Mitogen and the components it integrates
  with,
* precise emulation by an alternative implementation, for example if Mitogen
  grew support for Paramiko.


Connection Types
----------------

Matching Ansible, connection variables are treated on a per-task basis, causing
establishment of additional reuseable interpreters as necessary to match the
configuration of each task.


.. _doas:

Doas
~~~~

``doas`` can be used as a connection method that supports connection delegation, or
as a become method.

When used as a become method:

* ``ansible_python_interpreter``
* ``ansible_become_exe``: path to ``doas`` binary.
* ``ansible_become_user`` (default: ``root``)
* ``ansible_become_pass`` (default: assume passwordless)
* ansible.cfg: ``timeout``

When used as the ``mitogen_doas`` connection method:

* The inventory hostname has no special meaning.
* ``ansible_user``: username to use.
* ``ansible_password``: password to use.
* ``ansible_python_interpreter``


.. _method-docker:

Docker
~~~~~~

Like `docker
<https://docs.ansible.com/ansible/2.6/plugins/connection/docker.html>`_ except
connection delegation is supported.

* ``ansible_host``: Name of Docker container (default: inventory hostname).
* ``ansible_user``: Name of user within the container to execute as.


.. _method-jail:

FreeBSD Jail
~~~~~~~~~~~~

Like `jail
<https://docs.ansible.com/ansible/2.6/plugins/connection/jail.html>`_ except
connection delegation is supported.

* ``ansible_host``: Name of jail (default: inventory hostname).
* ``ansible_user``: Name of user within the jail to execute as.


.. _method-kubectl:

Kubernetes Pod
~~~~~~~~~~~~~~

Like `kubectl
<https://docs.ansible.com/ansible/2.6/plugins/connection/kubectl.html>`_ except
connection delegation is supported.

* ``ansible_host``: Name of pod (default: inventory hostname).
* ``ansible_user``: Name of user to authenticate to API as.


Local
~~~~~

Like `local
<https://docs.ansible.com/ansible/2.6/plugins/connection/local.html>`_ except
connection delegation is supported.

* ``ansible_python_interpreter``


Process Model
^^^^^^^^^^^^^

Ansible usually executes local connection commands as a transient subprocess of
the forked worker executing a task. With the extension, the local connection
exists as a persistent subprocess of the connection multiplexer.

This means that global state mutations made to the top-level Ansible process
that are normally visible to newly forked subprocesses, such as vars plug-ins
that modify the environment, will not be reflected when executing local
commands without additional effort.

During execution the extension presently mimics the working directory and
process environment inheritence of regular Ansible, however it is possible some
additional differences exist that may break existing playbooks.


.. _method-lxc:

LXC
~~~

Connect to classic LXC containers, like `lxc
<https://docs.ansible.com/ansible/2.6/plugins/connection/lxc.html>`_ except
connection delegation is supported, and ``lxc-attach`` is always used rather
than the LXC Python bindings, as is usual with ``lxc``.

* ``ansible_python_interpreter``
* ``ansible_host``: Name of LXC container (default: inventory hostname).
* ``mitogen_lxc_attach_path``: path to ``lxc-attach`` command if not available
    on the system path.


.. _method-lxd:

LXD
~~~

Connect to modern LXD containers, like `lxd
<https://docs.ansible.com/ansible/2.6/plugins/connection/lxd.html>`_ except
connection delegation is supported. The ``lxc`` command must be available on
the host machine.

* ``ansible_python_interpreter``
* ``ansible_host``: Name of LXC container (default: inventory hostname).
* ``mitogen_lxc_path``: path to ``lxc`` command if not available on the system
  path.


.. _machinectl:

Machinectl
~~~~~~~~~~

Like the `machinectl third party plugin
<https://github.com/BaxterStockman/ansible-connection-machinectl>`_ except
connection delegation is supported. This is a light wrapper around the
:ref:`setns <setns>` method.

* ``ansible_host``: Name of Docker container (default: inventory hostname).
* ``ansible_user``: Name of user within the container to execute as.
* ``mitogen_machinectl_path``: path to ``machinectl`` command if not available
  as ``/bin/machinectl``.


.. _setns:

Setns
~~~~~

The ``setns`` method connects to Linux containers via `setns(2)
<https://linux.die.net/man/2/setns>`_. Unlike :ref:`method-docker`,
:ref:`method-lxc`, and :ref:`method-lxd` the namespace transition is handled
internally, ensuring optimal throughput to the child. This is necessary for
:ref:`machinectl` where only PTY channels are supported.

A utility program must be installed to discover the PID of the container's root
process.

* ``mitogen_kind``: one of ``docker``, ``lxc``, ``lxd`` or ``machinectl``.
* ``ansible_host``: Name of container as it is known to the corresponding tool
  (default: inventory hostname).
* ``ansible_user``: Name of user within the container to execute as.
* ``mitogen_docker_path``: path to Docker if not available on the system path.
* ``mitogen_lxc_path``: path to LXD's ``lxc`` command if not available as
  ``lxc-info``.
* ``mitogen_lxc_info_path``: path to LXC classic's ``lxc-info`` command if not
  available as ``lxc-info``.
* ``mitogen_machinectl_path``: path to ``machinectl`` command if not available
  as ``/bin/machinectl``.


.. _su:

Su
~~

Su can be used as a connection method that supports connection delegation, or
as a become method.

When used as a become method:

* ``ansible_python_interpreter``
* ``ansible_su_exe``, ``ansible_become_exe``
* ``ansible_su_user``, ``ansible_become_user`` (default: ``root``)
* ``ansible_su_pass``, ``ansible_become_pass`` (default: assume passwordless)
* ``su_flags``, ``become_flags``
* ansible.cfg: ``timeout``

When used as the ``mitogen_su`` connection method:

* The inventory hostname has no special meaning.
* ``ansible_user``: username to su as.
* ``ansible_password``: password to su as.
* ``ansible_python_interpreter``


.. _sudo:

Sudo
~~~~

Sudo can be used as a connection method that supports connection delegation, or
as a become method.

When used as a become method:

* ``ansible_python_interpreter``
* ``ansible_sudo_exe``, ``ansible_become_exe``
* ``ansible_sudo_user``, ``ansible_become_user`` (default: ``root``)
* ``ansible_sudo_pass``, ``ansible_become_pass`` (default: assume passwordless)
* ``sudo_flags``, ``become_flags``
* ansible.cfg: ``timeout``

When used as the ``mitogen_sudo`` connection method:

* The inventory hostname has no special meaning.
* ``ansible_user``: username to sudo as.
* ``ansible_password``: password to sudo as.
* ``sudo_flags``, ``become_flags``
* ``ansible_python_interpreter``


SSH
~~~

Like `ssh <https://docs.ansible.com/ansible/2.6/plugins/connection/ssh.html>`_
except connection delegation is supported.

* ``ansible_ssh_timeout``
* ``ansible_host``, ``ansible_ssh_host``
* ``ansible_user``, ``ansible_ssh_user``
* ``ansible_port``, ``ssh_port``
* ``ansible_ssh_executable``, ``ssh_executable``
* ``ansible_ssh_private_key_file``
* ``ansible_ssh_pass``, ``ansible_password`` (default: assume passwordless)
* ``ssh_args``, ``ssh_common_args``, ``ssh_extra_args``
* ``mitogen_ssh_debug_level``: integer between `0..3` indicating the SSH client
  debug level. Ansible must also be run with '-vvv' to view the output.


Debugging
---------

Diagnostics and :py:mod:`logging` package output on targets are usually
discarded. With Mitogen, these are captured and forwarded to the controller
where they can be viewed with ``-vvv``. Basic high level logs are produced with
``-vvv``, with logging of all IO on the controller with ``-vvvv`` or higher.

While uncaptured standard IO and the logging package on targets is forwarded,
it is not possible to receive IO activity logs, as the forwarding process would
would itself generate additional IO.

To receive a complete trace of every process on every machine, file-based
logging is necessary. File-based logging can be enabled by setting
``MITOGEN_ROUTER_DEBUG=1`` in your environment. When file-based logging is
enabled, one file per context will be created on the local machine and every
target machine, as ``/tmp/mitogen.<pid>.log``.


Common Problems
~~~~~~~~~~~~~~~

The most common bug reports fall into the following categories, so it is worth
checking whether you can categorize a problem using the tools provided before
reporting it:

**Missed/Incorrect Configuration Variables**
    In some cases Ansible may support a configuration variable that Mitogen
    does not yet support, or Mitogen supports, but the support is broken. For
    example, Mitogen may pick the wrong username or SSH parameters.

    To detect this, use the special ``mitogen_get_stack`` action described
    below to verify the settings Mitogen has chosen for the connection make
    sense.

**Process Environment Differences**
    Mitogen's process model differs significantly to Ansible's in many places.
    In the past, bugs have been reported because Ansible plug-ins modify an
    environment variable after Mitogen processes are started.

    If your task's failure may relate to the process environment in some way,
    for example, ``SSH_AUTH_SOCK``, ``LC_ALL`` or ``PATH``, then an environment
    difference may explain it. Environment differences are always considered
    bugs in the extension, and are very easy to repair, so even if you find a
    workaround, please report them to avoid someone else encountering the same
    problem.

**Variable Expansion Differences**
    To avoid many classes of bugs, Mitogen avoids shell wherever possible.
    Ansible however is traditionally built on shell, and it is often difficult
    to tell just how many times a configuration parameter will pass through
    shell expansion and quoting, and in what context before it is used.

    Due to this, in some circumstances Mitogen may parse some expanded
    variables differently, for example, in the wrong user account. Careful
    review of ``-vvv`` and ``mitogen_ssh_debug_level`` logs can reveal this.
    For example in the past, Mitogen used a different method of expanding
    ``~/.ssh/id_rsa``, causing authentication to fail when ``ansible-playbook``
    was run via ``sudo -E``.

**External Tool Integration Differences**
    Mitogen reimplements any aspect of Ansible that involves integrating with
    SSH, sudo, Docker, or related tools. For this reason, sometimes its support
    for those tools differs or is less mature than in Ansible.

    In the past Mitogen has had bug reports due to failing to recognize a
    particular variation of a login or password prompt on an exotic or
    non-English operating system, or confusing a login banner for a password
    prompt. Careful review of ``-vvv`` logs help identify these cases, as
    Mitogen logs all strings it receives during connection, and how it
    interprets them.


.. _mitogen-get-stack:

The `mitogen_get_stack` Action
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

When a Mitogen strategy is loaded, a special ``mitogen_get_stack`` action is
available that returns a concise description of the connection configuration as
extracted from Ansible and passed to the core library. Using it, you can learn
whether a problem lies in the Ansible extension or deeper in library code.

The action may be used in a playbook as ``mitogen_get_stack:`` just like a
regular module, or directly from the command-line::

    $ ANSIBLE_STRATEGY=mitogen_linear ansible -m mitogen_get_stack -b -k k3
    SSH password:
    k3 | SUCCESS => {
        "changed": true,
        "result": [
            {
                "kwargs": {
                    "check_host_keys": "enforce",
                    "connect_timeout": 10,
                    "hostname": "k3",
                    "identities_only": false,
                    "identity_file": null,
                    "password": "mysecretpassword",
                    "port": null,
                    "python_path": null,
                    "ssh_args": [
                        "-C",
                        "-o",
                        "ControlMaster=auto",
                        "-o",
                        "ControlPersist=60s"
                    ],
                    "ssh_debug_level": null,
                    "ssh_path": "ssh",
                    "username": null
                },
                "method": "ssh"
            },
            {
                "enable_lru": true,
                "kwargs": {
                    "connect_timeout": 10,
                    "password": null,
                    "python_path": null,
                    "sudo_args": [
                        "-H",
                        "-S",
                        "-n"
                    ],
                    "sudo_path": null,
                    "username": "root"
                },
                "method": "sudo"
            }
        ]
    }

Each object in the list represents a single 'hop' in the connection, from
nearest to furthest. Unlike in Ansible, the core library treats ``become``
steps and SSH steps identically, so they are represented distinctly in the
output.

The presence of ``null`` means no explicit value was extracted from Ansible,
and either the Mitogen library or SSH will choose a value for the parameter. In
the example above, Mitogen will choose ``/usr/bin/python`` for ``python_path``,
and SSH will choose ``22`` for ``port``, or whatever ``Port`` it parses from
``~/.ssh/config``. Note the presence of ``null`` may indicate the extension
failed to extract the correct value.

When using ``mitogen_get_stack`` to diagnose a problem, pay special attention
to ensuring the invocation exactly matches the problematic task. For example,
if the failing task has ``delegate_to:`` or ``become:`` enabled, the
``mitogen_get_stack`` invocation must include those statements in order for the
output to be accurate.

If a playbook cannot start at all, you may need to temporarily use
``gather_facts: no`` to allow the first task to proceed. This action does not
create connections, so if it is the first task, it is still possible to review
its output.


The `mitogen_ssh_debug_level` Variable
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Mitogen has support for capturing SSH diagnostic logs, and integrating them
into the regular debug log output produced when ``-vvv`` is active. This
provides a single audit trail of every component active during SSH
authentication.

Particularly for authentication failures, setting this variable to 3, in
combination with ``-vvv``, allows review of every parameter passed to SSH, and
review of every action SSH attempted during authentication.

For example, this method can be used to ascertain whether SSH attempted agent
authentication, or what private key files it was able to access and which it tried.


Post-authentication Bootstrap Failure
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

If logging indicates Mitogen was able to authenticate, but some error occurred
after authentication preventing the Python bootstrap from completing, it can be
immensely useful to temporarily replace ``ansible_python_interpreter`` with a
wrapper that runs Python under ``strace``::

    $ ssh badbox

    badbox$ cat > strace-python.sh
    #!/bin/sh
    strace -o /tmp/strace-python.$$ -ff -s 100 python "$@"
    ^D

    badbox$ chmod +x strace-python.sh
    badbox$ logout

    $ ansible-playbook site.yml \
        -e ansible_python_interpreter=./strace-python.sh \
        -l badbox

This will produce a potentially large number of log files under ``/tmp/``. The
lowest-numbered traced PID is generally the main Python interpreter. The most
intricate bootstrap steps happen there, any error should be visible near the
end of the trace.

It is also possible the first stage bootstrap failed. That is usually the next
lowest-numbered PID and tends to be the smallest file. Even if you can't
ascertain the problem with your configuration from these logs, including them
in a bug report can save days of detective effort.


.. _diagnosing-hangs:

Diagnosing Hangs
~~~~~~~~~~~~~~~~

If you encounter a hang, the ``MITOGEN_DUMP_THREAD_STACKS=<secs>`` environment
variable arranges for each process on each machine to dump each thread stack
into the logging framework every `secs` seconds, which is visible when running
with ``-vvv``.

However, certain controller hangs may render ``MITOGEN_DUMP_THREAD_STACKS``
ineffective, or occur too infrequently for interactive reproduction. In these
cases `faulthandler <https://faulthandler.readthedocs.io/>`_ may be used:

1. For Python 2, ``pip install faulthandler``. This is unnecessary on Python 3.
2. Once the hang occurs, observe the process tree using ``pstree`` or ``ps
   --forest``.
3. The most likely process to be hung is the connection multiplexer, which can
   easily be identified as the parent of all SSH client processes.
4. Send ``kill -SEGV <pid>`` to the multiplexer PID, causing it to print all
   thread stacks.
5. `File a bug <https://github.com/dw/mitogen/issues/new/>`_ including a copy
   of the stacks, along with a description of the last task executing prior to
   the hang.

It is possible the hang occurred in a process on a target. If ``strace`` is
available, look for the host name not listed in Ansible output as reporting a
result for the most recent task, log into it, and use ``strace -ff -p <pid>``
on each process whose name begins with ``mitogen:``::

    $ strace -ff -p 29858
    strace: Process 29858 attached with 3 threads
    [pid 29864] futex(0x55ea9be52f60, FUTEX_WAIT_BITSET_PRIVATE|FUTEX_CLOCK_REALTIME, 0, NULL, 0xffffffff <unfinished ...>
    [pid 29860] restart_syscall(<... resuming interrupted poll ...> <unfinished ...>
    [pid 29858] futex(0x55ea9be52f60, FUTEX_WAIT_BITSET_PRIVATE|FUTEX_CLOCK_REALTIME, 0, NULL, 0xffffffff
    ^C

    $ 

This shows one thread waiting on IO (``poll``) and two more waiting on the same
lock. It is taken from a real example of a deadlock due to a forking bug.
Please include any such information for all processes that you are able to
collect in any bug report.


Getting Help
~~~~~~~~~~~~
Some users and developers hang out on the
`#mitogen <https://webchat.freenode.net/?channels=mitogen>`_ channel on the
FreeNode IRC network.


Sample Profiles
---------------

Local VM connection
~~~~~~~~~~~~~~~~~~~

This demonstrates Mitogen vs. connection pipelining to a local VM executing
``bench/loop-100-items.yml``, which simply executes ``hostname`` 100 times.
Mitogen requires **43x less bandwidth and 6.5x less time**.

.. image:: images/ansible/run_hostname_100_times_mito.svg
.. image:: images/ansible/run_hostname_100_times_plain.svg


Kathmandu to Paris
~~~~~~~~~~~~~~~~~~

This is a full Django application playbook over a ~180ms link between Kathmandu
and Paris. Aside from large pauses where the host performs useful work, the
high latency of this link means Mitogen only manages a 1.7x speedup.

Many early roundtrips are due to inefficiencies in Mitogen's importer that will
be fixed over time, however the majority, comprising at least 10 seconds, are
due to idling while the host's previous result and next command are in-flight
on the network.

The initial extension lays groundwork for exciting structural changes to the
execution model: a future version will tackle latency head-on by delegating
some control flow to the target host, melding the performance and scalability
benefits of pull-based operation with the management simplicity of push-based
operation.

.. image:: images/ansible/costapp.png
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								Mitogen for Ansible
 								===================
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												docs: wrap text around logos

											
										
										
											6 years ago
+								.. image:: images/ansible/ansible_mitogen.svg
 								    :class: mitogen-right-200 mitogen-logo-wrap
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								An extension to `Ansible`_ is included that implements connections over
 								Mitogen, replacing embedded shell invocations with pure-Python equivalents
 								invoked via highly efficient remote procedure calls to persistent interpreters
 								tunnelled over SSH. No changes are required to target hosts.
-												docs: remove another warning label.

											
										
										
											6 years ago
+								The extension is approaching stability and real-world usage is encouraged. `Bug
 								reports`_ are welcome: Ansible is huge, and only wide testing will ensure
 								soundness.
-												docs: extra ansible paragraph.

											
										
										
											7 years ago
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
+								.. _Ansible: https://www.ansible.com/
 								.. _Bug reports: https://goo.gl/yLKZiJ
-												docs: small reference fixes.

											
										
										
											6 years ago
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
+								Overview
 								--------
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								**Expect a 1.25x - 7x speedup** and a **CPU usage reduction of at least 2x**,
 								depending on network conditions, modules executed, and time already spent by
 								targets on useful work. Mitogen cannot improve a module once it is executing,
 								it can only ensure the module executes as quickly as possible.
 								* **One connection is used per target**, in addition to one sudo invocation per
 								  user account. This is much better than SSH multiplexing combined with
 								  pipelining, as significant state can be maintained in RAM between steps, and
 								  system logs aren't spammed with repeat authentication events.
 								* **A single network roundtrip is used** to execute a step whose code already
 								  exists in RAM on the target. Eliminating multiplexed SSH channel creation
-												docs: more ansible updates

											
										
										
											7 years ago
+								  saves 4 ms runtime per 1 ms of network latency for every playbook step.
-												docs: major Ansible page update.

											
										
										
											7 years ago
 								* **Processes are aggressively reused**, avoiding the cost of invoking Python
 								  and recompiling imports, saving 300-800 ms for every playbook step.
 								* Code is ephemerally cached in RAM, **reducing bandwidth usage by an order
 								  of magnitude** compared to SSH pipelining, with around 5x fewer frames
 								  traversing the network in a typical run.
-												docs: more accurate bullet.

											
										
										
											7 years ago
+								* **Fewer writes to the target filesystem occur**. In typical configurations,
 								  Ansible repeatedly rewrites and extracts ZIP files to multiple temporary
-												docs: glaring ancient typo.

											
										
										
											6 years ago
+								  directories on the target. Security issues relating to temporary files in
-												docs: more accurate bullet.

											
										
										
											7 years ago
+								  cross-account scenarios are entirely avoided.
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												docs: add 'will it work' intuition.

											
										
										
											6 years ago
+								The effect is most potent on playbooks that execute many **short-lived
 								actions**, where Ansible's overhead dominates the cost of the operation, for
 								example when executing large ``with_items`` loops to run simple commands or
 								write files.
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												docs: move installation section above demo

											
										
										
											7 years ago
+								Installation
 								------------
-												docs: update install steps to point directly at Known Issues.

											
										
										
											6 years ago
+. Thoroughly review :ref:`noteworthy_differences` and :ref:`known_issues`.
-												docs: link mitogen-announce mailing list.

											
										
										
											6 years ago
+. Download and extract |mitogen_url|.
 . Modify ``ansible.cfg``:
-												docs: move installation section above demo

											
										
										
											7 years ago
-												docs: link to PyPI release, not GitHub archive URL.

Now download counts are visible via PSF BigQuery.

											
										
										
											6 years ago
+								   .. parsed-literal::
-												docs: move installation section above demo

											
										
										
											7 years ago
 								        [defaults]
-												docs: link to PyPI release, not GitHub archive URL.

Now download counts are visible via PSF BigQuery.

											
										
										
											6 years ago
+								        strategy_plugins = /path/to/mitogen-|mitogen_version|/ansible_mitogen/plugins/strategy
-												docs: move installation section above demo

											
										
										
											7 years ago
+								        strategy = mitogen_linear
 								   The ``strategy`` key is optional. If omitted, the
 								   ``ANSIBLE_STRATEGY=mitogen_linear`` environment variable can be set on a
-												ansible: initial mitogen_host_pinned strategy.

											
										
										
											6 years ago
+								   per-run basis. Like ``mitogen_linear``, the ``mitogen_free`` and
 								   ``mitogen_host_pinned`` strategies exists to mimic the ``free`` and
 								   ``host_pinned`` strategies.
-												docs: move installation section above demo

											
										
										
											7 years ago
-												docs: link mitogen-announce mailing list.

											
										
										
											6 years ago
+. If targets have a restrictive ``sudoers`` file, add a rule like:
-												docs: add example sudoers rule

hat tip @seuf :)

											
										
										
											7 years ago
-												docs: link mitogen-announce mailing list.

											
										
										
											6 years ago
+								   ::
-												docs: add example sudoers rule

hat tip @seuf :)

											
										
										
											7 years ago
 								       deploy = (ALL) NOPASSWD:/usr/bin/python -c*
-												docs: add inline subscribe form to installation instructions

											
										
										
											6 years ago
+.
 								   .. raw:: html
 								    <form action="https://www.freelists.org/cgi-bin/subscription.cgi" method="post">
 								        Releases occur frequently and often include important fixes. Subscribe
 								        to the <a
 								        href="https://www.freelists.org/list/mitogen-announce">mitogen-announce
 								        mailing list</a> be notified of new releases.
 								        <p>
 								        <input type="email" placeholder="E-mail Address" name="email" style="font-size: 105%;">
 								        <input type=hidden name="list" value="mitogen-announce">
 								        <!-- <input type=hidden name="url_or_message" value="https://mitogen.readthedocs.io/en/stable/ansible.html#installation">-->
 								        <input type="hidden" name="action" value="subscribe">
 								        <button type="submit" style="font-size: 105%;">
 								            Subscribe
 								        </button>
 								        </p>
 								    </form>
-												docs: link mitogen-announce mailing list.

											
										
										
											6 years ago
-												docs: move installation section above demo

											
										
										
											7 years ago
-												docs: link to Ansible video demo

											
										
										
											7 years ago
+								Demo
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								~~~~
-												docs: link to Ansible video demo

											
										
										
											7 years ago
 								This demonstrates Ansible running a subset of the Mitogen integration tests
 								concurrent to an equivalent run using the extension.
 								.. raw:: html
-												docs: host demo on Vimeo.

											
										
										
											6 years ago
+								    <iframe src="https://player.vimeo.com/video/283272293?title=0&byline=0&portrait=0" width="720" height="439" frameborder="0" webkitallowfullscreen mozallowfullscreen allowfullscreen></iframe>
-												docs: link to Ansible video demo

											
										
										
											7 years ago
-												docs: reorder sections

											
										
										
											7 years ago
+								Testimonials
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								~~~~~~~~~~~~
-												docs: reorder sections

											
										
										
											7 years ago
 								* "With mitogen **my playbook runtime went from 45 minutes to just under 3
 								  minutes**. Awesome work!"
 								* "The runtime was reduced from **1.5 hours on 4 servers to just under 3
 								  minutes**. Thanks!"
 								* "Oh, performance improvement using Mitogen is *huge*. As mentioned before,
 								  running with Mitogen enables takes 7m36 (give or take a few seconds). Without
 								  Mitogen, the same run takes 19m49! **I'm not even deploying without Mitogen
 								  anymore** :)"
 								* "**Works like a charm**, thank you for your quick response"
 								* "I tried it out. **He is not kidding about the speed increase**."
-												docs: slightly bikeshed last testimonial

											
										
										
											7 years ago
+								* "I don't know what kind of dark magic @dmw_83 has done, but his Mitogen
 								  strategy took Clojars' Ansible runs from **14 minutes to 2 minutes**. I still
 								  can't quite believe it."
-												Add testimonial from Clojars
											
										
										
											7 years ago
-												docs: add funny testimonial

											
										
										
											6 years ago
+								* "Enabling the mitogen plugin in ansible feels like switching from floppy to SSD"
-												docs: reorder sections

											
										
										
											7 years ago
-												docs: link changelog into Ansible install procedure

											
										
										
											6 years ago
 								.. _noteworthy_differences:
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								Noteworthy Differences
 								----------------------
-												docs: update changelog.

											
										
										
											6 years ago
+								* Ansible 2.3-2.7 are supported along with Python 2.6, 2.7 or 3.6. Verify your
-												docs: link mitogen-announce mailing list.

											
										
										
											6 years ago
+								  installation is running one of these versions by checking ``ansible
 								  --version`` output.
-												docs: add 'raw' to 0.2 in-scope

											
										
										
											6 years ago
+								* The Ansible ``raw`` action executes as a regular Mitogen connection,
 								  precluding its use for installing Python on a target. This will be addressed
 								  soon.
-												issue #303: add doas to the docs

											
										
										
											6 years ago
+								* The ``doas``, ``su`` and ``sudo`` become methods are available. File bugs to
 								  register interest in more.
-												ansible: doc updates

											
										
										
											7 years ago
-												Updated readme with build status, updated docs

											
										
										
											6 years ago
+								* The `docker <https://docs.ansible.com/ansible/2.6/plugins/connection/docker.html>`_,
 								  `jail <https://docs.ansible.com/ansible/2.6/plugins/connection/jail.html>`_,
-												docs: updates for #376 and #371

											
										
										
											6 years ago
+								  `kubectl <https://docs.ansible.com/ansible/2.6/plugins/connection/kubectl.html>`_,
-												Updated readme with build status, updated docs

											
										
										
											6 years ago
+								  `local <https://docs.ansible.com/ansible/2.6/plugins/connection/local.html>`_,
 								  `lxc <https://docs.ansible.com/ansible/2.6/plugins/connection/lxc.html>`_,
 								  `lxd <https://docs.ansible.com/ansible/2.6/plugins/connection/lxd.html>`_,
 								  and `ssh <https://docs.ansible.com/ansible/2.6/plugins/connection/ssh.html>`_
-												ansible: add mitogen_sudo method, split out connection subclasses.

Slowly moving towards real implementations in those files.

											
										
										
											7 years ago
+								  built-in connection types are supported, along with Mitogen-specific
-												docs: document local connection process model difference.

											
										
										
											6 years ago
+								  :ref:`machinectl <machinectl>`, :ref:`mitogen_doas <doas>`,
-												issue #303: add doas to the docs

											
										
										
											6 years ago
+								  :ref:`mitogen_su <su>`, :ref:`mitogen_sudo <sudo>`, and :ref:`setns <setns>`
 								  types. File bugs to register interest in others.
-												ansible: doc updates

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								* Local commands execute in a reuseable interpreter created identically to
 								  interpreters on targets. Presently one interpreter per ``become_user``
 								  exists, and so only one local action may execute simultaneously.
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								  Ansible usually permits up to ``forks`` simultaneous local actions. Any
 								  long-running local actions that execute for every target will experience
 								  artificial serialization, causing slowdown equivalent to `task_duration *
 								  num_targets`. This will be fixed soon.
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								* "Module Replacer" style modules are not supported. These rarely appear in
 								  practice, and light web searches failed to reveal many examples of them.
-												docs: new Ansible limitation, add new heading

Some differences are eventually likely to become permanent, because the
existing behaviour is unforgiveable.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								* Ansible permits up to ``forks`` connections to be setup in parallel, whereas
 								  in Mitogen this is handled by a fixed-size thread pool. Up to 16 connections
 								  may be established in parallel by default, this can be modified by setting
 								  the ``MITOGEN_POOL_SIZE`` environment variable.
-												docs: new Ansible limitation, add new heading

Some differences are eventually likely to become permanent, because the
existing behaviour is unforgiveable.

											
										
										
											7 years ago
-												docs: update Changelog.

											
										
										
											6 years ago
+								* The ``ansible_python_interpreter`` variable is parsed using a restrictive
 								  :mod:`shell-like <shlex>` syntax, permitting values such as ``/usr/bin/env
 								  FOO=bar python``, which occur in practice. Ansible `documents this
 								  <https://docs.ansible.com/ansible/latest/user_guide/intro_inventory.html#ansible-python-interpreter>`_
 								  as an absolute path, however the implementation passes it unquoted through
 								  the shell, permitting arbitrary code to be injected.
-												docs: remove more Ansible limitations

											
										
										
											7 years ago
+								* Performance does not scale linearly with target count. This will improve over
 								  time.
-												issue #144: ansible: increase default pool size to 16.

											
										
										
											7 years ago
-												docs: more updates.

- accurate description of Ansible timeouts
- rough detach() sketch

											
										
										
											7 years ago
+								* SSH and ``become`` are treated distinctly when applying timeouts, and
 								  timeouts apply up to the point when the new interpreter is ready to accept
 								  messages. Ansible has two timeouts: ``ConnectTimeout`` for SSH, applying up
 								  to when authentication completes, and a separate parallel timeout up to when
 								  ``become`` authentication completes.
 								  For busy targets, Ansible may successfully execute a module where Mitogen
 								  would fail without increasing the timeout. For sick targets, Ansible may hang
 								  indefinitely after authentication without executing a command, for example
 								  due to a stuck filesystem IO appearing in ``$HOME/.profile``.
-												docs: note the semantic difference in Mitogen vs. Ansible timeouts

Related to issue #141.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								New Features & Notes
 								--------------------
 								Connection Delegation
 								~~~~~~~~~~~~~~~~~~~~~
-												docs: 4kify images.

These commits were squashed to avoid repo size exploding.

											
										
										
											6 years ago
+								.. image:: images/jumpbox.svg
 								    :class: mitogen-right-275
-												docs: major Ansible page update.

											
										
										
											7 years ago
 								Included is a preview of **Connection Delegation**, a Mitogen-specific
-												docs: more ansible updates

											
										
										
											7 years ago
+								implementation of `stackable connection plug-ins`_. This enables connections
 								via a bastion, or container connections delegated via their host machine, where
 								reaching the host may entail further delegation.
-												docs: major Ansible page update.

											
										
										
											7 years ago
 								.. _Stackable connection plug-ins: https://github.com/ansible/proposals/issues/25
 								Unlike with SSH forwarding Ansible has complete visibility of the final
 								topology, declarative configuration via static/dynamic inventory is possible,
 								and data can be cached and re-served, and code executed on every intermediary.
 								For example when targeting Docker containers on a remote machine, each module
 								need only be uploaded once for the first task and container that requires it,
 								then cached and served from the SSH account for every future task in any
 								container.
 								.. raw:: html
 								    <div style="clear: both;"></div>
 								.. caution::
 								    Connection delegation is a work in progress, bug reports are welcome.
-												ansible: enable forking when requested and for async jobs.

Closes #105.
References #155.

mitogen/service.py:
    Refactor services to support individually exposed methods with
    different security policies for each method.

    - @mitogen.service.expose() to expose a method and set its policy
    - @mitogen.service.arg_spec() to validate input.
    - Require basic service message format to be a tuple of
      `(method, kwargs)`, where kwargs is always a dict.
    - Update DeduplicatingService to match the new scheme.

ansible_mitogen/connection.py:
    - Rename 'method' to 'method_name' to disambiguate it from the
      service.call()'s method= argument.

ansible_mitogen/planner.py:
    - Generate an ID for every job, sync or not, and fetch job results
      from JobResultService rather than via the initiating function
      call's return value.
    - Planner subclasses now get to select whether their Runner should
      run in a forked process. The base implementation requests this if
      the 'mitogen_isolation_mode=fork' task variable is present.

ansible_mitogen/runner.py:
    Teach runners to deliver their result via JobResultService executing
    in their indirect parent mux process.

ansible_mitogen/plugins/actions/mitogen_async_status.py:
    Split the implementation up into methods, and more compatibly
    emulate Ansible's existing output.

ansible_mitogen/process.py:
    Mux processes now host JobResultService.

ansible_mitogen/services.py:
    Update existing services to the new mitogen.service scheme, and
    implement JobResultService:

    * listen() method for synchronous jobs. planner.invoke() registers a
      Sender with the service prior to invoking the job, then sleeps
      waiting for the service to write the job result to the
      corresponding Receiver.

    * Non-blocking get() method for implementing mitogen_async_status
      action.

    * Child-accessible push() method for delivering task results.

ansible_mitogen/target.py:
    New helpers for spawning a virginal subprocess on startup, from
    which asynchronous and mitogen_task_isolation=fork jobs are forked.
    Necessary to avoid a task inheriting potentially
    polluted/monkey-patched parent environment, since remaining jobs
    continue to run in the original child process.

docs/ansible.rst:
    Add/merge/remove some behaviours/risks.

tests/ansible/integration:
    New tests for forking/async.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								    * Delegated connection setup is single-threaded; only one connection can be
 								      constructed in parallel per intermediary.
 								    * Inferring the configuration of intermediaries may be buggy, manifesting
 								      as duplicate connections between hops, due to not perfectly replicating
 								      the configuration Ansible would normally use for the intermediary.
-												docs: mention synchronize/delegation issue.

											
										
										
											6 years ago
+								    * Automatic tunnelling of SSH-dependent actions, such as the
 								      ``synchronize`` module, is not yet supported. This will be added in the
 .3 series.
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								To enable connection delegation, set ``mitogen_via=<inventory name>`` on the
 								command line, or as host and group variables.
 								.. code-block:: ini
 								    # Docker container on web1.dc1 is reachable via web1.dc1.
 								    [app-containers.web1.dc1]
 								    app1.web1.dc1 ansible_host=app1 ansible_connection=docker mitogen_via=web1.dc1
 								    # Web servers in DC1 are reachable via bastion.dc1
 								    [dc1]
 								    web1.dc1
 								    web2.dc1
 								    web3.dc1
 								    [dc1:vars]
 								    mitogen_via = bastion.dc1
 								    # Web servers in DC2 are reachable via bastion.dc2
 								    [dc2]
 								    web1.dc2
 								    web2.dc2
 								    web3.dc2
 								    [dc2:vars]
 								    mitogen_via = bastion.dc2
-												docs: show become_user example for connection delegation.

											
										
										
											7 years ago
+								    # Prod bastions are reachable via a magic account on a
 								    # corporate network gateway.
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								    [bastions]
-												docs: show become_user example for connection delegation.

											
										
										
											7 years ago
+								    bastion.dc1 mitogen_via=prod-ssh-access@corp-gateway.internal
 								    bastion.dc2 mitogen_via=prod-ssh-access@corp-gateway.internal
-												docs: major Ansible page update.

											
										
										
											7 years ago
 								    [corp-gateway]
 								    corp-gateway.internal
 								File Transfer
 								~~~~~~~~~~~~~
-												docs: Don't redefine links to scp and sftp

This addresses an error found while running tox -edocs

Warning, treated as error:
./docs/ansible.rst:6:Duplicate explicit target name: "scp(1)".

											
										
										
											6 years ago
+								Normally `sftp(1)`_ or `scp(1)`_ are used to copy files by the
-												docs: add links.

											
										
										
											7 years ago
+								`assemble <http://docs.ansible.com/ansible/latest/modules/assemble_module.html>`_,
 								`copy <http://docs.ansible.com/ansible/latest/modules/copy_module.html>`_,
 								`patch <http://docs.ansible.com/ansible/latest/modules/patch_module.html>`_,
 								`script <http://docs.ansible.com/ansible/latest/modules/script_module.html>`_,
 								`template <http://docs.ansible.com/ansible/latest/modules/template_module.html>`_, and
 								`unarchive <http://docs.ansible.com/ansible/latest/modules/unarchive_module.html>`_
 								actions, or when uploading modules with pipelining disabled. With Mitogen
 								copies are implemented natively using the same interpreters, connection tree,
 								and routed message bus that carries RPCs.
-												docs: major Ansible page update.

											
										
										
											7 years ago
-												docs: Don't redefine links to scp and sftp

This addresses an error found while running tox -edocs

Warning, treated as error:
./docs/ansible.rst:6:Duplicate explicit target name: "scp(1)".

											
										
										
											6 years ago
+								.. _scp(1): https://linux.die.net/man/1/scp
 								.. _sftp(1): https://linux.die.net/man/1/sftp
-												docs: more ansible updates

											
										
										
											7 years ago
+								This permits direct streaming between endpoints regardless of execution
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								environment, without necessitating temporary copies in intermediary accounts or
 								machines, for example when ``become`` is active, or in the presence of
-												docs: more ansible updates

											
										
										
											7 years ago
+								connection delegation. It also avoids the need to securely share temporary
 								files between accounts and machines.
-												docs: major Ansible page update.

											
										
										
											7 years ago
-												docs: more ansible updates

											
										
										
											7 years ago
+								As the implementation is self-contained, it is simple to make improvements like
 								prioritizing transfers, supporting resume, or displaying progress bars.
-												docs: major Ansible page update.

											
										
										
											7 years ago
-												docs: add file transfer safety section.

											
										
										
											7 years ago
+								Safety
 								^^^^^^
-												docs: more ansible updates

											
										
										
											7 years ago
+								Transfers proceed to a hidden file in the destination directory, with content
 								and metadata synced using `fsync(2) <https://linux.die.net/man/2/fsync>`_ prior
 								to rename over any existing file. This ensures the file remains consistent at
 								all times, in the event of a crash, or when overlapping `ansible-playbook` runs
 								deploy differing file contents.
-												docs: add file transfer safety section.

											
										
										
											7 years ago
-												docs: Don't redefine links to scp and sftp

This addresses an error found while running tox -edocs

Warning, treated as error:
./docs/ansible.rst:6:Duplicate explicit target name: "scp(1)".

											
										
										
											6 years ago
+								The `sftp(1)`_ and `scp(1)`_ tools may cause undetected data corruption
-												docs: more ansible updates

											
										
										
											7 years ago
+								in the form of truncated files, or files containing intermingled data segments
 								from overlapping runs. As part of normal operation, both tools expose a window
 								where readers may observe inconsistent file contents.
-												docs: add file transfer safety section.

											
										
										
											7 years ago
 								Performance
 								^^^^^^^^^^^
-												ansible: avoid roundtrip for small file transfers.

Calls to connect.put_file() where the file is sufficiently small enough
to fit in a single RPC proceed without waiting for an RPC response. If
the write fails the target context will log an exception, and any
subsequent step depending on the written file will fail.

I verified every built-in action plugin for file transfer calls, and
they all depend on the transferred file in the following step, so this
should be safe.

Reduces template/copy actions to 2-RTT, loop-20-templates.yml runtime
reduced from 30 seconds to 10 seconds over a 250ms link compared to
v0.2.2, and from 123 seconds compared to vanilla with pipelining
enabled.

											
										
										
											6 years ago
+								One roundtrip initiates a transfer larger than 124 KiB, while smaller transfers
-												docs: some more cleanups

- add faulthandler/thread stacks to changelog.
- various api.rst cleanups.
- docs: explain chain_id in howitworks.

											
										
										
											6 years ago
+								are embedded in a 0-roundtrip pipelined call. For tools operating via SSH
 								multiplexing, 4 roundtrips are required to configure the IO channel, followed
 								by 6 roundtrips to transfer the file in the case of ``sftp``, in addition to
 								the time to start the local and remote processes.
-												docs: more ansible updates

											
										
										
											7 years ago
 								An invocation of ``scp`` with an empty ``.profile`` over a 30 ms link takes
 								~140 ms, wasting 110 ms per invocation, rising to ~2,000 ms over a 400 ms
 								UK-India link, wasting 1,600 ms per invocation.
-												docs: add file transfer safety section.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								Interpreter Reuse
 								~~~~~~~~~~~~~~~~~
 								Python interpreters are aggressively reused to execute modules. While this
 								works well, it violates an unwritten assumption, and so it is possible an
 								earlier module execution could cause a subsequent module to fail, or for
 								unrelated modules to interact poorly due to bad hygiene, such as
 								monkey-patching that becomes stacked over repeat invocations.
 								Before reporting a bug relating to a misbehaving module, please re-run with
 								``-e mitogen_task_isolation=fork`` to see if the problem abates. This may be
 								set per-task, paying attention to the possibility an earlier task may be the
 								true cause of a failure.
 								.. code-block:: yaml
 								    - name: My task.
 								      broken_module:
 								        some_option: true
 								      vars:
 								        mitogen_task_isolation: fork
-												ansible: enable forking when requested and for async jobs.

Closes #105.
References #155.

mitogen/service.py:
    Refactor services to support individually exposed methods with
    different security policies for each method.

    - @mitogen.service.expose() to expose a method and set its policy
    - @mitogen.service.arg_spec() to validate input.
    - Require basic service message format to be a tuple of
      `(method, kwargs)`, where kwargs is always a dict.
    - Update DeduplicatingService to match the new scheme.

ansible_mitogen/connection.py:
    - Rename 'method' to 'method_name' to disambiguate it from the
      service.call()'s method= argument.

ansible_mitogen/planner.py:
    - Generate an ID for every job, sync or not, and fetch job results
      from JobResultService rather than via the initiating function
      call's return value.
    - Planner subclasses now get to select whether their Runner should
      run in a forked process. The base implementation requests this if
      the 'mitogen_isolation_mode=fork' task variable is present.

ansible_mitogen/runner.py:
    Teach runners to deliver their result via JobResultService executing
    in their indirect parent mux process.

ansible_mitogen/plugins/actions/mitogen_async_status.py:
    Split the implementation up into methods, and more compatibly
    emulate Ansible's existing output.

ansible_mitogen/process.py:
    Mux processes now host JobResultService.

ansible_mitogen/services.py:
    Update existing services to the new mitogen.service scheme, and
    implement JobResultService:

    * listen() method for synchronous jobs. planner.invoke() registers a
      Sender with the service prior to invoking the job, then sleeps
      waiting for the service to write the job result to the
      corresponding Receiver.

    * Non-blocking get() method for implementing mitogen_async_status
      action.

    * Child-accessible push() method for delivering task results.

ansible_mitogen/target.py:
    New helpers for spawning a virginal subprocess on startup, from
    which asynchronous and mitogen_task_isolation=fork jobs are forked.
    Necessary to avoid a task inheriting potentially
    polluted/monkey-patched parent environment, since remaining jobs
    continue to run in the original child process.

docs/ansible.rst:
    Add/merge/remove some behaviours/risks.

tests/ansible/integration:
    New tests for forking/async.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								If forking solves your problem, **please report a bug regardless**, as an
 								internal list can be updated to prevent others bumping into the same problem.
 								Interpreter Recycling
 								~~~~~~~~~~~~~~~~~~~~~
 								There is a per-target limit on the number of interpreters. Once 20 exist, the
 								youngest is terminated before starting any new interpreter, preventing
 								situations like below from triggering memory exhaustion.
 								.. code-block:: yaml
 								    - hosts: corp_boxes
 								      vars:
 								        user_directory: [
 								          # 10,000 corporate user accounts
 								        ]
 								      tasks:
 								        - name: Create user bashrc
 								          become: true
 								          vars:
 								            ansible_become_user: "{{item}}"
 								          copy:
 								            src: bashrc
 								            dest: "~{{item}}/.bashrc"
 								          with_items: "{{user_directory}}"
 								The youngest is chosen to preserve useful accounts like ``root`` and
 								``postgresql`` that often appear early in a run, however it is simple to
 								construct a playbook that defeats this strategy. A future version will key
 								interpreters on the identity of their creating task, avoiding useful account
 								recycling in every scenario.
 								To modify the limit, set the ``MITOGEN_MAX_INTERPRETERS`` environment variable.
 								Standard IO
 								~~~~~~~~~~~
 								Ansible uses pseudo TTYs for most invocations to allow it to type interactive
 								passwords, however pseudo TTYs are disabled where standard input is required or
 								``sudo`` is not in use. Additionally when SSH multiplexing is enabled, a string
 								like ``Shared connection to localhost closed\r\n`` appears in ``stderr`` of
 								every invocation.
 								Mitogen does not naturally require either of these, as command output is always
 								embedded within framed messages, and it can simply call :py:func:`pty.openpty`
 								in any location an interactive password must be typed.
 								A major downside to Ansible's behaviour is that ``stdout`` and ``stderr`` are
 								merged together into a single ``stdout`` variable, with carriage returns
 								inserted in the output by the TTY layer. However ugly, the extension emulates
 								this precisely, to avoid breaking playbooks that expect text to appear in
 								specific variables with a particular linefeed style.
-												docs: update ansible risks/differences.

											
										
										
											7 years ago
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												issue #321: simplify temp directory handling.

											
										
										
											6 years ago
+								.. _ansible_tempfiles:
 								Temporary Files
 								~~~~~~~~~~~~~~~
-												docs: fix up incomplete Temporary Files section.

											
										
										
											6 years ago
+								Temporary file handling in Ansible is tricky, and the precise behaviour varies
 								across major versions. A variety of temporary files and directories are
-												docs: Changelog concision

											
										
										
											6 years ago
+								created, depending on the operating mode.
-												issue #321: simplify temp directory handling.

											
										
										
											6 years ago
 								In the best case when pipelining is enabled and no temporary uploads are
 								required, for each task Ansible will create one directory below a
 								system-supplied temporary directory returned by :func:`tempfile.mkdtemp`, owned
-												issue #321: docs fixes

											
										
										
											6 years ago
+								by the target account a new-style module will execute in.
-												issue #321: simplify temp directory handling.

											
										
										
											6 years ago
 								In other cases depending on the task type, whether become is active, whether
 								the target become user is privileged, whether the associated action plugin
 								needs to upload files, and whether the associated module needs to store files,
 								Ansible may:
 								* Create a directory owned by the SSH user either under ``remote_tmp``, or a
 								  system-default directory,
 								* Upload action dependencies such as non-new style modules or rendered
-												docs: Don't redefine links to scp and sftp

This addresses an error found while running tox -edocs

Warning, treated as error:
./docs/ansible.rst:6:Duplicate explicit target name: "scp(1)".

											
										
										
											6 years ago
+								  templates to that directory via `sftp(1)`_ or `scp(1)`_.
-												issue #321: simplify temp directory handling.

											
										
										
											6 years ago
+								* Attempt to modify the directory's access control list to grant access to the
 								  target user using `setfacl(1) <https://linux.die.net/man/1/setfacl>`_,
 								  requiring that tool to be installed and a supported filesystem to be in use,
 								  or for the ``allow_world_readable_tmpfiles`` setting to be  :data:`True`.
 								* Create a directory owned by the target user either under ``remote_tmp``, or
 								  a system-default directory, if a new-style module needs a temporary directory
 								  and one was not previously created for a supporting file earlier in the
 								  invocation.
 								In summary, for each task Ansible may create one or more of:
 								* ``~ssh_user/<remote_tmp>/...`` owned by the login user,
 								* ``$TMPDIR/ansible-tmp-...`` owned by the login user,
 								* ``$TMPDIR/ansible-tmp-...`` owned by the login user with ACLs permitting
 								  write access by the become user,
 								* ``~become_user/<remote_tmp>/...`` owned by the become user,
 								* ``$TMPDIR/ansible_<modname>_payload_.../`` owned by the become user,
 								* ``$TMPDIR/ansible-module-tmp-.../`` owned by the become user.
-												ansible: temporary files take 5.

											
										
										
											6 years ago
 								Mitogen for Ansible
 								^^^^^^^^^^^^^^^^^^^
-												docs: fix up incomplete Temporary Files section.

											
										
										
											6 years ago
+								As Mitogen can execute new-style modules from RAM, and transfer files to target
 								user accounts without first writing an intermediary file in any separate login
 								account, handling is relatively simplified.
-												ansible: temporary files take 5.

											
										
										
											6 years ago
 								Temporary directories must exist to maintain compatibility with Ansible, as
 								many modules introspect :data:`sys.argv` to find a directory where they may
 								write files, however only one directory exists for the lifetime of each
-												docs: fix up incomplete Temporary Files section.

											
										
										
											6 years ago
+								interpreter, its location is consistent for each account, and it is always
 								privately owned by that account.
-												issue #321: 2.4+ compatibility fixes, disable test on Vanilla.

											
										
										
											6 years ago
-												docs: fix up incomplete Temporary Files section.

											
										
										
											6 years ago
+								During startup, the persistent remote interpreter tries the paths below until
 								one is found that is writeable and lives on a filesystem with ``noexec``
 								disabled:
-												issue #321: 2.4+ compatibility fixes, disable test on Vanilla.

											
										
										
											6 years ago
 . ``$variable`` and tilde-expanded ``remote_tmp`` setting from
 								   ``ansible.cfg``
 . ``$variable`` and tilde-expanded ``system_tmpdirs`` setting from
 								   ``ansible.cfg``
 . ``TMPDIR`` environment variable
 . ``TEMP`` environment variable
 . ``TMP`` environment variable
-												issue #321: take remote_tmp and system_tmpdirs into account.

Can't simply ignore these settings as some users may have weird noexec
filesystems.

											
										
										
											6 years ago
+. ``/tmp``
 . ``/var/tmp``
 . ``/usr/tmp``
-												issue #321: 2.4+ compatibility fixes, disable test on Vanilla.

											
										
										
											6 years ago
+. Current working directory
-												issue #321: simplify temp directory handling.

											
										
										
											6 years ago
-												docs: fix up incomplete Temporary Files section.

											
										
										
											6 years ago
+								The directory is created at startup and recursively destroyed during interpeter
 								shutdown. Subdirectories are automatically created and destroyed by the
 								controller for each task that requires them.
 								Round-trip Avoidance
 								^^^^^^^^^^^^^^^^^^^^
 								Mitogen avoids many round-trips due to temporary file handling that are present
 								in regular Ansible:
 								* During task startup, it is not necessary to wait until the target has
 								  succeeded in creating a temporary directory. Instead, any failed attempt to
 								  create the directory will cause any subsequent RPC belonging to the same task
 								  to fail with the error that occurred.
 								* As temporary directories are privately owned by the target user account,
 								  operations relating to modifying the directory to support cross-account
 								  access are avoided.
 								* An explicit work-around is included to avoid the `copy` and `template`
 								  actions needlessly triggering a round-trip to set their temporary file as
 								  executable.
 								* During task shutdown, it is not necessary to wait to learn if the target has
 								  succeeded in deleting a temporary directory, since any error that may occur
 								  can is logged asynchronously via the logging framework, and the persistent
 								  remote interpreter arranges for all subdirectories to be destroyed during
 								  interpreter shutdown.
-												issue #321: simplify temp directory handling.

											
										
										
											6 years ago
-												issue #338: refactor env handling into class and fix tests.

											
										
										
											6 years ago
+								.. _ansible_process_env:
 								Process Environment Emulation
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 								Since Ansible discards processes after each module invocation, follow-up tasks
 								often (but not always) receive a new environment that will usually include
 								changes made by previous tasks. As such modifications are common, for
 								compatibility the extension emulates the existing behaviour as closely as
 								possible.
 								Some scenarios exist where emulation is impossible, for example, applying
 								``nsswitch.conf`` changes when ``nscd`` is not in use. If future scenarios
 								appear that cannot be solved through emulation, the extension will be updated
 								to automatically restart affected interpreters instead.
 								DNS Resolution
 								^^^^^^^^^^^^^^
 								Modifications to ``/etc/resolv.conf`` cause the glibc resolver configuration to
 								be reloaded via `res_init(3) <https://linux.die.net/man/3/res_init>`_. This
 								isn't necessary on some Linux distributions carrying glibc patches to
 								automatically check ``/etc/resolv.conf`` periodically, however it is necessary
-												issue #321: simplify temp directory handling.

											
										
										
											6 years ago
+								on at least Debian and BSD derivatives.
-												issue #338: refactor env handling into class and fix tests.

											
										
										
											6 years ago
 								``/etc/environment``
 								^^^^^^^^^^^^^^^^^^^^
 								When ``become: true`` is active or SSH multiplexing is disabled, modifications
 								by previous tasks to ``/etc/environment`` and ``$HOME/.pam_environment`` are
-												issue #321: simplify temp directory handling.

											
										
										
											6 years ago
+								normally reflected, since the content of those files is reapplied by `PAM
-												issue #338: refactor env handling into class and fix tests.

											
										
										
											6 years ago
+								<https://en.wikipedia.org/wiki/Pluggable_authentication_module>`_ via `pam_env`
 								on each authentication of ``sudo`` or ``sshd``.
 								Both files are monitored for changes, and changes are applied where it appears
 								safe to do so:
 								* New keys are added if they did not otherwise exist in the inherited
 								  environment, or previously had the same value as found in the file before it
 								  changed.
 								* Given a key (such as ``http_proxy``) added to the file where no such key
 								  exists in the environment, the key will be added.
 								* Given a key (such as ``PATH``) where an existing environment key exists with
 								  a different value, the update or deletion will be ignored, as it is likely
 								  the key was overridden elsewhere after `pam_env` ran, such as by
 								  ``/etc/profile``.
 								* Given a key removed from the file that had the same value as the existing
 								  environment key, the key will be removed.
-												issue #106: docs: initial docs for how modules execute.

											
										
										
											7 years ago
+								How Modules Execute
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								~~~~~~~~~~~~~~~~~~~
-												issue #106: docs: initial docs for how modules execute.

											
										
										
											7 years ago
-												issue #106: docs: tidyup.

											
										
										
											7 years ago
+								Ansible usually modifies, recompresses and reuploads modules every time they
 								run on a target, work that must be repeated by the controller for every
 								playbook step.
 								With the extension any modifications are done on the target, allowing pristine
 								copies of modules to be cached, reducing the necessity to re-transfer modules
 								for each invocation. Unmodified modules are uploaded once on first use and
 								cached in RAM for the remainder of the run.
-												issue #106: docs: initial docs for how modules execute.

											
										
										
											7 years ago
 								**Binary**
-												docs: tidy up big list of bullets.

											
										
										
											7 years ago
+								    Native executables detected using a complex heuristic. Arguments are
 								    supplied as a JSON file whose path is the sole script parameter.
-												issue #106: docs: initial docs for how modules execute.

											
										
										
											7 years ago
 								**Module Replacer**
-												docs: tidy up big list of bullets.

											
										
										
											7 years ago
+								    Python scripts detected by the presence of
 								    ``#<<INCLUDE_ANSIBLE_MODULE_COMMON>>`` appearing in their source. This type
 								    is not yet supported.
-												issue #106: docs: initial docs for how modules execute.

											
										
										
											7 years ago
 								**New-Style**
-												docs: tidy up big list of bullets.

											
										
										
											7 years ago
+								    Python scripts detected by the presence of ``from ansible.module_utils.``
 								    appearing in their source. Arguments are supplied as JSON written to
 								    ``sys.stdin`` of the target interpreter.
-												issue #106: docs: initial docs for how modules execute.

											
										
										
											7 years ago
 								**JSON_ARGS**
-												docs: tidy up big list of bullets.

											
										
										
											7 years ago
+								    Detected by the presence of ``INCLUDE_ANSIBLE_MODULE_JSON_ARGS`` appearing
 								    in the script source. The interpreter directive (``#!interpreter``) is
 								    adjusted to match the corresponding value of ``{{ansible_*_interpreter}}``
 								    if one is set. Arguments are supplied as JSON mixed into the script as a
 								    replacement for ``INCLUDE_ANSIBLE_MODULE_JSON_ARGS``.
-												issue #106: docs: initial docs for how modules execute.

											
										
										
											7 years ago
 								**WANT_JSON**
-												docs: tidy up big list of bullets.

											
										
										
											7 years ago
+								    Detected by the presence of ``WANT_JSON`` appearing in the script source.
 								    The interpreter directive is adjusted as above. Arguments are supplied as a
 								    JSON file whose path is the sole script parameter.
-												issue #106: docs: initial docs for how modules execute.

											
										
										
											7 years ago
 								**Old Style**
-												docs: tidy up big list of bullets.

											
										
										
											7 years ago
+								    Files not matching any of the above tests. The interpreter directive is
 								    adjusted as above. Arguments are supplied as a file whose path is the sole
 								    script parameter. The format of the file is ``"key=repr(value)[
 								    key2=repr(value2)[ ..]] "``.
-												issue #106: docs: initial docs for how modules execute.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								Runtime Patches
 								~~~~~~~~~~~~~~~
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								Three small runtime patches are employed in ``strategy.py`` to hook into
 								desirable locations, in order to override uses of shell, the module executor,
 								and the mechanism for selecting a connection plug-in. While it is hoped the
 								patches can be avoided in future, for interesting versions of Ansible deployed
 								today this simply is not possible, and so they continue to be required.
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								The patches are concise and behave conservatively, including by disabling
 								themselves when non-Mitogen connections are in use. Additional third party
 								plug-ins are unlikely to attempt similar patches, so the risk to an established
 								configuration should be minimal.
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								Flag Emulation
 								~~~~~~~~~~~~~~
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								Mitogen re-parses ``sudo_flags``, ``become_flags``, and ``ssh_flags`` using
 								option parsers extracted from `sudo(1)` and `ssh(1)` in order to emulate their
 								equivalent semantics. This allows:
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								* robust support for common ``ansible.cfg`` tricks without reconfiguration,
 								  such as forwarding SSH agents across ``sudo`` invocations,
 								* reporting on conflicting flag combinations,
 								* reporting on unsupported flag combinations,
 								* internally special-casing certain behaviour (like recursive agent forwarding)
 								  without boring the user with the details,
 								* avoiding opening the extension up to untestable scenarios where users can
 								  insert arbitrary garbage between Mitogen and the components it integrates
 								  with,
 								* precise emulation by an alternative implementation, for example if Mitogen
 								  grew support for Paramiko.
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												docs: better connection type docs

											
										
										
											7 years ago
+								Connection Types
 								----------------
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												docs: better connection type docs

											
										
										
											7 years ago
+								Matching Ansible, connection variables are treated on a per-task basis, causing
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								establishment of additional reuseable interpreters as necessary to match the
 								configuration of each task.
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												issue #303: add doas to the docs

											
										
										
											6 years ago
+								.. _doas:
 								Doas
 								~~~~
 								``doas`` can be used as a connection method that supports connection delegation, or
 								as a become method.
 								When used as a become method:
 								* ``ansible_python_interpreter``
 								* ``ansible_become_exe``: path to ``doas`` binary.
 								* ``ansible_become_user`` (default: ``root``)
 								* ``ansible_become_pass`` (default: assume passwordless)
 								* ansible.cfg: ``timeout``
 								When used as the ``mitogen_doas`` connection method:
 								* The inventory hostname has no special meaning.
 								* ``ansible_user``: username to use.
 								* ``ansible_password``: password to use.
 								* ``ansible_python_interpreter``
-												docs: more ansible updates

											
										
										
											7 years ago
+								.. _method-docker:
-												docs: better connection type docs

											
										
										
											7 years ago
+								Docker
 								~~~~~~
-												ansible: allow establishment of duplicate SSH connections

											
										
										
											7 years ago
-												docs: more ansible updates

											
										
										
											7 years ago
+								Like `docker
-												Updated readme with build status, updated docs

											
										
										
											6 years ago
+								<https://docs.ansible.com/ansible/2.6/plugins/connection/docker.html>`_ except
-												docs: better connection type docs

											
										
										
											7 years ago
+								connection delegation is supported.
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												docs: better connection type docs

											
										
										
											7 years ago
+								* ``ansible_host``: Name of Docker container (default: inventory hostname).
 								* ``ansible_user``: Name of user within the container to execute as.
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												docs: updates for #376 and #371

											
										
										
											6 years ago
+								.. _method-jail:
-												docs: more ansible updates

											
										
										
											7 years ago
+								FreeBSD Jail
 								~~~~~~~~~~~~
-												ansible: add mitogen_sudo method, split out connection subclasses.

Slowly moving towards real implementations in those files.

											
										
										
											7 years ago
-												docs: more ansible updates

											
										
										
											7 years ago
+								Like `jail
-												Updated readme with build status, updated docs

											
										
										
											6 years ago
+								<https://docs.ansible.com/ansible/2.6/plugins/connection/jail.html>`_ except
-												ansible: add mitogen_sudo method, split out connection subclasses.

Slowly moving towards real implementations in those files.

											
										
										
											7 years ago
+								connection delegation is supported.
 								* ``ansible_host``: Name of jail (default: inventory hostname).
 								* ``ansible_user``: Name of user within the jail to execute as.
-												docs: updates for #376 and #371

											
										
										
											6 years ago
+								.. _method-kubectl:
 								Kubernetes Pod
 								~~~~~~~~~~~~~~
 								Like `kubectl
 								<https://docs.ansible.com/ansible/2.6/plugins/connection/kubectl.html>`_ except
 								connection delegation is supported.
 								* ``ansible_host``: Name of pod (default: inventory hostname).
 								* ``ansible_user``: Name of user to authenticate to API as.
-												ansible: add mitogen_sudo method, split out connection subclasses.

Slowly moving towards real implementations in those files.

											
										
										
											7 years ago
+								Local
 								~~~~~
-												docs: more ansible updates

											
										
										
											7 years ago
+								Like `local
-												Updated readme with build status, updated docs

											
										
										
											6 years ago
+								<https://docs.ansible.com/ansible/2.6/plugins/connection/local.html>`_ except
-												ansible: add mitogen_sudo method, split out connection subclasses.

Slowly moving towards real implementations in those files.

											
										
										
											7 years ago
+								connection delegation is supported.
-												docs: initial Ansible extension docs.

											
										
										
											7 years ago
-												ansible: allow establishment of duplicate SSH connections

											
										
										
											7 years ago
+								* ``ansible_python_interpreter``
-												ansible: migrate logging variables into utils.

											
										
										
											7 years ago
-												docs: document local connection process model difference.

											
										
										
											6 years ago
+								Process Model
 								^^^^^^^^^^^^^
 								Ansible usually executes local connection commands as a transient subprocess of
 								the forked worker executing a task. With the extension, the local connection
 								exists as a persistent subprocess of the connection multiplexer.
 								This means that global state mutations made to the top-level Ansible process
 								that are normally visible to newly forked subprocesses, such as vars plug-ins
 								that modify the environment, will not be reflected when executing local
 								commands without additional effort.
 								During execution the extension presently mimics the working directory and
 								process environment inheritence of regular Ansible, however it is possible some
 								additional differences exist that may break existing playbooks.
-												docs: more ansible updates

											
										
										
											7 years ago
+								.. _method-lxc:
-												ansible: add mitogen_sudo method, split out connection subclasses.

Slowly moving towards real implementations in those files.

											
										
										
											7 years ago
+								LXC
 								~~~
-												Support LXD; closes #339.

											
										
										
											6 years ago
+								Connect to classic LXC containers, like `lxc
 								<https://docs.ansible.com/ansible/2.6/plugins/connection/lxc.html>`_ except
 								connection delegation is supported, and ``lxc-attach`` is always used rather
 								than the LXC Python bindings, as is usual with ``lxc``.
-												ansible: add mitogen_sudo method, split out connection subclasses.

Slowly moving towards real implementations in those files.

											
										
										
											7 years ago
 								* ``ansible_python_interpreter``
 								* ``ansible_host``: Name of LXC container (default: inventory hostname).
-												issue #409: add missing path config variables to severa plugins

So every method can be redirected to a stub implementation.

											
										
										
											6 years ago
+								* ``mitogen_lxc_attach_path``: path to ``lxc-attach`` command if not available
 								    on the system path.
-												ansible: add mitogen_sudo method, split out connection subclasses.

Slowly moving towards real implementations in those files.

											
										
										
											7 years ago
-												Support LXD; closes #339.

											
										
										
											6 years ago
+								.. _method-lxd:
 								LXD
 								~~~
 								Connect to modern LXD containers, like `lxd
 								<https://docs.ansible.com/ansible/2.6/plugins/connection/lxd.html>`_ except
 								connection delegation is supported. The ``lxc`` command must be available on
 								the host machine.
 								* ``ansible_python_interpreter``
 								* ``ansible_host``: Name of LXC container (default: inventory hostname).
-												issue #409: add missing path config variables to severa plugins

So every method can be redirected to a stub implementation.

											
										
										
											6 years ago
+								* ``mitogen_lxc_path``: path to ``lxc`` command if not available on the system
 								  path.
-												Support LXD; closes #339.

											
										
										
											6 years ago
-												docs: document local connection process model difference.

											
										
										
											6 years ago
+								.. _machinectl:
 								Machinectl
 								~~~~~~~~~~
 								Like the `machinectl third party plugin
 								<https://github.com/BaxterStockman/ansible-connection-machinectl>`_ except
 								connection delegation is supported. This is a light wrapper around the
 								:ref:`setns <setns>` method.
 								* ``ansible_host``: Name of Docker container (default: inventory hostname).
 								* ``ansible_user``: Name of user within the container to execute as.
 								* ``mitogen_machinectl_path``: path to ``machinectl`` command if not available
 								  as ``/bin/machinectl``.
-												ansible: add mitogen_sudo method, split out connection subclasses.

Slowly moving towards real implementations in those files.

											
										
										
											7 years ago
+								.. _setns:
-												docs: better connection type docs

											
										
										
											7 years ago
+								Setns
 								~~~~~
-												issue #150: ansible: add basic Docker support.

											
										
										
											7 years ago
-												docs: better connection type docs

											
										
										
											7 years ago
+								The ``setns`` method connects to Linux containers via `setns(2)
-												Support LXD; closes #339.

											
										
										
											6 years ago
+								<https://linux.die.net/man/2/setns>`_. Unlike :ref:`method-docker`,
 								:ref:`method-lxc`, and :ref:`method-lxd` the namespace transition is handled
 								internally, ensuring optimal throughput to the child. This is necessary for
 								:ref:`machinectl` where only PTY channels are supported.
-												docs: better connection type docs

											
										
										
											7 years ago
-												docs: more links

											
										
										
											7 years ago
+								A utility program must be installed to discover the PID of the container's root
 								process.
-												docs: better connection type docs

											
										
										
											7 years ago
-												Support LXD; closes #339.

											
										
										
											6 years ago
+								* ``mitogen_kind``: one of ``docker``, ``lxc``, ``lxd`` or ``machinectl``.
-												docs: better connection type docs

											
										
										
											7 years ago
+								* ``ansible_host``: Name of container as it is known to the corresponding tool
 								  (default: inventory hostname).
-												setns: support changing user.

To match existing third party plugin.

											
										
										
											7 years ago
+								* ``ansible_user``: Name of user within the container to execute as.
-												docs: better connection type docs

											
										
										
											7 years ago
+								* ``mitogen_docker_path``: path to Docker if not available on the system path.
-												Support LXD; closes #339.

											
										
										
											6 years ago
+								* ``mitogen_lxc_path``: path to LXD's ``lxc`` command if not available as
 								  ``lxc-info``.
 								* ``mitogen_lxc_info_path``: path to LXC classic's ``lxc-info`` command if not
 								  available as ``lxc-info``.
-												docs: better connection type docs

											
										
										
											7 years ago
+								* ``mitogen_machinectl_path``: path to ``machinectl`` command if not available
 								  as ``/bin/machinectl``.
-												ansible: support su become method.

											
										
										
											7 years ago
+								.. _su:
 								Su
 								~~
 								Su can be used as a connection method that supports connection delegation, or
 								as a become method.
 								When used as a become method:
 								* ``ansible_python_interpreter``
 								* ``ansible_su_exe``, ``ansible_become_exe``
 								* ``ansible_su_user``, ``ansible_become_user`` (default: ``root``)
 								* ``ansible_su_pass``, ``ansible_become_pass`` (default: assume passwordless)
 								* ``su_flags``, ``become_flags``
 								* ansible.cfg: ``timeout``
 								When used as the ``mitogen_su`` connection method:
 								* The inventory hostname has no special meaning.
 								* ``ansible_user``: username to su as.
 								* ``ansible_password``: password to su as.
 								* ``ansible_python_interpreter``
-												ansible: add mitogen_sudo method, split out connection subclasses.

Slowly moving towards real implementations in those files.

											
										
										
											7 years ago
+								.. _sudo:
 								Sudo
 								~~~~
 								Sudo can be used as a connection method that supports connection delegation, or
 								as a become method.
 								When used as a become method:
 								* ``ansible_python_interpreter``
 								* ``ansible_sudo_exe``, ``ansible_become_exe``
 								* ``ansible_sudo_user``, ``ansible_become_user`` (default: ``root``)
 								* ``ansible_sudo_pass``, ``ansible_become_pass`` (default: assume passwordless)
 								* ``sudo_flags``, ``become_flags``
 								* ansible.cfg: ``timeout``
 								When used as the ``mitogen_sudo`` connection method:
-												docs: more ansible updates

											
										
										
											7 years ago
+								* The inventory hostname has no special meaning.
-												ansible: add mitogen_sudo method, split out connection subclasses.

Slowly moving towards real implementations in those files.

											
										
										
											7 years ago
+								* ``ansible_user``: username to sudo as.
 								* ``ansible_password``: password to sudo as.
 								* ``sudo_flags``, ``become_flags``
 								* ``ansible_python_interpreter``
-												docs: better connection type docs

											
										
										
											7 years ago
+								SSH
 								~~~
-												Updated readme with build status, updated docs

											
										
										
											6 years ago
+								Like `ssh <https://docs.ansible.com/ansible/2.6/plugins/connection/ssh.html>`_
-												docs: more ansible updates

											
										
										
											7 years ago
+								except connection delegation is supported.
-												docs: better connection type docs

											
										
										
											7 years ago
 								* ``ansible_ssh_timeout``
 								* ``ansible_host``, ``ansible_ssh_host``
 								* ``ansible_user``, ``ansible_ssh_user``
 								* ``ansible_port``, ``ssh_port``
 								* ``ansible_ssh_executable``, ``ssh_executable``
 								* ``ansible_ssh_private_key_file``
 								* ``ansible_ssh_pass``, ``ansible_password`` (default: assume passwordless)
 								* ``ssh_args``, ``ssh_common_args``, ``ssh_extra_args``
-												issue #278: ansible: support mitogen_ssh_debug_level variable.

											
										
										
											6 years ago
+								* ``mitogen_ssh_debug_level``: integer between `0..3` indicating the SSH client
 								  debug level. Ansible must also be run with '-vvv' to view the output.
-												Add link to IRC; closes #116

											
										
										
											7 years ago
-												ansible: migrate logging variables into utils.

											
										
										
											7 years ago
+								Debugging
 								---------
-												ansible: better support for diagnosing hangs

* Always enable the faulthandler module in the top-level process if it
  is available.
* Make MITOGEN_DUMP_THREAD_STACKS interval configurable, to better
  handle larger runs.
* Add docs subsection on diagnosing hangs.

Conflicts:
	ansible_mitogen/process.py

											
										
										
											6 years ago
+								Diagnostics and :py:mod:`logging` package output on targets are usually
 								discarded. With Mitogen, these are captured and forwarded to the controller
 								where they can be viewed with ``-vvv``. Basic high level logs are produced with
 								``-vvv``, with logging of all IO on the controller with ``-vvvv`` or higher.
 								While uncaptured standard IO and the logging package on targets is forwarded,
 								it is not possible to receive IO activity logs, as the forwarding process would
 								would itself generate additional IO.
 								To receive a complete trace of every process on every machine, file-based
 								logging is necessary. File-based logging can be enabled by setting
 								``MITOGEN_ROUTER_DEBUG=1`` in your environment. When file-based logging is
 								enabled, one file per context will be created on the local machine and every
 								target machine, as ``/tmp/mitogen.<pid>.log``.
-												issue #412: pad out debugging docs, add get_stack to changelog.

											
										
										
											6 years ago
 								Common Problems
 								~~~~~~~~~~~~~~~
 								The most common bug reports fall into the following categories, so it is worth
 								checking whether you can categorize a problem using the tools provided before
 								reporting it:
 								**Missed/Incorrect Configuration Variables**
 								    In some cases Ansible may support a configuration variable that Mitogen
 								    does not yet support, or Mitogen supports, but the support is broken. For
 								    example, Mitogen may pick the wrong username or SSH parameters.
 								    To detect this, use the special ``mitogen_get_stack`` action described
-												docs: docs and docstrings.

											
										
										
											6 years ago
+								    below to verify the settings Mitogen has chosen for the connection make
 								    sense.
-												issue #412: pad out debugging docs, add get_stack to changelog.

											
										
										
											6 years ago
 								**Process Environment Differences**
-												docs: docs and docstrings.

											
										
										
											6 years ago
+								    Mitogen's process model differs significantly to Ansible's in many places.
 								    In the past, bugs have been reported because Ansible plug-ins modify an
 								    environment variable after Mitogen processes are started.
 								    If your task's failure may relate to the process environment in some way,
 								    for example, ``SSH_AUTH_SOCK``, ``LC_ALL`` or ``PATH``, then an environment
 								    difference may explain it. Environment differences are always considered
 								    bugs in the extension, and are very easy to repair, so even if you find a
 								    workaround, please report them to avoid someone else encountering the same
 								    problem.
-												issue #412: pad out debugging docs, add get_stack to changelog.

											
										
										
											6 years ago
 								**Variable Expansion Differences**
 								    To avoid many classes of bugs, Mitogen avoids shell wherever possible.
 								    Ansible however is traditionally built on shell, and it is often difficult
 								    to tell just how many times a configuration parameter will pass through
 								    shell expansion and quoting, and in what context before it is used.
 								    Due to this, in some circumstances Mitogen may parse some expanded
 								    variables differently, for example, in the wrong user account. Careful
 								    review of ``-vvv`` and ``mitogen_ssh_debug_level`` logs can reveal this.
 								    For example in the past, Mitogen used a different method of expanding
 								    ``~/.ssh/id_rsa``, causing authentication to fail when ``ansible-playbook``
 								    was run via ``sudo -E``.
 								**External Tool Integration Differences**
 								    Mitogen reimplements any aspect of Ansible that involves integrating with
 								    SSH, sudo, Docker, or related tools. For this reason, sometimes its support
-												docs: s/doffers/differs/

											
										
										
											6 years ago
+								    for those tools differs or is less mature than in Ansible.
-												issue #412: pad out debugging docs, add get_stack to changelog.

											
										
										
											6 years ago
 								    In the past Mitogen has had bug reports due to failing to recognize a
 								    particular variation of a login or password prompt on an exotic or
 								    non-English operating system, or confusing a login banner for a password
 								    prompt. Careful review of ``-vvv`` logs help identify these cases, as
 								    Mitogen logs all strings it receives during connection, and how it
 								    interprets them.
 								.. _mitogen-get-stack:
 								The `mitogen_get_stack` Action
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 								When a Mitogen strategy is loaded, a special ``mitogen_get_stack`` action is
 								available that returns a concise description of the connection configuration as
 								extracted from Ansible and passed to the core library. Using it, you can learn
 								whether a problem lies in the Ansible extension or deeper in library code.
 								The action may be used in a playbook as ``mitogen_get_stack:`` just like a
 								regular module, or directly from the command-line::
 								    $ ANSIBLE_STRATEGY=mitogen_linear ansible -m mitogen_get_stack -b -k k3
 								    SSH password:
 								    k3 | SUCCESS => {
 								        "changed": true,
 								        "result": [
 								            {
 								                "kwargs": {
 								                    "check_host_keys": "enforce",
 								                    "connect_timeout": 10,
 								                    "hostname": "k3",
 								                    "identities_only": false,
 								                    "identity_file": null,
 								                    "password": "mysecretpassword",
 								                    "port": null,
 								                    "python_path": null,
 								                    "ssh_args": [
 								                        "-C",
 								                        "-o",
 								                        "ControlMaster=auto",
 								                        "-o",
 								                        "ControlPersist=60s"
 								                    ],
 								                    "ssh_debug_level": null,
 								                    "ssh_path": "ssh",
 								                    "username": null
 								                },
 								                "method": "ssh"
 								            },
 								            {
 								                "enable_lru": true,
 								                "kwargs": {
 								                    "connect_timeout": 10,
 								                    "password": null,
 								                    "python_path": null,
 								                    "sudo_args": [
 								                        "-H",
 								                        "-S",
 								                        "-n"
 								                    ],
 								                    "sudo_path": null,
 								                    "username": "root"
 								                },
 								                "method": "sudo"
 								            }
 								        ]
 								    }
 								Each object in the list represents a single 'hop' in the connection, from
 								nearest to furthest. Unlike in Ansible, the core library treats ``become``
 								steps and SSH steps identically, so they are represented distinctly in the
 								output.
 								The presence of ``null`` means no explicit value was extracted from Ansible,
 								and either the Mitogen library or SSH will choose a value for the parameter. In
 								the example above, Mitogen will choose ``/usr/bin/python`` for ``python_path``,
 								and SSH will choose ``22`` for ``port``, or whatever ``Port`` it parses from
 								``~/.ssh/config``. Note the presence of ``null`` may indicate the extension
 								failed to extract the correct value.
 								When using ``mitogen_get_stack`` to diagnose a problem, pay special attention
 								to ensuring the invocation exactly matches the problematic task. For example,
 								if the failing task has ``delegate_to:`` or ``become:`` enabled, the
 								``mitogen_get_stack`` invocation must include those statements in order for the
 								output to be accurate.
 								If a playbook cannot start at all, you may need to temporarily use
 								``gather_facts: no`` to allow the first task to proceed. This action does not
 								create connections, so if it is the first task, it is still possible to review
 								its output.
 								The `mitogen_ssh_debug_level` Variable
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 								Mitogen has support for capturing SSH diagnostic logs, and integrating them
 								into the regular debug log output produced when ``-vvv`` is active. This
 								provides a single audit trail of every component active during SSH
 								authentication.
 								Particularly for authentication failures, setting this variable to 3, in
 								combination with ``-vvv``, allows review of every parameter passed to SSH, and
 								review of every action SSH attempted during authentication.
 								For example, this method can be used to ascertain whether SSH attempted agent
 								authentication, or what private key files it was able to access and which it tried.
-												docs: include strace wrapper trick.

											
										
										
											6 years ago
+								Post-authentication Bootstrap Failure
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 								If logging indicates Mitogen was able to authenticate, but some error occurred
 								after authentication preventing the Python bootstrap from completing, it can be
 								immensely useful to temporarily replace ``ansible_python_interpreter`` with a
 								wrapper that runs Python under ``strace``::
 								    $ ssh badbox
 								    badbox$ cat > strace-python.sh
 								    #!/bin/sh
 								    strace -o /tmp/strace-python.$$ -ff -s 100 python "$@"
 								    ^D
 								    badbox$ chmod +x strace-python.sh
 								    badbox$ logout
 								    $ ansible-playbook site.yml \
 								        -e ansible_python_interpreter=./strace-python.sh \
 								        -l badbox
 								This will produce a potentially large number of log files under ``/tmp/``. The
 								lowest-numbered traced PID is generally the main Python interpreter. The most
 								intricate bootstrap steps happen there, any error should be visible near the
 								end of the trace.
 								It is also possible the first stage bootstrap failed. That is usually the next
 								lowest-numbered PID and tends to be the smallest file. Even if you can't
 								ascertain the problem with your configuration from these logs, including them
 								in a bug report can save days of detective effort.
-												docs: some more cleanups

- add faulthandler/thread stacks to changelog.
- various api.rst cleanups.
- docs: explain chain_id in howitworks.

											
										
										
											6 years ago
+								.. _diagnosing-hangs:
-												ansible: better support for diagnosing hangs

* Always enable the faulthandler module in the top-level process if it
  is available.
* Make MITOGEN_DUMP_THREAD_STACKS interval configurable, to better
  handle larger runs.
* Add docs subsection on diagnosing hangs.

Conflicts:
	ansible_mitogen/process.py

											
										
										
											6 years ago
+								Diagnosing Hangs
 								~~~~~~~~~~~~~~~~
 								If you encounter a hang, the ``MITOGEN_DUMP_THREAD_STACKS=<secs>`` environment
 								variable arranges for each process on each machine to dump each thread stack
 								into the logging framework every `secs` seconds, which is visible when running
 								with ``-vvv``.
 								However, certain controller hangs may render ``MITOGEN_DUMP_THREAD_STACKS``
 								ineffective, or occur too infrequently for interactive reproduction. In these
 								cases `faulthandler <https://faulthandler.readthedocs.io/>`_ may be used:
 . For Python 2, ``pip install faulthandler``. This is unnecessary on Python 3.
 . Once the hang occurs, observe the process tree using ``pstree`` or ``ps
 								   --forest``.
 . The most likely process to be hung is the connection multiplexer, which can
 								   easily be identified as the parent of all SSH client processes.
 . Send ``kill -SEGV <pid>`` to the multiplexer PID, causing it to print all
 								   thread stacks.
 . `File a bug <https://github.com/dw/mitogen/issues/new/>`_ including a copy
 								   of the stacks, along with a description of the last task executing prior to
 								   the hang.
-												ansible: MITOGEN_DUMP_THREAD_STACKS for mux process too

											
										
										
											7 years ago
-												docs: add another strace recipe

											
										
										
											6 years ago
+								It is possible the hang occurred in a process on a target. If ``strace`` is
 								available, look for the host name not listed in Ansible output as reporting a
 								result for the most recent task, log into it, and use ``strace -ff -p <pid>``
 								on each process whose name begins with ``mitogen:``::
 								    $ strace -ff -p 29858
 								    strace: Process 29858 attached with 3 threads
 								    [pid 29864] futex(0x55ea9be52f60, FUTEX_WAIT_BITSET_PRIVATE|FUTEX_CLOCK_REALTIME, 0, NULL, 0xffffffff <unfinished ...>
 								    [pid 29860] restart_syscall(<... resuming interrupted poll ...> <unfinished ...>
 								    [pid 29858] futex(0x55ea9be52f60, FUTEX_WAIT_BITSET_PRIVATE|FUTEX_CLOCK_REALTIME, 0, NULL, 0xffffffff
 								    ^C
 								    $
 								This shows one thread waiting on IO (``poll``) and two more waiting on the same
 								lock. It is taken from a real example of a deadlock due to a forking bug.
 								Please include any such information for all processes that you are able to
 								collect in any bug report.
-												ansible: limited support for become_flags, more docs.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								Getting Help
 								~~~~~~~~~~~~
 								Some users and developers hang out on the
 								`#mitogen <https://webchat.freenode.net/?channels=mitogen>`_ channel on the
 								FreeNode IRC network.
-												docs: tidy ansible docs.

											
										
										
											7 years ago
-												ansible: limited support for become_flags, more docs.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								Sample Profiles
 								---------------
-												docs: tidy ansible docs.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								Local VM connection
 								~~~~~~~~~~~~~~~~~~~
-												issue #164: precisely emulate Ansible's stdio behaviour.

* Use identical logic to select when stdout/stderr are merged, so
  'stdout', 'stdout_lines', 'stderr', 'stderr_lines' contain the same
  output before/after the extension.

* When stdout/stderr are merged, synthesize carriage returns just like
  the TTY layer.

* Mimic the SSH connection multiplexing message on stderr. Not really
  for user code, but so compare_output_test.sh needs fewer fixups.

											
										
										
											7 years ago
-												docs: 4kify images.

These commits were squashed to avoid repo size exploding.

											
										
										
											6 years ago
+								This demonstrates Mitogen vs. connection pipelining to a local VM executing
 								``bench/loop-100-items.yml``, which simply executes ``hostname`` 100 times.
 								Mitogen requires **43x less bandwidth and 6.5x less time**.
-												issue #164: precisely emulate Ansible's stdio behaviour.

* Use identical logic to select when stdout/stderr are merged, so
  'stdout', 'stdout_lines', 'stderr', 'stderr_lines' contain the same
  output before/after the extension.

* When stdout/stderr are merged, synthesize carriage returns just like
  the TTY layer.

* Mimic the SSH connection multiplexing message on stderr. Not really
  for user code, but so compare_output_test.sh needs fewer fixups.

											
										
										
											7 years ago
-												docs: 4kify images.

These commits were squashed to avoid repo size exploding.

											
										
										
											6 years ago
+								.. image:: images/ansible/run_hostname_100_times_mito.svg
 								.. image:: images/ansible/run_hostname_100_times_plain.svg
-												issue #164: precisely emulate Ansible's stdio behaviour.

* Use identical logic to select when stdout/stderr are merged, so
  'stdout', 'stdout_lines', 'stderr', 'stderr_lines' contain the same
  output before/after the extension.

* When stdout/stderr are merged, synthesize carriage returns just like
  the TTY layer.

* Mimic the SSH connection multiplexing message on stderr. Not really
  for user code, but so compare_output_test.sh needs fewer fixups.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								Kathmandu to Paris
 								~~~~~~~~~~~~~~~~~~
-												issue #164: precisely emulate Ansible's stdio behaviour.

* Use identical logic to select when stdout/stderr are merged, so
  'stdout', 'stdout_lines', 'stderr', 'stderr_lines' contain the same
  output before/after the extension.

* When stdout/stderr are merged, synthesize carriage returns just like
  the TTY layer.

* Mimic the SSH connection multiplexing message on stderr. Not really
  for user code, but so compare_output_test.sh needs fewer fixups.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								This is a full Django application playbook over a ~180ms link between Kathmandu
 								and Paris. Aside from large pauses where the host performs useful work, the
 								high latency of this link means Mitogen only manages a 1.7x speedup.
-												issue #164: precisely emulate Ansible's stdio behaviour.

* Use identical logic to select when stdout/stderr are merged, so
  'stdout', 'stdout_lines', 'stderr', 'stderr_lines' contain the same
  output before/after the extension.

* When stdout/stderr are merged, synthesize carriage returns just like
  the TTY layer.

* Mimic the SSH connection multiplexing message on stderr. Not really
  for user code, but so compare_output_test.sh needs fewer fixups.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								Many early roundtrips are due to inefficiencies in Mitogen's importer that will
 								be fixed over time, however the majority, comprising at least 10 seconds, are
 								due to idling while the host's previous result and next command are in-flight
 								on the network.
-												issue #164: precisely emulate Ansible's stdio behaviour.

* Use identical logic to select when stdout/stderr are merged, so
  'stdout', 'stdout_lines', 'stderr', 'stderr_lines' contain the same
  output before/after the extension.

* When stdout/stderr are merged, synthesize carriage returns just like
  the TTY layer.

* Mimic the SSH connection multiplexing message on stderr. Not really
  for user code, but so compare_output_test.sh needs fewer fixups.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								The initial extension lays groundwork for exciting structural changes to the
 								execution model: a future version will tackle latency head-on by delegating
 								some control flow to the target host, melding the performance and scalability
 								benefits of pull-based operation with the management simplicity of push-based
 								operation.
-												ansible: limited support for become_flags, more docs.

											
										
										
											7 years ago
-												docs: major Ansible page update.

											
										
										
											7 years ago
+								.. image:: images/ansible/costapp.png
-												ansible: limited support for become_flags, more docs.

											
										
										
											7 years ago