mitogen/docs/ansible.rst


Ansible Extension
=================

.. image:: images/ansible/cell_division.png
    :align: right

An extension to `Ansible`_ is included that implements host connections over
Mitogen, replacing embedded shell invocations with pure-Python equivalents
invoked via highly efficient remote procedure calls tunnelled over SSH. No
changes are required to the target hosts.

The extension is approaching a generally dependable state, and works well for
many real-world playbooks. `Bug reports`_ in this area are very welcome –
Ansible is a huge beast, and only significant testing will prove the
extension's soundness.

Divergence from Ansible's normal behaviour is considered a bug, so please
report anything you notice, regardless of how inconsequential it may seem.

.. _Ansible: https://www.ansible.com/

.. _Bug reports: https://goo.gl/yLKZiJ


Overview
--------

You should **expect a 1.25x - 7x speedup** and a **CPU usage reduction of at
least 2x**, depending on network conditions, the specific modules executed, and
time spent by the target host already doing useful work. Mitogen cannot speed
up a module once it is executing, it can only ensure the module executes as
quickly as possible.

* **A single SSH connection is used for each target host**, in addition to one
  sudo invocation per distinct user account. Subsequent playbook steps always
  reuse the same connection. This is much better than SSH multiplexing combined
  with pipelining, as significant state can be maintained in RAM between steps,
  and the system logs aren't filled with spam from repeat SSH and sudo
  invocations.

* **A single Python interpreter is used** per host and sudo account combination
  for the duration of the run, avoiding the repeat cost of invoking multiple
  interpreters and recompiling imports, saving 300-800 ms for every playbook
  step.

* Remote interpreters reuse Mitogen's module import mechanism, caching uploaded
  dependencies between steps at the host and user account level. As a
  consequence, **bandwidth usage is consistently an order of magnitude lower**
  compared to SSH pipelining, and around 5x fewer frames are required to
  traverse the wire for a run to complete successfully.

* **No writes to the target host's filesystem occur**, unless explicitly
  triggered by a playbook step. In all typical configurations, Ansible
  repeatedly rewrites and extracts ZIP files to multiple temporary directories
  on the target host. Since no temporary files are used, security issues
  relating to those files in cross-account scenarios are entirely avoided.


Demo
----

This demonstrates Ansible running a subset of the Mitogen integration tests
concurrent to an equivalent run using the extension.

.. raw:: html

    <video width="720" height="439" controls>
        <source src="http://k3.botanicus.net/tmp/ansible_mitogen.mp4" type="video/mp4">
    </video>


Testimonials
------------

* "With mitogen **my playbook runtime went from 45 minutes to just under 3
  minutes**. Awesome work!"

* "The runtime was reduced from **1.5 hours on 4 servers to just under 3
  minutes**. Thanks!"

* "Oh, performance improvement using Mitogen is *huge*. As mentioned before,
  running with Mitogen enables takes 7m36 (give or take a few seconds). Without
  Mitogen, the same run takes 19m49! **I'm not even deploying without Mitogen
  anymore** :)"

* "**Works like a charm**, thank you for your quick response"

* "I tried it out. **He is not kidding about the speed increase**."

* "I don't know what kind of dark magic @dmw_83 has done, but his Mitogen
  strategy took Clojars' Ansible runs from **14 minutes to 2 minutes**. I still
  can't quite believe it."


Installation
------------

.. caution::

    Please review the behavioural differences documented below prior to use.

1. Verify Ansible 2.4 and Python 2.7 are listed in the output of ``ansible
   --version``
2. Download and extract https://github.com/dw/mitogen/archive/master.zip
3. Modify ``ansible.cfg``:

   .. code-block:: dosini

        [defaults]
        strategy_plugins = /path/to/mitogen-master/ansible_mitogen/plugins/strategy
        strategy = mitogen_linear

   The ``strategy`` key is optional. If omitted, you can set the
   ``ANSIBLE_STRATEGY=mitogen_linear`` environment variable on a per-run basis.
   Like ``mitogen_linear``, the ``mitogen_free`` strategy also exists to mimic
   the built-in ``free`` strategy.

4. Cross your fingers and try it.


Limitations
-----------

* Only Ansible 2.4 is being used for development, with occasional tests under
  2.5, 2.3 and 2.2. It should be more than possible to fully support at least
  2.3, if not also 2.2.

* Only the ``sudo`` become method is available, however adding new methods is
  straightforward, and eventually at least ``su`` will be included.

* The extension's performance benefits do not scale perfectly linearly with the
  number of targets. This is a subject of ongoing investigation and
  improvements will appear in time.

* "Module Replacer" style modules are not yet supported. These rarely appear in
  practice, and light Github code searches failed to reveal many examples of
  them.


Behavioural Differences
-----------------------

* Ansible permits up to ``forks`` SSH connections to be setup simultaneously,
  whereas in Mitogen this is handled by a thread pool. Eventually this pool
  will become per-CPU, but meanwhile, a maximum of 16 SSH connections may be
  established simultaneously by default. This can be increased or decreased
  setting the ``MITOGEN_POOL_SIZE`` environment variable.

* Mitogen treats connection timeouts for the SSH and become steps of a task
  invocation separately, meaning that in some circumstances the configured
  timeout may appear to be doubled. This is since Mitogen internally treats the
  creation of an SSH account context separately to the creation of a sudo
  account context proxied via that SSH account.

  A future revision may detect a sudo account context created immediately
  following its parent SSH account, and try to emulate Ansible's existing
  timeout semantics.

* Local commands are executed in a reuseable Python interpreter created
  identically to interpreters used on remote hosts. At present only one such
  interpreter per ``become_user`` exists, and so only one local action may be
  executed simultaneously per local user account.

  Ansible usually permits up to ``ansible.cfg:forks`` simultaneous local
  actions. Any long-running local actions that execute for every target will
  experience artificial serialization, causing slowdown equivalent to
  `task_duration * num_targets`. This will be fixed soon.

* Asynchronous jobs exist only for the duration of a run, and cannot be
  queried by subsequent ansible-playbook invocations. Since the ability to
  query job IDs across runs relied on an implementation detail, it is not
  expected this will break any real-world playbooks.


How Modules Execute
-------------------

Ansible usually modifies, recompresses and reuploads modules every time they
run on a target, work that must be repeated by the controller for every
playbook step.

With the extension any modifications are done on the target, allowing pristine
copies of modules to be cached, reducing the necessity to re-transfer modules
for each invocation. Unmodified modules are uploaded once on first use and
cached in RAM for the remainder of the run.

**Binary**
    Native executables detected using a complex heuristic. Arguments are
    supplied as a JSON file whose path is the sole script parameter.

**Module Replacer**
    Python scripts detected by the presence of
    ``#<<INCLUDE_ANSIBLE_MODULE_COMMON>>`` appearing in their source. This type
    is not yet supported.

**New-Style**
    Python scripts detected by the presence of ``from ansible.module_utils.``
    appearing in their source. Arguments are supplied as JSON written to
    ``sys.stdin`` of the target interpreter.

**JSON_ARGS**
    Detected by the presence of ``INCLUDE_ANSIBLE_MODULE_JSON_ARGS`` appearing
    in the script source. The interpreter directive (``#!interpreter``) is
    adjusted to match the corresponding value of ``{{ansible_*_interpreter}}``
    if one is set. Arguments are supplied as JSON mixed into the script as a
    replacement for ``INCLUDE_ANSIBLE_MODULE_JSON_ARGS``.

**WANT_JSON**
    Detected by the presence of ``WANT_JSON`` appearing in the script source.
    The interpreter directive is adjusted as above. Arguments are supplied as a
    JSON file whose path is the sole script parameter.

**Old Style**
    Files not matching any of the above tests. The interpreter directive is
    adjusted as above. Arguments are supplied as a file whose path is the sole
    script parameter. The format of the file is ``"key=repr(value)[
    key2=repr(value2)[ ..]] "``.


Sample Profiles
---------------

Local VM connection
~~~~~~~~~~~~~~~~~~~

This demonstrates Mitogen vs. connection pipelining to a local VM, executing
the 100 simple repeated steps of ``run_hostname_100_times.yml`` from the
examples directory. Mitogen requires **43x less bandwidth and 4.25x less
time**.

.. image:: images/ansible/run_hostname_100_times.png


Kathmandu to Paris
~~~~~~~~~~~~~~~~~~

This is a full Django application playbook over a ~180ms link between Kathmandu
and Paris. Aside from large pauses where the host performs useful work, the
high latency of this link means Mitogen only manages a 1.7x speedup.

Many early roundtrips are due to inefficiencies in Mitogen's importer that will
be fixed over time, however the majority, comprising at least 10 seconds, are
due to idling while the host's previous result and next command are in-flight
on the network.

The initial extension lays groundwork for exciting structural changes to the
execution model: a future version will tackle latency head-on by delegating
some control flow to the target host, melding the performance and scalability
benefits of pull-based operation with the management simplicity of push-based
operation.

.. image:: images/ansible/costapp.png


SSH Variables
-------------

Matching Ansible's existing model, these variables are treated on a per-task
basis, causing establishment of additional reuseable interpreters as necessary
to match the configuration of each task.

This list will grow as more missing pieces are discovered.

* ``ansible_ssh_timeout``
* ``ansible_host``, ``ansible_ssh_host``
* ``ansible_user``, ``ansible_ssh_user``
* ``ansible_port``, ``ssh_port``
* ``ansible_ssh_executable``, ``ssh_executable``
* ``ansible_ssh_private_key_file``
* ``ansible_ssh_pass``, ``ansible_password`` (default: assume passwordless)
* ``ssh_args``, ``ssh_common_args``, ``ssh_extra_args``


Sudo Variables
--------------

* ``ansible_python_interpreter``
* ``ansible_sudo_exe``, ``ansible_become_exe``
* ``ansible_sudo_user``, ``ansible_become_user`` (default: ``root``)
* ``ansible_sudo_pass``, ``ansible_become_pass`` (default: assume passwordless)
* ``sudo_flags``, ``become_flags``
* ansible.cfg: ``timeout``


Docker Variables
----------------

Note: Docker support is only intended for developer testing, it might disappear
entirely prior to a stable release.

* ansible_host


Chat on IRC
-----------

Some users and developers hang out on the
`#mitogen <https://webchat.freenode.net/?channels=mitogen>`_ channel on the
FreeNode IRC network.


Debugging
---------

Normally with Ansible, diagnostics and use of the :py:mod:`logging` package
output on the target machine are discarded. With Mitogen, all of this is
captured and returned to the host machine, where it can be viewed as desired
with ``-vvv``. Basic high level logs are produced with ``-vvv``, with logging
of all IO on the controller with ``-vvvv`` or higher.

Although use of standard IO and the logging package on the target is forwarded
to the controller, it is not possible to receive IO activity logs, as the
processs of receiving those logs would would itself generate IO activity. To
receive a complete trace of every process on every machine, file-based logging
is necessary. File-based logging can be enabled by setting
``MITOGEN_ROUTER_DEBUG=1`` in your environment.

When file-based logging is enabled, one file per context will be created on the
local machine and every target machine, as ``/tmp/mitogen.<pid>.log``.


Implementation Notes
--------------------

Interpreter Reuse
~~~~~~~~~~~~~~~~~

The extension aggressively reuses the single target Python interpreter to
execute every module. While this generally works well, it violates an unwritten
assumption regarding Ansible modules, and so it is possible a buggy module
could cause a run to fail, or for unrelated modules to interact with each other
due to bad hygiene.

Before reporting a bug relating to a module behaving incorrectly, please re-run
your playbook with ``-e mitogen_task_isolation=fork`` to see if the problem
abates. This may also be set on a per-task basis:

::

    - name: My task.
      broken_module:
        some_option: true
      vars:
        mitogen_task_isolation: fork

If forking fixes your problem, **please report a bug regardless**, as an
internal list can be updated to prevent users bumping into the same problem in
future.


Interpreter Recycling
~~~~~~~~~~~~~~~~~~~~~

The extension limits the number of persistent interpreters in use. When the
limit is reached, the youngest interpreter is terminated before starting a new
interpreter, preventing situations like below from triggering memory
exhaustion.

.. code-block:: yaml

    - hosts: corp_boxes
      vars:
        user_directory: [
          # 10,000 corporate user accounts
        ]
      tasks:
        - name: Create user bashrc
          become: true
          vars:
            ansible_become_user: "{{item}}"
          copy:
            src: bashrc
            dest: "~{{item}}/.bashrc"
          with_items: "{{user_directory}}"

This recycling does not occur for direct connections from the controller, and
it is keyed on a per-target basis, i.e. up to 20 interpreters may exist for
each directly connected target.

The youngest interpreter is chosen to preserve useful accounts, like "root" or
"postgresql" that tend to appear early in a run, however it is simple to
construct a playbook that defeats this strategy. A future version will key
interpreters on the identity of their creating task, file and/or playbook,
avoiding useful account recycling in every scenario.

To raise or lower the limit from 20, set the ``MITOGEN_MAX_INTERPRETERS``
environment variable to a new value.


Runtime Patches
~~~~~~~~~~~~~~~

Three small runtime patches are employed in ``strategy.py`` to hook into
desirable locations, in order to override uses of shell, the module executor,
and the mechanism for selecting a connection plug-in. While it is hoped the
patches can be avoided in future, for interesting versions of Ansible deployed
today this simply is not possible, and so they continue to be required.

The patches are concise and behave conservatively, including by disabling
themselves when non-Mitogen connections are in use. Additional third party
plug-ins are unlikely to attempt similar patches, so the risk to an established
configuration should be minimal.


Standard IO
~~~~~~~~~~~

Ansible uses pseudo TTYs for most invocations, to allow it to handle typing
passwords interactively, however it disables pseudo TTYs for certain commands
where standard input is required or ``sudo`` is not in use. Additionally when
SSH multiplexing is enabled, a string like ``Shared connection to localhost
closed\r\n`` appears in ``stderr`` of every invocation.

Mitogen does not naturally require either of these, as command output is
embedded within the SSH stream, and it can simply call :py:func:`pty.openpty`
in every location an interactive password must be typed.

A major downside to Ansible's behaviour is that ``stdout`` and ``stderr`` are
merged together into a single ``stdout`` variable, with carriage returns
inserted in the output by the TTY layer. However ugly, the extension emulates
all of this behaviour precisely, to avoid breaking playbooks that expect
certain text to appear in certain variables with certain linefeed characters.

See `Ansible#14377`_ for related discussion.

.. _Ansible#14377: https://github.com/ansible/ansible/issues/14377


Flag Emulation
~~~~~~~~~~~~~~

Mitogen re-parses ``sudo_flags``, ``become_flags``, and ``ssh_flags`` using
option parsers extracted from `sudo(1)` and `ssh(1)` in order to emulate their
equivalent semantics. This allows:

* robust support for common ``ansible.cfg`` tricks without reconfiguration,
  such as forwarding SSH agents across ``sudo`` invocations,
* reporting on conflicting flag combinations,
* reporting on unsupported flag combinations,
* internally special-casing certain behaviour (like recursive agent forwarding)
  without boring the user with the details,
* avoiding opening the extension up to untestable scenarios where users can
  insert arbitrary garbage between Mitogen and the components it integrates
  with,
* precise emulation by an alternative implementation, for example if Mitogen
  grew support for Paramiko.
-												docs: initial Ansible extension docs.

											
										
										
											6 years ago
 								Ansible Extension
 								=================
-												examples: rename playbooks for clarity.

											
										
										
											6 years ago
+								.. image:: images/ansible/cell_division.png
-												docs: initial Ansible extension docs.

											
										
										
											6 years ago
+								    :align: right
-												docs: remove warning labels.

											
										
										
											6 years ago
+								An extension to `Ansible`_ is included that implements host connections over
 								Mitogen, replacing embedded shell invocations with pure-Python equivalents
 								invoked via highly efficient remote procedure calls tunnelled over SSH. No
 								changes are required to the target hosts.
-												docs: initial Ansible extension docs.

											
										
										
											6 years ago
-												docs: remove warning labels.

											
										
										
											6 years ago
+								The extension is approaching a generally dependable state, and works well for
 								many real-world playbooks. `Bug reports`_ in this area are very welcome –
 								Ansible is a huge beast, and only significant testing will prove the
 								extension's soundness.
-												docs: initial Ansible extension docs.

											
										
										
											6 years ago
-												docs: extra ansible paragraph.

											
										
										
											6 years ago
+								Divergence from Ansible's normal behaviour is considered a bug, so please
 								report anything you notice, regardless of how inconsequential it may seem.
-												docs: initial Ansible extension docs.

											
										
										
											6 years ago
+								.. _Ansible: https://www.ansible.com/
 								.. _Bug reports: https://goo.gl/yLKZiJ
 								Overview
 								--------
-												docs: new Ansible limitation, add new heading

Some differences are eventually likely to become permanent, because the
existing behaviour is unforgiveable.

											
										
										
											6 years ago
+								You should **expect a 1.25x - 7x speedup** and a **CPU usage reduction of at
-												docs: mention CPU usage reduction

											
										
										
											6 years ago
+								least 2x**, depending on network conditions, the specific modules executed, and
 								time spent by the target host already doing useful work. Mitogen cannot speed
 								up a module once it is executing, it can only ensure the module executes as
 								quickly as possible.
-												docs: initial Ansible extension docs.

											
										
										
											6 years ago
-												docs: more marketing, add lots of drama bold.

											
										
										
											6 years ago
+								* **A single SSH connection is used for each target host**, in addition to one
 								  sudo invocation per distinct user account. Subsequent playbook steps always
 								  reuse the same connection. This is much better than SSH multiplexing combined
 								  with pipelining, as significant state can be maintained in RAM between steps,
 								  and the system logs aren't filled with spam from repeat SSH and sudo
 								  invocations.
-												docs: initial Ansible extension docs.

											
										
										
											6 years ago
-												docs: more marketing, add lots of drama bold.

											
										
										
											6 years ago
+								* **A single Python interpreter is used** per host and sudo account combination
 								  for the duration of the run, avoiding the repeat cost of invoking multiple
-												docs: more modest and accurate numbers for Ansible

											
										
										
											6 years ago
+								  interpreters and recompiling imports, saving 300-800 ms for every playbook
-												docs: initial Ansible extension docs.

											
										
										
											6 years ago
+								  step.
 								* Remote interpreters reuse Mitogen's module import mechanism, caching uploaded
 								  dependencies between steps at the host and user account level. As a
-												docs: more marketing, add lots of drama bold.

											
										
										
											6 years ago
+								  consequence, **bandwidth usage is consistently an order of magnitude lower**
-												docs: initial Ansible extension docs.

											
										
										
											6 years ago
+								  compared to SSH pipelining, and around 5x fewer frames are required to
 								  traverse the wire for a run to complete successfully.
-												docs: more marketing, add lots of drama bold.

											
										
										
											6 years ago
+								* **No writes to the target host's filesystem occur**, unless explicitly
-												docs: initial Ansible extension docs.

											
										
										
											6 years ago
+								  triggered by a playbook step. In all typical configurations, Ansible
 								  repeatedly rewrites and extracts ZIP files to multiple temporary directories
 								  on the target host. Since no temporary files are used, security issues
 								  relating to those files in cross-account scenarios are entirely avoided.
-												docs: link to Ansible video demo

											
										
										
											6 years ago
+								Demo
 								----
 								This demonstrates Ansible running a subset of the Mitogen integration tests
 								concurrent to an equivalent run using the extension.
 								.. raw:: html
-												issue #106: docs: tidyup.

											
										
										
											6 years ago
+								    <video width="720" height="439" controls>
-												docs: link to Ansible video demo

											
										
										
											6 years ago
+								        <source src="http://k3.botanicus.net/tmp/ansible_mitogen.mp4" type="video/mp4">
 								    </video>
-												docs: reorder sections

											
										
										
											6 years ago
+								Testimonials
 								------------
 								* "With mitogen **my playbook runtime went from 45 minutes to just under 3
 								  minutes**. Awesome work!"
 								* "The runtime was reduced from **1.5 hours on 4 servers to just under 3
 								  minutes**. Thanks!"
 								* "Oh, performance improvement using Mitogen is *huge*. As mentioned before,
 								  running with Mitogen enables takes 7m36 (give or take a few seconds). Without
 								  Mitogen, the same run takes 19m49! **I'm not even deploying without Mitogen
 								  anymore** :)"
 								* "**Works like a charm**, thank you for your quick response"
 								* "I tried it out. **He is not kidding about the speed increase**."
-												docs: slightly bikeshed last testimonial

											
										
										
											6 years ago
+								* "I don't know what kind of dark magic @dmw_83 has done, but his Mitogen
 								  strategy took Clojars' Ansible runs from **14 minutes to 2 minutes**. I still
 								  can't quite believe it."
-												Add testimonial from Clojars
											
										
										
											6 years ago
-												docs: reorder sections

											
										
										
											6 years ago
-												ansible: doc updates

											
										
										
											6 years ago
+								Installation
 								------------
 								.. caution::
-												docs: remove warning labels.

											
										
										
											6 years ago
+								    Please review the behavioural differences documented below prior to use.
-												ansible: doc updates

											
										
										
											6 years ago
 . Verify Ansible 2.4 and Python 2.7 are listed in the output of ``ansible
 								   --version``
-												docs: Convert all URLs that support https://

Excluded: graphml XML namespaces, links to e.g. Fabric homepage

Fixes #128

											
										
										
											6 years ago
+. Download and extract https://github.com/dw/mitogen/archive/master.zip
-												ansible: doc updates

											
										
										
											6 years ago
+. Modify ``ansible.cfg``:
 								   .. code-block:: dosini
 								        [defaults]
 								        strategy_plugins = /path/to/mitogen-master/ansible_mitogen/plugins/strategy
-												ansible: Add support for free strategy.

											
										
										
											6 years ago
+								        strategy = mitogen_linear
-												ansible: doc updates

											
										
										
											6 years ago
 								   The ``strategy`` key is optional. If omitted, you can set the
-												ansible: Add support for free strategy.

											
										
										
											6 years ago
+								   ``ANSIBLE_STRATEGY=mitogen_linear`` environment variable on a per-run basis.
 								   Like ``mitogen_linear``, the ``mitogen_free`` strategy also exists to mimic
 								   the built-in ``free`` strategy.
-												ansible: doc updates

											
										
										
											6 years ago
 . Cross your fingers and try it.
-												docs: initial Ansible extension docs.

											
										
										
											6 years ago
+								Limitations
 								-----------
-												docs: rearrange more ansible risks

											
										
										
											6 years ago
+								* Only Ansible 2.4 is being used for development, with occasional tests under
 .5, 2.3 and 2.2. It should be more than possible to fully support at least
 .3, if not also 2.2.
-												docs: initial Ansible extension docs.

											
										
										
											6 years ago
-												docs: Split up limitations list, add warning

											
										
										
											6 years ago
+								* Only the ``sudo`` become method is available, however adding new methods is
 								  straightforward, and eventually at least ``su`` will be included.
-												docs: ansible.rst: note multi-host perf isn't great right now

											
										
										
											6 years ago
+								* The extension's performance benefits do not scale perfectly linearly with the
 								  number of targets. This is a subject of ongoing investigation and
 								  improvements will appear in time.
-												issue #106: docs: remove built-in only limitation :>

											
										
										
											6 years ago
+								* "Module Replacer" style modules are not yet supported. These rarely appear in
 								  practice, and light Github code searches failed to reveal many examples of
 								  them.
-												docs: new Ansible limitation, add new heading

Some differences are eventually likely to become permanent, because the
existing behaviour is unforgiveable.

											
										
										
											6 years ago
 								Behavioural Differences
 								-----------------------
-												issue #144: ansible: increase default pool size to 16.

											
										
										
											6 years ago
+								* Ansible permits up to ``forks`` SSH connections to be setup simultaneously,
 								  whereas in Mitogen this is handled by a thread pool. Eventually this pool
 								  will become per-CPU, but meanwhile, a maximum of 16 SSH connections may be
 								  established simultaneously by default. This can be increased or decreased
 								  setting the ``MITOGEN_POOL_SIZE`` environment variable.
-												docs: note the semantic difference in Mitogen vs. Ansible timeouts

Related to issue #141.

											
										
										
											6 years ago
+								* Mitogen treats connection timeouts for the SSH and become steps of a task
 								  invocation separately, meaning that in some circumstances the configured
 								  timeout may appear to be doubled. This is since Mitogen internally treats the
 								  creation of an SSH account context separately to the creation of a sudo
 								  account context proxied via that SSH account.
 								  A future revision may detect a sudo account context created immediately
 								  following its parent SSH account, and try to emulate Ansible's existing
 								  timeout semantics.
-												docs: update ansible risks/differences.

											
										
										
											6 years ago
+								* Local commands are executed in a reuseable Python interpreter created
 								  identically to interpreters used on remote hosts. At present only one such
-												ansible: enable forking when requested and for async jobs.

Closes #105.
References #155.

mitogen/service.py:
    Refactor services to support individually exposed methods with
    different security policies for each method.

    - @mitogen.service.expose() to expose a method and set its policy
    - @mitogen.service.arg_spec() to validate input.
    - Require basic service message format to be a tuple of
      `(method, kwargs)`, where kwargs is always a dict.
    - Update DeduplicatingService to match the new scheme.

ansible_mitogen/connection.py:
    - Rename 'method' to 'method_name' to disambiguate it from the
      service.call()'s method= argument.

ansible_mitogen/planner.py:
    - Generate an ID for every job, sync or not, and fetch job results
      from JobResultService rather than via the initiating function
      call's return value.
    - Planner subclasses now get to select whether their Runner should
      run in a forked process. The base implementation requests this if
      the 'mitogen_isolation_mode=fork' task variable is present.

ansible_mitogen/runner.py:
    Teach runners to deliver their result via JobResultService executing
    in their indirect parent mux process.

ansible_mitogen/plugins/actions/mitogen_async_status.py:
    Split the implementation up into methods, and more compatibly
    emulate Ansible's existing output.

ansible_mitogen/process.py:
    Mux processes now host JobResultService.

ansible_mitogen/services.py:
    Update existing services to the new mitogen.service scheme, and
    implement JobResultService:

    * listen() method for synchronous jobs. planner.invoke() registers a
      Sender with the service prior to invoking the job, then sleeps
      waiting for the service to write the job result to the
      corresponding Receiver.

    * Non-blocking get() method for implementing mitogen_async_status
      action.

    * Child-accessible push() method for delivering task results.

ansible_mitogen/target.py:
    New helpers for spawning a virginal subprocess on startup, from
    which asynchronous and mitogen_task_isolation=fork jobs are forked.
    Necessary to avoid a task inheriting potentially
    polluted/monkey-patched parent environment, since remaining jobs
    continue to run in the original child process.

docs/ansible.rst:
    Add/merge/remove some behaviours/risks.

tests/ansible/integration:
    New tests for forking/async.

											
										
										
											6 years ago
+								  interpreter per ``become_user`` exists, and so only one local action may be
 								  executed simultaneously per local user account.
 								  Ansible usually permits up to ``ansible.cfg:forks`` simultaneous local
 								  actions. Any long-running local actions that execute for every target will
 								  experience artificial serialization, causing slowdown equivalent to
 								  `task_duration * num_targets`. This will be fixed soon.
-												docs: remove Ansible risk

											
										
										
											6 years ago
+								* Asynchronous jobs exist only for the duration of a run, and cannot be
-												ansible: enable forking when requested and for async jobs.

Closes #105.
References #155.

mitogen/service.py:
    Refactor services to support individually exposed methods with
    different security policies for each method.

    - @mitogen.service.expose() to expose a method and set its policy
    - @mitogen.service.arg_spec() to validate input.
    - Require basic service message format to be a tuple of
      `(method, kwargs)`, where kwargs is always a dict.
    - Update DeduplicatingService to match the new scheme.

ansible_mitogen/connection.py:
    - Rename 'method' to 'method_name' to disambiguate it from the
      service.call()'s method= argument.

ansible_mitogen/planner.py:
    - Generate an ID for every job, sync or not, and fetch job results
      from JobResultService rather than via the initiating function
      call's return value.
    - Planner subclasses now get to select whether their Runner should
      run in a forked process. The base implementation requests this if
      the 'mitogen_isolation_mode=fork' task variable is present.

ansible_mitogen/runner.py:
    Teach runners to deliver their result via JobResultService executing
    in their indirect parent mux process.

ansible_mitogen/plugins/actions/mitogen_async_status.py:
    Split the implementation up into methods, and more compatibly
    emulate Ansible's existing output.

ansible_mitogen/process.py:
    Mux processes now host JobResultService.

ansible_mitogen/services.py:
    Update existing services to the new mitogen.service scheme, and
    implement JobResultService:

    * listen() method for synchronous jobs. planner.invoke() registers a
      Sender with the service prior to invoking the job, then sleeps
      waiting for the service to write the job result to the
      corresponding Receiver.

    * Non-blocking get() method for implementing mitogen_async_status
      action.

    * Child-accessible push() method for delivering task results.

ansible_mitogen/target.py:
    New helpers for spawning a virginal subprocess on startup, from
    which asynchronous and mitogen_task_isolation=fork jobs are forked.
    Necessary to avoid a task inheriting potentially
    polluted/monkey-patched parent environment, since remaining jobs
    continue to run in the original child process.

docs/ansible.rst:
    Add/merge/remove some behaviours/risks.

tests/ansible/integration:
    New tests for forking/async.

											
										
										
											6 years ago
+								  queried by subsequent ansible-playbook invocations. Since the ability to
 								  query job IDs across runs relied on an implementation detail, it is not
 								  expected this will break any real-world playbooks.
-												docs: update ansible risks/differences.

											
										
										
											6 years ago
-												docs: initial Ansible extension docs.

											
										
										
											6 years ago
-												issue #106: docs: initial docs for how modules execute.

											
										
										
											6 years ago
+								How Modules Execute
 								-------------------
-												issue #106: docs: tidyup.

											
										
										
											6 years ago
+								Ansible usually modifies, recompresses and reuploads modules every time they
 								run on a target, work that must be repeated by the controller for every
 								playbook step.
 								With the extension any modifications are done on the target, allowing pristine
 								copies of modules to be cached, reducing the necessity to re-transfer modules
 								for each invocation. Unmodified modules are uploaded once on first use and
 								cached in RAM for the remainder of the run.
-												issue #106: docs: initial docs for how modules execute.

											
										
										
											6 years ago
 								**Binary**
-												docs: tidy up big list of bullets.

											
										
										
											6 years ago
+								    Native executables detected using a complex heuristic. Arguments are
 								    supplied as a JSON file whose path is the sole script parameter.
-												issue #106: docs: initial docs for how modules execute.

											
										
										
											6 years ago
 								**Module Replacer**
-												docs: tidy up big list of bullets.

											
										
										
											6 years ago
+								    Python scripts detected by the presence of
 								    ``#<<INCLUDE_ANSIBLE_MODULE_COMMON>>`` appearing in their source. This type
 								    is not yet supported.
-												issue #106: docs: initial docs for how modules execute.

											
										
										
											6 years ago
 								**New-Style**
-												docs: tidy up big list of bullets.

											
										
										
											6 years ago
+								    Python scripts detected by the presence of ``from ansible.module_utils.``
 								    appearing in their source. Arguments are supplied as JSON written to
 								    ``sys.stdin`` of the target interpreter.
-												issue #106: docs: initial docs for how modules execute.

											
										
										
											6 years ago
 								**JSON_ARGS**
-												docs: tidy up big list of bullets.

											
										
										
											6 years ago
+								    Detected by the presence of ``INCLUDE_ANSIBLE_MODULE_JSON_ARGS`` appearing
 								    in the script source. The interpreter directive (``#!interpreter``) is
 								    adjusted to match the corresponding value of ``{{ansible_*_interpreter}}``
 								    if one is set. Arguments are supplied as JSON mixed into the script as a
 								    replacement for ``INCLUDE_ANSIBLE_MODULE_JSON_ARGS``.
-												issue #106: docs: initial docs for how modules execute.

											
										
										
											6 years ago
 								**WANT_JSON**
-												docs: tidy up big list of bullets.

											
										
										
											6 years ago
+								    Detected by the presence of ``WANT_JSON`` appearing in the script source.
 								    The interpreter directive is adjusted as above. Arguments are supplied as a
 								    JSON file whose path is the sole script parameter.
-												issue #106: docs: initial docs for how modules execute.

											
										
										
											6 years ago
 								**Old Style**
-												docs: tidy up big list of bullets.

											
										
										
											6 years ago
+								    Files not matching any of the above tests. The interpreter directive is
 								    adjusted as above. Arguments are supplied as a file whose path is the sole
 								    script parameter. The format of the file is ``"key=repr(value)[
 								    key2=repr(value2)[ ..]] "``.
-												issue #106: docs: initial docs for how modules execute.

											
										
										
											6 years ago
-												docs: link to Ansible video demo

											
										
										
											6 years ago
+								Sample Profiles
 								---------------
-												docs: initial Ansible extension docs.

											
										
										
											6 years ago
 								Local VM connection
 								~~~~~~~~~~~~~~~~~~~
 								This demonstrates Mitogen vs. connection pipelining to a local VM, executing
-												examples: rename playbooks for clarity.

											
										
										
											6 years ago
+								the 100 simple repeated steps of ``run_hostname_100_times.yml`` from the
-												docs: more marketing, add lots of drama bold.

											
										
										
											6 years ago
+								examples directory. Mitogen requires **43x less bandwidth and 4.25x less
 								time**.
-												docs: initial Ansible extension docs.

											
										
										
											6 years ago
-												examples: rename playbooks for clarity.

											
										
										
											6 years ago
+								.. image:: images/ansible/run_hostname_100_times.png
-												docs: initial Ansible extension docs.

											
										
										
											6 years ago
 								Kathmandu to Paris
 								~~~~~~~~~~~~~~~~~~
 								This is a full Django application playbook over a ~180ms link between Kathmandu
 								and Paris. Aside from large pauses where the host performs useful work, the
 								high latency of this link means Mitogen only manages a 1.7x speedup.
-												docs: tidy up ansible.rst

											
										
										
											6 years ago
+								Many early roundtrips are due to inefficiencies in Mitogen's importer that will
 								be fixed over time, however the majority, comprising at least 10 seconds, are
 								due to idling while the host's previous result and next command are in-flight
 								on the network.
-												docs: initial Ansible extension docs.

											
										
										
											6 years ago
 								The initial extension lays groundwork for exciting structural changes to the
 								execution model: a future version will tackle latency head-on by delegating
-												docs: small ansible.rst updates

											
										
										
											6 years ago
+								some control flow to the target host, melding the performance and scalability
 								benefits of pull-based operation with the management simplicity of push-based
 								operation.
-												docs: initial Ansible extension docs.

											
										
										
											6 years ago
-												examples: rename playbooks for clarity.

											
										
										
											6 years ago
+								.. image:: images/ansible/costapp.png
-												docs: initial Ansible extension docs.

											
										
										
											6 years ago
 								SSH Variables
 								-------------
-												ansible: allow establishment of duplicate SSH connections

											
										
										
											6 years ago
+								Matching Ansible's existing model, these variables are treated on a per-task
 								basis, causing establishment of additional reuseable interpreters as necessary
 								to match the configuration of each task.
-												docs: initial Ansible extension docs.

											
										
										
											6 years ago
+								This list will grow as more missing pieces are discovered.
-												ansible: allow establishment of duplicate SSH connections

											
										
										
											6 years ago
+								* ``ansible_ssh_timeout``
 								* ``ansible_host``, ``ansible_ssh_host``
 								* ``ansible_user``, ``ansible_ssh_user``
 								* ``ansible_port``, ``ssh_port``
 								* ``ansible_ssh_executable``, ``ssh_executable``
 								* ``ansible_ssh_private_key_file``
 								* ``ansible_ssh_pass``, ``ansible_password`` (default: assume passwordless)
 								* ``ssh_args``, ``ssh_common_args``, ``ssh_extra_args``
-												docs: initial Ansible extension docs.

											
										
										
											6 years ago
 								Sudo Variables
 								--------------
-												ansible: allow establishment of duplicate SSH connections

											
										
										
											6 years ago
+								* ``ansible_python_interpreter``
 								* ``ansible_sudo_exe``, ``ansible_become_exe``
 								* ``ansible_sudo_user``, ``ansible_become_user`` (default: ``root``)
 								* ``ansible_sudo_pass``, ``ansible_become_pass`` (default: assume passwordless)
 								* ``sudo_flags``, ``become_flags``
 								* ansible.cfg: ``timeout``
-												ansible: migrate logging variables into utils.

											
										
										
											6 years ago
-												issue #150: ansible: add basic Docker support.

											
										
										
											6 years ago
+								Docker Variables
 								----------------
 								Note: Docker support is only intended for developer testing, it might disappear
 								entirely prior to a stable release.
 								* ansible_host
-												Add link to IRC; closes #116

											
										
										
											6 years ago
+								Chat on IRC
 								-----------
-												docs: typo

											
										
										
											6 years ago
+								Some users and developers hang out on the
-												Add link to IRC; closes #116

											
										
										
											6 years ago
+								`#mitogen <https://webchat.freenode.net/?channels=mitogen>`_ channel on the
 								FreeNode IRC network.
-												ansible: migrate logging variables into utils.

											
										
										
											6 years ago
+								Debugging
 								---------
-												ansible: enable forking when requested and for async jobs.

Closes #105.
References #155.

mitogen/service.py:
    Refactor services to support individually exposed methods with
    different security policies for each method.

    - @mitogen.service.expose() to expose a method and set its policy
    - @mitogen.service.arg_spec() to validate input.
    - Require basic service message format to be a tuple of
      `(method, kwargs)`, where kwargs is always a dict.
    - Update DeduplicatingService to match the new scheme.

ansible_mitogen/connection.py:
    - Rename 'method' to 'method_name' to disambiguate it from the
      service.call()'s method= argument.

ansible_mitogen/planner.py:
    - Generate an ID for every job, sync or not, and fetch job results
      from JobResultService rather than via the initiating function
      call's return value.
    - Planner subclasses now get to select whether their Runner should
      run in a forked process. The base implementation requests this if
      the 'mitogen_isolation_mode=fork' task variable is present.

ansible_mitogen/runner.py:
    Teach runners to deliver their result via JobResultService executing
    in their indirect parent mux process.

ansible_mitogen/plugins/actions/mitogen_async_status.py:
    Split the implementation up into methods, and more compatibly
    emulate Ansible's existing output.

ansible_mitogen/process.py:
    Mux processes now host JobResultService.

ansible_mitogen/services.py:
    Update existing services to the new mitogen.service scheme, and
    implement JobResultService:

    * listen() method for synchronous jobs. planner.invoke() registers a
      Sender with the service prior to invoking the job, then sleeps
      waiting for the service to write the job result to the
      corresponding Receiver.

    * Non-blocking get() method for implementing mitogen_async_status
      action.

    * Child-accessible push() method for delivering task results.

ansible_mitogen/target.py:
    New helpers for spawning a virginal subprocess on startup, from
    which asynchronous and mitogen_task_isolation=fork jobs are forked.
    Necessary to avoid a task inheriting potentially
    polluted/monkey-patched parent environment, since remaining jobs
    continue to run in the original child process.

docs/ansible.rst:
    Add/merge/remove some behaviours/risks.

tests/ansible/integration:
    New tests for forking/async.

											
										
										
											6 years ago
+								Normally with Ansible, diagnostics and use of the :py:mod:`logging` package
 								output on the target machine are discarded. With Mitogen, all of this is
 								captured and returned to the host machine, where it can be viewed as desired
 								with ``-vvv``. Basic high level logs are produced with ``-vvv``, with logging
 								of all IO on the controller with ``-vvvv`` or higher.
 								Although use of standard IO and the logging package on the target is forwarded
 								to the controller, it is not possible to receive IO activity logs, as the
 								processs of receiving those logs would would itself generate IO activity. To
 								receive a complete trace of every process on every machine, file-based logging
 								is necessary. File-based logging can be enabled by setting
 								``MITOGEN_ROUTER_DEBUG=1`` in your environment.
-												docs: Ansible logging update (#111)

											
										
										
											6 years ago
 								When file-based logging is enabled, one file per context will be created on the
-												docs: So many typos

											
										
										
											6 years ago
+								local machine and every target machine, as ``/tmp/mitogen.<pid>.log``.
-												ansible: limited support for become_flags, more docs.

											
										
										
											6 years ago
 								Implementation Notes
 								--------------------
 								Interpreter Reuse
 								~~~~~~~~~~~~~~~~~
 								The extension aggressively reuses the single target Python interpreter to
-												docs: document mitogen_task_isolation.

											
										
										
											6 years ago
+								execute every module. While this generally works well, it violates an unwritten
-												ansible: limited support for become_flags, more docs.

											
										
										
											6 years ago
+								assumption regarding Ansible modules, and so it is possible a buggy module
 								could cause a run to fail, or for unrelated modules to interact with each other
-												docs: document mitogen_task_isolation.

											
										
										
											6 years ago
+								due to bad hygiene.
 								Before reporting a bug relating to a module behaving incorrectly, please re-run
 								your playbook with ``-e mitogen_task_isolation=fork`` to see if the problem
 								abates. This may also be set on a per-task basis:
 								::
 								    - name: My task.
 								      broken_module:
 								        some_option: true
 								      vars:
 								        mitogen_task_isolation: fork
 								If forking fixes your problem, **please report a bug regardless**, as an
 								internal list can be updated to prevent users bumping into the same problem in
 								future.
-												ansible: limited support for become_flags, more docs.

											
										
										
											6 years ago
-												issue #159: initial context LRU implementation

Now Connection.close() *must* be called in the worker, to ensure the
reference count for a context drops correctly.

Remove 'discriminator' for now, I'm not using it for testing any more
and it complicated this code.

This code is a car crash, it needs rewritten again. Ideally some/most of
this behaviour could live on services.DeduplicatingService somehow, but
I couldn't come up with a sensible design.

											
										
										
											6 years ago
+								Interpreter Recycling
 								~~~~~~~~~~~~~~~~~~~~~
-												docs: typo

											
										
										
											6 years ago
+								The extension limits the number of persistent interpreters in use. When the
 								limit is reached, the youngest interpreter is terminated before starting a new
 								interpreter, preventing situations like below from triggering memory
-												issue #159: make LRU size configurable.

											
										
										
											6 years ago
+								exhaustion.
-												issue #159: initial context LRU implementation

Now Connection.close() *must* be called in the worker, to ensure the
reference count for a context drops correctly.

Remove 'discriminator' for now, I'm not using it for testing any more
and it complicated this code.

This code is a car crash, it needs rewritten again. Ideally some/most of
this behaviour could live on services.DeduplicatingService somehow, but
I couldn't come up with a sensible design.

											
										
										
											6 years ago
 								.. code-block:: yaml
 								    - hosts: corp_boxes
 								      vars:
 								        user_directory: [
 								          # 10,000 corporate user accounts
 								        ]
 								      tasks:
 								        - name: Create user bashrc
 								          become: true
 								          vars:
 								            ansible_become_user: "{{item}}"
 								          copy:
 								            src: bashrc
 								            dest: "~{{item}}/.bashrc"
 								          with_items: "{{user_directory}}"
-												issue #159: make LRU size configurable.

											
										
										
											6 years ago
+								This recycling does not occur for direct connections from the controller, and
 								it is keyed on a per-target basis, i.e. up to 20 interpreters may exist for
 								each directly connected target.
-												issue #159: initial context LRU implementation

Now Connection.close() *must* be called in the worker, to ensure the
reference count for a context drops correctly.

Remove 'discriminator' for now, I'm not using it for testing any more
and it complicated this code.

This code is a car crash, it needs rewritten again. Ideally some/most of
this behaviour could live on services.DeduplicatingService somehow, but
I couldn't come up with a sensible design.

											
										
										
											6 years ago
-												issue #159: make LRU size configurable.

											
										
										
											6 years ago
+								The youngest interpreter is chosen to preserve useful accounts, like "root" or
 								"postgresql" that tend to appear early in a run, however it is simple to
 								construct a playbook that defeats this strategy. A future version will key
 								interpreters on the identity of their creating task, file and/or playbook,
 								avoiding useful account recycling in every scenario.
 								To raise or lower the limit from 20, set the ``MITOGEN_MAX_INTERPRETERS``
 								environment variable to a new value.
-												issue #159: initial context LRU implementation

Now Connection.close() *must* be called in the worker, to ensure the
reference count for a context drops correctly.

Remove 'discriminator' for now, I'm not using it for testing any more
and it complicated this code.

This code is a car crash, it needs rewritten again. Ideally some/most of
this behaviour could live on services.DeduplicatingService somehow, but
I couldn't come up with a sensible design.

											
										
										
											6 years ago
-												docs: tidy ansible docs.

											
										
										
											6 years ago
+								Runtime Patches
 								~~~~~~~~~~~~~~~
 								Three small runtime patches are employed in ``strategy.py`` to hook into
 								desirable locations, in order to override uses of shell, the module executor,
 								and the mechanism for selecting a connection plug-in. While it is hoped the
 								patches can be avoided in future, for interesting versions of Ansible deployed
 								today this simply is not possible, and so they continue to be required.
-												ansible: limited support for become_flags, more docs.

											
										
										
											6 years ago
-												ansible: doc updates

											
										
										
											6 years ago
+								The patches are concise and behave conservatively, including by disabling
 								themselves when non-Mitogen connections are in use. Additional third party
-												ansible: limited support for become_flags, more docs.

											
										
										
											6 years ago
+								plug-ins are unlikely to attempt similar patches, so the risk to an established
 								configuration should be minimal.
-												docs: tidy ansible docs.

											
										
										
											6 years ago
-												issue #164: precisely emulate Ansible's stdio behaviour.

* Use identical logic to select when stdout/stderr are merged, so
  'stdout', 'stdout_lines', 'stderr', 'stderr_lines' contain the same
  output before/after the extension.

* When stdout/stderr are merged, synthesize carriage returns just like
  the TTY layer.

* Mimic the SSH connection multiplexing message on stderr. Not really
  for user code, but so compare_output_test.sh needs fewer fixups.

											
										
										
											6 years ago
+								Standard IO
 								~~~~~~~~~~~
 								Ansible uses pseudo TTYs for most invocations, to allow it to handle typing
 								passwords interactively, however it disables pseudo TTYs for certain commands
 								where standard input is required or ``sudo`` is not in use. Additionally when
 								SSH multiplexing is enabled, a string like ``Shared connection to localhost
 								closed\r\n`` appears in ``stderr`` of every invocation.
 								Mitogen does not naturally require either of these, as command output is
 								embedded within the SSH stream, and it can simply call :py:func:`pty.openpty`
 								in every location an interactive password must be typed.
 								A major downside to Ansible's behaviour is that ``stdout`` and ``stderr`` are
 								merged together into a single ``stdout`` variable, with carriage returns
 								inserted in the output by the TTY layer. However ugly, the extension emulates
 								all of this behaviour precisely, to avoid breaking playbooks that expect
 								certain text to appear in certain variables with certain linefeed characters.
 								See `Ansible#14377`_ for related discussion.
 								.. _Ansible#14377: https://github.com/ansible/ansible/issues/14377
-												ansible: limited support for become_flags, more docs.

											
										
										
											6 years ago
+								Flag Emulation
 								~~~~~~~~~~~~~~
 								Mitogen re-parses ``sudo_flags``, ``become_flags``, and ``ssh_flags`` using
 								option parsers extracted from `sudo(1)` and `ssh(1)` in order to emulate their
 								equivalent semantics. This allows:
 								* robust support for common ``ansible.cfg`` tricks without reconfiguration,
 								  such as forwarding SSH agents across ``sudo`` invocations,
 								* reporting on conflicting flag combinations,
 								* reporting on unsupported flag combinations,
-												ansible: doc updates

											
										
										
											6 years ago
+								* internally special-casing certain behaviour (like recursive agent forwarding)
 								  without boring the user with the details,
-												ansible: limited support for become_flags, more docs.

											
										
										
											6 years ago
+								* avoiding opening the extension up to untestable scenarios where users can
 								  insert arbitrary garbage between Mitogen and the components it integrates
-												docs: small fix

											
										
										
											6 years ago
+								  with,
-												ansible: limited support for become_flags, more docs.

											
										
										
											6 years ago
+								* precise emulation by an alternative implementation, for example if Mitogen
 								  grew support for Paramiko.