mitogen

Commit Graph

Author	SHA1	Message	Date
David Wilson	75d179e4b9	remove unused imports flagged by lgtm	5 years ago
David Wilson	eeb7150f24	issue #549 : increase open file limit automatically if possible While catching every possible case where "open file limit exceeded" is not possible, we can at least increase the soft limit to the available hard limit without any user effort. Do this in Ansible top-level process, even though we probably only need it in the MuxProcess. It seems there is no reason this could hurt	5 years ago
David Wilson	acab26d796	ansible: improve process.py docs	5 years ago
David Wilson	4dfbe82e76	tests: hide ugly error during Ansible tests	5 years ago
David Wilson	108015aa22	ansible: gracefully handle failure to connect to MuxProcess It's possible to hit an ugly exception during early CTRL+C	5 years ago
David Wilson	bf1f3682aa	ansible: pin per-CPU muxes to their corresponding CPU This slightly breaks the old scheme, in that CPU 1 may now end up with a mux and the top-level process pinned to it.	5 years ago
David Wilson	dc9f4e89e6	ansible: reap mux processes on shut down Previously we exitted without calling waitpid(), which meant the top-level process struct rusage did not reflect the resource usage consumed by the multiplexer processes. Existing benchmarks are made using perf so this never created a problem, but it could be confusing to others using the "time" command, and also allows logging the final exit status of the process.	5 years ago
David Wilson	1fca0b7a94	[linear2] fix MuxProcess test fixture and some merge fallout	5 years ago
David Wilson	0f63ca4c68	Make setting affinity optional.	5 years ago
David Wilson	9035884c77	ansible: abstract worker process model. Move all details of broker/router setup out of connection.py, instead deferring it to a WorkerModel class exported by process.py via get_worker_model(). The running strategy can override the configured worker model via _get_worker_model(). ClassicWorkerModel is installed by default, which implements the extension's existing process model. Add optional support for the third party setproctitle module, so children have pretty names in ps output. Add optional support for per-CPU multiplexers to classic runs.	5 years ago
David Wilson	300f8b2ff9	ansible: fixturize creation of MuxProcess This relies on the previous commit resetting global variables. Update clean_shutdown() to handle duplicate calls, due to tests repeatedly installing it.	5 years ago
David Wilson	26b6333787	[stream-refactor] fix unix.Listener construction	5 years ago
David Wilson	7dacb68eeb	issue #552 : include process identity in log messages.	6 years ago
David Wilson	1f77d24bec	Update copyright year everywhere.	6 years ago
David Wilson	7ff4e6694c	issue #536 : rework how 2.3-compatible simplejson is served Regardless of the version of simplejson loaded in the master, load up the ModuleResponder cache with our 2.4-compatible version. To cope with simplejson being loaded due to modules like ec2_group that try to import it before importing 'json', also update target.py to remove it from the whitelist if a local 'json' module import succeeds.	6 years ago
David Wilson	05b1ccb658	ansible: stash PID files in CWD if requested for debugging.	6 years ago
David Wilson	eb67fbe9d2	ansible: double the default pool size. Tempted to push this up to 64, but let's do it incrementally just in case.	6 years ago
David Wilson	4531338b12	ansible: document and make affinity stuff portable to non-Linux Portable as in does nothing for the time at least for now.	6 years ago
David Wilson	c6d5aa29ba	ansible: new multiplexer/workers configuration Following on from 152effc26c9a5918cb7ead7a97fe7fa7f81b6764, * Pin mux to CPU 0 * Pin top-level CPU 1 * Pin workers sequentially to CPU 2..n Nets 19.5% improvement on issue_140__thread_pileup.yml when targetting 64 Docker containers on the same 8 core/16 thread machine. Before (prior to last scheme, no affinity at all): 2294528.731458 task-clock (msec) # 6.443 CPUs utilized 10,429,745 context-switches # 0.005 M/sec 2,049,618 cpu-migrations # 0.893 K/sec 8,258,952 page-faults # 0.004 M/sec 5,532,719,253,824 cycles # 2.411 GHz (83.35%) 3,267,471,616,230 instructions # 0.59 insn per cycle # 1.22 stalled cycles per insn (83.35%) 662,006,455,943 branches # 288.515 M/sec (83.33%) 39,453,895,977 branch-misses # 5.96% of all branches (83.37%) 356.148064576 seconds time elapsed After: 2226463.958975 task-clock (msec) # 7.784 CPUs utilized 9,831,466 context-switches # 0.004 M/sec 180,065 cpu-migrations # 0.081 K/sec 5,082,278 page-faults # 0.002 M/sec 5,592,548,587,259 cycles # 2.512 GHz (83.35%) 3,135,038,855,414 instructions # 0.56 insn per cycle # 1.32 stalled cycles per insn (83.32%) 636,397,509,232 branches # 285.833 M/sec (83.30%) 39,135,441,790 branch-misses # 6.15% of all branches (83.35%) 286.036681644 seconds time elapsed	6 years ago
David Wilson	1b909e8697	ansible: pin connection multiplexer to a single core Nets a reliable 8% improvement in issue_140__thread_pileup.yml when targetting 64 Docker containers on the same 8 core/16 thread machine. Before: 2294528.731458 task-clock (msec) # 6.443 CPUs utilized 10,429,745 context-switches # 0.005 M/sec 2,049,618 cpu-migrations # 0.893 K/sec 8,258,952 page-faults # 0.004 M/sec 5,532,719,253,824 cycles # 2.411 GHz (83.35%) 4,001,276,805,120 stalled-cycles-frontend # 72.32% frontend cycles idle (83.30%) 2,024,159,442,463 stalled-cycles-backend # 36.59% backend cycles idle (66.65%) 3,267,471,616,230 instructions # 0.59 insn per cycle # 1.22 stalled cycles per insn (83.35%) 662,006,455,943 branches # 288.515 M/sec (83.33%) 39,453,895,977 branch-misses # 5.96% of all branches (83.37%) 356.148064576 seconds time elapsed After: 2208247.938562 task-clock (msec) # 6.735 CPUs utilized 8,489,840 context-switches # 0.004 M/sec 1,432,967 cpu-migrations # 0.649 K/sec 7,508,957 page-faults # 0.003 M/sec 5,477,293,750,357 cycles # 2.480 GHz (83.31%) 3,984,360,350,811 stalled-cycles-frontend # 72.74% frontend cycles idle (83.32%) 1,976,646,418,711 stalled-cycles-backend # 36.09% backend cycles idle (66.64%) 3,196,197,480,792 instructions # 0.58 insn per cycle # 1.25 stalled cycles per insn (83.36%) 648,247,332,967 branches # 293.557 M/sec (83.35%) 39,004,881,070 branch-misses # 6.02% of all branches (83.37%) 327.876903668 seconds time elapsed	6 years ago
David Wilson	84944a9a61	ansible: ensure MuxProcess MITOGEN_PROFILING results reach disk. This has been broken for quite some time.	6 years ago
David Wilson	be6ab52fe1	issue #488 : fix shutdown damage caused in `6ca2677de5` os._exit() subverted calm shutdown, meaning unix.Listener never had a chance to cleanup its socket. Move unix.Listener socket cleanup into its class so it is automatic during shutdown, rather than cutpasted for each consumer. Disable the watcher thread in the MuxProcess, it is useless. Add .sock extension to /tmp/mitogen_unix_*, so we can write a test.	6 years ago
David Wilson	dd30a907ce	issue #477 : promote setup_gil() to mitogen.utils This is since ansible_mitogen/process.py is 2.6-only, and I want to use setup_gil() in 2.4 code.	6 years ago
David Wilson	a48ee3a536	issue #477 : vendorize the last 2.4-compatible simplejson This is in part so image_prep can run against an ancient CentOS 5 image without any upfront help, and in part simply because it's very easy to support.	6 years ago
David Wilson	59dd0dc814	issue #477 : serve up junk ansible/__init__.py just like Ansible.	6 years ago
David Wilson	6ca2677de5	ansible: fix test failure during process exit. ====================================================================== ERROR: tests.connection_test (unittest2.loader._FailedTest) ---------------------------------------------------------------------- Traceback (most recent call last): ImportError: Failed to import test module: tests.connection_test Traceback (most recent call last): File "/home/dmw/src/mitogen/.venv/local/lib/python2.7/site-packages/unittest2/loader.py", line 456, in _find_test_path module = self._get_module_from_name(name) File "/home/dmw/src/mitogen/.venv/local/lib/python2.7/site-packages/unittest2/loader.py", line 395, in _get_module_from_name __import__(name) RuntimeError: not holding the import lock	6 years ago
David Wilson	4bdf60326c	issue #424 : ansible: make put_file() raise AnsibleFileNotFound	6 years ago
David Wilson	e647adc62e	ansible: copy GIL change from linear2 branch. Reduces runtime by 25% given 100 25ms SSH targets: ANSIBLE_STRATEGY=mitogen \ MITOGEN_POOL_SIZE=100 \ /usr/bin/time -l ansible k3-x100 -m shell -a hostname Before: 39.56 real 35.29 user 17.24 sys 59600896 maximum resident set size 1784252 page reclaims 9016 messages sent 10382 messages received 18774 voluntary context switches 770070 involuntary context switches After: 29.79 real 22.10 user 11.77 sys 59281408 maximum resident set size 1725268 page reclaims 8582 messages sent 9959 messages received 14582 voluntary context switches 75280 involuntary context switches	6 years ago
David Wilson	2647f73501	ansible: bump UNIX listener default backlog, and set it to match forks. The connection multiplexer can expect to not be scheduled at least until every $forks worker processes has attempted a connection, so the backlog must be able to hold every worker.	6 years ago
David Wilson	8ab11f415f	ansible: better support for diagnosing hangs * Always enable the faulthandler module in the top-level process if it is available. * Make MITOGEN_DUMP_THREAD_STACKS interval configurable, to better handle larger runs. * Add docs subsection on diagnosing hangs. Conflicts: ansible_mitogen/process.py	6 years ago
David Wilson	e18396d54d	ansible: enable profiling by default! Thankfully this never made it into a release	6 years ago
David Wilson	9e572a7939	ansible: fix duplicate MuxProcess socket write. The while: loop was necessary due to some cutpaste further on down the file.	6 years ago
David Wilson	053c594d65	ansible: prevent logs spamming user console on exit. Closes #331.	6 years ago
David Wilson	5c573f7fcb	ansible: insert short sleep when MITOGEN_PROFILING active. Hacky, but works fine.	6 years ago
David Wilson	d8e0c9e12c	issue #297 : local commands must execute with WorkerProcess environment.	6 years ago
David Wilson	410016ff47	Initial Python 3.x port work. * ansible: use unicode_literals everywhere since it only needs to be compatible back to 2.6. * compat/collections.py: delete this entirely and rip out the parts of functools that require it. * Introduce serializable Kwargs dict subclass that translates keys to Unicode on instantiation. * enable_debug_logging() must set _v/_vv globals. * cStringIO does not exist in 3.x. * Treat IOLogger and LogForwarder input as latin-1. * Avoid ResourceWarnings in first stage by explicitly closing fps. * Fix preamble_size.py syntax errors.	6 years ago
David Wilson	6377f2d69c	issue #257 : split pool shutdown and join.	7 years ago
David Wilson	d33ef1866e	ansible: wrap socket calls in io_op() Breaks under signal stress test.	7 years ago
David Wilson	e35694acd5	ansible: flake8 fixes.	7 years ago
David Wilson	caffaa79f7	issue #186 : rework async/forked tasks again. The controller must know the ID of the forked child in order to propagate dependencies to it, so forking+starting the module run cannot happen entirely on the target, without some additional mechanism to wait-and-repropagate the deps as they arrive on the target. Rework things so that init_child() also handles starting the fork parent, and returns it along with the context's home directory in a single round trip. Now master knows the identity of the fork parent, it can directly create fork children and call run_module_async() in them. This necessitates 2 roundtrips to start an asynchronous task. This whole thing sucks and entirely needs simplified, but for now things almost work, so keeping it. connection.py: * Expect ContextService to return the entire dict return value of init_child(). Store the fork_contxt from the return value. planner.py: * Rework Planner to store the invocation as an instance attribute, to simplify method calls. * Add Planner.get_push_files() and Planner.get_module_deps(). * Add _propagate_deps() which takes a Planner and ensures the deps it describes are sent to a (non forked or forked) context. * Move async task logic out of target.py and into invoke() / _invoke_(). process.py: Services no longer need references to each other. planner.py handles sending module deps with one extra RPC. services.py: * Return "init_child_result" key instead of simple "home_dir" key. * Get rid of dep propagation from ModuleDepService, it lives in planner.py now. target.py: * Get rid of async task start logic, lives in planner.py now.	7 years ago
David Wilson	569c12a2d6	ansible: use PushFileService for module deps. planner.py: * Rather than grant FileService access to a file for children, use PushFileService to trigger deduplicating send of the file through the hierarchy immediately. * Send the complete list of Ansible module imports to the target so runner.py knows which files and scripts must be loaded via PushFileService prior to detaching. runner.py: * Teach NewStyleRunner to use the full module map to block until everything is loaded prior to detach(). target.py: * Delete old _get_file(), replace get_file() with get_small_file() which uses PushFileService instead. Closes #186	7 years ago
David Wilson	daa9cfd0a8	ansible: MITOGEN_DUMP_THREAD_STACKS for mux process too	7 years ago
David Wilson	d9087c510b	ansible: move FileService into mitogen.service.	7 years ago
David Wilson	30034877a5	issue #217 : ansible: working, if extremely inefficient implementation	7 years ago
David Wilson	81b62d9a1a	issue #217 : ansible: beginnings of ModuleDepService.	7 years ago
David Wilson	6edb3f165d	ansible: avoid a race during shutdown.	7 years ago
David Wilson	85e1f5f515	ansible: remove JobResultService, more compatible async jobs; closes #191 . And by "compatible" I mean "terrible". This does not implement async job timeouts, but I'm not going to bother, upstream async implementation is so buggy and inconsistent it resists even having its behaviour captured in tests.	7 years ago
David Wilson	3613162bc0	ansible: enable forking when requested and for async jobs. Closes #105. References #155. mitogen/service.py: Refactor services to support individually exposed methods with different security policies for each method. - @mitogen.service.expose() to expose a method and set its policy - @mitogen.service.arg_spec() to validate input. - Require basic service message format to be a tuple of `(method, kwargs)`, where kwargs is always a dict. - Update DeduplicatingService to match the new scheme. ansible_mitogen/connection.py: - Rename 'method' to 'method_name' to disambiguate it from the service.call()'s method= argument. ansible_mitogen/planner.py: - Generate an ID for every job, sync or not, and fetch job results from JobResultService rather than via the initiating function call's return value. - Planner subclasses now get to select whether their Runner should run in a forked process. The base implementation requests this if the 'mitogen_isolation_mode=fork' task variable is present. ansible_mitogen/runner.py: Teach runners to deliver their result via JobResultService executing in their indirect parent mux process. ansible_mitogen/plugins/actions/mitogen_async_status.py: Split the implementation up into methods, and more compatibly emulate Ansible's existing output. ansible_mitogen/process.py: Mux processes now host JobResultService. ansible_mitogen/services.py: Update existing services to the new mitogen.service scheme, and implement JobResultService: * listen() method for synchronous jobs. planner.invoke() registers a Sender with the service prior to invoking the job, then sleeps waiting for the service to write the job result to the corresponding Receiver. * Non-blocking get() method for implementing mitogen_async_status action. * Child-accessible push() method for delivering task results. ansible_mitogen/target.py: New helpers for spawning a virginal subprocess on startup, from which asynchronous and mitogen_task_isolation=fork jobs are forked. Necessary to avoid a task inheriting potentially polluted/monkey-patched parent environment, since remaining jobs continue to run in the original child process. docs/ansible.rst: Add/merge/remove some behaviours/risks. tests/ansible/integration: New tests for forking/async.	7 years ago
David Wilson	0dd5e04eae	issue #106 : partially working BinaryRunner/Planner. Refactor planner.py to look a lot more like runner.py. This 'structural cutpaste' looks messy -- probably we can simplify this code, even though it's pretty simple already.	7 years ago
David Wilson	1ff27ada49	Add maximum message size checks. Closes #151 .	7 years ago

1 2

56 Commits (cd2689af0a2a58f748c6f1ac4606fac292063428)