mitogen

Commit Graph

Author	SHA1	Message	Date
David Wilson	728a0da8a4	issue #139 : eliminate quadratic behaviour from transmit path Implication: the entire message remains buffered until its last byte is transmitted. Not wasting time on it, as there are pieces of work like issue #6 that might invalidate these problems on the transmit path entirely.	8 years ago
David Wilson	a3b4b459fa	issue #139 : eliminate quadratic behaviour on input path Rather than slowly build up a Python string over time, we just store a deque of chunks (which, in a later commit, will now be around 128KB each), and track the total buffer size in a separate integer. The tricky loop is there to ensure the header does not need to be sliced off the full message (which may be huge, causing yet another spike and copy), but rather only off the much smaller first 128kb-sized chunk received. There is one more problem with this code: the ''.join() causes RAM usage to temporarily double, but that was true of the old solution too. Shall wait for bug reports before fixing this, as it gets very ugly very fast.	8 years ago
David Wilson	ba9a06d0f5	issue #139 : core: Side.write(): let the OS write as much as possible. There is no penalty for just passing as much data to the OS as possible, it is not copied, and for a non-blocking socket, the OS will just keep buffer as much as it can and tell us how much that was. Also avoids a rather pointless string slice.	8 years ago
David Wilson	49db4125d0	issue #139 : core: bump CHUNK_SIZE from 16kb to 128Kb Reduces the number of IO loop iterations required to receive large messages at a small cost to RAM usage. Note that when calling read() with a large buffer value like this, Python must zero-allocate that much RAM. In other words, for even a single byte received, 128kb of RAM might need to be written. Consequently CHUNK_SIZE is quite a sensitive value and this might need further tuning.	8 years ago
David Wilson	44d36eccba	issue #146 : don't crash during on_broker_shutdown There is some insane unidentifiable Mitogen context (the local context?) that instantly crashes with a higher forks setting. It appears to be harmless, but meanwhile this naturally shouldn't be happening.	8 years ago
David Wilson	cb620500d1	issue #131 : log stack and PPID with MITOGEN_ROUTER_DEBUG=1	8 years ago
David Wilson	d58b5ad777	core: prevent creation of unicode Message.data Was triggering a crash indirectly due to Ansible passing us Unicode strings. Needs a better fix.	8 years ago
Alex Willmer	f999b9adbf	Crank zlib.compress() upto 9 SSH command size: 482 bytes (no change) Preamble size: 8946 bytes (down 33)	8 years ago
David Wilson	b243da087c	issue #121 : fix call_function_test by not raising the dead A first small mea culpa to all my testing sins of late :)	8 years ago
David Wilson	f1009b7502	issue #121 : fix breakage caused by `a9c6c13` This actually addresses multiple problems: * Single-file programs were broken, since the fix introduced in `6931cc10c4` caused builtin_find_module() to start indicating __main__ can always be loaded locally. That's broken, and there might be more cases where the same problem will crop up. Since it was indicated __main__ could be loaded locally, the built-in import machinery was allowed to attempt that (since we remove __main__ from sys.modules during bootstrap), which caused a safety check to fire in the bowels of Python: "Cannot re-init internal module %.200s" * The check for presence of the whitelist was totally broken, since the whitelist is never an empty list. Therefore 'self' was being returned for every module, including extension modules like 'termios'. I have hand-verified this does not break the fix for issue #113. I looked at writing a test for that, but it requires a Docker container (or similar) with an ancient version of Ansible installed. Will open a separate ticket tracking this.	8 years ago
David Wilson	5dddee62ea	Revert "issue #121 : minimal fix for nested_test." Mega broken. This reverts commit `a7dbbd96aa`.	8 years ago
David Wilson	a0c4df72b0	issue #121 : minimal fix for nested_test.	8 years ago
David Wilson	28afa955a3	importer: take priority over system packages when whitelisting is enabled Might want to de-overload the meaning of whitelist in future, but in the meantime it works fine for Ansible and I can't think of a whitelisting use case that would break because of it. Closes #114.	8 years ago
David Wilson	cf01c6b710	importer: avoid duplicate module load(!); closes #113 . Amazed this one managed to scrape through for so long. Calling __import__ from within find_module() was causing the target module, in this case cookielib, to be loaded then overwritten by a subsequent duplicate load higher in the stack. The result is that cookielib was loaded twice, and, per usual Python import semantics, a reference to the partially initialized first cookielib was installed in sys.modules while its code executed. At the end of cookielib on 2.x, it imports _LWPCookieJar, which in turn imports the partially built cookielib from sys.modules, then subclasses the CookieJar from /that/ module. Everything is wonderful. Then the call returns back up into the import mechanism which restarts the entire process -- only this time, _LWPCookieJar is /not/ reinitialized, so the copy in sys.modules is still left with types pointing at the old module! So the duplicate import creates a new CookieJar which is not the base class of LWPCookieJar. Tada! 3 hours debugging. This is probably a performance fix in disguise, didn't realize things were so broken. It may also be a regression elsewhere. Urgently need to finish the tests.	8 years ago
David Wilson	ff617824a1	ansible: fix some flake8 errors * Unused imports * Undefined names in helpers.py * Copyright header wrapping	8 years ago
Alex Willmer	33781aba2c	core: Correct naming of Stream.sent_modules Fixes #90	8 years ago
Alex Willmer	a1aab30e63	core: Implement Dead.__ne__ & Dead.__hash__ Both these addtions are to address warnings in https://lgtm.com/projects/g/dw/mitogen/alerts/?mode=list. Namely that if a class defines an equality method then it should also define an inequality and a hash method. Refs #61	8 years ago
Alex Willmer	4b373c421b	core: Standardise type of Importer.whitelist This seemed a reasonable streamlining, but I'm happy to be overruled.	8 years ago
Alex Willmer	ecaa8609f3	core: Add docstring to is_blacklisted_import() This documents the existing behaviour, which may not be the intended.	8 years ago
David Wilson	5855f1739f	core: Handle unpicklable data in dispatch_calls() Sending just via .call_async() would previously crash the child, now it generates CallError like intended.	8 years ago
David Wilson	d4169557f1	Fix some more Python 2.4 syntax	8 years ago
David Wilson	afc8697288	core: Ensure add_handler() callbacks really receive _DEAD on shutdown	8 years ago
David Wilson	020036f807	core: add a nasty hack for Ansible modules.	8 years ago
David Wilson	4d940f08ae	importer: drop redundant prefix from pkg_present For the 52 submodules of ansible.modules.system, this produced a 1602 byte pkg_present list. After stripping it becomes 406 bytes, and the entire LOAD_MODULE size drops from 1988 bytes to 792 bytes (-60%). For the 68 submodules of ansible.module_utils, 1902 bytes pkg_present becomes 474 bytes (-75%), and LOAD_MODULE size drops from 2867 bytes to 1439 bytes (-49%). In a simple test running Ansible's "setup" module followed by its "apt" module, wire bytes sent drops from 140,357 to 135,531 (-3.4%).	8 years ago
David Wilson	b543b84e80	importer: share blacklist logic between master/parent	8 years ago
David Wilson	8ec6ae1da0	importer: module whitelist/blacklist support Hoped to avoid it, but it's the obvious solution for Ansible.	8 years ago
David Wilson	43ba1c76dc	core: wrap selects in EINTR handlers This isn't nearly enough, but it catches the most common victim of EINTR.	8 years ago
David Wilson	aafe458a13	core: #39 : don't call logging framework when logging is disabled It looks ugly as sin, but this nets about a 20% drop in user CPU time, and close to 15% increase in throughput. The average log call is around 10 opcodes, prefixing with '_v and' costs an extra 2, but both are simple operations, and the remaining 10 are skipped entirely when _v or _vv are False.	8 years ago
Alex Willmer	3261c561dd	Fix AttributeError in mitogen.core.Context.send_await() As of `adc8fe3aed` Receiver objects do not have a get_data() method and Receiver.get() does not unpickle the message.	8 years ago
David Wilson	6905dc4e8d	master: use queue-like Latch in Select() too.	8 years ago
David Wilson	20afa5b90c	Latch v2: combined queue + one self-pipe-per-thread Turns out it is far too easy to burn through available file descriptors, so try something else: self-pipes are per thread, and only temporarily associated with a Lack that wishes to sleep. Reduce pointless locking by giving Latch its own queue, and removing Queue.Queue() use in some places. Temporarily undo merging of of Waker and Latch, let's do this one step at a time.	8 years ago
David Wilson	e6a107c5aa	core: replace Queue with Latch On Python 2.x, operations on pthread objects with a timeout set actually cause internal polling. When polling fails to yield a positive result, it quickly backs off to a 50ms loop, which results in a huge amount of latency throughout. Instead, give up using Queue.Queue.get(timeout=...) and replace it with the UNIX self-pipe trick. Knocks another 45% off my.yml in the Ansible examples directory against a local VM. This has the potential to burn a lot of file descriptors, but hell, it's not the 1940s any more, RAM is all but infinite. I can live with that. This gets things down to around 75ms per playbook step, still hunting for additional sources of latency.	8 years ago
David Wilson	a35fcf44cc	ansible: restructure to avoid intermediate imports	8 years ago
David Wilson	f3e51a7b18	core: CALL_FUNCTION should check auth_id, not src_id	8 years ago
David Wilson	32f6ee7d43	issue #40 : mitogen.unix initial implementation.	8 years ago
David Wilson	e63e9d299e	docs: add Message documentation	8 years ago
David Wilson	10230f62dd	core: Message.reply() helper function	8 years ago
David Wilson	6fc8fa5b22	core: Don't crash if a stream is missing a side.	8 years ago
David Wilson	9238e09ae8	core: Restore behaviour of unpickling Router-specific Context subclass	8 years ago
David Wilson	dd088908df	select: clean up API.	8 years ago
David Wilson	df07e47d24	core: de-munge Message.unpickle() vs. Receiver.get().	8 years ago
David Wilson	a39cd44bf2	core: add auth_id field.	8 years ago
David Wilson	a54c96faae	core: remove unused SecurityError.	8 years ago
David Wilson	07d4d799f1	Add mitogen.main() decorator mainly for docs and demo use.	8 years ago
David Wilson	55c23e1c57	issue #68 : replace sets with lists Fix a MyPy warning by only passing lists to select.select(). At least on Python 2.x, select.select() was internally converting the sets to lists anyway. By the time lists become inefficient here, it is likely that select.select() itself will also be inefficient, and need replaced with .poll() or similar. No discernible performance different when transferring django.db.models to a local VM.	8 years ago
David Wilson	a0d9d34231	core: fix profiling * SIGTERM safety net prevents profiler from writing results, so disable it when profiling is active. * fix warning corrupting stream when profiling=True	8 years ago
David Wilson	5f2fa2cda6	importer: always refuse builtins and __builtin__.	8 years ago
David Wilson	0f899f34ff	importer: new format to signal ImportError Previously we'd send just None in GET_MODULE reply, but now since there is no single request-reply structure, we must include the fullname in the LOAD_MODULE response and make all of its data fields None to indicate the same.	8 years ago
David Wilson	4d01dc3ba6	Initial pass at module preloading * Don't implement the rules for when preloading occurs yet * Don't attempt to streamily preload modules downstream while this context hasn't yet received the final module. There is quite significant latency buried in here, but for now it's a lot of work to fix. This works well enough to handle at least the mitogen package, but it's likely broken for anything bigger.	8 years ago
David Wilson	ed71ae72f8	master: make mitogen minimally functional under gevent It seems gevent automatically sets blocking behaviour on fds produced by the socket module, which causes the Python process we fork to fail horribly. So in the child, always reset the blocking flag.	8 years ago

1 2

98 Commits (86076807306c5502fa526e1b150b85be67a6f721)