[zeromq-dev] zmq-signaler-port-sync
Pau
pau at teleopsia.com
Mon Jan 21 18:23:58 CET 2013
Hi,
I am using (not yet in production) ZMQ on Windows and I have found what
I think is a big problem for Windows users.
We use WXP and W7 and Visual C++ different versions. ZMQ version 3.2.0
(as far as I see the same problem happens in 3.2.2)
I do not fully understand ZMQ internals but I've seen that every time a
socket is created the function make_fdpair(..) is called and in
signaler.cpp(line244) a system event "zmq-signaler-port-sync" is created.
This event is used as a system-wide critical section and, so all
applications that try to create an event will WaitForSingleObject (sync,
INFINITE) until SetEvent (...) is called.
The problem is that the code between:
HANDLE sync = CreateEvent (NULL, FALSE, TRUE, TEXT
("zmq-signaler-port-sync"));
and
SetEvent (sync);
is full of wsa_asserts(..) that will terminate the application if
something goes wrong.
It is clear that terminating the application not leaving the system-wide
critical section is a bad idea because all applications in the system
will hang and you will have to stop all them to start again.
I understand that no errors should happen but anyway to escape from the
error is not a good idea in this case.
I do not know all possible reasons to generate a fatal wsa_assert(..)
but there is at least one:
I have seen that in XP it is possible that line 301 rc = connect (*w_,
(sockaddr *) &addr, sizeof (addr)); returns an error when a socket tries
to connect to 5905 and this has happened many times.
Windows uses port numbers starting near 1400 and XP has a limit at 5000.
In W7 this does not look as a problem because maximum is 65000
It sounds as if the number was big enough but apart from the fact that
ZMQ uses a big number of connections (at least in my tests) I have
experienced that Windows jumps port numbers by 7, so 5000 happens
sometimes with catastrophic consequences.
best,
Pau Ceano
More information about the zeromq-dev
mailing list