15:00:06 #startmeeting mistral 15:00:07 Meeting started Mon Oct 1 15:00:06 2018 UTC and is due to finish in 60 minutes. The chair is d0ugal. Information about MeetBot at http://wiki.debian.org/MeetBot. 15:00:08 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 15:00:11 The meeting name has been set to 'mistral' 15:00:22 Hey everyone. It's the Monday office hour 15:00:25 Who is around? 15:00:40 I am back and rested after 2 weeks vaction - so send me your review requests and other questions :) 15:03:08 d0ugal: That RPC bug was interesting 15:03:17 Also, welcome back :) 15:03:21 therve: Thanks! 15:03:33 o/ 15:03:37 therve: Yeah, I gotta admit I didn't fully understand the fix 15:03:44 therve: must have been hard to track down? 15:03:47 d0ugal: I think there is an issue with running in a WSGI container and crons 15:03:52 ah 15:04:02 d0ugal: Yeah a bit, easier once we got a proper trace 15:04:12 Still took a good day to reproduce it properly 15:04:41 d0ugal: https://github.com/openstack/mistral/blob/master/mistral/api/app.py#L58 15:04:46 That is quite quick by openstack standards :) 15:05:08 I think that todo is critical if you don't use mistral-server 15:05:37 Right, makes sense 15:05:46 Running it in the API process always felt like a massive hack 15:06:50 d0ugal: Which brings the related remark: crons aren't probably tested by the devstack jobs 15:07:02 And I'm not sure by the tripleo jobs either 15:08:07 We have discussed re-writing the cron trigger subsytem a few times. There is some hope it should be easy to do on the new scheduler work rakhmerov has been working on 15:08:37 PING rakhmerov, apetrich, bobh, mcdoker181818, akovi, hardikjasani (Sorry, forgot to do the pings at the start of the office hour) 15:09:42 Dougal Matthews proposed openstack/mistral stable/rocky: Add a release note for Ic98e2db02abd8483591756d73e06784cc2e9cbe3 https://review.openstack.org/606969 15:09:55 Dougal Matthews proposed openstack/mistral stable/queens: Add a release note for Ic98e2db02abd8483591756d73e06784cc2e9cbe3 https://review.openstack.org/606970 15:10:04 Dougal Matthews proposed openstack/mistral stable/pike: Add a release note for Ic98e2db02abd8483591756d73e06784cc2e9cbe3 https://review.openstack.org/606971 15:16:17 Once these merge I am going to propose rocky, queens and pike releases. 15:17:23 We got a few new bugs while I was away, so I'm going to do some triage: https://bugs.launchpad.net/mistral/+bugs?search=Search&field.status=New&orderby=id&start=0 15:19:21 https://bugs.launchpad.net/mistral/+bug/1795068 15:19:21 Launchpad bug 1795068 in Mistral "screen-mistral-engine.txt size is causing logstash index OOM" [Undecided,New] 15:19:27 Damn, our logs are too verbose :) 17:20:59 Merged openstack/mistral stable/queens: Add a release note for Ic98e2db02abd8483591756d73e06784cc2e9cbe3 https://review.openstack.org/606970 17:52:43 Merged openstack/mistral stable/rocky: Add a release note for Ic98e2db02abd8483591756d73e06784cc2e9cbe3 https://review.openstack.org/606969 17:52:43 Merged openstack/mistral stable/pike: Add a release note for Ic98e2db02abd8483591756d73e06784cc2e9cbe3 https://review.openstack.org/606971 06:32:30 Merged openstack/mistral-lib master: Removed older version of python added 3.5 https://review.openstack.org/606340 11:13:17 Dougal Matthews proposed openstack/mistral master: Use eventlet-aware threading events https://review.openstack.org/557487 12:17:32 d0ugal: Here to chat about that event patch? 12:17:50 therve: Yeah? What do you make of it? 12:19:34 d0ugal: I have the symptoms, but I don't think I have the conditions of the fix 12:19:46 Right 12:19:47 d0ugal: It should only happen when running kombu right? 12:19:57 I'm still working my way through the bugzilla discussion 12:20:53 I have the issue right now on a deployed machine. 12:21:01 It's 100% cpu, doing nothing but epoll_wait 12:21:27 Currently it's the executor, but engine and event-engine had the same issue earlier on 12:23:08 ouch 12:23:35 therve: What do you mean by "I don't think I have the conditions of the fix" 12:23:50 d0ugal: The fix seems to be specific to the kombu server? 12:23:59 Right 12:24:10 tripleo uses the regular oslo backend 12:24:34 Right, so wouldn't that be fixed in oslo-messaging? 12:25:04 If it was the culprit, yes, but I don't think it is 12:25:17 Otherwise other services would be affected 12:26:47 right 12:26:49 Good point 12:28:09 Arf 12:28:12 Easy to reproduce 12:28:21 d0ugal: Another SIGHUP issue 12:29:16 therve: Fun. 12:29:21 How do you reproduce it? 12:29:39 Just kill -HUP the engine or the executor 12:30:56 Hum doesn't work the second time obviously :D 12:31:12 Oh it does 12:35:57 lol 16:23:17 Thomas Herve proposed openstack/mistral master: Wait for rpc server on shutdown https://review.openstack.org/607306 16:23:29 d0ugal: ^^ if you're still around 16:23:46 therve: just leaving, I'll check it out tomorrow. 16:23:55 np 16:24:44 thrash|bbl maybe when you come back 16:24:47 * therve away 18:59:42 therve: wassup? 19:20:32 thrash: Was looking at another bug where mistral services were using 100% cpus 19:20:43 thrash: I think https://review.openstack.org/607306 fixes it 19:21:12 therve: awesome! 19:22:20 Still related to SIGHUP handling... 08:12:40 d0ugal: hi, is it a problem to backport a change if it adds a new config option (say with a default value that keeps it backward compatible)? 10:25:31 d0ugal: :) Please let me know when you can 10:25:54 maybe we can ask for an exception like we did before 10:26:10 I may potentially need to backport such a change 10:28:22 rakhmerov: I think it is okay if there is a default and means no user action is required 10:28:46 yeah 10:28:47 ok 10:29:02 anyway, I'll ask to support me, if needed ;) 10:52:01 rakhmerov: sure 20:35:27 Oleg Ovcharuk proposed openstack/mistral master: Add started_at and finished_at to task execution. https://review.openstack.org/607703 04:51:07 Renat Akhmerov proposed openstack/mistral master: WIP: change workflow completion logic https://review.openstack.org/607807 05:39:45 Renat Akhmerov proposed openstack/mistral master: WIP: change workflow completion logic https://review.openstack.org/607807 07:40:53 Oleg Ovcharuk proposed openstack/mistral master: Add started_at and finished_at to task execution. https://review.openstack.org/607703 07:58:40 Renat Akhmerov proposed openstack/mistral master: WIP: change workflow completion logic https://review.openstack.org/607807 08:04:38 Renat Akhmerov proposed openstack/mistral master: WIP: change workflow completion logic https://review.openstack.org/607807 08:06:39 therve: Hey - should we also backport this? https://review.openstack.org/#/c/607306/ 08:07:01 d0ugal: Depends what happens around log rotation... 08:07:10 If you want/need SIGHUP to work, yes :) 08:07:42 I guess I do :) 08:07:48 therve: would you mind adding a release note? 08:08:19 Nope! 08:08:25 Thanks 08:10:15 Thomas Herve proposed openstack/mistral master: Wait for rpc server on shutdown https://review.openstack.org/607306 08:10:36 d0ugal: We have various issues around SIGHUP handing in Heat as well 08:10:53 oh, interesting. I'll take a look 08:10:55 I wonder if we should consider changing the behavior imposed by oslo.service 08:11:15 Restarting all the services to reload log/conf files seem dumb to me 08:11:23 Agreed 08:11:52 but I can see why it was done - it was probably the easiest fix. 08:11:56 I know long-running heat stacks are broken, I wonder about workflows 08:12:09 How long is long-running? 08:12:21 In case of heat, hours 08:12:46 I don't think we have any workflows that run that long (because of keystone token issues) 08:12:46 There is just no way that you're going to "wait" for completion in this case 08:12:53 but rakhmerov may have some in a non-keystone setup 08:13:10 Even 20 minutes 08:13:28 oh, we have that for sure with introspection 08:14:00 If you have a SIGHUP at say 10 minutes into 20, what do you do? 08:14:41 ¯\_(ツ)_/¯ 08:15:13 d0ugal: hey 08:15:17 what's the question? 08:16:01 yes, we have workflows running for days 08:16:13 in a non-keystone env, right 08:16:31 rakhmerov: Not sure we have a specific question, but therve is investigating SIGHUP issues. Do you use the oslo or kombu rpc backend? 08:16:47 oslo 08:17:01 I would not recommend to use Kombu RPC, at least under load 08:17:13 it has a number of issues actually 08:17:14 Good to know. 08:17:17 yeah 08:17:17 rakhmerov: https://review.openstack.org/#/c/607306/ 08:17:24 We use oslo 08:20:44 haah.. 08:20:48 tricky thing 08:48:04 hello 08:48:34 I have a small question related to the plugin system 08:49:04 this is more related to setuptools actually but since it used in mistral to extend the actions possible I figured I'd ask you :D 08:50:08 can I have mistral installed in the normal sys.path via my distribution's packages and register an action plugin deployed somewhere else in a venv? 08:50:39 mistral would be run without a venv and resides in /usr/lib/python.... 08:51:11 whereas the code of the custom action would be totally outside the sys.path, in a venv somewhere 08:57:20 Renat Akhmerov proposed openstack/mistral master: WIP: change workflow completion logic https://review.openstack.org/607807 09:21:31 pgaxatte: Well mistral needs to be able to import your code, right? 09:21:42 If it's in a different venv, that doesn't sound doable 09:22:05 therve: that's what I thought so it raises a problem to me 09:22:31 because maybe I want to use a different version of some lib in a specific action 09:22:54 Yeah that won't work 09:23:24 and I don't want to have to know in advance the dependencies of all my custom actions so that I can build a mistral in a venv with everything it needs 09:25:05 if I know in advance WHERE my plugins venv will be, can't I extend mistral's sys.path before running it? 09:26:09 Right, but you'll use the same version of the lib everywhere then? 09:27:44 the different version of the lib was an hypothetical question, I'd probably need additionnal packages rather than different versions of them 09:33:40 OK that's a totally different issue though 09:54:12 Renat Akhmerov proposed openstack/mistral master: WIP: change workflow completion logic https://review.openstack.org/607807 10:23:18 Renat Akhmerov proposed openstack/mistral master: WIP: change workflow completion logic https://review.openstack.org/607807 11:24:13 98k proposed openstack/python-mistralclient master: build universal wheels https://review.openstack.org/607915 11:59:52 Oleg Ovcharuk proposed openstack/mistral master: Add started_at and finished_at to task execution. https://review.openstack.org/607703 13:35:05 Andras Kovi proposed openstack/mistral master: Fix state change propagation in workflows https://review.openstack.org/607960 14:35:13 Brad P. Crochet proposed openstack/mistral master: Use SessionClient for Ironic actions https://review.openstack.org/607974 20:51:00 Oleg Ovcharuk proposed openstack/mistral master: Add started_at and finished_at to task execution. https://review.openstack.org/607703 05:20:55 Renat Akhmerov proposed openstack/mistral master: WIP: change workflow completion logic https://review.openstack.org/607807 06:22:30 Renat Akhmerov proposed openstack/mistral master: WIP: change workflow completion logic https://review.openstack.org/607807 07:30:12 Renat Akhmerov proposed openstack/mistral master: Add sqlalchemy.exc.OperationalError to the retry decorator https://review.openstack.org/608171 07:45:44 hi, can anybody look at http://logs.openstack.org/71/608171/1/check/openstack-tox-py27/2cd5c83/job-output.txt.gz#_2018-10-05_07_41_46_803612 07:45:54 it happens only on py27 07:46:24 seems like some problem with dependencies but I can't understand what's wrong 07:46:44 this module exists 07:47:24 d0ugal: ^ 07:47:55 d0ugal: btw, I haven't seen apetrich for a while. Is he on vacation? 08:02:06 rakhmerov: Relative import. In mistral.db.utils you import "sqlalchemy", it thinks you try "mistral.db.sqlalchemy" 08:02:15 d0ugal: Error: Can't start another meeting, one is in progress. Use #endmeeting first. 08:02:42 d0ugal: Error: Can't start another meeting, one is in progress. Use #endmeeting first. 08:03:35 oops! 08:03:37 #endmeeting