15:00:24 #startmeeting openstack-cyborg 15:00:25 Meeting started Wed Jan 3 15:00:24 2018 UTC and is due to finish in 60 minutes. The chair is zhipeng. Information about MeetBot at http://wiki.debian.org/MeetBot. 15:00:26 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 15:00:28 The meeting name has been set to 'openstack_cyborg' 15:00:51 meetbot still on vacation ? 15:01:11 Happy New Year guys! 15:01:17 Hi 15:01:23 happy new year 15:02:05 zhipeng: Error: Can't start another meeting, one is in progress. Use #endmeeting first. 15:03:25 zhipeng: Error: Can't start another meeting, one is in progress. Use #endmeeting first. 15:03:33 meetbot seemed to have worked the first time? 15:03:49 ah looks like my network's problem 15:04:27 who do we have here? 15:04:32 #topic Roll Call 15:04:40 #info Howard 15:04:49 #info Justin 15:05:30 #info Michele 15:06:51 and also #info liliu and yumeng 15:07:07 #topic Queens Release Prep 15:07:26 folks entering Jan we are officially in the mode for release prep 15:07:28 here is Vipparthy 15:07:42 so we need to accelerate our progress 15:08:08 let's run through the open patches, see what is still missing and what could be closed 15:08:18 #link https://review.openstack.org/#/q/status:open+project:openstack/cyborg 15:08:34 #info code freeze is Feb 5 15:08:57 #info 1. Internal API spec 15:09:01 #link https://review.openstack.org/525598 15:09:12 jkilpatr any update on this one ? 15:09:50 zhipeng, waiting on feedback, I'm pretty happy with it. 15:10:11 i think we should keep it open as we flash out the api/conductor/agent extension update ? 15:10:29 sounds good to me, talk about what you want in the review. 15:11:15 no problem, I think we should go back to the spec after we land zhuli's API patch 15:11:36 ok next up 15:11:56 #info 2. CRUD api extension 15:12:07 #link https://review.openstack.org/527396 15:13:03 #info Li 15:13:22 TL; DR for this patch is that 15:13:52 for Queens we will need a working API/DB that implements the resource provider data model 15:14:06 this patch is the attempt to achieve that 15:14:16 I'll review today 15:14:27 zhipeng, The generic driver depends on this patch. Once this patch merges, I can add the implementation for the CRUD to the generic driver too 15:14:38 And it doesn't have unit tests yet 15:14:45 so I think as I mentioned before, the implementation should align with jkilpatr's internal API spec 15:15:15 crushil ya I saw your comment 15:15:36 I will work with zhuli and let's land this patch this week 15:15:55 I have already added create functionality to the generic driver 15:16:00 I need to look at my own commit again, it's been a while. 15:16:09 i know :P 15:16:34 okey without further a due 15:16:42 #info 3. generic driver 15:16:47 #link https://review.openstack.org/525057 15:16:56 crushil take away 15:17:51 So, as I already mentioned this patch depends on https://review.openstack.org/527396 and once that patch merges, I can add functionality for the other use cases for Cyborg. 15:19:20 we also need unit test for this one right ? I mean later 15:19:29 yup 15:20:27 how generic should we expect for the driver when we release it ? Can we have very simple demo on NVIDIA GPU ? 15:21:07 That's the goal 15:21:23 We can do a simple POC at the OS summit 15:21:43 cool 15:22:06 ok now let's move on two the relative two new specs 15:22:19 #info 4. FPGA data modeling spec 15:22:29 can you please plan for Demo online, so we can join and view functionality , Thank you 15:22:29 #link https://review.openstack.org/526559 15:22:50 this is actually the spec that zhuli is implementing in his patch 15:23:12 vipparthy we could discuss that at Dublin PTG for sure 15:23:27 Li Liu you there ? 15:23:33 yup 15:23:36 Thank you Zhipeng 15:23:37 Li_Liu 15:24:02 could you briefly introduce the spec ? 15:24:07 sure 15:24:13 so that everyone gets up to speed 15:25:57 Basically, the spec introduces a way to describe hierarchy accelerator structure 15:26:08 One of the most typical usage is FPGA 15:26:28 We plan to add a new table called Deployables 15:27:16 It represent any accelerators that can be discovered/deployed 15:27:54 The key is it has 2 pointers, parent_id and root_ir 15:28:02 root_id* 15:28:27 parent_id points to the parent Deployable row 15:28:40 root_id points to the root Deployable row 15:29:16 This way, we can form nested tree structures 15:29:28 For instance, in the case of FPGA 15:30:02 Each FPGA can have multiple Physical Functions 15:30:29 Each Physical Functions can have multiple Virtual Functions 15:32:13 Also, we have a Attribute table which stores k-v pairs referencing back to the Deployables table 15:32:45 is this aligned with the nested resource provider spec ? 15:32:50 this way, we can track arbitrary type of attributes for generic types 15:32:55 (which expected to be done in Queens) 15:33:20 It does, actually 15:33:40 cool 15:34:00 thx for the intro, everyone plz help review the spec 15:34:08 the Deployable is counterpart to resource providers in NOVA 15:34:13 zhuli will also reflect the changes in the spec on the fly 15:34:19 Thanks a lot guys 15:35:30 moving on to the next 15:35:43 #info 5. Intel's FPGA device driver 15:35:56 #link https://review.openstack.org/530720 15:36:23 shaohe_feng_ could you help introduce the spec ? 15:38:45 zhipeng: yes 15:39:43 zhipeng: this spec is going to provide the fpga information to the Li_Liu's spec 15:40:27 zhipeng: and support a program interface for FPGA 15:41:06 When cyborg agent starts or does resource check periodically, the cyborg FPGA driver should enumerate the list of the FPGA devices, and report the details of all available FPGA accelerator on the host. Such as BDF, PID, VID, IMAGE_ID, PF/VF type. 15:41:33 this has any impact on your work @crushil ? 15:41:37 When user uses empty FPGA regions as its accelerator, cyborg agent should call driver's program() interface. Cyborg agent should provide BDF of VF, and local image path to the driver. More details ref 15:44:05 I'm not sure 15:44:17 I need to look at the spec 15:44:55 zhipeng: I'm not familiar with other vendor's FPGA. Not sure we need a virtual FPGA driver interface. 15:45:58 crushil okey 15:46:22 shaohe_feng_ I think Li_Liu could help you with the insight of other vendor implementation 15:46:23 o, sorry. the driver should can program any FPGA regions not only empty region 15:46:51 zhipeng: that's greate 15:47:10 will update the spec. 15:47:26 cool 15:47:39 okey then we have the last non-priority patch 15:47:49 #info 6. SPDK driver code 15:48:01 #link https://review.openstack.org/513704 15:48:02 the agent decide which region need to be re-program. 15:49:03 the SPDK code is developed by people in my team, so if we could squeeze it in Queens then it would be great, if not it is also not an issue 15:49:13 will ping you guys when it is ready 15:49:23 #topic Queens Priorities 15:49:40 proof of concept? 15:49:40 I would suggest the following priorities for this month 15:49:53 jkilpatr ya like that 15:50:10 we have patches need to be merged in spdk in the first place to make it work 15:50:27 but it is non-priority so no hurry on that one 15:50:55 i will handle that quietly on my side :P 15:51:05 okey back to priority suggestion 15:51:30 #info suggestion 1: land LiLiu and zhuli's patch this week 15:52:00 i know there are a lot of reviews to be done, but let's try to crunch it this week 15:52:17 so that crushil and justin's work could move ahead 15:52:58 #info suggestion 2: we land justin's and shaohe's spec by the end of next week 15:53:38 meaning that once we agreed upon the API layer implementation this week, we could crunch out the conductor/agent/driver interface pretty much next Wed 15:54:14 and we should be okey to have a full picture for the internal API spec, as well as shalhe's driver spec to align with it 15:54:31 sounds great 15:54:42 #info suggestion 3: at the moment we keep the vendor driver code in-tree since there are not a lot 15:54:43 I'll give it a go. 15:54:51 agreed 15:55:00 no reason to make things more complicated until it becomes an issue. 15:55:07 yep 15:55:29 anyone else comment on my three suggestions ? 16:02:52 crushil ? 16:03:18 3 16:03:43 We should keep it in-tree 16:03:53 We are not in that position yet to move it out of tree 16:04:03 zhipeng, who can help me to identify driver specs to be written for different FPGA's in our team ? 16:04:06 crushil exactly 16:04:14 And apologies for delays. I'm in a conference right now 16:04:27 no problem :) 16:05:09 vipparthy you could go over the log, there are info points that could help you identify it 16:06:28 so if there are no other issues, I will close the meeting on time-ish :P 16:06:43 let's go to work and try to deliver an awesome first release 16:07:05 \o/ 16:09:23 #endmeeting