19:04:59 #startmeeting 19:05:00 Meeting started Tue Aug 23 19:04:59 2011 UTC. The chair is mtaylor. Information about MeetBot at http://wiki.debian.org/MeetBot. 19:05:01 Useful Commands: #action #agreed #help #info #idea #link #topic. 19:05:08 bengrue, anotherjesse: no worries. 19:05:20 bengrue: feeling better today I hope? 19:05:29 Yeah. In the office. 19:05:55 OK, so mtaylor, before you start... lemme get something in. 19:06:36 * mtaylor punches jaypipes 19:06:43 #topic jaypipes talking to people about things 19:06:48 All: I'm putting together a project plan that has action items for folks on my team (mtaylor, soren, and jeblair) as well as some folks on other teams like anotherjesse's. 19:07:07 Hoped to get it done by this meeting time, but failed unfortunately. 19:07:58 The basic gist of the plan is that mtaylor and jeblair will have Jenkins firing off an automated deployment of OpenStack onto 5 lab servers by Friday, and this deployment cluster will have a set of functional tests run against it. 19:08:18 The tests will be just the nova smoketests and possibly some stuff from smokestack or kong (for this Friday) 19:08:54 * soren is back 19:09:07 mtaylor is at Burning Man all next week so jeblair will be working on documenting a bunch of stuff next week around the CI infrastructure and soren will be documenting and working on QA/QE stuff (the actual test cases and suites running on a deployed cluster) 19:09:38 Burning man? Will he be alright? 19:09:44 soren: always am 19:09:50 termie_ and sleepsonthefloor got Djeep installed on one of the lab servers late yesterday and jeblair now has access to that (thank you termie_ and sleepsonthefloor!_ 19:09:54 soren: I will, however, be COMPLETELY unreachable 19:10:35 what anotherjesse wants most is to have all the code (puppet modules, glue code, etc) in a public repo that is tested and code reviewed like any othe ropenstack project. We are all in agreement on that! 19:11:03 jaypipes: I'm cool with the CI team who is doing the draft not having to get approval at the begining ;) 19:11:15 anotherjesse: ++ 19:11:16 but putting it in a repo we can watch and help would be muy bueno 19:11:28 anotherjesse: sure 19:11:37 mtaylor: that doesn't mean 1 huge drop - if you don't commit at least daily, no burning man for you 19:11:48 anotherjesse: so far, depending on what it is, we put everything we do in either openstack/openstack-ci or openstack/openstack-ci-puppet 19:11:54 anotherjesse: hehe 19:12:10 mtaylor: will that be added to ci.openstack.org - which repo is for what 19:12:14 anotherjesse: and we actually already do gerrit reviews and some testing on each other ... so we're set 19:12:19 I think that there are a number of GitHub repos already existing that contain puppet modules and chef recipes. These need to be identified and anotherjesse and others need to let mtaylor and jeblair know which puppet/chef modules we need to fire from Jenkins on each build to get continuos deployment testing better 19:12:42 anotherjesse: openstack-ci-puppet is for the puppet modules we use to manage the jenkins slaves and other servers 19:12:55 anotherjesse: openstack-ci is for supporting scripts/glue code/docs/whatnot 19:12:56 mtaylor: for the website - not for me ;) 19:13:02 one sec guys, can I just finish up? 19:13:03 anotherjesse: yup. 19:13:08 jaypipes: NO 19:13:09 jaypipes: yes 19:14:11 OK, so the Djeep installation on the deployment test cluster is temporary. Over the next week, jeblair is going to be looking into the puppet modules (site.pp IIRC) that Djeep generates/builds and converting the automation code to run cobbler instead of having the Djeep stuff on the deployment cluster. 19:14:28 This is going to be done with an eye to adaptability 19:14:43 so that in the future, things like Crowbar can also be used to deploy 19:14:54 anotherjesse: does above ^^ ok with you? 19:14:57 jaypipes: cooleo - fyi djeep builds yaml files - and so puppet can run without it 19:15:27 anotherjesse: ok, well that is something that jeblair is going to document and analyze next week 19:15:35 er, i'm at rookie-o next week 19:15:42 but the bottom line for all is that by Friday, we WILL have a deployment cluster that is at least: 19:15:44 i'm not sure i'll have much time to work on this 19:15:50 (next week) 19:15:50 jeblair: take your laptop 19:15:53 jeblair: hold on, one sec :) 19:16:03 but the bottom line for all is that by Friday, we WILL have a deployment cluster that is at least: 19:16:13 a) testing *some* deployment packaging 19:16:23 b) firing *some* tests against the deployed cluster 19:16:27 OK.. .now 19:17:13 jeblair: documenting/analyzing stuff you can do while on the plane, etc.. Rookie-O isn't an all-consuming thing the entire week, and I'm trying to get us out of it a bit early so we can actually get some stuff done... 19:18:01 jeblair: if you don't get the analyze/document stuff done next week, it's ok... as long as we're moving forward and have a plan, I'm happy. 19:18:02 jaypipes: fyi we took our laptops and sugarbear told us when we had to make sure to pay 100% attention (lawyers, hr, ...) 19:18:29 anotherjesse: yeah, but we're there the entire week... 19:18:33 i just want to set expectations about how much time i'm really going to have to work on this stuff next week. 19:18:34 anyway, back on track... 19:18:46 jeblair: expectation is that you'll do the best you can, nothing more. 19:18:56 jeblair: same expectation for anyone else.. :) 19:19:15 the point is that we have a) a plan and b) we're moving forward and not spinning wheels 19:19:27 bengrue, anotherjesse, jeblair, mtaylor: agreed? ^^ 19:19:39 lgtm 19:19:40 works for me 19:19:57 jaypipes: agreed - mtaylor question about a 19:20:06 #agreed we have a) a plan and b) we're moving forward and not spinning wheels 19:20:12 anotherjesse: talk to me 19:20:19 sounds clearer to me; where will the CI server be on friday? 19:20:20 if we need to change the preseed or puppet rules - is that stuff that will be in the bzr repo? 19:20:24 as for soren's part in this, I will work with soren over the next few days to identify some low-hanging fruit re: funtional test suite and identifying/agreeing on OUTPUTs, etc 19:20:26 (what it is pointing to) 19:20:38 anotherjesse: git repo, but yes 19:20:45 jaypipes: cooleo - either way 19:20:48 anotherjesse: what jaypipes said 19:20:51 mtaylor: answer for bengrue ? 19:20:56 (ie, what should I be paying attention to / eagerly awaiting with baited breath?) 19:21:01 grue have seen your email from jay outlining a&b? 19:21:07 bengrue: I'm not sure I understand the question? 19:21:07 (and cdefgh) 19:21:17 mtaylor: asking where he can see the deploy cluster 19:21:24 bengrue: as in - where will it physically be? 19:21:25 ah 19:21:35 well - jenkins.openstack.org will be the thing driving this puppy 19:21:37 I have the email, scanned it so far, haven't responded yet. 19:21:43 mtaylor: and the code that is running/deploying the cluster, I assume 19:21:55 Oh, the link is there. 19:22:10 Well, the link to the documentation? http://ci.openstack.org 19:22:43 bengrue: that is mtaylor and jeblair are dumping docs on this stuff (and dumping more docs in the next days and weeks) 19:23:02 are all of the ci jobs going to be prefixed with ci- ? 19:23:23 Will there be more added in the next week? Or will everything be added into ci-puppet? 19:23:40 bengrue: more of what? sorry, docs or code? 19:23:42 sorry man - I'm still just not following the questions 19:24:05 yes, I'm not quite following you either, bengrue 19:24:22 mtaylor: request for the ci.openstack.org site - can you put a "how to contribute" section? With links to repos and whatnot? 19:24:23 there will be a job, something like say "nova-baremetal" (it hasn't been named yet) that will launch the deploy/test stuff 19:24:30 anotherjesse: yes 19:24:34 anotherjesse: ++ 19:24:42 #action mtaylor Add how to contribute section to ci.openstack.org 19:24:53 mtaylor: openstack-deploy? :) nova is only one piece :) 19:24:56 mtaylor: and perhaps a link to the ci site on the jenkins site? 19:25:02 You're saying that by friday, there will be a deployment cluster that is testing deployment packaging and firing tests against it. You're saying that the ci server is jenkins.openstack.org. I assume that jenkins will be triggering the cluster deployment via new jobs, yes? 19:25:12 jaypipes: sure. like I said - I have not gotten there yet 19:25:14 bengrue: yep 19:25:19 bengrue: yes 19:25:20 And then coordinating the testing against the cluster? 19:25:30 bengrue: yes 19:25:32 bengrue: well, firing a command that runs the tests, yes 19:25:38 Will the new jenkins jobs be prefixed with ci- ? 19:25:41 no 19:25:42 no 19:25:50 All jenkins jobs are CI... 19:25:55 where should I be looking on Jenkins for the new stuff? 19:26:04 I do not know what they will be prefixed with ... I'll send out a mailing list message 19:26:07 Oh 19:26:07 https://jenkins.openstack.org/job/ci/ 19:26:16 Is this what I should be stalking? 19:26:21 no 19:26:36 co 19:26:38 no 19:26:39 so tbd then, I'll wait for the msg. 19:26:47 that is merely a job that gerrit uses to check commits we make to openstack-ci - which are a collection of helper scripts 19:26:59 yeah - msg will make it clearer 19:27:10 also- I'd like to be clear on something ... 19:27:12 mtaylor: let's just determine the name now... can we call the master job openstack-deploy? 19:27:21 CI != QA :) 19:27:37 I'm not sure I follow. 19:27:44 openstack-deploy is a bit encompasing - as I expect to have one of these jobs for msft/novel, one for citrix, etc. 19:27:52 so, how about openstack-deploy-rax 19:27:58 mtaylor: sure 19:28:00 since it's the one deploying to the rackspace hardware 19:28:03 jaypipes: CI != QA - can you elaborate? 19:28:08 In my experience, the whole point of CI/CD _is_ to play a large amount of the traditional QA role. 19:28:25 jaypipes: this might be the disconnect? 19:28:26 ;) 19:28:32 since I agree with ben 19:28:51 CI is a tool/method used for QA 19:28:53 anotherjesse, bengrue: CI is the automation of continuous builds triggered by commits to trunk. QA is the tests and test suites that get executed against deployed clusteres 19:29:18 anotherjesse, bengrue: I explained this disconnect in terminology in my email to you both :) 19:29:33 anotherjesse, bengrue: sooo, in other words... 19:30:25 CI == Gerrit/Jenkins/Tarmac, CD == CI + deployment scripts and modules, QA == functional, unit, and integration test suites running on the clusters deployed by CD. 19:30:38 that make more sense? 19:30:56 mtaylor: would you agree with the above simplification? 19:31:01 sure.. but the point of caring about CI is so you can do CD/QA 19:31:08 not CI just for the sake of it 19:31:15 I saw that, and I'm unsure that I agree with the premise. Technically the infrastructure to run tests are seperate from the tests, but both are necessary parts of the whole. 19:31:16 jaypipes++ 19:31:22 This is the CI/Testing team, right? 19:31:28 so CI requirements are driven by CD/QA ? 19:31:37 anotherjesse: sure, but our team has expertise in CI and a bit in CD, and we need your team's expertise in D to get good CD and QA ;) 19:31:53 OMG acronym BBQ 19:32:05 jaypipes: so - perhaps this is silly but imho most devs can't have a dev environment that even simulates a deployment 19:32:15 anotherjesse: the point being that mtaylor and jeblair own the glue code to fire off deployment modules that your team creates. 19:32:18 since it is rather complicated to have a network toplogy 19:32:22 NTT can also provide staff for QA in D ;) 19:32:30 so - CI is needed to do development if you do TDD 19:32:38 anotherjesse: mtaylor and jeblair can't be responsible for creating working deployment/puppet modules... 19:33:02 How can you have a stable testbed without working deployment scripts? 19:33:03 not asking for that - asking for ability to not restrict the modules / scripts / … (which we have said we aren't!) 19:33:04 so... this is great ... I have the feeling we might not solve the acronym discussion right here 19:33:07 anotherjesse: but they *can* be responsible for making sure that the puppet modules your team and others create get automatically built on hardware linked in the CI environemt 19:33:08 (they have said they aren't 19:33:21 agree 19:33:29 anotherjesse: k. 19:33:39 anotherjesse: just trying to delineate who is owning what :) 19:33:43 bengrue: I think he is saying that they will make the scripts that we collaborate on 19:33:45 work 19:33:52 but we (stackers) own the scripts 19:33:56 anotherjesse: yes! 19:33:59 :) 19:34:25 just the individual stacker teams? 19:34:25 anotherjesse: and we are relying on your team to create those puppet modules and make sure they actually work :) 19:34:27 I've not thought otherwise - I've just not seen what are the assumptions about the environment our scripts need to run within ;) 19:34:32 What about us sattelite companies? ; ) 19:34:39 bengrue: you are a stacker 19:34:48 anotherjesse: and you are relying on us to make sure that jenkins continuously executes those puppet modules to test deployment and packaging. 19:34:50 oh, openstack, derp. 19:35:04 racker / stacker conflation 19:35:10 anotherjesse: we good on that distinction? 19:35:13 bengrue: and so the CI environment needs to be chainable / …so you can kick off things (this is where monty/jim/jay can speak up since I don't even know the words) 19:35:21 jaypipes: I've agreed with that and continue to agree with that 19:35:33 * soren needs to run for 3 minutes. 19:35:44 anotherjesse: I think what you said may be part of a conversation we should have at some point re: "assumptions about the environment" 19:35:52 bengrue: outside partners and individuals such as yourself are welcome to contribute to any and all aspects of this, but IMHO, the most pressing needs are in the development of additional test cases. 19:35:58 anotherjesse: I think we may each be expecting to see those assumptions from each other :) 19:36:43 anotherjesse: re: chainable stuff, that's a task for mtaylor and jeblair to document: How to stand up a Jenkins builder in your own lab... 19:37:14 anotherjesse, bengrue: and yes. the intent is to allow all of the satelite companies to provide resources to ensure that specific hardware combinations/configurations that are important to them are tested 19:37:17 * soren returns 19:37:32 mtaylor: ok, now that we've agreed to agree on the agreement about responsibilities, can we get an update on where we stand on the deployment cluster? 19:38:01 jaypipes: jeblair is working with the djeep system that termie and sleepsonthefloor just handed off to him 19:38:12 mtaylor: by "working with", what do we mean? 19:39:01 evaluating that it works as expected, and we have access. it does, and i do 19:39:18 jeblair: awesome. next steps? 19:39:21 So, just for my general edification, when will the "assumptions about the environment" cease to be assumptions and start to be an agreed upon, explicit set of facts? Just looking for a general timeframe here, nothing binding. 19:39:40 hang on guys - one topic at a time please 19:39:40 Is that a conversation for this coming week, etc? 19:40:16 currently it's based on a puppetmaster server. we generally don't use that approach, and run puppet out of a local git repo. we may want to adjust the system to that approach 19:40:29 bengrue: I think mtaylor as he deploys the first environment might have a heavily constrained first environment --- and then they are going to work on loosening it 19:40:35 and get the puppet config in a git repo where we can all hack on it 19:40:40 anotherjesse: ++ 19:40:42 bengrue: making it so there are multiple - and it evolves to multiple architectures 19:40:48 Okay, that makes sense. 19:41:00 i'm hoping mtaylor will be guilted into lots of docs in ci.openstack.org ;) or beered into it 19:41:01 we also need to start working on the jenkins job that fires off rebuilding the cluster 19:41:13 anotherjesse: definitely. 19:41:19 I am a fan of spiking. 19:41:24 beer works best. 19:41:35 make sure autocorrect is on. 19:41:43 ; ) 19:42:16 so i think that's the main thrust of work on the cluster, there are dependencies: 19:42:21 jeblair: the puppetmaster/git puppet thing can wait? or is that something you want to get done before Friday? 19:42:31 although just to set expectations ... priority #1 for the week is getting this working by friday. priority #2 is piles of docs. I want both, but I think 1 without 2 will be met with way more approval than 2 without 1 ... so if something has to slip to next week, it's going to be #2 19:42:50 mtaylor: yes 19:42:57 jeblair: the assumption is that we are working towards multiple architectures eventually (like VLAN mode vs LXC vs ...) 19:42:58 right? 19:43:14 so - tests will run against 1 or more architectures 19:43:30 yes 19:43:45 anotherjesse: yes, absolutely. 19:44:00 anotherjesse: there would be little point to this if that were not the case ;) 19:44:01 yes -- the djeep setup presupposes one architecture, so that's probably a post-djeep step 19:44:51 jeblair, mtaylor: OK, so could you outline in steps exactly what you need to do before Friday to get this all going? Any blockers that anotherjesse can assist with? 19:45:13 djeep pre-supposes nothing - but the preseeds/recipes it uses do … hence the roles / rolemapper - but for sake of arguement it doesn't change anything 19:45:57 anotherjesse mentioned his team was continuing work on puppet recipies (and removing dependence on puppetmaster) -- any progress there we can use? 19:46:22 jeblair: not yet - we've been focusing on d4 the last week 19:47:27 jeblair: is that a blocker for Friday or something post Rookie-O? 19:48:48 jeblair: imho if you have it kicking off with puppet solo but it blows up 19:48:53 the community can fix it 19:49:47 getting it so a button causes it to: spin up servers, kick off a script (that runs puppet or ?), then runs tests … if each of the steps is in CI, we can all pitch in if we can look at how it blows up 19:50:08 we'll need the puppet config in git for that to work 19:50:14 anotherjesse: a Big Red Button? 19:50:26 jeblair: isn't that what we are doing? 19:50:28 so that's definitely the goal 19:50:56 jeblair: where is the puppet config that is used to build the cluster now? 19:51:14 it's on the cluster -- anotherjesse's team built it somewhat as a one-off 19:51:26 so i'm not sure it's ready to go right into a git repo as-is 19:51:44 jeblair: our puppet config is at github.com/cloudbuilders/openstack-puppet 19:51:55 we don't even have a commit key on the server 19:52:25 jeblair: so - the "missing" site.pp 19:52:42 is because the whole point of djeep is 2 components 19:52:54 1) something that maps a server to a network / netboot config 19:53:04 these configs get written to yaml/json files 19:53:20 2) puppet server that uses the node classification configuration 19:53:27 that load those json/yaml 19:53:39 so you can just use the puppet recipes with json & a classifier 19:53:43 or you can use a site.pp 19:54:05 or you can use everything if you are doing dev and switch between xen and kvm and different net models and blow away servers daily 19:54:15 jeblair: make more sense? 19:54:31 jeblair: I can show you how to manually run the node classifier on the djeep server 19:54:47 djeep doesn't even have to be running after it writes the yaml files - it is pure puppet 19:55:24 anotherjesse: they are formulating their response :) 19:55:49 * jaypipes happy we are finally getting to the nitty gritty :) 19:55:57 jaypipes: sorry for the sploid 19:56:11 hey, no worries, this is exactly what I was hoping for. 19:56:23 for the moment, i don't think we need to change the configuration, right? 19:56:29 these are precisely the conversations we need to have. 19:56:39 in our master we set: 19:56:39 [master] 19:56:40 node_terminus = exec 19:56:40 external_nodes = /root/djeep/etc/puppet/node_classes.py 19:56:41 so we don't need to re-run the node classifier 19:56:57 jeblair: node classifier is a way for puppet master to not have a sitepp 19:58:20 i guess what i was trying to say is that we can leave the djeep configuration static for the time being 19:59:40 jeblair: here is an example of output: 19:59:40 http://paste.openstack.org/show/2259/ 19:59:45 sanitized 19:59:52 meeting time is almost up. = ) 20:00:12 if site.pp wants to hardcode 356602-staging-cpu2 to have various global variables and classes you can 20:00:18 it just is VERY verbose 20:00:27 having a simple classifier is much easier 20:00:29 Were there non-open-discussion topics that needed to be discussed? 20:00:34 out of time 20:00:35 :) 20:00:44 we can continue in #dev 20:00:50 Let's. 20:00:51 anotherjesse, jeblair: please do continue in #dev... 20:00:53 i'll dig more into puppet and ask anotherjesse with questions 20:01:29 thanks guys! easily the densest ci meeting we've had in a while! 20:01:31 #endmeeting