19:01:21 #startmeeting infra 19:01:21 Meeting started Tue May 17 19:01:21 2016 UTC and is due to finish in 60 minutes. The chair is fungi. Information about MeetBot at http://wiki.debian.org/MeetBot. 19:01:22 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 19:01:24 The meeting name has been set to 'infra' 19:01:28 #link https://wiki.openstack.org/wiki/Meetings/InfraTeamMeeting#Agenda_for_next_meeting 19:01:38 #topic Announcements 19:01:45 none for this week 19:01:51 #topic Actions from last meeting 19:02:02 not yet, sorry 19:02:21 #link http://eavesdrop.openstack.org/meetings/infra/2016/infra.2016-05-10-19.03.html 19:02:23 heh 19:02:32 #action jeblair investigate mail filtering by recipient options for infra-root@o.o inbox 19:02:40 #topic Specs approval 19:02:50 none new this week 19:03:01 we still need to flesh out the wip task tracking spec 19:03:16 i was going to pick it up, but got busy with other stuff 19:03:37 o/ 19:03:51 i did move the translation improvements spec to implemented last week 19:04:00 ++ 19:04:05 great 19:04:16 #topic Gerrit 2.11 disk utilization and Revisit Gerrit git gc (fungi, zaro) 19:04:25 #link http://cacti.openstack.org/cacti/graph.php?action=view&local_graph_id=2839&rra_id=all 19:04:50 a couple weeks ago i ended up growing the backing volume for the gerrit homedir on review.o.o 19:05:15 since right around the 2.11 upgrade, disk utilization inside the git tree there has been experiencing unbounded growth 19:05:32 and anteaya noticed that we were almost out of disk 19:05:38 that is... substantial. 19:05:54 so i doubled it and we've gained some breathing room for the other half of this topic 19:05:56 we've also been adding about one project a day, do we know whether it's gerrit or project growth? or both? 19:06:15 no, like the nova repo was some 10x larger than in january 19:06:23 wow 19:06:27 #link https://www.mail-archive.com/openstack-infra@lists.openstack.org/msg04264.html 19:06:47 zaro: has done some more experimentation with the gerrit background git gc task 19:07:02 and it looks like he wanted to discuss it in the meeting 19:07:29 o/ 19:07:36 yup. so i think there was some lingering concern since last testing i did 19:08:06 i've now tested it on review-dev.o.o running gc on the nova repo and nothing bad happend to it. 19:08:28 also put it up there in case anybody else wanted to test other stuff on it 19:09:04 wanted to see if poeple still have further questions? 19:09:09 are we sure we are running the existing repack cron? 19:09:32 * fungi checks syslog on review.o.o 19:10:00 it is at least starting... 19:10:08 May 15 04:07:01 review CRON[42940]: (gerrit2) CMD (find /home/gerrit2/review_site/git/ -type d -name "*.git" -print -exec git --git-dir="{}" repack -afd \;) 19:10:36 yeah 19:10:50 no idea if it's finishing 19:10:53 i suppose i could try manually running the same as the gerrit2 user and seeing what happens 19:11:09 any objection to me starting it now? 19:11:24 not here 19:11:27 no idea how long it runs 19:11:55 zaro: the only thing i haven't seen is how this impacts the git servers (which i think was one of the things i mentioned in my original email?) 19:12:40 i guess i wasn't aware of that. 19:13:06 zaro: http://paste.openstack.org/show/497409/ 19:13:11 zaro: that paragraph from the email 19:14:00 i've got a screen session as gerrit2 on review.o.o with that command copied from crontab -l running, if another infra-root needs to attach: sudo su - gerrit2; script /dev/null; screen -x 19:15:09 jeblair: are you referring to the cgit servers? 19:15:18 o/ (only sorta) 19:15:20 git.o.o? 19:15:45 zaro: yes, those 19:15:47 zaro: yes. that is the principal thing i'm worried about. 19:15:56 i'm sure that gerrit gc will make gerrit faster 19:16:03 i'm not sure what will happen to the git servers 19:16:14 so doesn't gerrit replicate to those? 19:16:18 I thought that was tested and hashar? provided independent confirmation 19:16:35 zaro: it does 19:16:39 clarkb: i was unaware of that 19:17:05 ya regular git used less memory for them after they started GCing iirc 19:17:30 would it be possible to test with some other repo on review.o.o? 19:17:31 and what impact did it have on network traffic and the resulting size of repos that were cloned from a git server? 19:18:02 i think this is easy to test. just get a repo, gc it, and serve it with cgit and the apache smart git protocol 19:18:05 I don't recall specifics on network traffic and size of cloned repos 19:18:16 this is a one way trip for us 19:18:50 so before we embark on it, i think it would be good to know how it's going to alter one of the more performance-critical pieces of our infrastructure 19:19:04 however, if other folks don't share that concern, we can yolo it. 19:19:50 zaro: jeblair: we could test what happens with replication to the local mirror on review-dev? 19:19:51 jeblair: are you suggesting we setup a seperate cgit server for this test? 19:19:54 i assume we have one configured 19:20:06 fungi: we do 19:20:31 zaro: the testing can be done locally 19:20:33 i thought review-dev was only replicating to github? 19:20:40 zaro: it has a local replica too 19:20:47 the stuff served out of /p/foo/bar 19:21:19 git.o.o has lots of protocols 19:21:37 git smart http via apache, git native, cgit 19:21:57 (also, even occasionally git dumb via http) 19:21:58 the first two are trivial to test on review-dev, the last would require a cgit install 19:22:06 all of them may behave differently 19:22:25 yeah, and cgit install means probably separate centos server 19:22:47 i'm not trying to slow this down 19:22:49 so nova-gc repo is already on review-dev.o.o so we can just look at the local replication? 19:23:00 i seriously think this does not require any special access 19:23:03 just do it locally 19:23:12 not sure why we need a cgit install now. thought you said we only need local test? 19:23:24 zaro: because cgit serves git repos 19:23:35 yes, install cgit on a local server 19:23:39 or a cloud server somewhere 19:23:46 so the test is to ensure we don't negative impact the memory/cpu/network/disk of cgit when we switch 19:23:49 don't worry about review-dev or setting up some kind of new public cgit server 19:25:31 \o_ 19:25:59 private gerrit vm replicating to private cgit vm, test performance of cloning and fetching from the cgit vm via http/https/git protocols before and after turning on gerrit background git gc? 19:26:19 fungi: sure. actual gerrit optional. :) 19:26:40 (since we could forego using gerrit gc and just use git gc to give us more control) 19:27:22 just, whichever of those zaro wants to actually do :) 19:27:26 btw, faux-cron repack still going on review.o.o in my screen session 19:27:33 hasn't bombed yet anyway 19:28:09 i guess it depends whether there's value in having a cgit test server somewhere? 19:28:21 and having gerrit replicate to it? 19:28:38 or is this something we only do one time? 19:29:07 this is a one time thing 19:29:43 ok. then i'll try the less work route 19:29:45 that's the reason i put the tarball of the git repo up -- because none of this requires access to production servers 19:30:19 o/ 19:31:48 ok, so i guess we prefer to test cgit before doing? is that the decision? 19:32:44 hashar: clarkb mentioned your gc'd git replica performance testing back around 19:16 utc in the meeting log. do you have a link to that analysis handy? 19:32:54 i don't mind either way (though if there are surprises, i won't be fixing them) 19:33:32 clarkb: fungi: I might have wrote it in the infra mailing list directly 19:33:45 hashar: do you recall roughly when that was? 19:33:58 nop but can look it up 19:34:09 if you can narrow it down to a particular month, i can dig up a ml archive url for it easily 19:34:10 I was replying to some email from Zaro for sure 19:34:46 hashar: sorry i don't remember :( 19:36:54 maybe we should continue and round back? 19:37:31 yeah, let's proceed to teh next (also gerrit-related) topic 19:37:44 yup I will post to the mailling list if I find something 19:37:45 it's "gerrit week" here in infra 19:37:52 thanks hashar! thanks zaro! 19:38:04 #topic Gerrit project renames using the Gerrit import/delete plugin (zaro) 19:38:14 #link https://gerrit.googlesource.com/plugins/importer/+/stable-2.11/src/main/resources/Documentation/about.md 19:38:36 so i've tested what's described there to rename projects. it's real slick and easy 19:38:44 zaro: so if i understand the proposal, it's to use that plugin to "import" a gerrit project to a new name and then delete the old one? 19:38:49 and no restart or rindex required 19:39:05 fungi: yup, that's really all there is too it. 19:39:12 zaro: reading that, it seems to import from a second gerrit. does importing from itself also work? 19:39:15 neato! 19:39:20 https://gerrit.googlesource.com/plugins/importer/+/stable-2.11/src/main/resources/Documentation/about.md#Project-Rename is the section for renames 19:39:21 yes. 19:39:26 aha 19:39:34 i skimmed poorly! 19:39:49 you can copy, and even move it to a subproject 19:40:08 or i mean across subprojects 19:40:09 and it keeps open changes, subscriptions, et cetera? 19:40:16 yes 19:40:25 starred changes carry over too? 19:40:37 hmm, i didn't check that. but can later 19:40:47 do they end up with new change urls? new change-ids? 19:41:07 just thinking through the things we normally copy when updating the db 19:41:17 fungi: ya I would worry about that, not sure how you can preserve the change numbers between two servers 19:41:25 seems like collisions would be a big issue 19:41:45 for the changes. no, it keeps the same url, change id and commit id 19:41:55 i have a feeling the long change-ids remain the same at least 19:42:05 but i'm surprised it could have the same change number 19:42:30 sorry, actually i think change ids do change. 19:42:35 i thought that was a unique incremented integer column in the table 19:42:37 especially since, at some point during the process, the project exists in two places... 19:43:04 so this would break all of our urls 19:43:17 opps, i mean change number changes, but change id stays the same 19:44:08 ti definitely sounds worthy of further investigation. i don't mind there being some caveats to a rename since that may help discourage them further, but the compromises will need to be weighed against the benefits (no downtime and no offline reindex are biggies) 19:44:16 here's a project copy in progress: https://review-dev.openstack.org/#/x/importer/list 19:45:20 i wonder if the plugin could be extended to have a proper "rename" mode which encapsulates the copy and delete but preserves id numbers and anything else which would otherwise be awkward for an import/copy 19:46:54 so anyway, it's an interesting proposal for sure. there's some serious merit if it gets us free of having maintenance windows to rename projects 19:47:12 more research needed for now though 19:47:24 #topic Project rename scheduling (fungi) 19:47:28 and on that note... 19:47:56 when do we want to rename openstack/openstack-ansible-ironic to openstack/openstack-ansible-os_ironic, and possibly also openstack-infra/ansible-puppet to openstack-infra/ansible-role-puppet 19:48:16 mordred added the second one, but hasn't written the change to actually implement it 19:48:45 i can tell everyone is excited at the prospect of yet another gerrit maintenance window 19:49:18 woooo 19:49:45 i guess let's not shoot for "May 23-27" since that's our server upgrades sprint week 19:50:15 memorial day in the states is the following week, but maybe later in that week? 19:50:45 clarkb and i are also not around for much of the week of june 5 19:51:15 how about the 3rd? 19:51:34 i could see doing it on friday the 3rd of may 19:51:41 I won't be around the 3rd, but you don't need me 19:51:45 june ;) 19:51:49 er 19:51:51 right 19:51:53 months 19:51:58 i could see doing it on friday the 3rd of JUNE 19:52:06 I'll be around 19:52:18 yeah, i don't have any plans that day 19:52:35 if there's at least two infra-root volunteers, it's probably fine? 19:52:46 #link http://releases.openstack.org/newton/schedule.html release schedule 19:52:47 I'll be on east coast time, so earlier is better 19:53:04 that's the end of n1 milestone week 19:53:11 ah, hm 19:53:25 which is probably fine? earlier that week less so of course 19:53:33 yeah 19:54:17 I have family arriving the econd and traveling the 4th so the 3rd is likely family day 19:54:33 dhellmann: any concerns with an extended gerrit maintenance window friday the 3rd of june, the last day of n1 milestone week? 19:54:52 fungi : when would you want to start? 19:55:21 it probably won't be an issue, as long as we get the milestone tags done before you start 19:55:38 and it should be possible to do that early in the day EST 19:55:40 pleia2 and i are volunteering to work on it, and we'll both be in edt that week, so probably not super late 19:55:54 2000? 19:56:03 20:00 utc works fine for me 19:56:10 * dhellmann converts to local time 19:56:14 now :) 19:56:19 that's 4pm augusta time 19:56:44 also athens but yeah, that should work fine 19:56:58 #agreed Gerrit rename maintenance tentatively scheduled 20:00-24:00 UTC Friday, June 3 19:57:11 dhellmann: ahh, one of those a towns. at least i didn't say atlanta ;) 19:57:28 all the "A" towns in georgia are conveniently in the same timezone 19:57:37 so desu nee 19:57:38 can I bring up something real quick? have a meeting to run to 19:57:48 still one topic to go 19:57:51 jeblair : augusta is actually spelled disgusta, but yeah ;-) 19:57:55 #topic Privileged or sanitized log access for openstackid.org (fungi) 19:58:13 ok, guess I'll be late 19:58:26 we temporarily granted smarcet root access to openstackid.org shortly before the summit so he could review logs 19:58:58 just wanted to see if anyone is interested in helping me work out a solution for puppeting privileged log access or alternatively a log sanitization and publishing solution for that server 19:59:25 come find me in #openstack-infra to discuss since it's almost tc meeting time now 19:59:33 #topic Open discussion 19:59:38 thanks 19:59:38 prometheanfire: what's the question? 19:59:47 i think priv access is better; i'm not super keen on exposing (even sanitized) authn logs 19:59:49 I've been working on getting gentoo into nodepool 20:00:11 it was brought up that maybe I should make a spec for it, do I need to? 20:00:15 jeblair: agreed, particularly since they carry potentially personally identifiable infos 20:00:20 jeblair: mapping names and users to IPs 20:00:35 prometheanfire: i don't personally see a need to have a spec for that unless there's a lot of review confusion around it 20:00:50 I don't think there is 20:00:52 also we're out of time. don't want to anger the lovely tc 20:00:59 prometheanfire: then i'd just go forth and code 20:01:01 just have to touch a lot of projects 20:01:01 haha fungi 20:01:05 thanks everyone! 20:01:05 yep, sgtm 20:01:08 cya 20:01:09 #endmeeting