Monday, 2014-11-24

*** mmaglana has quit IRC00:00
*** mmaglana has joined #openstack-infra00:01
*** tnovacik has quit IRC00:08
*** rkukura has quit IRC00:12
*** mmaglana has quit IRC00:18
*** Ryan_Lane has joined #openstack-infra00:20
*** banix has joined #openstack-infra00:23
*** armax has joined #openstack-infra00:24
*** sarob has quit IRC00:28
*** salv-orlando has quit IRC00:29
*** emagana has joined #openstack-infra00:29
*** ZZelle_ has quit IRC00:34
*** emagana has quit IRC00:34
*** dimsum__ has joined #openstack-infra00:35
*** harlowja_at_home has joined #openstack-infra00:37
*** banix has quit IRC00:38
*** armax has quit IRC00:42
*** banix has joined #openstack-infra00:44
*** Ryan_Lane has quit IRC00:50
openstackgerritJoshua Harlow proposed openstack-infra/elastic-recheck: Add stable/icehouse query for bug 1395368  https://review.openstack.org/13665700:52
uvirtbotLaunchpad bug 1395368 in tempest "ExternalNetworksTestJSON.test_delete_external_networks_with_floating_ip (icehouse) failures" [Undecided,New] https://launchpad.net/bugs/139536800:52
*** Ryan_Lane has joined #openstack-infra00:52
openstackgerritMonty Taylor proposed openstack-infra/system-config: Add elements for Infra servers  https://review.openstack.org/13659701:05
openstackgerritMonty Taylor proposed openstack-infra/system-config: Add debootstrap and rinse to nodepool  https://review.openstack.org/13659801:05
mordredclarkb, fungi, jhesketh: ^^ those at least build something. next step, try booting the to see if they are operable01:06
mordredoh - and I should add the root ssh key :)01:06
clarkbthat is an important step01:06
mordredyah01:07
mordredooh. new bug found on the ubuntu side ... no ssl certs ... anybody remember what the ubuntu package is to get the certs? ca-certificates right?01:07
*** koolhead17 has quit IRC01:08
*** banix has quit IRC01:09
zxiiroDoes anyone know if "git review" and submit drafts? I can't see any docs that provide the arguments for that01:09
clarkbit can git review -d01:10
clarkbshould be in the man page01:10
zxiiroah ok thanks (I was googling, should have thought to check the man page...)01:10
mordredclarkb: speaking of manpages - I made sure that the centos elements above install both man-pages and lsof01:11
mordred:)01:11
openstackgerritJoshua Harlow proposed openstack-infra/elastic-recheck: Add stable/icehouse query for bug 1395368  https://review.openstack.org/13665701:12
uvirtbotLaunchpad bug 1395368 in tempest "ExternalNetworksTestJSON.test_delete_external_networks_with_floating_ip (icehouse) failures" [Undecided,New] https://launchpad.net/bugs/139536801:12
*** Ark has joined #openstack-infra01:15
*** Ark is now known as Guest8123601:15
openstackgerritZhidong Yu proposed openstack/requirements: Add cm-api to global requirements.  https://review.openstack.org/13015301:15
*** yaguang has joined #openstack-infra01:17
*** superdan is now known as dansmith01:23
*** emagana has joined #openstack-infra01:23
*** bhunter71 has joined #openstack-infra01:25
*** emagana has quit IRC01:28
*** dimsum__ has quit IRC01:31
*** dimsum__ has joined #openstack-infra01:31
*** dimsum__ has quit IRC01:36
*** adalbas has quit IRC01:38
*** harlowja_at_home has quit IRC01:42
*** stevemar has joined #openstack-infra01:44
*** Ryan_Lane has quit IRC01:44
*** MaxV has joined #openstack-infra01:46
*** wuhg has joined #openstack-infra01:47
openstackgerritMonty Taylor proposed openstack-infra/system-config: Add elements for Infra servers  https://review.openstack.org/13659701:51
*** MaxV has quit IRC01:51
mordredok. that fixes the ubuntu build01:52
*** dimsum__ has joined #openstack-infra01:54
*** banix has joined #openstack-infra01:56
*** lttrl has joined #openstack-infra01:57
*** achuprin_ has quit IRC02:01
*** fandi has joined #openstack-infra02:02
*** xchu has joined #openstack-infra02:05
*** Guest81236 has quit IRC02:05
*** rkukura has joined #openstack-infra02:06
*** xchu has quit IRC02:06
*** stevemar has quit IRC02:10
*** emagana has joined #openstack-infra02:17
*** achuprin_ has joined #openstack-infra02:21
*** emagana has quit IRC02:22
*** pcrews has joined #openstack-infra02:28
*** yongli has quit IRC02:29
*** camunoz is now known as camunoz_away02:30
jogoclarkb: any ideas why this didn't work? https://review.openstack.org/#/c/136596/ I am stumped02:32
*** weshay has quit IRC02:33
*** armax has joined #openstack-infra02:40
openstackgerritMonty Taylor proposed openstack-infra/system-config: Add elements for Infra servers  https://review.openstack.org/13659702:44
*** unicell has joined #openstack-infra02:46
*** Ryan_Lane has joined #openstack-infra02:47
anteayabeen lurking to see if Yongli He and Shane Wang make an appearance02:55
anteayathinking about heading offline02:55
*** harlowja_at_home has joined #openstack-infra02:57
anteayayep, cat is sleeping in my lap and I can't stay awake02:58
*** mase_x200 has joined #openstack-infra02:58
*** MaxV has joined #openstack-infra02:58
*** shashankhegde has joined #openstack-infra03:00
*** MaxV has quit IRC03:02
*** KanagarajM has joined #openstack-infra03:04
*** pcrews has quit IRC03:07
*** emagana has joined #openstack-infra03:12
*** harlowja_at_home has quit IRC03:15
*** Ark has joined #openstack-infra03:16
*** Ark is now known as Guest4798303:16
*** emagana has quit IRC03:16
*** fifieldt has joined #openstack-infra03:19
*** Guest47983 has quit IRC03:20
*** annegentle has quit IRC03:21
*** davideagnello has joined #openstack-infra03:22
*** dimsum__ has quit IRC03:27
*** davideagnello has quit IRC03:27
*** sarob has joined #openstack-infra03:29
*** sarob has quit IRC03:33
*** ddieterly has quit IRC03:35
*** baoli has quit IRC03:37
*** baoli has joined #openstack-infra03:38
*** banix has quit IRC03:39
*** hdd has joined #openstack-infra03:45
*** banix has joined #openstack-infra03:46
*** camunoz_away is now known as camunoz03:48
*** Ryan_Lane has quit IRC03:57
*** boris-42 has quit IRC03:57
*** armax has quit IRC03:58
*** bhunter71 has quit IRC03:59
*** armax has joined #openstack-infra03:59
*** armax has quit IRC04:00
*** hdd has quit IRC04:00
*** Ryan_Lane has joined #openstack-infra04:00
*** mase_x200 has quit IRC04:03
*** emagana has joined #openstack-infra04:06
*** emagana has quit IRC04:10
*** Ryan_Lane has quit IRC04:16
*** ryanpetrello has joined #openstack-infra04:20
*** armax has joined #openstack-infra04:20
*** banix has quit IRC04:20
*** zz_dimtruck is now known as dimtruck04:21
*** Hal_ has joined #openstack-infra04:24
*** dimsum__ has joined #openstack-infra04:27
*** ddieterly has joined #openstack-infra04:29
*** ddieterly has quit IRC04:31
*** ddieterly has joined #openstack-infra04:31
*** dimsum__ has quit IRC04:32
*** shashankhegde has quit IRC04:34
*** ddieterly has quit IRC04:36
*** armax has quit IRC04:36
*** Hal_ has quit IRC04:36
*** chandankumar has joined #openstack-infra04:40
*** boris-42 has joined #openstack-infra04:42
*** koolhead17 has joined #openstack-infra04:47
*** yfried_ has joined #openstack-infra04:52
*** chandankumar has quit IRC04:55
*** baoli has quit IRC05:06
*** dimtruck is now known as zz_dimtruck05:07
*** otter768 has quit IRC05:14
*** shashankhegde has joined #openstack-infra05:14
*** armax has joined #openstack-infra05:16
*** Longgeek has joined #openstack-infra05:21
*** Longgeek has quit IRC05:27
*** armax has quit IRC05:28
*** ddieterly has joined #openstack-infra05:30
*** teran has quit IRC05:31
*** viglesias has quit IRC05:35
*** ddieterly has quit IRC05:35
*** viglesias has joined #openstack-infra05:41
*** stevemar has joined #openstack-infra05:42
*** rushiagr_away is now known as rushiagr05:45
*** yongli has joined #openstack-infra05:46
*** yfried_ has quit IRC05:46
*** ryanpetrello has quit IRC05:49
*** k4n0 has joined #openstack-infra06:00
*** yongli has quit IRC06:03
*** BharatK has joined #openstack-infra06:12
*** chandankumar has joined #openstack-infra06:14
*** Hal_ has joined #openstack-infra06:17
*** Hal_ has quit IRC06:18
*** shashankhegde has quit IRC06:21
*** ddieterly has joined #openstack-infra06:30
*** teran has joined #openstack-infra06:31
*** ddieterly has quit IRC06:35
*** teran has quit IRC06:36
*** ildikov has quit IRC06:42
*** camunoz is now known as camunoz_gone06:44
*** boris-42 has quit IRC06:47
*** patrickeast has joined #openstack-infra06:48
*** koolhead17 has quit IRC06:48
*** yfried_ has joined #openstack-infra06:48
*** michchap_ has quit IRC06:51
*** patrickeast has quit IRC06:51
*** michchap has joined #openstack-infra06:52
*** talluri has joined #openstack-infra06:53
*** talluri_ has joined #openstack-infra06:56
*** talluri has quit IRC06:58
*** davideagnello has joined #openstack-infra07:00
*** viglesias has quit IRC07:01
*** talluri has joined #openstack-infra07:01
*** afazekas has joined #openstack-infra07:01
*** talluri_ has quit IRC07:03
*** AlexF has joined #openstack-infra07:04
*** davideagnello has quit IRC07:04
*** Longgeek has joined #openstack-infra07:05
*** Hefeweizen has quit IRC07:09
*** viglesias has joined #openstack-infra07:12
*** otter768 has joined #openstack-infra07:15
*** AlexF has quit IRC07:15
*** stevemar has quit IRC07:16
*** achanda has joined #openstack-infra07:17
*** belmoreira has joined #openstack-infra07:19
*** Murad has joined #openstack-infra07:19
*** otter768 has quit IRC07:20
*** achanda has quit IRC07:20
*** achanda has joined #openstack-infra07:20
*** ildikov has joined #openstack-infra07:20
*** koolhead17 has joined #openstack-infra07:20
*** koolhead17 has joined #openstack-infra07:20
*** yfried_ is now known as yfried|afk07:21
*** koolhead17 has quit IRC07:23
*** viglesias has quit IRC07:24
*** viglesias has joined #openstack-infra07:30
*** ddieterly has joined #openstack-infra07:30
*** yfried|afk is now known as yfried_07:32
*** teran has joined #openstack-infra07:32
*** talluri has quit IRC07:33
*** ddieterly has quit IRC07:34
*** ddieterly has joined #openstack-infra07:36
*** teran has quit IRC07:37
*** mrmartin has joined #openstack-infra07:38
*** jgallard_ has joined #openstack-infra07:40
*** ddieterly has quit IRC07:40
*** belmoreira has quit IRC07:42
*** salv-orlando has joined #openstack-infra07:42
*** emagana has joined #openstack-infra07:44
*** belmoreira has joined #openstack-infra07:45
*** ivar-lazzaro has quit IRC07:46
*** koolhead17 has joined #openstack-infra07:47
openstackgerritMate Lakat proposed openstack-infra/project-config: XenServer: Use nodepool to inject XVA and ISO url  https://review.openstack.org/13670007:48
*** emagana has quit IRC07:49
openstackgerritMate Lakat proposed openstack-infra/nodepool: Support install phase with nodepool  https://review.openstack.org/9778707:51
openstackgerritMate Lakat proposed openstack-infra/nodepool: Support nodes with launch condition  https://review.openstack.org/9779807:51
*** Daisy has joined #openstack-infra07:57
*** jyuso has joined #openstack-infra07:57
*** achanda has quit IRC07:59
*** amuller has joined #openstack-infra08:02
*** ZZelle has quit IRC08:02
*** ZZelle has joined #openstack-infra08:02
*** miqui_ has quit IRC08:05
*** rcarrillocruz has quit IRC08:09
*** rcarrillocruz has joined #openstack-infra08:09
*** teran has joined #openstack-infra08:12
*** talluri has joined #openstack-infra08:13
*** skolekonov has joined #openstack-infra08:15
*** HeOS has quit IRC08:18
*** e0ne has joined #openstack-infra08:18
*** KanagarajM has quit IRC08:19
*** talluri has quit IRC08:22
*** doude has quit IRC08:25
*** achuprin_ has quit IRC08:28
*** jcoufal has joined #openstack-infra08:31
openstackgerritMate Lakat proposed openstack-infra/nodepool: Support nodes with launch condition  https://review.openstack.org/9779808:36
*** jerryz has joined #openstack-infra08:38
*** jlibosva has joined #openstack-infra08:38
*** emagana has joined #openstack-infra08:38
*** arxcruz has joined #openstack-infra08:39
*** achuprin_ has joined #openstack-infra08:40
*** MaxV has joined #openstack-infra08:41
*** emagana has quit IRC08:43
*** bo_sh has joined #openstack-infra08:45
*** nadya has joined #openstack-infra08:47
*** nadya is now known as Guest3664508:48
*** berendt has joined #openstack-infra08:49
*** Guest36645 has quit IRC08:49
*** jistr has joined #openstack-infra08:55
*** jlibosva has quit IRC08:56
*** jpich has joined #openstack-infra08:59
*** ala_ has joined #openstack-infra09:00
*** nfedotov has joined #openstack-infra09:02
*** jlibosva has joined #openstack-infra09:03
*** teran has quit IRC09:06
*** derekh has joined #openstack-infra09:11
*** jedimike has joined #openstack-infra09:13
*** Murad has quit IRC09:15
*** otter768 has joined #openstack-infra09:16
*** andreykurilin_ has joined #openstack-infra09:18
*** tnovacik has joined #openstack-infra09:18
*** jlibosva has quit IRC09:20
*** otter768 has quit IRC09:20
openstackgerritClaudiu Popa proposed openstack-dev/pbr: Support platform-specific requirements files  https://review.openstack.org/13670709:21
*** jlibosva has joined #openstack-infra09:21
*** IvanBerezovskiy has joined #openstack-infra09:22
*** HeOS has joined #openstack-infra09:23
*** bo_sh has left #openstack-infra09:25
*** teran has joined #openstack-infra09:26
*** andreykurilin_ has quit IRC09:26
*** amuller_ has joined #openstack-infra09:26
*** amuller__ has joined #openstack-infra09:28
*** nadya has joined #openstack-infra09:29
*** nadya is now known as Guest2489009:29
*** maishsk has joined #openstack-infra09:29
* maishsk says hi and good morning - anyone awake? 09:30
*** amuller has quit IRC09:30
*** mpaolino has joined #openstack-infra09:31
*** hashar has joined #openstack-infra09:31
*** amuller_ has quit IRC09:31
*** emagana has joined #openstack-infra09:33
*** talluri has joined #openstack-infra09:33
*** Longgeek has quit IRC09:35
*** emagana has quit IRC09:37
*** talluri has quit IRC09:38
*** yamamoto has joined #openstack-infra09:39
*** zz_johnthetubagu is now known as johnthetubaguy09:42
*** bo_sh has joined #openstack-infra09:42
*** hashar has quit IRC09:43
*** teran has quit IRC09:44
*** maishsk has quit IRC09:46
*** jp_at_hp has joined #openstack-infra09:47
*** Longgeek has joined #openstack-infra09:51
*** jcoufal has quit IRC09:51
*** hashar has joined #openstack-infra09:52
*** bo_sh has left #openstack-infra09:53
*** deepakcs has joined #openstack-infra09:53
*** hashar has quit IRC09:53
*** maishsk has joined #openstack-infra09:53
maishskAnyone awake yet?09:53
*** hashar has joined #openstack-infra09:53
*** dimsum__ has joined #openstack-infra09:54
*** belmoreira has quit IRC09:55
*** Guest24890 has quit IRC09:58
*** dimsum__ has quit IRC09:59
*** cnesa has joined #openstack-infra10:00
*** boris-42 has joined #openstack-infra10:01
*** yaguang has quit IRC10:02
*** johnthetubaguy is now known as zz_johnthetubagu10:03
*** rlandy has joined #openstack-infra10:04
*** cnesa has quit IRC10:04
*** cnesa has joined #openstack-infra10:05
*** marcusvrn has joined #openstack-infra10:07
*** amuller__ has quit IRC10:10
*** dmelladol is now known as dmellado|afk10:12
*** Daisy has quit IRC10:13
*** Daisy has joined #openstack-infra10:13
*** mase_x200 has joined #openstack-infra10:20
*** zz_johnthetubagu is now known as johnthetubaguy10:21
*** johnthetubaguy is now known as zz_johnthetubagu10:24
*** zz_johnthetubagu is now known as johnthetubaguy10:24
*** dmellado|afk has quit IRC10:25
*** belmoreira has joined #openstack-infra10:26
*** dmellado has joined #openstack-infra10:27
*** emagana has joined #openstack-infra10:27
*** fandi has quit IRC10:28
*** mase_x200 has quit IRC10:29
*** Daisy has quit IRC10:30
*** emagana has quit IRC10:31
*** BharatK has quit IRC10:32
*** mase_x200 has joined #openstack-infra10:32
*** vdo has joined #openstack-infra10:33
*** hdd has joined #openstack-infra10:34
*** mpaolino has quit IRC10:35
*** yamamoto has quit IRC10:36
*** davideagnello has joined #openstack-infra10:37
*** mase_x200 has quit IRC10:38
*** ldnunes has joined #openstack-infra10:39
*** davideagnello has quit IRC10:42
*** BharatK has joined #openstack-infra10:43
*** akuznetsova has joined #openstack-infra10:46
*** mpaolino has joined #openstack-infra10:48
*** yamamoto has joined #openstack-infra10:49
*** jcoufal has joined #openstack-infra10:53
*** yfried_ is now known as yfried|afk10:56
*** maishsk has quit IRC11:05
*** ldnunes has quit IRC11:07
*** yamamoto has quit IRC11:07
*** MaxV has quit IRC11:08
*** teran has joined #openstack-infra11:08
*** ldnunes has joined #openstack-infra11:09
*** yamamoto has joined #openstack-infra11:10
*** MaxV has joined #openstack-infra11:12
*** sergsh has joined #openstack-infra11:13
*** hdd has quit IRC11:15
*** jgallard_ has quit IRC11:15
*** yfried|afk is now known as yfried_11:16
*** otter768 has joined #openstack-infra11:17
*** nadya has joined #openstack-infra11:17
*** nadya is now known as Guest8691911:18
*** emagana has joined #openstack-infra11:21
*** hdd has joined #openstack-infra11:21
*** otter768 has quit IRC11:21
*** amuller__ has joined #openstack-infra11:23
*** rfolco has joined #openstack-infra11:24
*** emagana has quit IRC11:25
*** yfried_ is now known as yfried|afk11:26
*** MaxV has quit IRC11:27
*** maishsk has joined #openstack-infra11:28
*** rfolco has quit IRC11:29
*** yfried|afk is now known as yfried_11:31
*** groknix has quit IRC11:31
*** teran has quit IRC11:31
*** teran has joined #openstack-infra11:31
*** groknix has joined #openstack-infra11:31
*** ldnunes_ has joined #openstack-infra11:32
*** ldnunes has quit IRC11:32
*** hdd has quit IRC11:33
*** groknix has quit IRC11:34
maishskAnyone around?11:34
*** groknix has joined #openstack-infra11:34
*** pblaho has joined #openstack-infra11:35
*** ashaeron has joined #openstack-infra11:35
*** isaacb has joined #openstack-infra11:36
*** koolhead17 has quit IRC11:36
*** koolhead17 has joined #openstack-infra11:36
*** amuller__ is now known as amuller11:36
*** koolhead17 has quit IRC11:37
*** aysyd has joined #openstack-infra11:37
*** marcusvrn has quit IRC11:37
*** adalbas has joined #openstack-infra11:40
*** yfried_ is now known as yfried|afk11:41
*** yfried|afk is now known as yfried_11:42
*** pblaho has quit IRC11:45
*** rcarrillocruz has quit IRC11:55
*** rcarrillocruz has joined #openstack-infra11:55
*** marcusvrn has joined #openstack-infra11:56
*** dkehn has quit IRC11:56
*** chandankumar has quit IRC11:56
*** dkehn has joined #openstack-infra11:57
*** yfried_ is now known as yfried|afk12:01
*** chandankumar has joined #openstack-infra12:01
*** dimsum__ has joined #openstack-infra12:03
*** unicell has quit IRC12:10
*** hashar has quit IRC12:14
*** BharatK has quit IRC12:16
*** pblaho has joined #openstack-infra12:21
*** koolhead17 has joined #openstack-infra12:21
*** yfried|afk is now known as yfried_12:21
*** mase_x200 has joined #openstack-infra12:22
*** MaxV has joined #openstack-infra12:27
*** mase_x200 has quit IRC12:29
*** mase_x200 has joined #openstack-infra12:30
*** weshay has joined #openstack-infra12:32
*** amuller_ has joined #openstack-infra12:37
*** amuller has quit IRC12:37
*** vdo has quit IRC12:39
*** amuller__ has joined #openstack-infra12:41
openstackgerritMerged openstack-infra/storyboard: setup for running as a stand alone application.  https://review.openstack.org/13187012:43
*** amuller_ has quit IRC12:44
*** amuller has joined #openstack-infra12:45
*** amuller__ has quit IRC12:49
openstackgerritMerged openstack-infra/storyboard: Split Token DB API into separate file  https://review.openstack.org/13440812:50
*** jcoufal_ has joined #openstack-infra12:51
openstackgerritMerged openstack-infra/storyboard: User token API  https://review.openstack.org/13440912:53
*** jcoufal has quit IRC12:54
*** marcusvrn1 has joined #openstack-infra12:56
*** marcusvrn has quit IRC12:57
*** deepakcs has quit IRC12:59
*** k4n0 has quit IRC13:00
*** ddieterly has joined #openstack-infra13:00
openstackgerritMerged openstack-infra/storyboard-webclient: Add timeout to the blur event of tag-complete  https://review.openstack.org/13533413:02
*** maishsk has quit IRC13:04
*** yolanda has joined #openstack-infra13:04
*** sandywalsh has joined #openstack-infra13:04
openstackgerritMerged openstack-infra/storyboard-webclient: Switched use of "Resource.read()" to "Resource.get()"  https://review.openstack.org/13614813:04
*** jaypipes has joined #openstack-infra13:05
*** emagana has joined #openstack-infra13:09
*** mbacchi has joined #openstack-infra13:09
*** emagana has quit IRC13:13
*** maishsk has joined #openstack-infra13:16
openstackgerritMerged openstack-infra/storyboard: Added project group title to loader.  https://review.openstack.org/13324813:17
*** otter768 has joined #openstack-infra13:18
*** dprince has joined #openstack-infra13:21
*** bswartz has quit IRC13:22
*** baoli has joined #openstack-infra13:22
*** baoli has quit IRC13:22
*** otter768 has quit IRC13:22
*** eharney has joined #openstack-infra13:23
*** baoli has joined #openstack-infra13:23
*** Ng has quit IRC13:24
*** jraim has quit IRC13:24
*** tchaypo has quit IRC13:24
*** serverascode___ has quit IRC13:24
*** simonmcc has quit IRC13:24
*** Ng has joined #openstack-infra13:24
*** zhiyan has quit IRC13:24
*** jraim has joined #openstack-infra13:25
*** boris-42 has quit IRC13:25
*** sweston_ has quit IRC13:25
*** sweston_ has joined #openstack-infra13:25
*** rainya has quit IRC13:26
*** boris-42 has joined #openstack-infra13:26
*** tchaypo has joined #openstack-infra13:26
*** serverascode___ has joined #openstack-infra13:26
*** rainya has joined #openstack-infra13:27
*** zhiyan has joined #openstack-infra13:27
*** simonmcc_ has joined #openstack-infra13:28
nibalizergood morning13:29
*** koolhead17 has quit IRC13:30
*** koolhead17 has joined #openstack-infra13:30
maishskhi13:32
*** julim has joined #openstack-infra13:33
*** jerryz1 has joined #openstack-infra13:33
*** che-arne has joined #openstack-infra13:34
*** pc_m has joined #openstack-infra13:35
*** pc_m has quit IRC13:35
*** jerryz has quit IRC13:35
*** koolhead17 has quit IRC13:35
*** pc_m has joined #openstack-infra13:35
*** NithyaG has joined #openstack-infra13:35
*** mase_x200 has quit IRC13:36
*** ayoung has joined #openstack-infra13:41
*** alexpilotti has joined #openstack-infra13:43
*** koolhead17 has joined #openstack-infra13:44
*** e0ne has quit IRC13:50
*** jgallard_ has joined #openstack-infra13:50
*** jamespage_ has joined #openstack-infra13:51
*** e0ne has joined #openstack-infra13:51
*** alexpilotti has quit IRC13:52
*** jamespage_ has quit IRC13:52
*** jgallard_ has quit IRC13:53
*** ddieterly has quit IRC13:53
*** jgallard_ has joined #openstack-infra13:53
*** jgallard_ has quit IRC13:53
*** koolhead17 has quit IRC13:56
*** koolhead17 has joined #openstack-infra13:57
*** dustins has joined #openstack-infra13:58
*** groknix has quit IRC13:58
*** groknix has joined #openstack-infra13:59
*** bswartz has joined #openstack-infra13:59
*** cpowell has joined #openstack-infra14:00
*** dimsum__ has quit IRC14:01
*** koolhead17 has quit IRC14:02
*** dimsum__ has joined #openstack-infra14:02
*** otherwiseguy has joined #openstack-infra14:03
*** dkliban_afk is now known as dkliban14:03
*** emagana has joined #openstack-infra14:04
*** emagana has quit IRC14:09
*** chandankumar has quit IRC14:12
*** ryanpetrello has joined #openstack-infra14:12
*** esker has joined #openstack-infra14:12
*** esker has quit IRC14:13
*** esker has joined #openstack-infra14:13
*** davideagnello has joined #openstack-infra14:15
*** esker has quit IRC14:16
*** davideagnello has quit IRC14:20
*** dkranz has joined #openstack-infra14:26
*** ddieterly has joined #openstack-infra14:26
*** Sincler has joined #openstack-infra14:30
*** belmoreira has quit IRC14:31
*** jerryz has joined #openstack-infra14:32
*** jungleboyj has quit IRC14:33
*** BharatK has joined #openstack-infra14:34
*** jerryz1 has quit IRC14:35
*** amitgandhinz has joined #openstack-infra14:35
*** koolhead17 has joined #openstack-infra14:36
maishsknibalizer:14:36
maishsk?14:36
nibalizermaishsk: yes?14:37
openstackgerritMonty Taylor proposed openstack-infra/system-config: Add elements for Infra servers  https://review.openstack.org/13659714:37
maishskI am trying to understand how the CI works and uses Openstack compute for resource provisioning14:38
maishskIs it done through Jenkins?14:38
maishskIs there a Jenkins plugin that plugs into OpenStack? like the Vmware plugin?14:38
jktmaishsk: http://ci.openstack.org/nodepool.html14:39
maishskThis is what I was looking for.. - http://ci.openstack.org/nodepool/configuration.html#providers14:41
maishskthanks jkt !14:42
*** mjturek has joined #openstack-infra14:42
*** jklare_ is now known as jklare14:44
fungimaishsk: also you can see the template of our current production nodepool configuration file at http://git.openstack.org/cgit/openstack-infra/system-config/tree/modules/openstack_project/templates/nodepool/nodepool.yaml.erb14:44
*** Hal_ has joined #openstack-infra14:45
maishskthanks fungi14:45
*** Hal_ has quit IRC14:45
*** maishsk has quit IRC14:45
*** mriedem has joined #openstack-infra14:46
*** signed8bit has joined #openstack-infra14:48
*** thedodd has joined #openstack-infra14:50
*** esker has joined #openstack-infra14:51
*** emagana has joined #openstack-infra14:53
*** kgiusti has joined #openstack-infra14:53
*** wuhg has quit IRC14:53
*** dangers_away is now known as dangers14:56
*** bhunter71 has joined #openstack-infra15:00
mtreinishjogo: I am now15:00
*** unicell has joined #openstack-infra15:05
*** esker has quit IRC15:10
*** esker has joined #openstack-infra15:10
*** rushiagr is now known as rushiagr_away15:11
*** xyang0 has joined #openstack-infra15:12
*** doug-fish has joined #openstack-infra15:12
*** prad has joined #openstack-infra15:13
*** erikwilson has joined #openstack-infra15:15
*** zz_jgrimm is now known as jgrimm15:15
*** esker has quit IRC15:18
*** otter768 has joined #openstack-infra15:18
*** koolhead17 has quit IRC15:19
*** koolhead17 has joined #openstack-infra15:19
*** beekneemech is now known as bnemec15:20
*** AlexF has joined #openstack-infra15:21
*** ayoung is now known as ayoung-afk15:22
*** r-daneel has joined #openstack-infra15:22
*** thedodd has quit IRC15:22
openstackgerritSean Dague proposed openstack-infra/project-config: Revert "move nova-tox-functional to experimental until there is content"  https://review.openstack.org/13679515:22
mtreinishfungi: so for subunit2sql bugs I go through storyboard now?15:23
fungimtreinish: yep15:23
mtreinishok, time to give this a try...15:24
*** otter768 has quit IRC15:24
*** koolhead17 has quit IRC15:24
*** jerryz has quit IRC15:25
*** stevemar has joined #openstack-infra15:25
fungisdague: what's the story behind 136795? the commit that reverts is from yesterday, right?15:25
*** funzo_ is now known as funzo15:26
*** jungleboyj has joined #openstack-infra15:28
sdaguefungi: yeh, basically, I actually got working on it today15:28
fungisdague: wow, you're speedy15:29
*** teran has quit IRC15:31
*** teran has joined #openstack-infra15:31
*** pcrews has joined #openstack-infra15:32
*** dimsum__ is now known as dims15:32
*** atiwari has joined #openstack-infra15:35
sdaguefungi: so what's the overall nodepool time for both the build up and tear down of a server?15:35
tchaypo ./toci_devtest.sh: line 173: cd: /opt/stack/new//os-net-config: No such file or directory15:35
tchaypowheee15:36
tchaypooh, here we go15:36
tchaypohttps://www.irccloud.com/pastebin/sak6KepY15:36
fungisdague: it varies extremely depending on provider and load15:37
sdaguefungi: ranges?15:37
fungisdague: for example, in rax dfw it can take up to an hour for nova to assign an ip address because they're severely constrained on address turn-over15:37
fungiwhereas i've seen some nodes booted in ~5 minutes15:38
sdaguegotcha15:38
sdagueman... that's kind of special :(15:38
funginova instance reuse might help us there. i know mordred was looking at it at one point15:38
sdagueso, as I was looking at the job definitions, we seem to be exploding on project tests that run in a couple of minutes. For instance, the docs team run 4 tests that individually take < 2m each15:39
sdagueon every manuals change15:39
mordredfungi: yes, it's next on my list15:39
sdagueand if we have substantial setup / teardown time, it seems like it would behoove us to be a little smarter there15:39
*** rushiagr_away is now known as rushiagr15:40
mordredsdague: ++15:40
fungisdague: also dibification may help. we've seen some providers exhibit much worse boot time from snapshots than from glance images, due to the storage backends in use15:40
fungior more likely, due to the network characteristics between nova compute hosts and wherever snapshots are being stashed15:41
zulare you guys planning to do a release of pbr soon?15:41
fungidhellmann: ^ ?15:41
*** zz_dimtruck is now known as dimtruck15:41
clarkband dibification can continue now. I want to remove f21 because its noisy15:41
mrmartinre15:41
clarkbthen get my nodepool change in for vpc images15:42
clarkbat that point hopefully we can focus on images15:42
fungimrmartin: i have the groups.openstack.org cert purchased now. where's the change you had to reference it?15:42
mrmartinfungi, thats great :) we need some cert for groups-dev.openstack.org first15:43
mrmartinhttps://review.openstack.org/#/c/135708/15:43
fungimrmartin: looking15:43
mrmartinbut this patch enabling ssl for groups-dev first, for groups.openstack.org we need another one15:44
mrmartinbased on this.15:44
fungik15:44
*** bhunter71 has quit IRC15:44
mrmartinso I suggest, to do the groups-dev cert deployment first, and if it works, go with the prod site.15:44
*** matel has joined #openstack-infra15:44
*** ildikov has quit IRC15:45
mrmartinand something else. :) I'm working on this askbot migration, and it is using postgresql as a db backend. do we have pgsql in rackspace dbaas ?15:45
fungimrmartin: yep, i'll review in a bit to compare it against how we're doing self-signed certs for other dev sites15:45
*** bhunter71 has joined #openstack-infra15:45
*** banix has joined #openstack-infra15:47
matelHi guys, I'm looking for some nodepool expertise, anyone around?15:47
*** krtaylor has quit IRC15:47
*** emagana has quit IRC15:47
*** koolhead17 has joined #openstack-infra15:47
*** emagana has joined #openstack-infra15:47
*** jerryz has joined #openstack-infra15:47
clarkboh there is also a change to allow mixed dib and snapshot images against the same label15:48
clarkbthat one is important for migrating to dib15:48
clarkbI will look them over and rebase as necessary today15:48
sdaguefungi / clarkb: +A? - https://review.openstack.org/#/c/134620/15:50
clarkbmatel its usually best to just ask your question15:51
matelI have two changes to generate snapshot images15:51
matelhttps://review.openstack.org/9778715:51
matelAnd https://review.openstack.org/9779815:52
*** mattfarina has joined #openstack-infra15:52
*** esker has joined #openstack-infra15:52
matelThese changes are required for XenServer CI.15:52
fungiskimming those, i wonder if we couldn't just build those via dib now rather than trying to shoehorn something like that into the snapshot style image generation15:53
clarkbfungi ++15:53
*** ayoung-afk is now known as ayoung15:53
matelfungi: Can you run a "VM" inside DIB?15:54
clarkbthough would not be surprised if xenserver needs more than achroot to build15:54
matelIt does need more15:54
matelThat's why I rebased these changes.15:54
fungimatel: oh, you can't just assemble that by downloading/installing things into a loopback-mounted filesystem in a chroot?15:54
*** krtaylor has joined #openstack-infra15:54
matelfungi: You can't do that. You have to run the XenServer installer.15:54
jeblairmatel: what does the installer do?15:55
mateljeblair: It's a custom installer script, inside an initrd.15:55
* fungi notes that's an answer to a different question15:57
mateljeblair: I don't know exactly what it does.15:57
mateljeblair: I can dig out the sources, and reverse engineer that, but that would be a much bigger project15:57
*** BharatK has quit IRC15:58
openstackgerritBradley Klein proposed openstack-infra/project-config: Add puppet-monasca acls and review group.  https://review.openstack.org/13643215:58
jeblairmatel: the current way of doing xenserver installs is very complicated and fragile -- the fact that we needed to do that kind of work to get the images we wanted was a big part of why we wanted to move nodepool to dib15:58
*** kgiusti has quit IRC15:58
*** kgiusti has joined #openstack-infra15:59
fungijust worth noting that debian/ubuntu, rhel/centos, et cetera installers similarly run from an in-memory virtual filesystem, so that's not unusual, but people have also written tools to bootstrap them from running operating systems as well15:59
krotscheckStoryboard meeting in #openstack-meeting-316:00
matelfungi: we don't have such installer for XenServer16:00
jeblairmatel: i believe in the long run, we don't want to have nodepool using running vms to create images, and only use dib in the future; i think it would be worth spending some time thinking about what it would take to get a xenserver image with dib16:00
*** mpavlase has joined #openstack-infra16:01
mateljeblair: I completely agree with that, and I think that's the correct way of doing it, however, I doubt that we'll have the resources to mimic the behavior of the installer inside a chroot any time soon.16:01
fungiso, can't we do this with a downloaded xenserver base image and a very thin dib element set to customize it?16:02
jeblairfungi: that seems reasonable -- matel: that's another advantage of dib -- we don't have to start with what our cloud providers give us, we can start with something that already exists16:03
matelfungi: looks like "sysprep" ing a XenServer, right?16:03
jkthmm, anyone got success with using gertty to connect to gerrit-review.googlesource.com?16:03
fungimatel: more like retrieve a xenserver image, mount it on a loopback, modify files present in it, then repack the image and use that16:04
jkttheir web UI doesn't show me my HTTP password, just some, er, crap straight for git16:04
*** davideagnello has joined #openstack-infra16:04
*** enikanorov has quit IRC16:04
*** juice has quit IRC16:04
matelfungi: That still leaves us with one problem: launching that instance. XenServer itself does not know about the cloud - it can't communicate with the agent.16:05
jkteh, auth-type: basic16:05
jktanother PEBKAC on my side today :(16:05
*** bhunter71 has quit IRC16:05
fungimatel: that modified xenserver image gets uploaded to glance and we nova-boot it. what else does it need to know?16:05
sdaguehmmmm.... why are we doing a puppet apply on centos6 on zuul layout changes?16:06
openstackgerritMerged openstack-infra/project-config: Remove python26 jobs from various projects  https://review.openstack.org/12943516:06
matelfungi: first, the image boots an Ubuntu, that has an agent inside. This agent will know the instance's address, and sets these parameters to XenServer, and re-boots the VM to XenServer.16:06
openstackgerritMerged openstack-infra/project-config: Update ironic pxe job names to reflect voting status  https://review.openstack.org/13462016:06
*** juice has joined #openstack-infra16:07
mateljohnthetubaguy: ping16:07
jeblairfungi, clarkb, pleia2: any storyboard feedback?16:08
*** davideagnello has quit IRC16:08
fungimatel: why not just make an image that boots directly into xenserver?16:08
*** garyh has joined #openstack-infra16:09
matelfungi: You don't know your IP address - XenServer is not cloud aware, it's HVM16:09
matelfungi: the alternate route would be to use config drive.16:09
fungimatel: you're saying the ip address of the instance has to be embedded somewhere in teh xenserver filesystem before the instance boots xenserver?16:11
matelfungi: exactly16:11
*** enikanorov has joined #openstack-infra16:11
matelfungi: that's why we first boot the ubuntu, inject the IP, and reboot the box, and on next reboot XenServer picks that up.16:12
fungimatel: that suggests that if you want to change the ip address of a xenserver in production you have to rerun the installer on your server? sounds unbelievably painful16:12
matelYou are not running the full installer at that point, you just re-configure the IP address.16:12
fungimatel: but it needs a reboot to change its ip address?16:13
*** yfried_ has quit IRC16:14
*** tonytan4ever has joined #openstack-infra16:14
matelfungi: No, it doesn't. I need to re-boot it, because In my image I have two operating systems: A cloud aware ubuntu and a XenServer. First the partition with Ubuntu is active. That has the agent, gets the IP, modifies the boot loader, and reboots16:14
fungimatel: so this is a multi-partition block device? or a virtual block device inside another block device?16:15
*** amitgandhinz has quit IRC16:16
*** achanda has joined #openstack-infra16:16
fungimatel: i'm mostly asking why we can't just boot the xenserver and change its ip address configuration once it's up, and skip the ubuntu partition, the agent therein, and the extra reboot16:16
matelThe layout of the image is a multi-partition block device with 3 partitions. One for Ubuntu, one for XenServer, and one for XenServer's storage, where the VMs live (that is an embedded block device, this is where devstack bits live)16:17
sdaguehmmm... there are a relatively small number of nodes actually running tests right now, which seems quite odd.16:17
matelfungi: the image runs xen and that prevents us from communicating with the xen under that one.16:18
matelfungi: And xenstore is used for communication16:19
*** nfedotov has quit IRC16:19
fungimatel: it sounds like you're describing the current design while i'm asking why we can't change the design. why can't you use a statically-assigned loopback address on the xenserver for communicating with the xen instance it's managing? why does it have to be the ever-changing nova instance i address from the service provider instead?16:19
mordredmatel, fungi I'm hacking in dib ... maybe I take a look this afternoon16:19
*** amitgandhinz has joined #openstack-infra16:20
fungimatel: if you can design it so that it's not dependent on the nova-assigned interface ip address then it sounds like all this other bootstrapping complexity goes away?16:21
matelfungi: Rackspace agent uses xenstore to communicate the IP, how can you work around that?16:21
matelfungi: How can I reach the instance from the outside?16:22
*** dannywilson has joined #openstack-infra16:22
fungimatel: oh, so the problem is not that you need that interface for internal communication between the cubcomponents you're testing, but rather that you need some mechanism to set a static ip address in the system so that it can configure the interface?16:23
*** Longgeek has quit IRC16:23
fungis/cubcomponents/subcomponents/16:24
matelYes, I need to know that IP to configure the interface - so that the system is accessible.16:24
fungiso this takes us back to rackspace's file injection or dhcp in hpcloud (hpcloud is dhcp right?)16:24
matelfungi: DHCP or config drive would be nice.16:25
clarkbyes hpcloud is dhcp16:25
*** timrc-afk is now known as timrc16:26
mordredisn't rax also dhcp now?16:27
mordredif you set it properly?16:27
mordredlike, if you set the image to be a non-agent instance16:27
mordredlike we are doing for our other dib instances16:27
clarkbyou can do that?16:28
mordredthose glance meta params16:28
matelThat would be excellent.16:28
clarkbwe havent dibed anything in rax yet16:28
mordredone of them informs rackspace that you are not going to be running the rackspace agent on this image16:28
clarkbmordred I think theory is yes maybe16:28
clarkbreality is who knows :)16:28
mordredso - at that point, the choices would be config-drive or dhcp16:28
matelAnyone from rax here?16:28
mordredboth of which are available to us16:28
mordredso - I think this is simply going to be a matter of trying some things16:29
mordredwhich I'm willing to do16:29
johnthetubaguymatel: you called?16:29
*** achanda has quit IRC16:29
*** armax has joined #openstack-infra16:29
mateljohnthetubaguy: does rax support DHCP / config drive?16:29
clarkbmordred but we are close to rax dib so should know soon16:29
mordredclarkb: ++16:29
johnthetubaguymatel: you can't inject networking via config drive right now, certainly no DCHP support16:29
mateljohnthetubaguy: thanks16:29
mordredjohnthetubaguy: so the _only_ way to inject networking is via nova-agent?16:29
fungimatel: it does support config drive... i just mounted /dev/xvdd on one of my rax instances and am poking around inside it out of curiosity16:29
openstackgerritJames Polley proposed openstack-infra/devstack-gate: Add os-net-config to the list of packages we clone  https://review.openstack.org/13681116:30
johnthetubaguymatel: the plan is to do config drive network injection soon ish, but I can't promise anything, its used by on metal, but not VMs right now16:30
*** mudassirlatif has joined #openstack-infra16:30
mordredjohnthetubaguy: but what about for custom images?16:30
johnthetubaguyfungi: it does config drive, but we don't but the correct network config in there right now16:30
fungimatel: any way we could install nova-agent into the xenserver image?16:30
johnthetubaguymordred: nope, its to do with how we set the OVS rules, sadly16:31
mordredzomg16:31
johnthetubaguyquite16:31
mordredso we're going to have to install nova-agent into our dib instances?16:31
clarkbapparently16:31
* mordred smashes head against wall16:31
* mordred screams16:31
* mordred throws things16:31
* mordred cries16:31
matelfungi: I can install something that read config drive, yes16:32
johnthetubaguyyeah, I mean you can try without the agent, using the image property use_xenapi_agent = False16:32
* mordred resigns himself to a world where he can't have nice things16:32
johnthetubaguyand we do attempt to inject the networking16:32
johnthetubaguybut I am told it doesn't work, but I have not tested it myself16:32
mordredjohnthetubaguy: so you do file injection when we do use_xenai_agent = False16:32
fungiclarkb: so it's working for a variety of our images now... do we explicitly install nova-agent into them, or do ubuntu et al include it preinstalled and running in their official base images?16:32
johnthetubaguymordred: no file injection, its just injecting network data into config drive16:32
matelmordred: config drive has all the details, why would we go for file inject?16:32
clarkbfungi it would be explicut likely a dib element16:33
mordredfungi: it's installed into the rax base images by default16:33
johnthetubaguymordred: I have a feeling those compute nodes don't have the template file in place16:33
*** emagana has quit IRC16:33
johnthetubaguythe one to generate the interfaces file16:33
fungimordred: though we're not necessarily using the rax images when we dib16:33
mordredfungi: we arent' dib-ing on rax yet16:33
clarkbfungi we dont dib rax yet16:33
fungiohhhhh16:33
clarkbbecause there are about 100 things broken about it16:33
fungifor some reason i thought we were booting from dib on rax already16:33
*** emagana has joined #openstack-infra16:33
mordrednot yet16:33
fungibut right, there's also the glance issue16:33
mordredso - this problem is quite literally the next one on my plate16:34
mordredso I'll poke at options16:34
mordredand let everyone know16:34
clarkbmordred can you review my nodepool changes then?16:34
mordredyup16:34
openstackgerrityolanda.robla proposed openstack-infra/storyboard: Add API call to return task statuses  https://review.openstack.org/13522116:34
johnthetubaguyfungi: whats the glance issue?16:35
fungimatel: so, yes, this is something we basically need to solve for anything we dib on rax, and perhaps as a result a much simpler and less fragile xenserver image build process could leverage the same solution16:35
*** mrmartin has quit IRC16:35
*** mrmartin has joined #openstack-infra16:35
*** ashaeron has quit IRC16:35
matelfungi: We can avoid the reboot, if I use the config drive to configure the xenserver image. Question is: can I build / re-shape a xenserver image with DIB?16:36
matelfungi: The image layout of a typical installation looks like: first partition with dom0's filesystem, second partition is an ext3, and the disk images are on that.16:36
johnthetubaguymatel: ah, I forgot about XenServer not having access to xenstore, I remember now, yuck16:37
openstackgerritMonty Taylor proposed openstack-infra/system-config: Add elements for Infra servers  https://review.openstack.org/13659716:37
clarkbglance endpoint is wrong iirc16:37
fungijohnthetubaguy: i think it had to do with the16:37
mordredclarkb: I'll review yours if you review mine :)16:37
fungiyeah what clarkb said16:37
clarkband vpv images are bad16:37
mordredclarkb: it's not glance endpoint16:37
mordredclarkb: it's version16:37
clarkb*vpc16:37
fungiv1 vs v2 discrepancy. it reports as one but uses the other, right?16:37
mateljohnthetubaguy: that's it16:37
mordredfungi: it reports NEITHER16:37
*** emagana has quit IRC16:37
mordredand glanceclient defaults to the other value16:37
mordredthis is because glance does not report versions16:38
fungioh, and fallback is to the wrong api version16:38
mordredand expects the user to explicitly tell it16:38
mordredyah16:38
johnthetubaguyfungi: hmm, that was news to me...16:38
johnthetubaguyfungi: sounds like an upstream glance bug? or am I missing something?16:38
mordredI've got the overrides in shade for reference if we need it16:38
mordredjohnthetubaguy: it's that upstream glance does an evil thing - in this case, it is not rax fault16:38
sdaguemordred: flaper87 is fixing some of those glance client terribles16:38
*** tsg_ has joined #openstack-infra16:38
mordredsdague: yes, thank god16:38
flaper87o/16:39
mordredbut it still won't fully help if glance doesn't put a version in the service catalog16:39
johnthetubaguymordred: OK, cool, needs fixing either way, but OK16:39
flaper87mordred: sdague please, send them all my way16:39
* mordred hands flaper87 a beer16:39
* mordred stands on mountain top and shouts "I should not have to know the API version the server is running in advance"16:39
flaper87and btw, in name of all the people that haven't done this (and probably won't ever do it), I want to say I'M SO FUCKING SORRY FOR SUCH A TERRIBLE AND INCONSISTENT API16:39
mordredflaper87: thank you for working on it16:40
fungiout of curiosity how does hpcloud work around it?16:40
mordredfungi: what do you mean?16:40
clarkbthey accpet the fallback version likely16:40
*** vryzhenkin has quit IRC16:40
mordredfungi: hpcloud is running the version that glance defaults to16:40
fungiaha16:40
mordredso it's the same problem16:40
fungiright, that16:40
mordredit just happens to fail open16:40
clarkbso rax could fix this16:40
clarkbbut its still a bug16:41
mordredyah16:41
fungi"fix" it by using v1 i guess?16:41
clarkbya but as a client I dont care :)16:41
clarkbI just was images16:41
*** emagana has joined #openstack-infra16:41
johnthetubaguyfungi: oh, right we only expose v2 as we need protected properties16:41
mordredwhat clarkb said16:41
*** david-lyle_afk is now known as david-lyle16:41
mordredjohnthetubaguy: which is fine - it's that this version should be in the service catalog - which for bonghits reasons it's not16:42
clarkbbut in addition to that we need vpc images that work16:42
mordredclarkb: what's the vpc issue?16:42
clarkbmordred we have to support all that in nodepool and I had to patch dib16:43
fungioh, right, so in my personal scripts i shadow openstackclient's image calls and pass them off to glanceclient with OS_IMAGE_API_VERSION and OS_IMAGE_URL overridden to work around it16:43
clarkband we may not get images that resize properly16:43
matelfungi: The image layout of a typical installation looks like: first partition with dom0's filesystem, second partition is an ext3, and the disk images are on that in vhd format - can we build such images with DIB?16:43
clarkbmordred it isnt clear to me if qemu-img does the right thing there16:44
clarkbbut one step at a time16:44
clarkbalso those images are massive...16:44
clarkb~5GB16:44
fungimatel: i'm wondering if that would need multiple dib runs (one to manage the dom0 filesystems and one for modifying the inner vhd images)16:45
mordredclarkb: questions ... a) do we care about resize b) does it help if we go AMI/AKI/ARI format instead of all-in-one?16:45
fungimatel: the vhd images are the ones which need altering before dom0 boots?16:46
clarkbmordred a-yes because we use / b-no idea16:46
clarkbmordred rax docs say use vpc16:46
fungimatel: or are they something which could be provided as-is?16:46
clarkbunsure if we have any other options16:46
ddieterlynot sure if this is the right place to ask this... is zuul sick?16:46
mordredddieterly: it is ALWAYS the right place to ask that question16:46
ddieterlyjobs seem to be stacking up16:46
ddieterlymordred: oh, good16:46
matelfungi: no, dom0 (the first partition), but I guess we want to have a basic ubuntu/centos for devstack, and that goes to the vhd.16:47
mordredddieterly: we may just be busy - looking16:47
mordredddieterly: we seem to just be busy16:47
*** alexpilotti has joined #openstack-infra16:47
ddieterlymordred: ok, thanks for checking16:47
fungimatel: but we wouldn't need to create/modify the vhd images during the dom0 image creation? in which case we can just treat them as normal files and reuse them if they're already present on the base image we're starting from or download them and add them if necessary16:48
*** mudassirlatif has quit IRC16:48
mordredclarkb: ^^ we do have a rather small number of nodes in nodepool - known problem?16:48
jogoclarkb: if you have a few minutes I can use some help debugging https://review.openstack.org/#/c/136596/416:48
* tchaypo wonders what gymnastics we have to do in order to check things like https://review.openstack.org/#/c/136811/16:48
jogonot sure what went wrong16:48
clarkbmordred not known to me16:48
fungimordred: clarkb: ddieterly: zuul says it's aware of around 250-300 jobs currently running in progress16:49
matelfungi: yes, you can do that.16:49
fungimordred: clarkb: ddieterly: with ~1000 pending16:49
ddieterlyfungi: ok16:50
fungialso nodepool seems to have decided it should boot a bunch of additional nodes to help with the demand, but they're not done being added yet16:50
*** davideagnello has joined #openstack-infra16:51
openstackgerritThierry Carrez proposed openstack-infra/release-tools: Add autokick.py  https://review.openstack.org/13682016:51
mtreinishjogo: what exactly did you think you broke there? Because it looks like a couple of weird things happened, like largeops was running all the tests16:51
jogomtreinish: oh that is intentional16:52
jogomtreinish: https://review.openstack.org/#/c/136596/4/devstack-vm-gate.sh,cm16:52
fungialso the pending changes in the post pipeline (as well as the sparkline for the merge-check pipeline) show quite a number of changes getting merged out of the gate as well16:52
mtreinishand stable failures, although i'm not sure why from a quick glance16:52
*** arxcruz has quit IRC16:52
mtreinishjogo: oh, yeah it's because this is on top of the test patch16:52
jogoso the patch should be adding ssh-hostkeys by hostname16:53
sdagueclarkb: can I get a +A on this - https://review.openstack.org/#/c/136795/ ?16:53
jogomtreinish: but in http://logs.openstack.org/96/136596/4/experimental/check-tempest-dsvm-aiopcpu/7def73d/console.html#_2014-11-23_02_37_26_672 it isn't working16:53
*** AlexF has quit IRC16:54
*** afazekas has quit IRC16:54
*** JayJ has joined #openstack-infra16:54
*** e0ne has quit IRC16:55
jogomtreinish: looks like its not able to resolve the name of the second node (slave)16:55
jogoeven though /etc/hosts contains that information16:56
mordredclarkb: it' nodepool patches from you I need to look like?16:56
mordredlook at16:56
*** otherwiseguy has quit IRC16:57
*** bhunter71 has joined #openstack-infra16:58
clarkbmordred: ya let me dig them up16:58
clarkbthey are older and may need rebasing or other love16:59
*** isaacb has quit IRC16:59
clarkbmordred: https://review.openstack.org/#/c/130878/ definitely needs a new commit message. it tests and fixes that behavior not just tests it. https://review.openstack.org/#/c/126747/ is the change to allow us to use both qcow2 and vpc images16:59
*** nikhil_k is now known as nikhil_k|vacay17:00
mordredclarkb:126747 lgtm - rebase and I'll get the +2 on there17:00
clarkbcool doing that now17:00
* clarkb notes the error rates as reported by nodepool are non trivial right now17:01
clarkbmay explain the lack of test nodes17:01
clarkb*cloud error rates17:01
*** sarob has joined #openstack-infra17:01
mordredclarkb: has_snapshot and has_image to me read like they should return boolean17:02
pleia2good morning17:02
*** MaxV has quit IRC17:02
mordredjust, fwiw17:03
mordredmorning pleia2 !17:03
openstackgerritClark Boylan proposed openstack-infra/nodepool: Support multiple image formats in a diskimage  https://review.openstack.org/12674717:03
clarkbmordred: ^17:03
clarkbmordred: ya I think renaming those vars was suggestined17:03
clarkb*suggested17:03
clarkbmordred: something like snapshot_list and image_list?17:03
clarkbI will refresh my memory on that code and can update that17:03
*** chandankumar has joined #openstack-infra17:04
mordredclarkb: yeah17:04
mordredor just snapshots and images17:04
mordredbut I dont' _really_ care strongly17:04
clarkbwell I need a new patchset regardless so can rename to something better17:04
*** MaxV has joined #openstack-infra17:05
*** teran has quit IRC17:05
clarkbbut with these two changes on top of nodepool trusty server we should be able to start rolling on dib again17:05
clarkbthere was also ianw's change to support glance meta vars but I think that merged17:05
clarkbyup 34335b5fbff40c0129ac641aac4179a2275ee33817:06
mordredclarkb: did the dib changes land then?17:06
clarkbmordred: yes latest dib has my change and ghe's change in it17:06
clarkband that is what is installed on new nodepool17:06
mordredcool17:06
clarkbI might've helped greghaynes do that release in a bar >_>17:06
greghaynes:)17:06
mordredperfect. what could possibly go wrong17:07
*** davideagnello has quit IRC17:08
mordredclarkb, fungi: btw - if you want to laugh - look at the section starting at line 90 here: https://review.openstack.org/#/c/136597/7/elements/centos-minimal/root.d/08-rinse17:10
mordreddtroyer: fwiw, this ^^ fixed my problem with centos yesterday17:10
* dtroyer is again glad OpenWRT exists…17:12
mordredclarkb: dib_image.filename + image_type17:12
mordredclarkb: that seems like it's going to make a strangely named file17:12
fungidtroyer: because ddwrt is so awfully assembled from entirely non-free blobs?17:13
*** radez_g0n3 is now known as radez17:13
dtroyerfungi: because it doesn't have crowd-following Insanity as a Service17:13
mordredclarkb: also, I'm not sure I see where you compose a filename from filename and image_type17:14
fungidtroyer: that, definitely17:14
mordreddtroyer: maybe we shoudl start using openwrt as our base os17:14
clarkbmordred: on the has_diskimage front I think renaming it has helped me find a bug \o/17:14
dtroyermordred: don't think I haven't started that already17:14
*** emagana has quit IRC17:15
clarkbmordred: dib will take a non suffixed file name then add the file suffix for each file type it writes out17:15
sdaguemordred: so... there is a reason why the project I wrote years ago to do the same thing as dib is something that got abandoned :)17:15
clarkbmordred: so when we call dib we use it without a suffix17:15
*** emagana has joined #openstack-infra17:15
clarkbso filename there is dibs concept of a filename17:16
mordredclarkb: ah, ok. cool. thanks17:17
mordredsdague: :)17:17
dhellmannit looks like the oslo-messaging-release group is empty in gerrit, could I get someone to add me, please? https://review.openstack.org/#/admin/groups/463,members17:17
*** amuller has quit IRC17:18
sdague... if zuul remains with only 150 active nodes all day... it's going to be a long day. I wonder why it can't seem to get beyond that17:18
openstackgerritMichael Krotscheck proposed openstack-infra/storyboard-webclient: Enable HTTP Caching on resources.  https://review.openstack.org/13614917:18
sdaguedid we lose a cloud?17:19
fungiFailed to fetch http://security.ubuntu.com/ubuntu/dists/trusty-security/main/i18n/Translation-en  Hash Sum mismatch17:19
mordredfungi: that's the saddest thing I've ever heard17:19
jogomtreinish: any ideas?17:19
*** tonytan4ever has quit IRC17:19
fungithis is not shaping up to be a high-throughput day17:19
*** otter768 has joined #openstack-infra17:20
mordredfungi, jeblair: oh - btw - I found a cantrip for telling apt to not grab translations files17:20
*** emagana has quit IRC17:20
clarkbsdague: cloud error rates are high17:20
mordredmaybe we should put it on our stuff, since those are always goign to just be overhead17:20
sdagueit's like there are no bare nodes in play17:20
*** ala_ has quit IRC17:20
clarkbsdague: across the board17:20
*** winston-d_ has joined #openstack-infra17:20
sdagueclarkb: ok, fun17:20
*** alexpilotti has quit IRC17:20
funginodepool seems to be aware of 8 bare nodes in use17:22
fungiwith 4 building17:22
jeblairclarkb: both clouds?17:22
fungioh, and 8 ready17:22
clarkbjeblair: ya the graphs mordred had made seem to imply that17:22
clarkbI haven't looked at nodepool logs yet17:22
winston-d_jeblair: hi, a quick question about gertty search & local check-out/cherry-pick functions. these don't seem to work on my Mac.17:23
sdaguefungi: are there any bare-trusty?17:23
fungi205 devstack nodes in use, 93 ready, 61 being deleted and 1 building17:23
sdaguethe only ones I can see look like bare-precise on chef jobs17:23
*** koolhead17 has quit IRC17:24
fungisdague: for bare-trusty we have 2 building, 4 ready and 5 in use17:24
mtreinishjogo: hmm, not sure I do. I see where it adds the hostname to /etc/hosts right above ssh-keyscan failure17:24
*** otter768 has quit IRC17:24
*** koolhead17 has joined #openstack-infra17:24
clarkbfungi: hrm we should have bare-trusty images available but maybe we don't and are hitting quota issues for that type?17:24
fungiclarkb: i'm hunting in logs17:25
mtreinishjogo: I guess I would suggest catting things (like /etc/hosts) and adding status checks around the failed call to make sure the system state is what you think it is17:25
fungiclarkb: but usually quota issues cause huge numbers to show up in building17:25
jeblairwinston-d_: that's strange.  i don't have a mac to test with.  i know there was recently a new release of gitpython, if you installed it within the last week or so, perhaps it is behaving differently.  (i haven't tested that)17:25
*** ivar-lazzaro has joined #openstack-infra17:25
*** mudassirlatif has joined #openstack-infra17:25
fungiclarkb: we've got tracebacks from the DiskImageBuilderThread by the way17:26
*** jpich has quit IRC17:26
fungilikely not related to our node starvation issues however17:26
*** ivar-lazzaro has quit IRC17:26
*** koolhead_ has joined #openstack-infra17:26
*** koolhead17 has quit IRC17:27
winston-d_jeblair: hmm, this version of gertty has been installed for more than one month, works pretty well by the way. :) let me grab a Linux VM and test those functions.17:27
*** ivar-lazzaro has joined #openstack-infra17:27
openstackgerritClark Boylan proposed openstack-infra/nodepool: Allow labels to have snapshot and dib images  https://review.openstack.org/13087817:27
clarkbmordred: ^ renamed the bug I thought was a bug wasn't really and just needed a better comment so it has that now17:27
openstackgerritMonty Taylor proposed openstack-infra/system-config: Add elements for Infra servers  https://review.openstack.org/13659717:28
openstackgerritMonty Taylor proposed openstack-infra/system-config: Add debootstrap and rinse to nodepool  https://review.openstack.org/13659817:28
openstackgerritMonty Taylor proposed openstack-infra/system-config: Make apt skip grabbing translations  https://review.openstack.org/13683717:28
clarkbfungi: yes building f21 images17:28
mordredclarkb: cool17:28
clarkbfungi: which is why I just want to rip out f21. https://review.openstack.org/#/c/136534/ does that match up with what you are seeing?17:28
mordredclarkb, fungi, jeblair: may or may not help, but https://review.openstack.org/136837 may cut down on the number of files we try to grab from the internets17:28
fungiclarkb: oh, fun! looks like maybe we have issues with at least one image in ord... it tried to nova boot with an image there that returned http 400 (image is not active)17:28
mordredfungi: whee!17:28
jeblairmordred: i think you forgot a file in the commit17:29
fungiclarkb: ahh, yes, f21 is the cause for the diskimage tracebacks17:30
fungiclarkb: too bad i'm the only +2 on that change so far17:31
*** isaacb has joined #openstack-infra17:31
jeblairi'm reviewing17:31
fungiclarkb: i'm going to delete image ba4c3302-1e18-460b-841a-c7cdbd5ea8d3 in rax-ord (it's the only one throwing this http 400)17:32
clarkbfungi: ok17:32
clarkbfungi: is that a bare-trusty image?17:32
fungiseems to be a devstack-trusty node, so probably not responsible for the bare images shortage17:32
jeblairclarkb: 136534 looks like a nodepool bug in the traceback; do you understand it?17:32
clarkbjeblair: sort of. exec is complaining that the data being passed in the env var is not of a byte type17:33
clarkbor string in this case because python217:33
mordredjeblair: BAH17:33
clarkbjeblair: I do not know why the vars used for f21 are different than the vars used to override tmpdir and cachedir locations17:33
jeblairclarkb: it looks like yeah.... that :)17:33
clarkbjeblair: likely has to do with yaml and its string types17:33
openstackgerritMonty Taylor proposed openstack-infra/system-config: Make apt skip grabbing translations  https://review.openstack.org/13683717:34
openstackgerritMonty Taylor proposed openstack-infra/system-config: Add elements for Infra servers  https://review.openstack.org/13659717:34
openstackgerritMonty Taylor proposed openstack-infra/system-config: Add debootstrap and rinse to nodepool  https://review.openstack.org/13659817:34
mordredjeblair: sorry. there you go17:34
*** dims has quit IRC17:35
mordredclarkb: btw - my ubuntu-minimal images are 245M17:35
*** dims has joined #openstack-infra17:35
*** rushiagr is now known as rushiagr_away17:36
mordredclarkb: I have not yet tried running the nodepool elements on top of it - I imagine most of the 5G is actually the data cache17:36
fungiclarkb: jeblair: also we seem to have maybe broken the nodepool cli's image-delete subcommand17:36
fungitaking a closer look17:37
*** AlexF has joined #openstack-infra17:37
*** AlexF has quit IRC17:37
jeblairclarkb:  repr(d['diskimages'][-1]['env-vars'])17:37
jeblair"{'BASE_IMAGE_FILE': 'Fedora-Cloud-Base-20141029-21_Beta.x86_64.qcow2', 'DIB_IMAGE_CACHE': '/opt/dib_cache', 'DIB_CLOUD_IMAGES': 'http://download.fedoraproject.org/pub/fedora/linux/releases/test/21-Beta/Cloud/Images/x86_64/', 'TMPDIR': '/opt/dib_tmp'}"17:37
jeblairthat looks sane to me17:37
clarkbmordred: it is, but the big annoyance is qcow2 is compressed and comes in under 3GB. vpc is not and is just bad17:37
clarkbjeblair: ya17:37
*** emagana has joined #openstack-infra17:38
jeblairclarkb: i agree this needs further offline debugging.  +3ing.17:38
*** dims has quit IRC17:40
jogomtreinish: hmm good idea, I tried things locally.  but adding status checks is a good idea17:40
fungiahh, nope, image-delete still works. for some reason it's just this image uuid being a pain17:40
clarkbfungi: as a temporary measure maybe start building a new image of that type in ord17:41
clarkbnodepool should seamlessly switch to using it once it is done building17:41
fungifor some reason snap_image in _deleteImage ends up being None for this one17:41
fungiso maybe nodepool got a success result for the snapshot action but then rax disappeared it out from under us and the nova image list is now lacking it17:42
fungichecking that theory now17:42
*** shashankhegde has joined #openstack-infra17:43
fungiand as suggested i've started an image-update of the same label+provider now in case this takes longer to clean up17:43
*** otherwiseguy has joined #openstack-infra17:43
*** mikedillion has joined #openstack-infra17:44
winston-d_jeblair: well, tried on Linux, search/check out worked. And I did a uninstall/install gertty on Mac, still the same.17:45
fungifwiw, looks like it retried a few dozen times before finally getting a response from the api endpoint17:45
jeblairwinston-d_: can you file a bug about that on storyboard.openstack.org ?17:46
*** andreykurilin_ has joined #openstack-infra17:46
jeblairwinston-d_: and any help you can provide toward diagnosing it or fixing it would be great :)17:47
winston-d_jeblair: sure, but let me confirm with someone else is using gertty on Mac first.17:47
clarkbok I think https://review.openstack.org/#/c/130878 and https://review.openstack.org/#/c/126747 are ready for review now (though still waiting on zuul to +1 each of them)17:48
clarkbbut those are next step in dibification17:48
*** mpaolino has quit IRC17:48
*** dims has joined #openstack-infra17:48
*** rushiagr_away is now known as rushiagr17:50
openstackgerritSean Dague proposed openstack-infra/project-config: remove nova pylint  https://review.openstack.org/13684617:51
jeblairclarkb: ack will review17:51
*** AlexF has joined #openstack-infra17:51
*** harlowja_away is now known as harlowja17:52
dhellmannfungi, jeblair : it looked like you were in fire-fighting mode when I asked before. When things calm down, could one of you add me to the oslo-messaging-release group in gerrit, please? It's completely empty somehow, which will prevent us from releasing next week.17:53
ddieterlyi'm still unable to see any progress with jobs in zuul17:53
fungiokay, after confirming nova image-list did not know about the offending snapshot, i removed its row from the snapshot_images table17:53
*** mikedillion has quit IRC17:54
jeblairdhellmann: on it17:54
*** shashankhegde has quit IRC17:54
fungidhellmann: done17:54
dhellmannjeblair: thanks, no rush if you guys still have an issue17:54
jeblairdrat :)17:54
dhellmannfungi: thanks!17:54
dhellmannheh17:54
jeblairdhellmann: confirmed that fungi did it! :P17:55
dhellmannjeblair, fungi : thanks! I've added the rest of the folks we need in that group so we're all set now.17:55
*** mpavlase has quit IRC17:55
ddieterlylooks like jobs just keep piling up and jobs launched per hour is down17:55
fungiwe're getting lots of node launches in error state in rax-dfw17:55
fungithough https://status.rackspace.com/ suggests it should be smooth sailing17:56
fungiooh, here's a new one... ERROR nodepool.GearmanClient: Exception while listing functions [...] TimeoutError17:57
fungimaybe it's having trouble talking to zuul to determine demand?17:57
*** tsg_ has quit IRC17:57
*** bhunter71 has quit IRC17:59
jeblairoh18:00
jeblairis the old nodepool server still around?18:01
*** sarob has quit IRC18:01
*** mpaolino has joined #openstack-infra18:01
openstackgerritJoe Gordon proposed openstack-infra/devstack-gate: Set up ssh_known_host based on hostname  https://review.openstack.org/13659618:01
clarkbjeblair: yes but nodepool shouldn't be running on it18:01
jeblairanyone have an ip handy?18:01
clarkbya one sec18:01
fungigetting it18:01
*** emagana has quit IRC18:01
clarkb192.237.211.9118:01
mordred192.237.211.9118:02
mordredblast18:02
mordredclarkb beat me18:02
fungithat matches what i looked up too18:02
openstackgerritMerged openstack-infra/system-config: Revert "Initial Fedora 21 nodepool disk-image creation"  https://review.openstack.org/13653418:02
mordred2001:4800:7813:0516:3bc3:d7f6:ff04:b86318:02
mordredif you want ipv618:02
*** sweston_ is now known as sweston18:02
*** bhunter71 has joined #openstack-infra18:02
*** emagana has joined #openstack-infra18:02
funginodepoold is definitely not active on iot18:02
fungiit18:02
clarkboh hrm did iptables not apply like I thought they would on zuul.o.o?18:02
*** tonytan4ever has joined #openstack-infra18:02
*** prad has quit IRC18:03
clarkbthey must've otherwise nodepoold would've hung at startup like it was doing previously18:03
clarkbI can telnet from new nodepool to zuul over 4730 so that isn't the issue18:03
*** e0ne has joined #openstack-infra18:04
*** derekh has quit IRC18:05
*** davideagnello has joined #openstack-infra18:05
jeblairfungi: does that timeout happen a lot?18:05
*** jlibosva has quit IRC18:06
*** emagana has quit IRC18:06
*** tsg_ has joined #openstack-infra18:07
fungijeblair: 3 times today since the log was rotated18:07
*** ci-testing_ has quit IRC18:07
*** cpowell has quit IRC18:07
fungioh, the log was rotated at 17:3918:07
fungiso that's 3 times in half an hour18:08
fungior about every 10 minutes18:08
*** isaacb has quit IRC18:08
*** mpaolino has quit IRC18:08
fungialso i think we need to roll back the change that increased rotation frequency18:09
*** cpowell has joined #openstack-infra18:09
fungioh, nevermind18:09
fungithe 3 days of history is because this server is 3 days old ;)18:10
ddieterlythe check queue on zuul is continuing to grow. is anyone looking into that?18:10
fungiddieterly: it's the entirety of what we're discussing in here18:10
ddieterlygreat, thanks18:11
jeblairi'm trying to figure out if this is related to the async io problems in geard (we raised the timeout on the zuul server, and there's a patch up to improve geard)18:11
jeblairfungi: however, as long as it isn't failing all the time, it should work well enough18:11
matelfungi: So with DIB in nodepool, do you expect DIB to build a node, that has repos cached, etc, or just a base operating system?18:12
fungimatel: build a node with cached repos et cetera, starting from a base operating system image of whatever origin we want18:13
jeblairhuh18:13
matelfungi: so the starting point could be a xenserver image, which is presented in a qcow2 image format?18:13
jeblairthe old nodepool server has an older version of gear, though i doubt the differences would account for that error18:14
fungimatel: that's what i was suggesting, yes18:14
*** groknix has quit IRC18:14
*** groknix has joined #openstack-infra18:15
*** bhunter71 has quit IRC18:15
fungithe node graph is showing an interestingly regular building hysteresis18:15
matelfungi: I can provide such an image, but the node installation has to happen "inside" the vhd file, which is sitting in the image's filesystem.18:16
*** bhunter71 has joined #openstack-infra18:16
*** Bobba is now known as BobBall_AWOL18:16
fungimatel: what is "node installation" in this sense?18:16
jogoboth aiopcpu tests for a patch (nova-network and neutron) just failed with the same error: https://jenkins02.openstack.org/job/check-tempest-dsvm-neutron-aiopcpu/32/console18:16
matelfungi: all the openstacky stuff, cached repos, etc18:17
winston-d_jeblair: ok, jgriffith helped me confirm that 'search' doesn't work for him on Mac neither. I'll create a bug on storyboard.18:17
jogohttps://jenkins06.openstack.org/job/check-tempest-dsvm-aiopcpu/26/console18:17
matelfungi: so what could be possible is to have DIB build a VM - as if it was not inside xenserver, and produce say a vhd. This image, let's call it domU image would have all the openstacky bits18:17
fungimatel: ahh, any way that could be presented from the dom0? e.g. via hostfs or something?18:17
matelfungi: so what you are looking for is a way to mount domU's filesystem, given a full xenserver image, right?18:18
jogoclarkb fungi:^18:19
fungimatel: as a possible workaround, yes18:19
fungiokay, the image rebuild in rax-ord finally completed18:19
sdaguematel: hey, xenserver logs, where do you put your console.html when things fail, because I don't see that in the reports it's posting18:19
matelsdague: could you give me a log dir url?18:20
sdaguehttp://dd6b71949550285df7dc-dda4e480e005aaa13ec303551d2d8155.r49.cf1.rackcdn.com/22/136822/1/32840/results.html18:20
jeblairfungi: i think we're seeing the periodicity of the nodepool main loop18:20
fungijeblair: that's what i wondered18:20
*** patrickeast has joined #openstack-infra18:20
jeblairfungi: https://etherpad.openstack.org/p/magzsGkQrX18:20
*** ci-testing_ has joined #openstack-infra18:21
fungijeblair: that looks like about the same frequency, yes18:21
matelfungi: In theory, you can mount the xenserver image's second partition as an ext3 to your system, and mount the vhd file from that. - I need to look at how to properly mount vhd files though - would that work?18:21
winston-d_jeblair: not sure if this is the right format, but here's the story for gertty 'search' bug: https://storyboard.openstack.org/#!/story/200002418:21
fungimatel: yeah, that would be another possible solution18:21
matelsdague:looking at it18:22
jeblairfungi: updated etherpad18:22
*** berendt has quit IRC18:23
fungijeblair: Demand from gearman: bare-trusty: 54218:23
fungiet cetera18:23
fungilooking in the debug log18:23
clarkbjeblair: so its hanging in the mail loop somewhere?18:23
matelsdague: I guess this job did not run any tests, and you're interested why is that?18:23
fungiso it does seem to be finding the demand with numbers which match what we're seeing in the zuul status18:24
jeblairwinston-d_: updated18:24
fungidevstack-trusty demand seems to be roughly equal to bare-trusty demand at this point18:25
*** AJaeger has joined #openstack-infra18:25
fungiaccording to the log18:25
jeblairfungi: yep.  though we went from 17:44 to 18:00 with no demand info18:25
jeblairbecause of the timeouts18:25
matelsdague: I would expect run_tests.log to contain those bits, see this: http://dd6b71949550285df7dc-dda4e480e005aaa13ec303551d2d8155.r49.cf1.rackcdn.com/98/134598/1/31652/run_tests.log18:26
AJaegerHi!one more strange thing in case you haven't noticed: we have a periodic-stable job since 36 hours in the queue ;(18:26
*** andreykurilin_ has quit IRC18:26
fungibut also the number of devstack-trusty nodes reported in existence is ~20x the number of bare-trusty nodes18:26
jeblairfungi, clarkb: should we try downgrading gear to the old version?18:26
*** andreykurilin_ has joined #openstack-infra18:26
matelfungi: Let me find a way to mount that partition - will get back to you tomorrow.18:26
jeblairfungi: i think that running without demand for a cycle will mess up the round robin allocator18:26
*** MaxV has quit IRC18:27
fungiahh18:27
fungithis is likely the case18:27
jeblairthe old allocator did not have that problem, but the new one maintains state18:27
clarkbjeblair: its a reasonably simple thing to try so +2 from me18:27
*** chandankumar has quit IRC18:27
jeblairit was 0.5.218:27
fungiyeah, downgrade gear and restart nodepool i guess18:27
fungiworth a shot18:27
jeblairclarkb: can you do that?  i'm trying to dig into it further from another angle18:27
*** melwitt has joined #openstack-infra18:28
clarkbya I can do that18:28
* clarkb starts now18:28
jeblairianw: ping18:28
matelsdague: did that help?18:28
jeblairin the long run, we can't depend on things always working, so we need the allocator to not behave that way18:28
clarkbgear is downgraded. restarting nodepool now18:29
viscioushas anyone been looking at why the postgres tests are failing in stable jobs?18:29
*** viscious is now known as vishy18:29
vishydatabase "openstack_citest" is being accessed by other users18:29
clarkbdone18:29
*** teran has joined #openstack-infra18:29
fungijeblair: clarkb: i suppose in the long term a failure to query for demand should just no-op for that cycle rather than assuming all zero?"18:30
jeblairfungi: then we'll never build anything, even the min-ready18:30
*** mpavlase has joined #openstack-infra18:30
jeblair(if the network connection breaks)18:30
clarkbso we probably want to build to min ready then until we get data18:30
jeblairclarkb: that's what we do18:31
sdaguematel: well run_tests.log is completely missing anything useful here18:31
jeblairclarkb: the problem is that the current allocator says everything is satisfied for that round, and so the next round, with actual demand data, doesn't build as much as what you would expect18:31
*** shashankhegde has joined #openstack-infra18:31
jeblairat least, the proportion is wrong18:31
matelsdague: yes, it's strange.18:31
clarkbjeblair: oh because of state18:31
clarkbgotcha18:31
matelsdague: Do you have a change ref?18:32
openstackgerritKyle Mestery proposed openstack-infra/project-config: Add networking-odl project to StackForge  https://review.openstack.org/13685418:32
matelsdague: refs/changes/22/136822/1 i guess18:32
fungifollowing the nodepool restart, we've got more than 100 bare-trusty nodes building now18:33
*** pc_m has quit IRC18:34
*** mriedem has quit IRC18:34
*** erikwilson has quit IRC18:34
fungii guess if we stop seeing "Exception while listing functions" in the log after the 18:29 restart, then it's probably new gear18:34
*** erikwilson has joined #openstack-infra18:35
*** wenlock has joined #openstack-infra18:35
jeblairyeah.  i'm running parallel tests to see if i can reproduce that behavior in each version18:35
*** zaro has joined #openstack-infra18:36
*** Ryan_Lane has joined #openstack-infra18:37
*** unicell has quit IRC18:37
*** signed8bit is now known as signed8bit_ZZZzz18:39
*** tgohad has joined #openstack-infra18:40
*** signed8bit_ZZZzz has quit IRC18:40
*** tsg_ has quit IRC18:40
*** marcusvrn1 has quit IRC18:40
fungino dice18:41
fungi2014-11-24 18:37:53,732 ERROR nodepool.GearmanClient: Exception while listing functions18:41
jeblairhuh18:41
jeblairso right around that time, both of my tests took 29.x seconds to return from a listing18:42
fungiwith the same traceback as before18:42
jeblairMon Nov 24 18:36:54 2014 29.199444055618:42
*** pblaho has quit IRC18:42
jeblairthough a timeout for an admin request should be 90 seconds18:42
fungidoes it time out at 30s?18:42
fungioh18:42
jeblairand they did not actually timeout18:42
*** erlon has joined #openstack-infra18:42
winston-d_jeblair: I did tried some other keybindings for gertty search, but no luck.18:43
*** emagana has joined #openstack-infra18:43
matelsdague: It looks as if gate_hook terminated the whole run - although I don't really understand why we don't have any output at all.18:43
clarkbjeblair: are you running your tests on a different host too?18:43
jeblairwinston-d_: what did you do?18:43
openstackgerritMerged openstack-infra/infra-manual: Added some initial content in Peer Review before ReviewChecklist link  https://review.openstack.org/10758818:43
jeblairclarkb: no, on the new nodepool.o.o18:43
*** signed8bit has joined #openstack-infra18:44
jeblairi can run it on the old nodepool as well18:44
clarkbjeblair: ya it may be worthwhile just to see if its isolated to new nodepool.o.o18:44
*** amcrn has joined #openstack-infra18:44
jeblairclarkb: we'll need to open the firewall on zuul i think18:44
clarkbjeblair: ya18:44
jeblairi'll do that manually real quick18:44
*** AlexF has quit IRC18:45
winston-d_jeblair: i changed '~/.gertty.yaml' and added a entry under 'keymap' with "change-search: 'ctrl i'"18:45
ekarlso-https://review.openstack.org/#/c/136624/ < anyone wanna sign off on that ?18:45
*** e0ne has quit IRC18:46
*** achanda has joined #openstack-infra18:46
fungior test from zm0X or jenkins0X?18:47
*** otherwiseguy has quit IRC18:47
sdaguematel: my guess is that your system isn't being careful with output buffering. We used to lose stuff like that in devstack18:47
*** mudassirlatif has quit IRC18:48
jeblairfungi: did the firewall thing18:48
jeblairthere's a bit of a heisenberg thing going on here; polling from a 2nd host slows it down considerably (probably because of the lack of async io handling)18:50
*** bhunter71 has quit IRC18:51
matelsdague: will look into that. Let's see if it does the same with the second patchset as well.18:51
*** bhunter71 has joined #openstack-infra18:51
clarkbthe gearman server log on zuul is empty18:51
jeblairfungi: i think we may be about to hit a timeout18:51
*** signed8bit has quit IRC18:52
jeblairthere it is18:52
*** tgohad has quit IRC18:53
*** marun has joined #openstack-infra18:53
*** HeOS has quit IRC18:53
*** achanda has quit IRC18:53
matelsdague: Failed at the same place...18:53
*** tsg_ has joined #openstack-infra18:53
*** achanda has joined #openstack-infra18:54
jeblairclarkb, fungi: my test script on both the old and new servers saw that18:54
fungiclarkb: we only temporarily enable that when we're trying to track down gearman-related issues because of verbosity, right?18:54
*** andreykurilin_ has quit IRC18:54
clarkbfungi: iirc its >DEBUG most of the time18:54
*** yfried_ has joined #openstack-infra18:54
fungiahh18:54
*** andreykurilin_ has joined #openstack-infra18:55
clarkbjeblair: I guess thats good news.18:55
clarkbat least doesn't point to new server being the only thing at play her18:55
jeblairi ran tcpdump on zuul for the last part of that; there seemed to be gearman related traffic in both directions18:55
*** winston-d_ has quit IRC18:58
jeblairthe server is currently logging at warning level19:00
openstackgerritDevananda van der Veen proposed openstack-infra/project-config: Update Ironic jobs post-graduation  https://review.openstack.org/12662719:00
*** mudassirlatif has joined #openstack-infra19:01
jeblairright now if i telnet to 4730, i can't run anything :/19:01
*** davideagnello has quit IRC19:01
*** davideagnello has joined #openstack-infra19:02
*** koolhead_ has quit IRC19:02
*** koolhead17 has joined #openstack-infra19:02
jeblairand zuul just dropped its gearman connection too19:03
*** emagana has quit IRC19:03
*** afazekas has joined #openstack-infra19:05
fungithe cacti graphs for zuul don't (or at least didn't a few minutes ago) look too bad19:05
openstackgerritSurojit Pathak proposed openstack-dev/hacking: Fixing broken while loop in imports.py  https://review.openstack.org/13651719:06
*** emagana_ has joined #openstack-infra19:06
jeblair[pid 23955] sendto(124, "build:gate-oslo.config-requireme"..., 55, 0, NULL, 019:06
jeblairzuul-serv 23954 zuul  124u  IPv4 58579682      0t0      TCP zuul.openstack.org:4730->nodepool.openstack.org:44312 (CLOSE_WAIT)19:06
*** mudassirlatif has quit IRC19:06
jeblairthat doesn't look great19:06
*** davideagnello has quit IRC19:07
*** mriedem has joined #openstack-infra19:07
*** koolhead_ has joined #openstack-infra19:07
*** koolhead17 has quit IRC19:07
*** bhunter71 has quit IRC19:07
*** jp_at_hp has quit IRC19:07
jeblairfungi: i'm wondering if one of the closing packets for that connection was dropped19:07
*** smoser has quit IRC19:07
*** rkukura has quit IRC19:07
*** rfolco has joined #openstack-infra19:08
*** davideagnello has joined #openstack-infra19:08
fungijeblair: entirely possible we've got some sort of packet loss, but fin should retry if no fin/ack comes back19:08
fungier, should retransmit19:08
* fungi is starting to lose his network engineering vocabulary19:09
*** emagana_ has quit IRC19:09
*** emagana has joined #openstack-infra19:09
jeblairfungi: there's no corresponding connection on the nodepool.o.o side19:09
clarkbya I just checked that19:09
*** bhunter71 has joined #openstack-infra19:10
clarkb104.130.155.213:41955 is the port nodepool claims to be connected from19:10
*** afazekas has quit IRC19:10
jeblairthat's established on both sides19:10
fungiideally if the fin is received but the fin/ack never arrives, then when the fin is retransmitted the receiving end should send an rst19:10
jeblairthere are 4 connections on zuul.o.o that are in close_wait19:10
*** signed8bit has joined #openstack-infra19:12
*** packet has joined #openstack-infra19:12
*** rfolco has quit IRC19:12
fungihrm. if the kernel on the zuul end is reporting close_wait then i believe that means it thinks the nodepool end may already be closed but the process at the zuul end is still holding the fd of the associated socket open19:12
*** andreykurilin_ has quit IRC19:13
*** sarob has joined #openstack-infra19:14
*** smcginnis has joined #openstack-infra19:14
jeblair19:11:26.735845 IP 162.242.150.96.4730 > 104.130.155.213.43882: Flags [P.], seq 4222912087:4222912144, ack 3780423436, win 114, options [nop,nop,TS val 1228423088 ecr 62825158], length 5719:14
jeblair19:12:21.071842 IP 162.242.150.96.4730 > 104.130.155.213.43882: Flags [P.], seq 0:57, ack 1, win 114, options [nop,nop,TS val 1228436672 ecr 62825158], length 5719:14
jeblair19:14:09.871860 IP 162.242.150.96.4730 > 104.130.155.213.43882: Flags [P.], seq 0:57, ack 1, win 114, options [nop,nop,TS val 1228463872 ecr 62825158], length 5719:14
jeblairthat's on the nodepool side19:14
jeblairthat's one of the close_wait connections on the zuul side19:14
fungioh, it's old nodepool19:15
fungier, nevermind, that is the new one19:15
*** gyee_ has joined #openstack-infra19:15
*** jcoufal_ has quit IRC19:15
clarkb104.130.155.213 is new nodepool19:15
jeblairwhew19:15
*** alexpilotti has joined #openstack-infra19:15
*** MaxV has joined #openstack-infra19:16
fungiso it's receiving packets from zuul for a connection zuul lists as being in close_wait except those don't look like they're associated with trying to close the socket19:17
jeblairyep, and they are continuing to trickle in19:18
*** ci-testing_ has quit IRC19:18
fungialso i had to try four times to load https://status.rackspace.com/19:18
clarkbhrm, could that be a bug in newer gear so restarting nodepool with older gear was a step in the right direction but not sufficient19:18
fungibut it's still showing all green19:18
clarkbexcept it is showing as close wait on zuul19:19
clarkbwhich implies it knows that connection went away?19:19
*** gothicmindfood has quit IRC19:19
*** rushiagr is now known as rushiagr_away19:19
*** koolhead_ has quit IRC19:19
*** prad has joined #openstack-infra19:20
*** koolhead17 has joined #openstack-infra19:20
*** otter768 has joined #openstack-infra19:20
*** gothicmindfood has joined #openstack-infra19:21
jeblair19:22:11.149857 IP zuul.openstack.org.4730 > nodepool.openstack.org.43882: Flags [P.], seq 4222912087:4222912144, ack 3780423436, win 114, options [nop,nop,TS val 1228584192 ecr 62825158], length 5719:22
jeblair19:22:11.150553 IP nodepool.openstack.org > zuul.openstack.org: ICMP host nodepool.openstack.org unreachable - admin prohibited, length 11719:22
jeblairthat's from the zuul side19:22
jeblairso zuul keeps sending that packet, and nodepool rejects it due to the iptables rules19:22
*** teran has quit IRC19:23
*** ci-testing has joined #openstack-infra19:23
*** amitgandhinz has quit IRC19:23
clarkbbecause it knows of no such connection19:23
jeblairyep19:23
*** amitgandhinz has joined #openstack-infra19:24
*** koolhead17 has quit IRC19:24
*** otter768 has quit IRC19:25
jeblairoh, is the problem that geard is not closing those sockets?19:25
fungithat's one way that you can end up with hung close_wait, yes19:25
*** timrc is now known as timrc-afk19:26
fungiso maybe geard is continuing to try to send on that socket even though the kernel thinks the other end has closed it19:26
jeblairoh19:26
jeblairone of the close_waits just went away19:26
fungiand there's the gearman log entry now19:27
*** alexpilotti_ has joined #openstack-infra19:27
fungithe file is no longer empty. plenty of tracebacks19:27
*** alexpilotti has quit IRC19:27
*** Ryan_Lane1 has joined #openstack-infra19:27
*** alexpilotti_ is now known as alexpilotti19:27
fungilots of broken pipes hit while sending19:27
*** Ryan_Lane has quit IRC19:27
clarkbzaro: the webui for editing acls is really broken on review-dev19:27
clarkbzaro: not a huge issue but worth pointing out19:28
fungiahh, that was from the incident where zuul had trouble communicating with its gearman service19:28
fungiall from around 19:1019:28
jeblairyep19:28
fungiso nothing reflecting the more recent sockets going away19:29
*** amitgandhinz has quit IRC19:29
*** amitgandhinz has joined #openstack-infra19:30
*** cpowell has quit IRC19:31
*** jistr has quit IRC19:31
jeblairso i think it recovered from that19:32
jeblairseems to be picking up demand again19:32
*** MarkAtwood has joined #openstack-infra19:33
*** johnthetubaguy is now known as zz_johnthetubagu19:34
jeblairnodepool seems to have a lot in 'ready' according to the graph19:34
jeblairi think maybe now we're waiting on zuul to catch up?19:35
clarkbya zuul has a ton of results to get through19:35
*** iax7 has joined #openstack-infra19:36
*** iax7 has quit IRC19:36
*** bo_sh has joined #openstack-infra19:37
*** bo_sh has left #openstack-infra19:38
openstackgerritElizabeth K. Joseph proposed openstack-infra/publications: Update tools and review purposes.  https://review.openstack.org/12872219:38
*** koolhead17 has joined #openstack-infra19:38
jeblairokay, now geard has decided to try to send a bunch of data to another close_wait19:39
jeblair(it's still one of the 4 from earlier)19:39
jeblairi suspect everything will stop again until it works through that19:39
*** SumitNaiksatam has quit IRC19:40
*** emagana has quit IRC19:40
*** SumitNaiksatam has joined #openstack-infra19:40
*** emagana has joined #openstack-infra19:41
jeblairclarkb, fungi: i think we should restart zuul to reset the state19:41
jeblairand see if we accumulate more close_wait sockets19:41
clarkbjeblair: ok19:42
fungisounds fair19:42
clarkbsounds reasonable to me. anything I can do to help with that?19:42
*** tkelsey has joined #openstack-infra19:42
jeblairi'll just save the queues and restart/re-enqueue19:42
*** cpowell has joined #openstack-infra19:42
fungiit's taken >2mos to get into this state since its last restart19:43
clarkbthough that may have partially been triggered by the nodepool move?19:43
clarkbbtw https://etherpad.openstack.org/p/third-party-openid-accounts is a thing now19:43
fungiyeah, i'm trying to think of what things have gone on in that span of time19:43
clarkbI think it would be relatively simple to get third party accounts as lp openid things going. just need to create some groups19:43
openstackgerritElizabeth K. Joseph proposed openstack-infra/publications: Update tools and review purposes.  https://review.openstack.org/12872219:44
clarkbfungi: though looking at the graphs it wasn't until today that it went sideways19:44
fungiright, which is the first real load we've had on it since the nodepool replacement19:44
dvorakis there a way to tie matrix jobs parent job without using the deprecated tie matrix job parent plugin?19:44
clarkbanteaya: if you are around https://etherpad.openstack.org/p/third-party-openid-accounts probabl interests you19:44
fungiespecially since i had stuff offline for much of the weekend dealing with the log vg19:45
clarkbdvorak: I don't know that many of us would know. we avoid matrix jobs and rely on zuul + jjb to provide that sort of job explosion feature19:45
harlowjadid zuul just restart19:45
*** e0ne has joined #openstack-infra19:45
dvorakfair enough19:45
*** emagana has quit IRC19:45
clarkbharlowja: yes19:45
fungiharlowja: yes, jeblair's restarting it to see if we clear a misbehavior we've been observing19:45
harlowjakk19:45
jedimikepleia2, maybe I'm missing something, on  https://review.openstack.org/128722 the commit messages says we note our use of review for translations and our use of Storyboard, but I can't see us saying that in the diff anywhere. Is that info just for the commit message?19:46
dvorakclarkb: I could do that, but I actually use the jenkins UI, so that'd be a lot of extra jobs :)19:46
jeblairclarkb, fungi: if you feel like reviewing the non-blocking io patch for gear: 12875419:47
* clarkb pulls that up now19:47
dvorakoddly, the JJB docs for the tie matrix job support explicitly mention that it's deprecated, and that comment was added as part of the initial implementation.19:47
*** mrmartin has quit IRC19:47
clarkbdvorak: ya it definitely doesn't fit everyones needs, but the alternatives are not something we are very familiar with19:48
jeblairi think that if we decide we need to dig deeper into this, we should use that patch if we think it's the way we want to go (otherwise we may waste some effort)19:48
dvorakyeap, understood :)19:48
clarkbjeblair: iirc that patch existed as a different change at one point? or am I misremembering?19:49
jeblairclarkb: there is also 96294, which is 'use non-blocking io' (everywhere)19:49
jeblairclarkb: the new one is only use it in the server19:49
clarkbgotcha19:49
*** luqas has joined #openstack-infra19:50
jeblairclarkb: i'm leaning toward that in order to keep the client simple and more predictable19:50
*** otherwiseguy has joined #openstack-infra19:50
clarkb++19:50
pleia2jedimike: it's in there...19:50
pleia2jedimike: Storyboard is line 64, translations & specs lines 94-9519:51
dvorakoh, I see.  it just uses the normal node restriction field.  that wasn't clear at all.19:51
adam_ganyone know why installation of oslo.vmware here wouldn't be pulling up the system's eventlet depedency to match oslo.vmware's requirements.txt? http://logs.openstack.org/73/135673/2/check/check-grenade-dsvm/23ba48c/logs/old/devstacklog.txt.gz#_2014-11-24_10_27_46_14419:51
jedimikepleia2, that's a sign i need to eat. Of course it's there :)19:52
*** mmaglana has joined #openstack-infra19:52
*** rcarrillocruz has quit IRC19:52
*** Ryan_Lane1 is now known as Ryan_Lane19:52
*** Ryan_Lane has joined #openstack-infra19:52
pleia2jedimike: eating is good! (also, I hope you're feeling better)19:52
mtreinishadam_g: maurosr was looking at this morning, IIRC it's because transitive deps don't really work19:53
jedimikepleia2, yeah, feeling much more alert this week :) was so jealous of your vacation photos :p19:53
*** rcarrillocruz has joined #openstack-infra19:53
pleia2jedimike: glad to hear it, and it was a wonderful vacation, much needed :)19:54
adam_gmtreinish, curious as to whats changed? other than the devstack backports that went in last week around libs from git/release19:54
fungimtreinish: adam_g: that could be the order-dependent issue dstufft was describing to clarkb over the weekend19:54
fungior was that friday19:54
* adam_g has some backscroll to read19:54
mtreinishadam_g: that would do it, because when oslo.vmware was installed from git the eventlet dep was at the higher version19:55
*** smcginnis has left #openstack-infra19:55
jeblairfungi: the nodepool main loop interval seems to be about 30 seconds now, which seems more normal19:55
clarkbit was friday19:56
fungibasically if you have package A already installed at version 1.2.3 and then feed pip a list of packages to install (directly or transitively) if the first dependency it sees listed on package A doesn't require newer than 1.2.3 then a later listed dependency on A>=2.3.4 will basically be ignored and won't trigger an upgrade19:56
clarkbtl;dr is "highest" req wins19:56
mtreinishadam_g: I expect that being installed from pypi on icehouse doesn't pull in the newer dep initially it doesn't work now19:56
clarkbso if you have a top level req that is >=1.0 and a lower level req that is >=1.5 but you already have 1.3 installed then pip does nothing19:56
clarkbbecause 1.0 wins and is satisfied19:56
adam_gfungi, oh, i seem to have overestimated pip19:56
fungiadam_g: yes, i do that all the time :/19:56
clarkbjeblair: that is good news19:56
fungijeblair: so whatever had it gummed up is no longer present since the zuul restart i guess19:57
*** dprince has quit IRC19:57
jeblairseems like it19:57
*** Guest86919 has quit IRC19:58
*** koolhead17 has quit IRC19:58
*** sarob has quit IRC19:59
*** koolhead17 has joined #openstack-infra19:59
*** MaxV has quit IRC19:59
*** sarob has joined #openstack-infra20:00
*** davideagnello has quit IRC20:01
ekarlso-https://review.openstack.org/#/c/136624/ < can we get a +A there ?20:02
jeblairclarkb, fungi: geard (and therefore nodepoold) seems to be sluggish again20:02
fungithat was quick20:02
*** nadya has joined #openstack-infra20:03
*** nadya is now known as Guest8219020:03
ekarlso-pretty please ? :) it's a simple change :D20:03
*** AJaeger has quit IRC20:03
*** koolhead17 has quit IRC20:03
fungiand indeed we got a gearmanclient timeout in nodepool as recently as 4 minutes ago20:04
*** mjturek has quit IRC20:05
*** bhunter71 has quit IRC20:05
*** luqas has quit IRC20:06
*** Hal_ has joined #openstack-infra20:06
*** luqas has joined #openstack-infra20:06
*** bhunter71 has joined #openstack-infra20:06
*** banix has quit IRC20:07
jeblairhuh, so the connection nodepool was using is still in use20:07
jeblairthat is, even after the timeout, it did not close/reopen the connection20:07
jeblairit just started working again20:07
*** Sincler has quit IRC20:07
*** MaxV has joined #openstack-infra20:07
*** HeOS has joined #openstack-infra20:09
*** davideagnello has joined #openstack-infra20:09
jeblairfungi, clarkb: oh, that would be because there's nothing in either gear or nodepool to drop/reset a connection when an admin request times out20:10
*** luqas has quit IRC20:10
jeblairso it seems like it just resumed using the connection and that worked :/20:10
*** gyee_ has quit IRC20:12
*** luqas has joined #openstack-infra20:12
*** ldnunes_ has quit IRC20:16
fungiprofiling the packet rates from gearman clients connected to zuul has yielded little of interest other than zuul's receiving a lot of gearman packets from the jenkins masters, an order of magnitude less from the zuul mergers and basically none from nodepool20:16
*** thedodd has joined #openstack-infra20:16
* mordred back from lunch ... from what I can tell, some stuff started working but some stuff didn't and we still dont' absolutely know the issue, yeah?20:16
fungimordred: that's every day20:16
mordredfungi: yup. just thought I'd express it out loud20:17
adam_gfungi, curious if this is the proper fix for that dependency issue or if i've oversimplified the problem, https://review.openstack.org/#/c/136879/20:17
jeblairfungi, mordred: i suspect that not dropping the connection after the admin request timeout is a problem20:17
*** smcginnis has joined #openstack-infra20:17
*** sarob has quit IRC20:17
jeblairit's sent another request, some data went over the network, but it's still sitting waiting for a response20:18
jeblairso i think it's gotten out of sync20:18
*** mjturek has joined #openstack-infra20:18
jeblair(part of why we're generally very agressive about dropping connections is because it's pretty easy to get into that state with the gearman protocol(s))20:18
jeblairso i think the first time geard gets stuck, nodepool goes into a state where it is difficult to recover20:19
*** mjturek has left #openstack-infra20:19
jeblair(and subsequent demand checks simply may or may not work)20:19
clarkbjeblair: ok commented on gear change20:19
clarkbhrm it started again :/20:19
fungithough that's part of a failure pattern which starts with something causing admin requests to timeout, and we don't know why that's the case either (but suspect nonblocking i/o in the server will help)?20:20
*** AlexF has joined #openstack-infra20:20
jeblairfungi: yes20:20
*** shashankhegde has quit IRC20:20
*** ilyashakhat has quit IRC20:21
*** banix has joined #openstack-infra20:21
*** mmaglana has quit IRC20:21
*** bhunter71 has quit IRC20:21
*** ilyashakhat has joined #openstack-infra20:22
mordredmakes sense to me20:22
clarkbjeblair: let me know if my comments are just wrong. I will sit down wth more caffeine :)20:23
*** otherwiseguy has quit IRC20:23
*** bhunter71 has joined #openstack-infra20:24
*** luqas has quit IRC20:24
*** otherwiseguy has joined #openstack-infra20:24
*** baoli has quit IRC20:24
*** alexpilotti has quit IRC20:24
*** amitgandhinz has quit IRC20:25
*** emagana has joined #openstack-infra20:25
*** gyee_ has joined #openstack-infra20:25
mordredjust in case anyone was curious, there is no /etc/apt/sources.list file on centos20:26
mtreinishmordred: heh, you should add it then :)20:26
mordredmtreinish: I really should ...20:27
*** amitgandhinz has joined #openstack-infra20:28
*** yfried_ has quit IRC20:28
*** amitgandhinz has quit IRC20:29
fungimordred: https://admin.fedoraproject.org/pkgdb/package/fedora-package-config-apt/20:30
*** amitgandhinz has joined #openstack-infra20:30
mordredthe other choice is that i can put sources.list manipulation somewhere that's not run on centos20:30
fungiadam_g: you could reorder the requirements.txt for ceilo and glance to list oslo.vmware before eventlet20:32
fungiadam_g: i think pinning oslo libs in stable reqs doesn't yet have the desired effect20:32
openstackgerritRicardo Carrillo Cruz proposed openstack-infra/nodepool: Add REST API to Nodepool  https://review.openstack.org/13688420:33
adam_gfungi, ah, good idea20:33
*** Ryan_Lane has quit IRC20:33
dstaneki'm working on some changes to add functional testing into Keystone. is there a good way for me to test everything out before i submit changesets?20:33
*** AlexF has quit IRC20:35
clarkbdstanek: the best thing is to run the tests as they will be run. this depends on how you are setting them up. might be with tox or with devstack-gate20:35
clarkbdstanek: either way, executing it locally against master is a good idea20:35
clarkbdstanek: if using devstack gate there are directions in that repo on how to run it locally20:35
clarkbin the readme iirc20:35
dstanekclarkb: ok, i'll check that out20:35
dstanekclarkb: i also heard there may be a way to tell devstack to install software without actually making changes to devstack itself. i have to potentially install apache modules and my own configs for those.20:37
*** achanda has quit IRC20:38
openstackgerritClayton O'Neill proposed openstack-infra/jenkins-job-builder: Document node parameter usage with matrix projects  https://review.openstack.org/13688620:38
*** AlexF has joined #openstack-infra20:39
*** achanda has joined #openstack-infra20:40
*** aysyd has quit IRC20:40
*** teran has joined #openstack-infra20:40
*** jedimike has quit IRC20:41
jeblairclarkb: i can not answer your second question yet.  i'm doing some testing.  thanks.  :)20:42
clarkbdstanek ya devstack has plugin points where you drop files in and they are run20:42
dstanekclarkb: thanks, i'll look into that too20:42
openstackgerritAllison Randal proposed stackforge/gertty: Sample color palette for inline review comments  https://review.openstack.org/13579920:43
dvorakCould I get someone to take a look at this? https://review.openstack.org/#/c/116704/  I think it's pretty close to final form and it has one +2 already20:43
*** baoli has joined #openstack-infra20:45
*** baoli has quit IRC20:46
openstackgerritKyle Mestery proposed openstack-infra/project-config: Add networking-odl project to StackForge  https://review.openstack.org/13685420:47
*** baoli has joined #openstack-infra20:47
*** lttrl has quit IRC20:48
*** esker has quit IRC20:49
*** ddieterly has quit IRC20:49
*** dolphm has joined #openstack-infra20:50
*** shashankhegde has joined #openstack-infra20:50
*** smcginnis has left #openstack-infra20:51
openstackgerritMerged openstack-infra/project-config: Revert "move nova-tox-functional to experimental until there is content"  https://review.openstack.org/13679520:51
dolphmwhere can i find notify_impact config as referred to here?: https://bugs.launchpad.net/keystonemiddleware/+bug/139392020:51
uvirtbotLaunchpad bug 1393920 in keystonemiddleware "I18n" [Medium,Confirmed]20:51
mordreddolphm: ./gerrit/notify_impact.yaml in openstack-infra/project-config20:52
dolphmmordred: thanks!20:52
mordreddolphm: I think20:52
mordreddolphm: nope. I think I'm wrong20:53
*** Guest82190 has quit IRC20:53
*** lttrl has joined #openstack-infra20:53
dolphmmordred: dev/gerrit/notify_impact.yaml ?20:54
mordreddolphm: gerrit/projects.yaml20:54
mordreddolphm: you'll see docimpact-group listings20:54
dolphmah that makes more sense20:54
mordredI believe that the notify_impact verbage there is misleading20:54
dolphmi was planning to update the verbiage once i understood it20:55
*** AlexF has quit IRC20:55
*** achanda has quit IRC20:55
*** rkukura has joined #openstack-infra20:57
*** weshay has quit IRC20:58
*** matel has quit IRC21:00
*** amotoki has joined #openstack-infra21:00
*** MaxV has quit IRC21:00
*** tkelsey has quit IRC21:01
*** melwitt has quit IRC21:01
mtreinishfungi: are things calm enough now to give the mysql_proxy stuff a shot?21:01
*** melwitt has joined #openstack-infra21:02
*** cpowell has quit IRC21:02
*** Hal_ has quit IRC21:05
*** shashankhegde has quit IRC21:05
fungimtreinish: not really. my to do list from this morning is basically now a mostly untouched to do clog as evening approaches21:06
mtreinishfungi: ok sure, anything I can do to help lighten the load?21:07
fungimtreinish: not really. need to ping devs on some security bugs, finish going over the groups portal ssl cert stuff, evaluate impact of big-tent and options for retuning on our free summit pass numbers21:10
*** aysyd has joined #openstack-infra21:11
*** melwitt has quit IRC21:11
*** melwitt has joined #openstack-infra21:11
*** Ryan_Lane has joined #openstack-infra21:11
fungitrying to get through a review of the gear non-blocking i/o change but i think my mind is not fresh enough to do so quickly21:11
*** sarob has joined #openstack-infra21:12
*** weshay has joined #openstack-infra21:13
*** nadya has joined #openstack-infra21:14
*** nadya is now known as Guest3309521:14
*** kgiusti has quit IRC21:18
*** shashankhegde has joined #openstack-infra21:20
*** otter768 has joined #openstack-infra21:21
fungispending too much time rereading the docs on select21:24
*** baoli has quit IRC21:24
adam_ghmm. is the  'check experimental' trigger expected to work against stable branches, or only master?21:26
*** otter768 has quit IRC21:26
*** ildikov has joined #openstack-infra21:27
*** bhunter71 has quit IRC21:27
*** bhunter71 has joined #openstack-infra21:27
*** tsg_ has quit IRC21:29
*** mudassirlatif has joined #openstack-infra21:29
*** erlon has quit IRC21:29
clarkbpredominantly master. we dont backport jobs usually21:29
*** Ryan_Lane has quit IRC21:30
*** Sukhdev has joined #openstack-infra21:31
*** mmaglana has joined #openstack-infra21:32
*** bhunter71 has quit IRC21:33
*** tsg has joined #openstack-infra21:33
*** bhunter71 has joined #openstack-infra21:34
adam_gclarkb, trying to trigger the grenade forward jobs that are listed in most project's experimental, was hoping to test an I -> J forward on stable/icehouse21:34
adam_gclarkb, nvm, they're running now21:34
*** jerryz has quit IRC21:34
clarkbya I wouldn't expect those to be branch restricted but I know many experimental jobns are21:35
*** tsg_ has joined #openstack-infra21:36
*** mmaglana has quit IRC21:36
*** Ryan_Lane1 has joined #openstack-infra21:36
*** achanda has joined #openstack-infra21:37
*** smoser has joined #openstack-infra21:37
*** Ryan_Lane1 is now known as Ryan_Lane21:38
*** Ryan_Lane has joined #openstack-infra21:38
*** tsg has quit IRC21:39
clarkbok lunch consumed /me dives back into stuff21:39
clarkbpuppet doesn't appear to be running on nodepool.o.o for some reason. going to look into that21:41
fungiclarkb: sudo ssh nodepool.openstack.org from puppetmaster.o.o21:42
fungii'm betting you need to replace its known_hosts entry there21:42
fungii always forget that until i notice my replaced server isn't puppeting at all21:43
clarkbdanke21:43
*** Sincler has joined #openstack-infra21:43
clarkbthats it21:43
*** dizquierdo has joined #openstack-infra21:44
* mordred is trying to make the new node launching stuff DTRT WRT known_hosts, fwiw21:44
mordredalso, I poked sdague about the idea that perhaps nova should be able to tell you a servers host cert21:45
greghaynesmordred: are you going to take a swing at preserving ssh host keys across boots of dib images?21:45
clarkbalso I note that the current puppetmaster key is not restricted to running puppet. Ithink that is intentional so that we can run other ansible things?21:45
mordredclarkb: yes21:45
greghaynesmordred: because I think no less than 3 different people have taken a swing at that :)21:45
mordredclarkb: ansible needs to be able to do all the things21:45
clarkbmordred: rgr21:45
mordredgreghaynes: I was not intending on it - but I may not even understand the problem space you're referring to21:46
*** Guest33095 has quit IRC21:46
mordredthe main thing I want to add to nova is to have nova poke vms locally to find out their ssh host key21:46
mordredbecause I have a trusted relationship with nova, and nova has an under-the-covers relationship with the vm21:47
greghaynesoh, well we have been wanting to preserve the ssh host key gen'd on first boot to solve a similar problem, but your problem might be a bit different21:47
mordredso I should be able to ask the API for the host key of a host, so that I can verify that I'm talking to the right thing21:47
greghaynesah, we could use that21:47
mordredgreghaynes: yeah - mine is a bit different, I want to be able to register the host key on the host I use to run automation when I spin up a new host21:47
greghaynesas is the only solution involves ansible putting the host key on the persistent partition but that would be a much cleaner solution21:48
*** ddieterly has joined #openstack-infra21:48
mordredbut - honestly I don't have a good way right now to know that I haven't been MITM'd between the time I created the vm and the first time I talk to it21:48
fungimordred: that would be dependent on nova agent though, right? i mean, some implementations just start with no ssh keys and then the initscript/whatever for sshd creates them the first time it's started21:48
JayFmordred: someone here was talking about that exact same problem on Friday21:48
mordredfungi: nope21:48
mordredfungi: nova could totally hit the ssh port of the vm - just do it on the local bridge21:48
*** emagana has quit IRC21:48
fungimordred: ahh, got it21:49
mordredfungi: it would also let nova report "ssh is running"21:49
clarkbok puppet is puppeting nodepool21:49
*** emagana has joined #openstack-infra21:49
fungimordred: right, thus not relevant for systems with no ssh at all21:49
clarkbI will not clean up old nodepool + images quite yet just in case we feel we need to rollback due to zuul, gearman nodepool weirdness21:49
nibalizerclarkb: play the power rangers theme while puppet runs21:49
JayFfungi: mordred: You gotta be careful not to assume state on the machine that's being deployed; relying or wnating to rely on SSH blocks out windows use21:50
asselinkrtaylor, mmedvede, sweston, nibalizer I have a script running that is sub-treeing all the puppet modules. https://github.com/rasselin?tab=activity21:50
fungimordred: implementation would probably also depend on sshd being configured to run on the standard port and firewall rules not blocking ssh connections from nova21:50
JayFfungi: mordred: Not to mention in the bare metal world there's not always a way to get that level of assuredness that you're connecting to  *that machine*21:50
clarkbJayF: windows should just run an sshd21:50
fungiclarkb: and a gnu userspace and a linux/bsd kernel21:51
swestonasselin: in the network meeting right now, just a moment21:51
mordredJayF: yah21:51
clarkbfungi: thats the spirit21:51
JayFclarkb: for clouds that allow customer images; that's not a solution at all. I'm highly skeptical any inside-instance thing could interact reliably with the nova control plane21:51
JayFregardless of the direction the chat is going21:51
mordredfungi: well, if you boot an image that doesn't have ssh - then you probably won't run "nova check-ssh"21:51
nibalizerasselin: okay21:51
clarkbJayF: its mostly me being mean to windows21:51
*** dimtruck is now known as zz_dimtruck21:52
fungimordred: this is true. though if you run it on a nonstandard port or get really restrictive with your iptables rules, you might be confused as to why it thinks no ssh is running21:52
ekarlso-https://review.openstack.org/#/c/136624/ < can we get a +A there ?21:52
*** SumitNaiksatam has quit IRC21:52
ekarlso-fungi:  ? :D21:52
nibalizerasselin: i worry that you won't get reviews fast enough to land those changes all at once21:52
nibalizerwhereas one at a time you can keep that in your head21:53
nibalizerbut okay, work with the cores21:53
*** SumitNaiksatam has joined #openstack-infra21:53
asselinnibalizer, no need to merge all at once. I will remain up-to-date, so ready to go when you are.21:53
*** emagana has quit IRC21:53
mordredfungi: I think that's a strange enough use case that nova can be forgiven for not handling it21:53
nibalizerasselin: oh you have it set to track all changes?21:53
mordredfungi: although "nova check-ssh --port=2222" should be easy enough to deal with21:54
asselinI'm putting it in a post-merge-job now21:54
JayFclarkb: I know; we've just put  a lot of thought in it for Rackspace OnMetal, because it's crappy UX for the instance to go active before POST is completed, but it doesn't seem like there's a way to know when it's UP without making assumptions you shouldn't make about network configuration or image configuration21:54
clarkbmordred: right I Think an important aspect of this is its best effort and it fails gracefully21:54
*** emagana has joined #openstack-infra21:54
clarkbeg it won't give you the wrong key it will only give you no ke21:54
mordredfungi: how could you firewall off connections from the nova compute host?21:54
mordredclarkb: ++21:54
*** tonytan4ever has quit IRC21:54
mordredJayF: so - when I launch things with ansible, there are two different statuses I care about21:54
asselinnibalizer, no, it retree's on every merge. not very smart, but that can be added next.21:55
mordredJayF: "active" - which means that no more cloud API things are needed, and when does ssh port become active21:55
mordredI think the biggest problem is the term "Active"21:55
JayFmordred: that's basically the pattern we tell our customers to follow; but it's still not super friendly compared to telling the customer what they actually care about: when the compute resource is usable21:55
mordredbecause what has finished is the nova internal communication needed to allocate this thing fully21:55
*** MarkAtwood has quit IRC21:55
mordredyup21:55
*** packet has quit IRC21:57
*** bhunter71 has quit IRC21:57
*** atiwari has quit IRC21:58
*** MarkAtwood has joined #openstack-infra21:58
*** erikwilson has quit IRC21:58
*** e0ne has quit IRC22:00
*** dkranz has quit IRC22:01
*** tonytan4ever has joined #openstack-infra22:02
jeblairclarkb: i believe your comment #2 is correct22:03
*** dustins has quit IRC22:03
clarkbwoot22:03
jeblairclarkb: i responded to comments now; let me know if my justification for not changing the thing from your #1 comment is insufficient :)22:04
*** e0ne has joined #openstack-infra22:04
clarkblooking22:04
*** mriedem has quit IRC22:05
*** harlowja is now known as harlowja_away22:05
*** mikedillion has joined #openstack-infra22:06
*** emagana has quit IRC22:06
clarkbjeblair: so the problem with that first thing is that we modify self.conn but we don't have a connection at that point aiui22:06
clarkbso I think you can do what ou have there if you add a connect method?22:06
*** emagana has joined #openstack-infra22:07
*** dkliban is now known as dkliban_afk22:07
*** ddieterly has quit IRC22:07
clarkbactually you don't need to add a connect method you just need to not try to set nonblocking in server connection init22:07
*** emagana has quit IRC22:09
jeblairclarkb: i mean, my real answer was that i didn't want to change the existing code more than necessary22:09
clarkbjeblair: oh wait22:09
clarkbserver connection expects something else to connect for it?22:09
*** emagana has joined #openstack-infra22:09
jeblairclarkb: yes22:09
clarkbnevermind I think my concerns are not valid (particularly if you want to avoid a lot of changes)22:10
clarkbso I am good with what you have there22:10
jeblairok.  yeah, the connection process is _very_ different, it's just they share a lot in common once things get going22:10
openstackgerritMichael Krotscheck proposed openstack-infra/storyboard-webclient: Enable HTTP Caching on resources.  https://review.openstack.org/13614922:10
jeblairthe object model is probably all wrong for that :)22:10
openstackgerritJames E. Blair proposed openstack-infra/gear: Use non-blocking IO in server  https://review.openstack.org/12875422:11
openstackgerritMonty Taylor proposed openstack-infra/system-config: Add elements for Infra servers  https://review.openstack.org/13659722:11
*** imcsk8 has quit IRC22:11
openstackgerritJoe Gordon proposed openstack-infra/devstack-gate: Set up ssh_known_host based on hostname  https://review.openstack.org/13659622:11
*** SumitNaiksatam has quit IRC22:11
*** imcsk8 has joined #openstack-infra22:11
jeblairclarkb, fungi: ^ there's clarkb's point addressed, and also a bonus bug fix i noticed when testing that (it could truncate data it was sending if it blocked)22:11
clarkbso one of the things I said I would do is coming up, deprecating py26 across a larger set of projects. I didn't hear any complaining but will send a reminder this week22:12
*** SumitNaiksatam has joined #openstack-infra22:12
clarkbplease do complain if appropriate :)22:12
*** aysyd has quit IRC22:12
clarkbmordred: we have been putting elements in project-config for nodepool. any reason to not continue doing that?22:12
openstackgerritMichael Krotscheck proposed openstack-infra/storyboard-webclient: Enable HTTP Caching on resources.  https://review.openstack.org/13614922:12
*** mattfarina has quit IRC22:12
mordredclarkb: yeah. these are not nodepool elements22:12
mordredclarkb: these are elements for the servers in project_config22:13
mordredgah22:13
mordredclarkb: these are elements for the servers in system-config22:13
clarkbmordred: right but then we have to duplicate documentation and helper scripts?22:13
*** jungleboyj has quit IRC22:13
mordredclarkb: I believe the helper scripts for these are going to be slightly different22:14
clarkbok22:14
fungijeblair: clarkb: i think maybe i'm just turned around after staring at it for too long, but in the readPacket() method why append to input_buffer when testing for a byte present? shouldn't that byte be prepended back instead?22:14
fungijeblair: clarkb: nevermind22:14
fungii was looking at it wrong. grrr22:14
mordredclarkb: also - actually, I've submitted all of this other than the infra element to dib upstream22:14
mordredclarkb: so the patch should get much smaller before it lands22:15
mordredand that way if we want to use ubuntu-minimal for nodepool, we can without it being weird22:15
clarkbjeblair: one question about the bugfix you added. when the ssl exceptions are thrown we set r to 0. is that correct? we could have written >0 bytes then hit the exception ya?22:15
clarkbjeblair: might be better to have your outer r=0 then then r update as appropriate and fall through?22:15
*** harlowja_away is now known as harlowja22:16
clarkbI am leaving that as a ocmment on the review22:17
*** shashankhegde has quit IRC22:17
clarkbjeblair: left22:17
*** andreykurilin_ has joined #openstack-infra22:17
jeblairclarkb: oh sorry, back now (was looking at the other problem -- closing the connection after timeout)22:18
jeblairSpamapS: can you look at that?22:19
*** sputnik13 has joined #openstack-infra22:19
jeblairSpamapS: clarkb's comment on https://review.openstack.org/#/c/128754/22:19
swestonasselin: separate repositories for each subtree?22:20
*** bswartz has quit IRC22:20
asselinsweston, yes22:20
jeblairclarkb: i'm not certain i understand all the implications, but i think you are right22:21
asselinsweston, steps 2, 3, 4 here: http://specs.openstack.org/openstack-infra/infra-specs/specs/puppet-modules.html22:21
swestonasselin: nice.  looks like you can also merge from upstream without losing history22:21
clarkbjeblair: ya I don't either which is why I didn't -1, but I think my suggestion is a bit more defensive22:21
jeblairclarkb: (in particular, i do not know how to generate that error for testing :/)22:21
asselinsweston, how?22:21
clarkbmy suggsetion shouldn't be wrong but may be more correct :)22:21
jeblairyp22:22
openstackgerritJames E. Blair proposed openstack-infra/gear: Use non-blocking IO in server  https://review.openstack.org/12875422:22
swestonasselin: switch to the upstream, checkout, pull, switch to subtree master, then merge22:23
*** emagana has quit IRC22:23
*** banix has quit IRC22:23
*** rlandy has quit IRC22:24
*** emagana has joined #openstack-infra22:24
*** emagana has quit IRC22:24
*** emagana has joined #openstack-infra22:25
*** baoli has joined #openstack-infra22:25
*** tonytan4ever has quit IRC22:25
asselinsweston, so when you do the merge on the subtree, it will only take what's new?22:25
clarkbjeblair: +222:25
*** dims has quit IRC22:25
*** dims has joined #openstack-infra22:26
swestonasselin: it's supposed to work with git merge -s subtree22:26
openstackgerritJames E. Blair proposed openstack-infra/nodepool: Reconnect to gearman on error  https://review.openstack.org/13691022:26
jeblairclarkb: ^ there's the other part of this22:26
*** dims has quit IRC22:26
swestonasselin: I have not tested this though, admittedly .. never this particular use case, hehe22:27
jeblairclarkb: what do you think about cowboy running those in production now, since things are still in flames?22:27
clarkbjeblair: I am cool with it22:27
*** dims_ has joined #openstack-infra22:27
clarkblet me review that second change22:27
asselinsweston, I can play around with it. It would certainly be faster than re-sub-treeing22:27
*** dims_ has quit IRC22:28
swestonasselin: cool, let me know how it goes ;-)22:28
fungijeblair: yeah, no obvious bugs jumped out at me in 128754 anyway22:29
clarkbjeblair: lgtm. Did you want me to help cowboy that in?22:29
jeblairclarkb: i think i can do it real quick22:29
fungiand 136910 lgtm22:29
jeblairi've installed that gear on zuul.o.o.  note that this is replacing the previously cowboy'd local patch to set the submit job timeout to 300 seconds22:31
clarkbnoted22:31
jeblairi'm not happy about how long that stayed there :(22:31
jeblairi will also use that gear on nodepool; shouldn't make a difference, but just to be consistent22:32
*** camunoz_gone is now known as camunoz22:32
*** achanda has quit IRC22:32
clarkbsounds good22:33
jeblairi'm setting the geard log to INFO22:34
fungilong enough that i don't even remember why we'd overridden the submit job timeout22:34
jeblairfungi: because of this problem :)22:34
jeblairfungi: big response packets to nodepool with blocking io could freeze geard for > 30 seconds, especially if there was a network problem between zuul and nodepool22:35
fungid'oh22:35
fungiright, that22:35
jeblairfungi: so zuul would drop/reconnect22:35
fungiso when we saw zuul timeout on it today, that was a full five minutes with no response22:36
openstackgerritMichael Krotscheck proposed openstack-infra/storyboard: Plugins may now register cron workers.  https://review.openstack.org/12960922:36
fungipretty bad22:36
jeblairyep22:37
*** xyang0 has quit IRC22:37
jeblairi stopped zuul and restarted nodepool22:37
jeblairthe nodepool restart does not look good22:37
jeblairi don't understand the traceback in the debug log there22:38
*** dims has joined #openstack-infra22:39
jeblairoh!22:39
jeblairthat's what happens when zuul is not running22:39
jeblairdoh22:39
fungiright, clarkb noticed friday that it blocks waiting to connect to a gearman server22:40
clarkbya took a while to figure out22:40
clarkbdove in with pdb too22:40
*** e0ne has quit IRC22:40
jeblairwe should fix that22:40
openstackgerritMichael Krotscheck proposed openstack-infra/storyboard: Plugins may now register cron workers.  https://review.openstack.org/12960922:40
jeblairit's just on the initial startup22:40
*** ZZelle_ has joined #openstack-infra22:41
*** tsg_ has quit IRC22:42
jeblairoh, still can't use info logging; too verbose.  set it back to warning.22:43
*** amitgandhinz has quit IRC22:47
*** ChuckC has quit IRC22:47
*** JayJ has quit IRC22:47
*** JayJ has joined #openstack-infra22:48
openstackgerritEduardo Costa proposed openstack-infra/elastic-recheck: Add e-r query for bug 1372670  https://review.openstack.org/13691522:49
uvirtbotLaunchpad bug 1372670 in nova "libvirtError: operation failed: cannot read cputime for domain" [High,Confirmed] https://launchpad.net/bugs/137267022:49
asselinsweston, seems like it's going to work. will reconfirm when I know for sure.22:52
swestonasselin: sweet!22:53
jeblairRuntimeError: Set changed size during iteration22:54
jeblairthat's showing up a lot in the log, but i believe it's not harmful22:55
*** shakamunyi_ has joined #openstack-infra22:55
clarkbjeblair: should probably copy the reader and writer sets before iterating on them?22:55
jeblair(i mean, we should fix it, but the exception handlers will be retrying it, so we're not losing data or anything)22:55
mmedvedeasselin: be aware some history can be lost, e.g. compare your elasticsearch module with this https://github.com/mmedvede/puppet-elasticsearch22:56
mmedvedeasselin: otherwise, this is rather good :)22:56
jeblairclarkb: yeah, i think so; not sure what we can do there atomically22:56
* mordred knows some excellent atomic operators in C++22:56
asselinmmedvede, yea, I know. testing on the automation. We can redo then with a smarter script.22:56
clarkbjeblair: ya I would just be worried about starvation if sort order is stable during iteration22:57
*** banix has joined #openstack-infra22:57
*** weshay has quit IRC22:57
jeblairokay, so that started spewing out NOT_REGISTERED results22:58
jeblairi stopped zuul22:59
*** ChuckC has joined #openstack-infra23:00
jeblairi'm guessing because the nodepool is in such bad shape23:00
*** camunoz has quit IRC23:00
*** sarob has quit IRC23:01
jeblairi'm doing a mass delete in nodepool23:01
clarkbok23:02
*** shakamunyi_ has quit IRC23:04
*** dizquierdo has quit IRC23:04
*** otherwiseguy has quit IRC23:05
*** ecosta has joined #openstack-infra23:06
ci-testingHi, I am setting up my CI testbed, but am running  into problem with running tempest test from the dvsm-tempest-full jenkins job.   Running "sudo -H -u tempest tox -esmoke -- --concurrency=2" would produce this error: "InvocationError: '/bin/bash tools/pretty_tox.sh ...".23:08
clarkbci-testing: without more of that error its hard to understand what went wrong. However that sounds like a tempest issue. you may have more luck debugging in #openstack-qa23:10
*** banix has quit IRC23:11
*** achanda has joined #openstack-infra23:11
*** shashankhegde has joined #openstack-infra23:12
ci-testingclarkb: ok will try that forum. I am following jaypipes guide on joinfu.com for setting up a openstack CI testing system, it 's sort of outdated.  Do you know of any other good reference site,messages other then openstack-qa and openstack-infra?23:12
*** camunoz has joined #openstack-infra23:13
clarkbci-testing: http://ci.openstack.org is a good place23:13
ci-testingClarkb: ok thanks!23:14
openstackgerritJeremy Stanley proposed openstack-infra/system-config: Loosen name restrictions for dedicated 3rd-parties  https://review.openstack.org/13505023:14
asselinci-testing, you can try my fork of jaypipes's repo: https://github.com/rasselin/os-ext-testing23:15
ci-testingasselin: thanks will go over and retry/setup if needed.23:17
*** emagana has quit IRC23:19
*** jgrimm is now known as zz_jgrimm23:20
*** emagana has joined #openstack-infra23:20
jeblairmordred: how do i disable puppet on a host?23:20
*** emagana has quit IRC23:21
*** AlexF has joined #openstack-infra23:21
fungi'puppet agent --disable' as root is what i've been doing23:21
jeblairfungi: thx23:21
*** emagana has joined #openstack-infra23:21
mordredyah23:21
mordredansible does the right thing23:21
fungialternatively, remove it temporarily from /etc/ansible/hostlist on puppetmaster23:21
openstackgerritJeremy Stanley proposed openstack-infra/project-config: Precreate temp holding dir in static publish job  https://review.openstack.org/13692123:22
*** otter768 has joined #openstack-infra23:22
*** dkranz has joined #openstack-infra23:23
clarkbfungi: curious of what you think about https://etherpad.openstack.org/p/third-party-openid-accounts considering ^23:23
*** MaxV has joined #openstack-infra23:24
fungiclarkb: i'm in favor, though i wonder what implications that has on account naming23:25
clarkbfungi: I think it would require individuals to edit that stuff themselves23:25
clarkbyou can set it under contact info23:26
fungiclarkb: yep23:26
fungiclarkb: so i suppose those managing the allowed voting groups would only add accounts with conforming name patterns, and would remove them on report of name changes which brought them out of conformance23:26
clarkbya23:27
fungithat'd be workable i think23:27
clarkband I think we would still end up disabling some accounts, but a bulk of the work would be pushed onto account owners23:27
*** otter768 has quit IRC23:27
clarkbssh key and email updates23:27
clarkband account creation itself23:27
fungihow easy is it to juggle multiple lp accounts?23:28
fungii've never tried23:28
clarkbfungi: its not too bad (I have had to juggle to gerrig before, worst thing is when you forget to switch and file a bug with wrong account >_>)23:28
fungioh, wait, i have23:28
clarkbyou basically log out then back in with other credentials23:28
clarkbin the future I will use browser privacy modes to avoid the bug thing23:29
fungiyeah, logout of lp, log into lp with other account, then relogin to gerrit via openid23:29
fungidid we also want to have a way to prevent these accounts from leaving code review votes?23:30
jeblairfungi: i left a -1 on 135050.23:31
clarkbfungi: I don't think we do that with existing accounts23:31
*** Sukhdev has quit IRC23:31
jeblairi don't feel that what we're doing to zuul and nodepool is going very well23:31
*** dkranz has quit IRC23:31
*** thedodd has quit IRC23:31
clarkbfungi: so I am not worried about it. but we could add DENY rules for those groups with +/-1 code review label23:31
clarkbjeblair: :/23:31
jeblairif anyone wants to help me figure out what's going on, that'd be swell23:31
clarkbjeblair: can do. where should I look?23:31
jeblairi'm getting really close to the point of saying we should roll everything back to where we were last thurdays23:32
jeblairclarkb: i'm not sure what is or is not working at this point23:32
fungijeblair: as in switch back to the old nodepool server?23:32
jeblairfungi: yeah23:32
jeblairclarkb: if you look at the status page, there are some NOT_REGISTERED entries there23:33
jeblairi think the zuul mergers are idle23:33
*** banix has joined #openstack-infra23:34
fungioh, weird yeah... for multiple job names on 136798,1 where the same jobs are still pending or running on other changes ahead of it23:35
clarkbit looks like nodepools allocations are still off23:35
jeblairclarkb: is it because it's the icehouse branch?23:35
clarkbjeblair: oh ya that could do it since those are precise nodes23:35
jeblairi mean, do we need to shut the whole thing down for like an hour until we can finally build a precise node or something?23:35
clarkband will register independently of the trusty nodes23:35
fungijeblair: it's possible nodepool hasn't built any new devstack-precise nodes for that yet23:35
jeblairclarkb: can you verify that?23:36
fungitwo building and a bunch delete23:36
fungino ready/used23:36
clarkbjeblair: yes, will look in zuul logs to confirm23:36
clarkbnodepool did just start launching a devstack-precise node too23:36
fungione has been building for about 5 minutes23:37
*** marun has quit IRC23:37
*** MaxV has quit IRC23:39
clarkb'ZUUL_NODE': 'devstack-precise' thats in the job parameter list23:39
clarkb2014-11-24 23:34:02,871 ERROR zuul.Gearman: Job <gear.Job 0x7f70f572f310 handle: None name: build:gate-devstack-dsvm-cells:devstack-precise unique: 87b12bcea2e94fd3b52e7c9c6b930203> is not registered with Gearman23:40
clarkbjeblair: I think that confirms it23:40
jeblairokay, so that's just a matter of waiting...23:40
*** rcarrillocruz has quit IRC23:40
jeblairwhat could be the problem with the mergers?23:41
fungiand definitely still seeing "GearmanClient: Exception while listing functions" in the nodepool logs since the restart23:41
*** rcarrillocruz has joined #openstack-infra23:41
jeblairfungi: i've restarted it a lot23:41
clarkbjeblair: maybe they have CLOSE WAIT connections too?23:41
*** banix has quit IRC23:41
* clarkb hops on a merger23:41
fungimost recent log entry for that was from 1 minute ago23:41
jeblairfungi: what's the exception?23:41
fungijeblair: TimeoutError23:42
jeblairclarkb: they seem to each run one merge operation after the restart23:42
fungi2014-11-24 23:40:0423:42
openstackgerritDavanum Srinivas (dims) proposed openstack/requirements: Add glance_store, kite, python-kiteclient to projects.txt  https://review.openstack.org/13560323:42
openstackgerritDavanum Srinivas (dims) proposed openstack-infra/devstack-gate: Add oslo.context to devstack-vm-gate-wrap.sh  https://review.openstack.org/13509323:42
clarkbya nodepool's allocation numbers don't look right to me (seem to be min ready)23:43
jeblairclarkb: there's very little load23:43
jeblairclarkb: because zuul is stuck waiting on merges23:43
clarkboh I see23:43
*** rhe00 has joined #openstack-infra23:44
*** ddieterly has joined #openstack-infra23:44
mordredevery time I think I have an idea someone else says something which tells me I was wrong23:44
clarkbzm01 definitely isn't doing much but its got a connection that looks good from both sides23:44
clarkbjeblair: is it possible that the set() modifications are starving merger connections?23:44
clarkbjeblair: we don't service them because the set is modified quickly enough to short circuit iteration23:44
jeblairclarkb: i restarted that with .copy()23:44
fungijeblair: though gearman status reports merger:update and merger:merge with 8 workers and no waiting tasks23:44
clarkbzuul merger is sitting on a pause() call according to strace23:45
jeblairsomething just changed23:45
*** armax has quit IRC23:46
jeblairi think some mergers ran some jobs23:46
clarkbya at least zm01 did23:46
clarkball I did was strace the merger process which shouldn't interfer23:46
jeblairthey all did23:46
clarkbhttp://paste.openstack.org/show/137770/ is from the log23:47
fungiyeah, another stable/icehouse change popped not_registered for stuff and then the gate queue changes after it got recalculated23:47
fungiwhich would explain sudden activity for merge workers23:47
jeblairthe enqueue calls haven't finished yet23:48
fungioh23:48
*** oomichi has joined #openstack-infra23:48
jeblairi assume they are waiting on mergers23:48
jeblairi think there's far too much going wrong right now to debug and am burning out23:50
*** MaxV has joined #openstack-infra23:50
jeblairif we want to roll back to the old nodepool server, i think we should do so soon23:50
clarkbjeblair: looking at the zuul process list I don't see what I expect23:50
fungiof the two devstack-precise nodes which were booting, the one in rax seems to have errored and the other has been building in hpcloud for about 20 minutes now23:50
clarkbusually you get zuuld and a child which is geard23:50
jeblairclarkb: i'm running geard separately23:50
clarkbok23:50
jeblairclarkb: so that i can restart it and zuul separately, so that it might accumulate function registrations to avoid spurious not_registered23:51
mordredjeblair: I think rolling back to old nodepool is at least a thing that's worth trying23:51
mordredjeblair: no idea why that would affect zuul mergers - but its at least _one_ variable that could be removed23:51
clarkbstracing that geard its sitting on a select23:52
clarkbso I think the epoll edges are not being tripped?23:52
jeblairmordred: i strongly suspect the async io is broken23:52
clarkbya23:52
jeblairmordred: which is what's wrong with the zuul mergers23:52
*** AlexF has quit IRC23:52
mordrednod23:52
jeblairbut the reason we're doing anything with async io today is because something was broken between zuul and nodepool this morning23:52
jeblairwe thought it might make things better, but it seems to only be adding problems23:53
mordredyah. so you're saying "rollback nodepool, and also rollback async io"23:53
jeblairyes23:53
*** JayJ has quit IRC23:54
mordredI think that sounds like the sane thing23:54
*** banix has joined #openstack-infra23:54
* clarkb notes that the read fds list is really small23:54
jeblairclarkb: how can you see it?23:54
clarkbjeblair: strace23:54
jeblairi think i don't understand what you're referring to23:55
clarkbselect(6, [3 5], [], NULL, NULL23:55
jeblairoh23:55
clarkbjeblair: I am stracing geard23:55
jeblairclarkb: edge triggering23:55
asselinsweston, actually it didn't work. I'm trying this instead: --rejoin to 'save state' git subtree -q split --prefix=modules/$module --branch=$module --rejoin23:55
jeblairclarkb: only shows up there when one of them changes23:55
openstackgerritMatthew Treinish proposed openstack-infra/devstack-gate: Set up ssh_known_host based on hostname  https://review.openstack.org/13659623:55
jeblairclarkb, mordred, fungi: i have a hard stop in 1 hour.  how would you like to proceed?23:56
fungii'm trying to confirm the images in old nodepool are still around23:56
jeblairi'm happy to continue debugging, but i don't know if we'll be where we want in an hour23:56
clarkbfungi: they should be, I didn't delete any of them and nodepool was turned off.23:56
clarkbjeblair: but that is a select call23:57
clarkbjeblair: or is that coming from some unrelated portion of geard?23:58
*** amitgandhinz has joined #openstack-infra23:58
*** camunoz has quit IRC23:58
*** andreykurilin_ has quit IRC23:58
clarkbanyways I am ok with rolling back23:58
*** sarob has joined #openstack-infra23:58
mordredjeblair: steps should be "turn off new nodepool, delete a bunch of nodes, turn on old nodepool" - yah?23:58
clarkbit is why I was conservative23:59
clarkbmordred: also update dns and iptables23:59
mordred++23:59
jeblairand re-install old version of gear (with 300 second patch) to zuul23:59
fungiyep, images look like they're still around23:59

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!