Friday, 2019-01-18

*** itlinux has joined #openstack-swift00:11
*** zufar_ has joined #openstack-swift00:57
zufar_hello all00:57
*** threestrands has quit IRC00:59
*** Jeffrey4l has left #openstack-swift01:26
*** mrjk_ has joined #openstack-swift01:51
*** lifeless_ has joined #openstack-swift01:55
*** irclogbot_2 has quit IRC02:00
*** lifeless has quit IRC02:00
*** zufar_ has quit IRC02:00
*** mrjk has quit IRC02:00
*** irclogbot_2 has joined #openstack-swift02:04
kota_hello02:04
kota_notmyname: FYI, I'll introduce recent Swift upstream and ProxyFS at an OpenStack Event in Japan, https://openstack-jp.connpass.com/event/113590/02:06
kota_today, in my time.02:06
*** baojg has joined #openstack-swift02:10
*** psachin has joined #openstack-swift02:46
notmynamekota_: that's great to hear. good luck!03:32
*** psachin has quit IRC05:23
*** pcaruana has joined #openstack-swift07:25
*** pcaruana has quit IRC07:55
*** pcaruana has joined #openstack-swift07:55
*** ccamacho has joined #openstack-swift08:13
*** dosaboy has quit IRC08:34
*** DHE has quit IRC08:35
*** DHE has joined #openstack-swift08:47
*** e0ne has joined #openstack-swift08:55
*** dosaboy has joined #openstack-swift09:10
*** mikecmpbll has joined #openstack-swift09:13
*** dosaboy has quit IRC09:15
*** dosaboy has joined #openstack-swift09:17
*** mvkr has joined #openstack-swift09:41
*** mikecmpbll has quit IRC09:41
*** mikecmpbll has joined #openstack-swift09:43
*** admin6 has joined #openstack-swift10:16
*** admin6 has quit IRC11:29
*** e0ne has quit IRC12:33
*** e0ne has joined #openstack-swift12:39
*** e0ne has quit IRC12:40
*** e0ne has joined #openstack-swift12:40
*** admin6 has joined #openstack-swift12:58
*** admin6 has quit IRC15:48
*** itlinux has quit IRC15:58
*** pcaruana has quit IRC16:00
cwrightHi everyone, I'm struggling with a strange issue we have been seeing on one of our swift proxy nodes16:01
cwrightWe deployed swift into 2 datacenters, Reston and London, each with a pair of swift proxies (sw-proxy01 and sw-proxy02)16:01
cwrightThese servers are deployed/configured via configuration management, and are all identical except for IP address16:01
cwrightEach proxy runs memcached, and swift-proxy is configured to use both memcached servers in its datacenter16:01
cwrightThe issue is that in our Reston environment the sw-proxy02 node constantlys spits out errors about connecting to memcached16:01
cwright"Timeout getting a connection to memcached" and "Error limiting server 10.10.10.180:11211"16:01
cwrightNone of the other 3 proxy servers ever displays these issues. There are no networking rules/filters in place16:01
cwrightIs this something anyone here has seen before?16:01
cwrightHere's some sample log output: https://gist.github.com/corywright/eead89b552026c19cabcf71f906aa17c16:02
*** e0ne has quit IRC16:22
*** itlinux has joined #openstack-swift17:02
*** NM has joined #openstack-swift17:08
*** mikecmpbll has quit IRC17:10
*** itlinux has quit IRC17:28
*** itlinux has joined #openstack-swift17:28
*** itlinux has quit IRC17:29
*** itlinux has joined #openstack-swift17:34
*** mikecmpbll has joined #openstack-swift17:51
zaitcevcwright: You may think that there are no filtering in place, but I would start by testing with telnet, from the proxy of course, and maybe "su - swift", in case it's SElinux.18:03
timburkegood morning18:04
zaitcevcwright: If it's not filtering, then I can divulge that I did have to change the default timeouts for real networks.18:04
notmynamecwright: I would recommend using separate memcache pools for each region. yes, you'll not have quite as good cache hit rate, but on the other hand you'll only do region-local lookups for cache18:05
notmynamezaitcev: adjusting the timeouts for "real" networks is a good point. should we adjust the defaults and/or the docs upstream?18:05
zaitcev[filter:cache]18:07
zaitcevconnect_timeout = 1.218:07
zaitcevI think default is 1 and bumping it juuuust this much was enough to resolve my timeouts.18:08
notmynamedefault seems to be 0.318:08
zaitcevoh18:08
zaitcevWell, I basically just experimented until it worked.18:08
*** openstackgerrit has joined #openstack-swift18:10
openstackgerritTim Burke proposed openstack/swift master: misc test cleanup  https://review.openstack.org/63107718:10
cwrighthi, thanks zaitcev and notmyname. maybe I didnt describe it properly, but we are using separate memcache pools for each region18:12
cwrightzaitcev: of course its possible that something is interfering, but i've tried all the telnet/network checks, and our networking team has looked too.18:13
cwrightit even is reporting these errors when trying to connect to its own public ip18:13
cwrighti spent some time digging through the source code and searching old bugs and it almost looks like a memcache connection pool issue to me18:13
*** ccamacho has quit IRC18:14
cwrighti will try the connect_timeout adjustment now and see if that helps, thanks zaitcev18:14
notmynameI'm going to reset the counter on the wall if this is true (seriously, we have one), but have you checked MTU settings? `ping -M do -s 8972 [destinationIP]`18:16
zaitcevcwright: I'm still unclear on the situation on your end: does every connection fail, or do only some? All you said was: "constantlys spits out".18:17
notmynameuse 1472 to check for the standard 1500 MTU. 8972 checks for jumbo frames (9000 MTU)18:18
cwrightzaitcev: i'm not certain, these servers are still getting relatively little traffic, but after a recent restart of swift-proxy the errors began immediately.18:18
cwrightnotmyname: let me check18:19
cwrightnotmyname: i think you've clued me in on something...18:21
notmynameoh no! do I have to reset the counter?18:22
cwrightmaybe :)  I will confirm for you in a bit18:22
notmynameso far the record is 15 days. it's normally no more than 3. (the 15 day record was because of the winter holidays!) http://d.not.mn/Image.jpeg18:24
cwrightnotmyname: thanks a ton. i can't begin to say how much i appreciate the help and wisdom of everyone here.18:28
notmynamewas it MTUs?18:28
cwrightyea its going to be.  we haven't adjusted yet but the pings are telling18:29
cwrightgo ahead and reset your counter :)18:30
notmynamehttp://d.not.mn/Image_2.jpeg18:31
notmynamewhomp whomp18:31
notmynamethe swiftstack support team (that I sit right next to) are sad now18:32
*** e0ne has joined #openstack-swift19:29
*** NM has quit IRC20:01
clayglo20:02
claygl20:02
cwrightnotmyname: well, sorry to say but that didn't fix it.20:18
cwrightupdated the mtu, rebooted, a few mins later the same errors are showing up20:20
cwrighti will try the connect_timeout now20:21
*** NM has joined #openstack-swift20:32
*** NM has quit IRC20:33
*** e0ne has quit IRC20:55
*** itlinux has quit IRC21:49
*** jistr has quit IRC22:32
*** jistr has joined #openstack-swift22:33
*** jistr has quit IRC22:49
*** jistr has joined #openstack-swift22:50

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!