Tuesday, 2025-04-29

*** haleyb|out is now known as haleyb13:00
sean-k-mooneyo/ am not sure if anyone else has had isseu with past.opendev.org but when i try to past baseiclly anything i get 502 proxy erros back form it16:19
sean-k-mooneythat not just a me thing right?16:19
clarkbhttps://paste.opendev.org/show/bvBWt51BbyRxrzkA0yFW/ just worked for me. I suspect that the content of your paste is at least part of the problem16:19
clarkblike maybe it is trying to use a captcha and failing? I can look to see if logs have any info shortly16:20
sean-k-mooneyhum its just my shell output16:20
sean-k-mooneylet me put it somewhere else for you do see16:20
sean-k-mooneyhttps://termbin.com/afim16:21
sean-k-mooneyok ya... 16:22
sean-k-mooneyhttps://paste.opendev.org/show/b4aSijWMYkvrRjnESSoW/ worked16:22
clarkbit doesn't like your ➜16:22
sean-k-mooneyoh ok16:22
clarkbthat appears to be a 4 byte utf8 char. I'm guessing the db is 3 byte only16:22
sean-k-mooneyi dont think that is new but i have been using kde for the last 3 weeks with a diffent terminal16:23
sean-k-mooneyso maybe that has changed16:23
clarkbwe haven't touched the db for that in years which also likely explains why it is 3 byte utf816:23
fungiwe ought to be able to switch it, but i thought we had changed all our systems to 4-byte encoding default for mariadb. maybe we missed paste16:23
clarkbfungi: ya modern mariadb does utf4 by default I think. But the old dbs when exported and imported keep the old utf8mb3 or whatever it is16:24
clarkbwe probably need to update lodgeit and then do a db migration16:24
clarkbhrm actually may be a different char. I'm grabbing the hex code from the log and will see what python translates that to16:25
sean-k-mooneyi can just update my starship config to not put that in ps116:25
sean-k-mooneyor whaterver16:25
sean-k-mooneyi think i updated the version i have installed recently 16:25
clarkb🐍16:26
sean-k-mooneyfeel free to fix it on the db side to but since i know the causei can work around it. i was wondering if one of the emogis was the issue 16:26
clarkbI think it was ^ not the arrow16:26
fungimy terminal doesn't have that glyph16:26
fungiwhat does unicodedata.name() say it is?16:26
clarkbfungi: b'\xF0\x9F\x90\x8D'16:26
clarkb'SNAKE'16:27
fungi>>> unicodedata.name(b'\xF0\x9F\x90\x8D'.decode('utf-8'))16:27
fungi'SNAKE'16:27
clarkb"Incorrect string value: '\\xF0\\x9F\\x90\\x8D v...' for column `lodgeit`.`pastes`.`code` at row 1"16:27
fungiindeed16:27
sean-k-mooneyoh16:27
sean-k-mooneyits not happy with the python sybol16:27
clarkbthat error comes from pymysql16:27
clarkband is mariadb error 126616:28
clarkb*136616:28
fungiright, so the db connection in lodgeit is probably still using utf8mb316:28
sean-k-mooneyso that has been in my modeline for a number of years but they may have changed the encodeing or unicode charter at some point16:28
clarkbit is possible older versions simply truncated the string. I seem to recall etherpad did something similar16:31
clarkbbut as we've updated things to keep up with python releases the error is less forgiving16:31
sean-k-mooneyya thats posisbel i can just avoid copying the prompt line for now16:32
clarkbits probably the sysadmin in me but I avoid all non ascii characters as much as possible for things I rely on16:32
sean-k-mooneyits nice locally to have the extra info but if im sharing command output its not relly needed16:32
clarkbblowing up on utf8 is a common issue16:32
fungithere are a bunch of snake codepoints, so might have been using a different one previously?16:33
clarkbthe matrix gerritbot container broke because haskell logging decided to do utf8 checkmarks by default then the container image they buitl it in didn't have a utf8 locale present... as a non mysql example16:33
sean-k-mooneyya given how lon utf8mb3 has been deprecrated i expect most install to be usign 4 byte now but there are a number of possibliteis16:33
fungithere are 5 different egyptian snake heiroglyphs alone, all of which are 3-byte codepoints16:34
sean-k-mooneyi dont think this is imporant enough to consume your time16:34
clarkbI think rhel 7 also doesn't do C.utf816:34
fungistill?16:34
fungioh, right, *7*16:34
fungii keep LC_CTYPE, LC_COLLATE and LC_MEASUREMENT all set to C.UTF-8 on my personal systems16:36
clarkbsean-k-mooney: fwiw on services like gerrit and etherpad we do test 4 byte ut8 works16:36
clarkbthough the gerrit test may have been a one off16:36
clarkbI pushed a change with a ton of different characters to ensure they rendered (and they mostly do)16:36
sean-k-mooneyclarkb: before i fix this up locally to use somethin else16:37
sean-k-mooneyme posting variation of my modelien until it work wont break anything right16:37
fungiwe first experienced the issue during a design summit when someone crashed the therpad server with a snowman16:37
fungier, the etherpad server16:37
clarkbsean-k-mooney: ya I think that is fine. Mariadb is just rejecting the data in the first place and lodgeit bubbles that up as a 502 apparently16:38
sean-k-mooneytaht impressive16:38
clarkbI don't think it is impacting anything in the db proper16:38
sean-k-mooneyi dont tend to deep dive into alot of customisation but i wonder if starship has an acsii mode16:39
clarkbhttps://review.opendev.org/c/opendev/system-config/+/900379 the gerrit check16:39
clarkband ya its not automated but instead just somethign we can check periodically16:39
sean-k-mooneythat would be nice but if not ill jsut tweak the common thing i use it for16:39
sean-k-mooneyliek git branch and virtual env to not have unicode16:39
clarkbthe little tab indicators overlap with the chars in some cases which sin't great but it doesn't explode in gerrit either16:40
clarkbwget uses ""s that aren't ascii either16:40
fungiyeah, mixing "east asian width" characters into terminal content often leads to alignment issues for me16:41
clarkbhttps://opendev.org/opendev/system-config/commit/73d2d784e42fa2e04bd513857ca5f2ed62c4cc1e and here is gitea rendering that same file16:42
funginot to mention line-wrapping routines have to take them into account as consuming additional columns16:42
fungifunny that it considers line 15 to contain ambiguous characters, but not line 516:43
clarkbif you hover the red outlined glyph it tells you what it is complaining about which is nice16:44
fungiit doesn't render the red outline for me, just the warning triangle16:45
fungimaybe that's a difference with its "dark mode"16:45
clarkbya I'm in light mode16:45
fungior maybe it's just a firefox issue, who knows16:45
clarkbis the second glyph on the line the U16:45
fungiyah, i still get the pop-up, but i had to hover over each character in turn to find the right one16:46
clarkbU+222A can be confused with U+005516:46
fungi0x55 is ascii "U"16:46
clarkbmaybe it only considers things that can be confused with ascii because of problems with say urls?16:47
clarkbya16:47
clarkbthe X variants are maybe ok because they don't look like an ascii X?16:47
fungii guess it depends on the font, ❌ and X look pretty similar in my terminal, as do ✗ and x16:47
clarkb∪ and U render sufficiently different in my terminal font (source code pro) but look similar in the web font whatever is being used by gitea16:47
fungiyeah, maybe that's it. i can easily distinguish them all side-by-side in my terminal fwiw, but similarities in the gitea web interface are what matters for the warning, sure16:48
fungigerrit has a similar warning, i seem to recall, but that change isn't tripping it16:49
sean-k-mooneyok well https://paste.opendev.org/show/bh6euk9Dxw21pbJEigb6/ does not break it so i guess im good16:50
sean-k-mooneyfungi: i remember the first time stephen finucan added an emogi to a commit message and gerrit was not a fan16:51
sean-k-mooneythat hasnt been a probelm in a very long time16:51
clarkbya I think the specific case I was trying to test was for the code editor. The RO rednering has been good for a long time16:52
sean-k-mooneyi dont tend to use emigi personly. i use emoticons :) but that about it16:53
fungisame here!16:53
sean-k-mooneyso i rearly hit issue liek this16:53
sean-k-mooneyclarkb: fungi  i know we do a lot in the public but for future refeicne i assuem the db log is not public right. i.e. i coudl not have debugged this myself if i knew where to look16:58
fricklersean-k-mooney: yes that's only local on the server in this case16:59
sean-k-mooneyack cool16:59
sean-k-mooneyim not asking to change that just wnat to avoid wasting other time if i can fix it myself16:59
clarkbyup we're generally careful about those logs because software is often bad about hiding secret info like passwords16:59
sean-k-mooneyyep understandable17:00
clarkbbut we're happy to dig into them and share when appropriate. I shared what i believed to be the relevant portion above. Happy to include more if people want it17:00
fungieven openstack itself is not great about hiding sensitive information when logging17:00
sean-k-mooneyepically if you dont use oslo.config17:01
fungior turn on debug17:01
sean-k-mooneybut if you ever see somethign that shoudl not be there for nova let me knwo17:01
sean-k-mooneywell no we try to make debug valid to have in proudction 17:01
fungiyeah, we consider those security hardening opportunity bugs17:01
sean-k-mooneywithout passwards17:01
sean-k-mooneyyep17:02
fungiit's gotten a lot better in recent years17:02
*** benj_2 is now known as benj_17:39

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!