paequ2@lemmy.today to Programmer Humor@lemmy.mlEnglish · 3 days agoUse this information wiselylemmy.mlimagemessage-square84fedilinkarrow-up1595arrow-down18
arrow-up1587arrow-down1imageUse this information wiselylemmy.mlpaequ2@lemmy.today to Programmer Humor@lemmy.mlEnglish · 3 days agomessage-square84fedilink
minus-squareAmazingAwesomator@lemmy.worldlinkfedilinkarrow-up123·3 days agoanother good one to sneak in there… thai zero-width space: U+200B cant see it, nothing reads it, and it makes everything error. : D
minus-squareanton@piefed.blahaj.zonelinkfedilinkEnglisharrow-up6·2 days agoThe right to left mark (U+2000F) can also be fun.
minus-squareOnno (VK6FLAB)@lemmy.radiolinkfedilinkarrow-up35·3 days agoHmm … we should start collecting these. Anyone know of an existing list?
minus-squarefloquant@lemmy.dbzer0.comlinkfedilinkarrow-up93·3 days agohttps://github.com/minimaxir/big-list-of-naughty-strings/
minus-square∞🏳️⚧️Edie [it/its, she/her, fae/faer, love/loves, null/void, des/pair, none/use name]@lemmy.mllinkfedilinkarrow-up39·3 days agohttps://invisible-characters.com/
minus-squareJohnnyCanuck@lemmy.calinkfedilinkarrow-up3·2 days agoOh ho! I see what you did there!
minus-squareCanadaPlus@lemmy.sdf.orglinkfedilinkarrow-up1·edit-21 day agoIt’s the first one, the ㅤU+3164 Hangul filler, to save everyone else a comment source+browser console+copy-paste+hex editor/search.
minus-squarePartyAt15thAndSummit@lemmy.ziplinkfedilinkarrow-up1·2 days agoI’m not an expert in Glagolitic, but I have a feeling that next-to-none of its letters are supposed to be invisible.
minus-squareS_H_K@lemmy.dbzer0.comlinkfedilinkarrow-up3·2 days agoBefore I went to the comments I wished no one mentioned that. As a DBA I fucking hate you…
minus-squareAmazingAwesomator@lemmy.worldlinkfedilinkarrow-up3·2 days agoi am an SDET. this character destroys DBs… i am sorry :(
minus-squareCallMeAnAI@lemmy.worldlinkfedilinkarrow-up22·3 days agoCame here to say fuck the zero width space. I spent 90 hours in the depths of solr looking for this fucker who brought down our entire search index.
minus-squarederfunkatron@lemmy.worldlinkfedilinkEnglisharrow-up14·3 days agoI deal with shy hyphens a lot. They don’t display unless there’s a line break, so they get copied from various word docs or websites and end up in a database somewhere waiting to piss me off.
minus-squareOnno (VK6FLAB)@lemmy.radiolinkfedilinkarrow-up7·3 days agoI’m guessing that they pasted code from inside Microsoft Word.
minus-squareCallMeAnAI@lemmy.worldlinkfedilinkarrow-up4·2 days agoNo. CMS updated to support new character set while solr did not. Not enough sanitization.
minus-squareOnno (VK6FLAB)@lemmy.radiolinkfedilinkarrow-up3·2 days agoI’ve had similar “fun” with the character defaults on MySQL, from memory for a time it was Swedish by default, rather than UTF.
another good one to sneak in there… thai zero-width space: U+200B
cant see it, nothing reads it, and it makes everything error. : D
The right to left mark (U+2000F) can also be fun.
Hmm … we should start collecting these.
Anyone know of an existing list?
https://github.com/minimaxir/big-list-of-naughty-strings/
https://invisible-characters.com/
ᅠ
Oh ho! I see what you did there!
It’s the first one, the ㅤU+3164 Hangul filler, to save everyone else a comment source+browser console+copy-paste+hex editor/search.
I’m not an expert in Glagolitic, but I have a feeling that next-to-none of its letters are supposed to be invisible.
Before I went to the comments I wished no one mentioned that. As a DBA I fucking hate you…
i am an SDET. this character destroys DBs… i am sorry :(
Came here to say fuck the zero width space. I spent 90 hours in the depths of solr looking for this fucker who brought down our entire search index.
I deal with shy hyphens a lot. They don’t display unless there’s a line break, so they get copied from various word docs or websites and end up in a database somewhere waiting to piss me off.
Yup
I’m guessing that they pasted code from inside Microsoft Word.
No. CMS updated to support new character set while solr did not. Not enough sanitization.
I’ve had similar “fun” with the character defaults on MySQL, from memory for a time it was Swedish by default, rather than UTF.