Programming Leftovers

-
Matthieu Caneill: Debsources, python3, and funky file names
Rumors are running that python2 is not a thing anymore.
Well, I'm certainly late to the party, but I'm happy to report that sources.debian.org is now running python3.
[...]
While transitioning to python3 and juggling left and right with str, bytes and unicode for internal objects, files, database entries and HTTP content, I stumbled upon a bug that has been there since day 1.
Quick recap if you're unfamiliar with this tool: Debsources displays the content of the source packages in the Debian archive. In other words, it's a bit like GitHub, but for the Debian source code.
And some pieces of software out there, that ended up in Debian packages, happen to contain files whose names can't be decoded to UTF-8. Interestingly enough, there's no such thing as a standard for file names: with a few exceptions that vary by operating system, any sequence of bytes can be a legit file name. And some sequences of bytes are not valid UTF-8.
Of course those files are rare, and using ASCII characters to name a file is a much more common practice than using bytes in a non-UTF-8 character encoding. But when you deal with almost 100 billion files on which you have no control (those files come from free software projects, and make their way into Debian without any renaming), it happens.
Now back to the bug: when trying to display such a file through the web interface, it would crash because it can't convert the file name to UTF-8, which is needed for the HTML representation of the page.
-
gfldex: Iterative golfing
-
Rakudo Weekly News: 2022.04 Unsigned Merge
This week’s good news is that the about 2 months of work of Stefan Seifert to properly support native unsigned integers, was merged. This work, and Daniel Green‘s work on using the mimalloc memory allocation library, however caused some unexpected ecosystem fallout. Instead of reverting all of this work (which would have been a big task in itself), it was decided to move forward by skipping the 2022.01 Rakudo Compiler release: the next release will be 2022.02. Which has an odd symmetry to it!
-
DIY “Solid State Drive” Puts Four Bytes In Your Pocket | Hackaday
In a relatively short amount of time, the average capacity of USB flash drives has skyrocketed. It wasn’t so long ago that two and four gigabyte drives were considered to be on the high end, but today you can grab a 512 GB drive for less than $50 USD. In fact they’ve gotten so large that it can feel wasteful using them for some tasks, and we occasionally find ourselves wishing we could find some modern USB drives that didn’t rival the storage capacity of our whole computer.
That said, this USB-C tetrabyte drive created by [Glen Akins] might be slightly too small for our tastes. No, that’s not a typo. As in the Greek tetra, this drive can hold a massive four bytes at a time. Even better, you don’t need a computer to write to it: the 32 DIP switches let you key in the content on the fly, bit-by-bit.
-
TSDgeos' blog: Okular: Signing of unsigned signature fields has landed
Up to today, Okular would kind of error out when opening a PDF file that contains a signature field that was unsigned (think like the old space in paper forms saying "sign here")
-
LLVM Clang Now Defaulting To The DWARFv5 Debug Format - Phoronix
Following GCC, the LLVM Clang C/C++ compiler front-end is now defaulting to using the DWARFv5 debugging data format.
DWARFv5 was published in 2017 and offers faster symbol searching, better debugging for optimized code, improved data compression, improve descriptions for some elements of the code, new language codes, and other improvements over the decade old DWARFv4. The DWARFv5 specification and more details can be found at dwarfstd.org.
-

- Login or register to post comments
Printer-friendly version- 1842 reads
PDF version
More in Tux Machines
- Highlights
- Front Page
- Latest Headlines
- Archive
- Recent comments
- All-Time Popular Stories
- Hot Topics
- New Members
digiKam 7.7.0 is released
After three months of active maintenance and another bug triage, the digiKam team is proud to present version 7.7.0 of its open source digital photo manager. See below the list of most important features coming with this release.
|
Dilution and Misuse of the "Linux" Brand
|
Samsung, Red Hat to Work on Linux Drivers for Future Tech
The metaverse is expected to uproot system design as we know it, and Samsung is one of many hardware vendors re-imagining data center infrastructure in preparation for a parallel 3D world.
Samsung is working on new memory technologies that provide faster bandwidth inside hardware for data to travel between CPUs, storage and other computing resources. The company also announced it was partnering with Red Hat to ensure these technologies have Linux compatibility.
|
today's howtos
|








.svg_.png)
Content (where original) is available under CC-BY-SA, copyrighted by original author/s.

Recent comments
1 year 11 weeks ago
1 year 11 weeks ago
1 year 11 weeks ago
1 year 11 weeks ago
1 year 11 weeks ago
1 year 11 weeks ago
1 year 11 weeks ago
1 year 11 weeks ago
1 year 11 weeks ago
1 year 11 weeks ago