Hacker Read

secure · 2018-07-13 17:11:25+00:00

Cool idea! Is there any particular reason to use SquashFS via FUSE instead of via the Linux kernel driver?

Slightly related: we also recently switched to SquashFS for the gokrazy.org’s root file systems.

If you’re curious about how SquashFS works under the hood, check out https://github.com/gokrazy/internal/blob/master/squashfs/wri.... I also intend to publish a poster about it at some point.

reply

voltagex_ | karma 10530 | avg karma 1.86 · | 2017-05-05 05:16:52+00:00

Cool tidbit found via this: there's a FUSE driver for SquashFS!

https://github.com/pmq20/squashfuse/

reply

nrdvana | karma 188 | avg karma 1.5 · | 2024-04-12 03:07:33

This looks really cool! though its a bit limited since it is a FUSE module and not a kernel driver, and unlikely to become a kernel module since it is written in C++ with large dependencies :-\

Would it be possible to take the core design changes here and apply them to squashfs, and maybe propose a next major version of the squashfs internal format to make all these things possible?

reply

rwmj | karma 37735 | avg karma 4.63 · | 2020-11-29 19:40:52+00:00

squashfs is widely used in Linux install media.

peterwwillis | karma 1 | avg karma 0.0 · | 2018-07-13 19:17:11+00:00

I would use this if it didn't depend on OS-specific features. Squashfs is not portable to Windows, unless you extract it to disk.

I actually prefer the jar/Tomcat model, where the read-only image gets distributed to servers, and when you run the app the image gets unpacked to disk as needed. You could also write I/O wrappers that would obviate the need to extract them to disk, and you could even make compression optional to reduce performance hits.

It seems like all you really need is a virtual filesystem implemented as a userspace i/o wrapper. Basically FUSE but only for the one app. There's no need for the FUSE kernel shim because only the application is writing to its own virtual filesystem. So this would work on any operating system that supported applications that can overload system calls.

For example, I would start with this project http://avf.sourceforge.net/ and modify it to run apps bundled with itself. With FUSE installed, other apps could interact with its virtual filesystem, but without FUSE, it could still access its own virtual filesystem in an archive. I would then extend it by shimming in a copy-on-write filesystem to stack modifications in a secondary archive.

reply

taneliv | karma 360 | avg karma 1.61 · | 2024-04-12 05:59:43

It fills a similar niche as https://en.wikipedia.org/wiki/SquashFS but has different data compression characteristics.

However, since it's a FUSE only file system, it's difficult to see how it would be used on embedded system firmware, so it could perhaps see use as a distribution mechanism. Similar to tar or zip files, but possibly with (much) better performance for random access, should you need only smaller portion of the whole archive.

The author indicates need for keeping multiple similar copies of sets of unchanging files on their computer, and made this to reduce the space needed for them, while retaining the access through the file system. So that is also a use case.

reply

jbotz | karma 2760 | avg karma 5.37 · | 2022-07-24 08:56:12

DwarFS may be good, but it's not in the Linux kernel (depends on FUSE). That makes it less universal, potentially significantly slower for some uses cases, and also less thoroughly tested. SquashFS is used by a lot of embedded Linux distros among other use cases, so we can have pretty high confidence in its correctness.

mmastrac | karma 45611 | avg karma 11.54 · | 2018-07-13 17:38:58

Awesome work. Did your team evaluate creating a virtual filesystem that could process the SquashFS images without involving the kernel? Having completely independent executables that could run on _any_ system with zero additional install would be sweet.

To clarify - a stub in each XAR would act as a filesystem driver and intercept calls to open/read/etc, redirecting them to the internal data blob.

Edit: I see your comment below which answers this! https://news.ycombinator.com/item?id=17524910

reply

ctur | karma 1330 | avg karma 9.37 · | 2018-07-13 17:18:01+00:00

We actually started with using "real" squashfs files. This had three main disadvantages:

- We had to maintain our own setuid executable to perform the loopback setup and mount (rather than relying on the far more tested and secure open source fusermount setuid binary that all FUSE file systems rely on) - Getting loopback devices to behave inside of containers (generally cgroup and mount namespace containers) was a little tricky at times in some of our environments - We didn't want to have a huge number of extra loopback devices on every host in our fleet

In fact, after implementing the loopback-based filesystem version, we almost abandoned XAR as the downside of the security considerations and in-container behavior wasn't ideal. The open source squashfuse FUSE filesystem really is what made it possible.

Another side benefit is we could iterate far faster with squashfuse -- this let us fix some performance issues, add idle unmounting, and implement zstd-based squashfs files, and then deploy that to our fleet, faster than we could deploy a kernel to 100% of hosts.

reply

seriesf | karma 32 | avg karma 0.45 · | 2019-12-12 03:14:51+00:00

squashfs is another linux feature that boils down to compressed swap.

drudru11 | karma 779 | avg karma 1.9 · | 2019-08-18 02:35:57+00:00

So is there any performance hit or gain when using SquashFS? I wonder why all the container folks aren’t using this.

SahAssar | karma 5923 | avg karma 2.35 · | 2020-09-06 09:31:19+00:00

Well, a squashfs file is already a disk image, so that's pretty much what they are doing.

Arnavion | karma 7715 | avg karma 3.11 · | 2022-07-25 05:12:40

And squashfs works the same way - mksquashfs takes a directory as input and writes a file as output. That file can then be loopback-mounted to present the readonly filesystem.

pdimitar | karma 9735 | avg karma 1.68 · | 2024-03-23 17:03:41

I haven't had a chance to use it yet, but https://github.com/mhx/dwarfs claims to be times faster than squashfs, to compress much better, and to have full FUSE support.

Scaevolus | karma 6788 | avg karma 5.2 · | 2023-01-15 01:56:16

SquashFS is read-only and requires elevated permissions to mount, but also presents as a true filesystem.

st_goliath | karma 4246 | avg karma 9.09 · | 2024-03-30 01:51:41

Surprisingly, the article doesn't seem to mention SquashFS[1] or EROFS[2].

Both SquashFS and EROFS are filesystem specifically designed for this kind of embedded, read-only use case. The former is optimized for high data density and compression, and already well established. The later is comparatively new and optimized for high read speed.[3] SquashFS as a rootfs can already be found in many embedded applications using flash storage and is typically also combined with tmpfs and persistent storage mount points or overlay mounts.

For both those filesystems, one would build a rootfs image offline. In the Debian ecosystem, there already exists a tool that can bootstrap a Debian image into SquashFS[4].

[1] https://en.wikipedia.org/wiki/SquashFS

[2] https://en.wikipedia.org/wiki/EROFS

[3] https://www.sigma-star.at/blog/2022/07/squashfs-erofs/

[4] https://manpages.debian.org/testing/mmdebstrap/mmdebstrap.1....

reply

mehrdadn | karma 286 | avg karma 0.06 · | 2020-09-05 19:06:20

Is there any replacement for squashfs? I don't know of any other format on Linux that captures the state of the file system as fully as squashfs.

bouk | karma 1256 | avg karma 6.16 · | 2021-11-10 13:02:14

I wonder how this format compares to SquashFS

yjftsjthsd-h | karma 28510 | avg karma 2.78 · | 2023-07-12 12:01:57

FWIW FUSE kind of is that layer; a filesystem targeting FUSE should at least be trivially portable across the FOSS unix-likes.

EdSchouten | karma 1352 | avg karma 4.86 · | 2022-09-05 12:48:52

What a coincidence! For a project that I maintain (Buildbarn, a distributed build cluster for Bazel) I recently generalized all of the FUSE code I had into a generic VFS that can both be exposed over FUSE and NFSv4. The intent was the same: to provide a better out of the box experience on macOS. Here's a design doc I wrote on that change. Slight warning that it's written with some Buildbarn knowledge in mind.

https://github.com/buildbarn/bb-adrs/blob/master/0009-nfsv4....

Fortunately, fuse-t doesn't make any of my work unnecessary. Buildbarn uses go-fuse, which talks to the FUSE character directly instead of using libfuse. fuse-t would thus not be a drop-in replacement. Phew!

PS: A bit unfortunate that fuse-t isn't Open Source. :-(

reply