littlefs

mirror of https://github.com/littlefs-project/littlefs.git synced 2025-12-01 12:20:02 +00:00

Author	SHA1	Message	Date
Christopher Haster	b49d9e9ece	Renamed REPOP* -> RE* So: - cfg.gc_repoplookahead_thresh -> cfg.gc_relookahead_thresh - cfg.gc_repopgbmap_thresh -> cfg.gc_regbmap_thresh - cfg.gbmap_repop_thresh -> cfg.gbmap_re_thresh - LFS3__REPOPLOOKAHEAD -> LFS3__RELOOKAHEAD - LFS3__REPOPGBMAP -> LFS3__REGBMAP Mainly trying to reduce the mouthful that is REPOPLOOKAHEAD and REPOPGBMAP. As a plus this also avoids potential confusion of "repop" as a push/pop related operation.	2025-10-24 00:16:37 -05:00
Christopher Haster	ffc40da878	scripts: Reworked tagrepr -> Tag.repr to rely more on self-parsing This should make tag editing less tedious/error-prone. We already used self-parsing to generate -l/--list in dbgtag.py, but this extends the idea to tagrepr (now Tag.repr), which is used in quite a few more scripts. To make this work the little tag encoding spec had to become a bit more rigorous, fortunately the only real change was the addition of '+' characters to mark reserved-but-expected-zero bits. Example: TAG_CKSUM = 0x3000 ## v-11 ---- ++++ +pqq ^--^----^----^--^-^-- valid bit, unmatched '----\|----\|--\|-\|-- matches 1 '----\|--\|-\|-- matches 0 '--\|-\|-- reserved 0, unmatched '-\|-- perturb bit, unmatched '-- phase bits, unmatched dbgtag.py 0x3000 => cksumq0 dbgtag.py 0x3007 => cksumq3p dbgtag.py 0x3017 => cksumq3p 0x10 dbgtag.py 0x3417 => 0x3417 Though Tag.repr still does a bit of manual formatting for the differences between shrub/normal/null/alt tags. Still, this should reduce the number of things that need to be changed from 2 -> 1 when adding/editing most new tags.	2025-10-24 00:15:21 -05:00
Christopher Haster	3f15b61c72	scripts: dbgflags.py: Added LFS3_SEEK_* flags for completeness This required a bit of a hack: LFS3_seek_MODE, which is marked internal to try to minimize confusion, but really doesn't exist in the code at all. But a hack is probably good enough for now.	2025-10-24 00:14:32 -05:00
Christopher Haster	0c0643d5d7	scripts: Adopted self-parsing script for dgbflags/err.py encoding This has just proven much easier to tweak in dbgtag.py, so adopting the same self-parsing pattern in dbgflags.py/dbgerr.py. This makes editing easier by (1) not needing to worry about parens/quotes/commas, and (2) allowing for non-python expressions, such as the mode flags in dbgflags.py. The only concern is script startup may be slightly slower, but we really don't care.	2025-10-24 00:13:40 -05:00
Christopher Haster	8a58954828	trv: Reduced LFS3_t_CKPOINTED + LFS3_t_MUTATED -> LFS3_t_CKPOINTED This drops LFS3_t_MUTATED in favor of just using LFS3_t_CKPOINTED everywhere: 1. These meant roughly the same thing, with LFS3_t_MUTATED being a bit tighter at the cost of needing to be explicitly set. 2. The implicit setting of LFS3_t_CKPOINTED by lfs3_alloc_ckpoint -- a function that already needs to be called before mutation -- means we have one less thing to worry about. Implicit properties like LFS3_t_CKPOINTED are great for building a reliable system. Manual flags like LFS3_t_MUTATED, not so much. 3. Why use two flags when we can get away with one? The only downside is we may unnecessarily clobber gc/traversal work when we don't actually mutate the filesystem. Failed file open calls are a good example. However this tradeoff seems well worth it for an overall simpler + more reliable system. --- Saves a bit of code: code stack ctx before: 37220 2352 688 after: 37160 (-0.2%) 2352 (+0.0%) 688 (+0.0%) code stack ctx gbmap before: 40184 2368 856 gbmap after: 40132 (-0.1%) 2368 (+0.0%) 856 (+0.0%)	2025-10-24 00:12:32 -05:00
Christopher Haster	5d70e47708	trv: Reverted LFS3_t_NOSPC, forward gbmap repop errors Note: This affects the blocking lfs3_alloc_repopgbmap as well as incremental gc/traversal repopulations. Now all repop attempts return LFS3_ERR_NOSPC when we don't have space for the gbmap, motivation below. This reverts the previous LFS3_t_NOSPC soft error, in which traversals were allowed to continue some gc/traversal work when encountering LFS3_ERR_NOSPC. This results in a simpler implementation and fewer error cases to worry about. Observation/motivation: - The main motivation is noticing that when we're in low-space conditions, we just start spamming gbmap repops even if they all fail. That's really not great! We might as well just mark the flash as dead if we're going to start spamming erases! At least with an error the user can call rmgbmap to try to make progress. - If we're in a low-space condition, something else will probably return LFS3_ERR_NOSPC anyways. Might as well report this early and simplify our system. - It's a simpler model, and littlefs3 is already much more complicated than littlefs2. Maybe we should lean more towards a simpler system at the cost of some niche optimizations. --- This had the side-effect of causing more lfs3_alloc_ckpoints to return errors during testing, which revealed a bug in our uz/uzd_fuzz tests: - We weren't flushing after writes to the opened RDWR files, which could cause delayed errors to occur during the later read checks in the test. Fortunately LFS3_O_FLUSH provides a quick and easy fix! Note we _don't_ adopt this in all uz/uzd_fuzz tests, only those that error. It's good to test both with and without LFS3_O_FLUSH to test that read-flushing also works under stress. Saves a bit of code: code stack ctx before: 37260 2352 688 after: 37220 (-0.1%) 2352 (+0.0%) 688 (+0.0%) code stack ctx gbmap before: 40220 2368 856 gbmap after: 40184 (-0.1%) 2368 (+0.0%) 856 (+0.0%)	2025-10-24 00:03:14 -05:00
Christopher Haster	f892d299dd	trv: Added LFS3_t_NOSPC, avoid ENOSPC errors in traversals This relaxes error encountered during lfs3_mtree_gc to _not_ propagate, but instead just log a warning and prevent the relevant work from being checked off during EOT. The idea is this allows other work to make progress in low-space conditions. I originally meant to limit this to gbmap repopulations, to match the behavior of lfs3_alloc_repopgbmap, but I think extending the idea to all filesystem mutating operations makes sense (LFS3_T_MKCONSISTENT + LFS3_T_REPOPGBMAP + LFS3_T_COMPACTMETA). --- To avoid incorrectly marking traversal work as completed, we need to track if we hit any ENOSPC errors, thus the new LFS3_t_NOSPC flag: LFS3_t_NOSPC 0x00800000 Optional gc work ran out of space Not the happiest just throwing flags at problems, but I can't think of a better solution at the moment. This doesn't differentiate between ENOSPC errors during the different types of work, but in theory if we're hitting ENOSPC errors whatever work returns the error is a toss-up anyways. --- Adds a bit of code: code stack ctx before: 37208 2352 688 after: 37248 (+0.1%) 2352 (+0.0%) 688 (+0.0%) code stack ctx gbmap before: 40120 2368 856 gbmap after: 40204 (+0.2%) 2368 (+0.0%) 856 (+0.0%)	2025-10-24 00:00:39 -05:00
Christopher Haster	1f824a029b	Renamed LFS3_T_COMPACT -> LFS3_T_COMPACTMETA (and gc_compactmeta_thresh) - LFS3_T_COMPACT -> LFS3_T_COMPACTMETA - gc_compact_thresh -> gc_compactmeta_thresh And friends: LFS3_M_COMPACTMETA 0x00000800 Compact metadata logs LFS3_GC_COMPACTMETA 0x00000800 Compact metadata logs LFS3_I_COMPACTMETA 0x00000800 Filesystem may have uncompacted metadata LFS3_T_COMPACTMETA 0x00000800 Compact metadata logs --- This does two things: 1. Highlights that LFS3_T_COMPACTMETA only interacts with metadata logs, and has no effect on data blocks. 2. Better matches the verb+noun names used for other gc/traversal flags (REPOPGBMAP, CKMETA, etc). It is a bit more of a mouthful, but I'm not sure that's entirely a bad thing. These are pretty low-level flags.	2025-10-23 23:54:57 -05:00
Christopher Haster	9bdfb25a09	Renamed LFS3_T_LOOKAHEAD -> LFS3_T_REPOPLOOKAHEAD And friends: LFS3_M_REPOPLOOKAHEAD 0x00000200 Repopulate lookahead buffer LFS3_GC_REPOPLOOKAHEAD 0x00000200 Repopulate lookahead buffer LFS3_I_REPOPLOOKAHEAD 0x00000200 Lookahead buffer is not full LFS3_T_REPOPLOOKAHEAD 0x00000200 Repopulate lookahead buffer To match LFS3_T_REPOPGBMAP, which is more-or-less the same operation. Though this does turn into quite the mouthful...	2025-10-23 23:54:02 -05:00
Christopher Haster	3b4e1e9e0b	gbmap: Renamed gbmap_rebuild_thresh -> gbmap_repop_thresh And tweaked a few related comments. I'm still on the fence with this name, I don't think it's great, but it at least betters describes the "repopulation" operation than "rebuilding". The important distinction is that we don't throw away information. Bad/erased block info (future) is still carried over into the new gbmap snapshot, and persists unless you explicitly call rmgbmap + mkgbmap. So, adopting gbmap_repop_thresh for now to see if it's just a habit thing, but may adopt a different name in the future. As a plus, gbmap_repop_thresh is two characters shorter.	2025-10-23 23:51:18 -05:00
Christopher Haster	06bc4dff04	trv: Simplified MUTATED/DIRTY flags, no more swapping A bit less simplified than I hoped, we don't _strictly_ need both LFS3_t_DIRTY + LFS3_t_MUTATED if we're ok with either (1) making multiple passes to confirm fixorphans succeeded or (2) clear the COMPACT flag after one pass (which may introduce new uncompacted metadata). But both of these have downsides, and we're not _that_ stressed for flag space yet... So keeping all three of: LFS3_t_DIRTY 0x04000000 Filesystem modified outside traversal LFS3_t_MUTATED 0x02000000 Filesystem modified during traversal LFS3_t_CKPOINTED 0x01000000 Filesystem ckpointed during traversal But I did manage to get rid of the bit swapping by tweaking LFS3_t_DIRTY to imply LFS3_t_MUTATED instead of being exclusive. This removes the "failed" gotos in lfs3_mtree_gc and makes things a bit more readable. --- I also split lfs3_fs/handle_clobber into separate lfs3_fs/handle_clobber and lfs3_fs/handle_mutate functions. This added a bit of code, but I think is worth it for a simpler internal API. A confusing internal API is no good. In total these simplifications saved a bit of code: code stack ctx before: 37208 2360 684 after: 37176 (-0.1%) 2360 (+0.0%) 684 (+0.0%) code stack ctx gbmap before: 40100 2432 848 gbmap after: 40060 (-0.1%) 2432 (+0.0%) 848 (+0.0%)	2025-10-23 23:41:43 -05:00
Christopher Haster	f5508a1b6c	gbmap: Added LFS3_T_REBUILDGBMAP and friends This adds LFS3_T_REBUILDGBMAP and friends, and enables incremental gbmap rebuilds as a part of gc/traversal work: LFS3_M_REBUILDGBMAP 0x00000400 Rebuild the gbmap LFS3_GC_REBUILDGBMAP 0x00000400 Rebuild the gbmap LFS3_I_REBUILDGBMAP 0x00000400 The gbmap is not full LFS3_T_REBUILDGBMAP 0x00000400 Rebuild the gbmap On paper, this is more or less identical to repopulating the lookahead buffer -- traverse the filesystem, mark blocks as in-use, adopt the new gbmap/lookahead buffer on success -- but a couple nuances make rebuilding the gbmap a bit trickier: - Unlike the lookahead buffer, which eagerly zeros in allocation, we need an explicit zeroing pass before we start marking blocks as in-use. This means multiple traversals can potentially conflict with each other, risking the adoption of a clobbered gbmap. - The gbmap, which stores information on disk, relies on block allocation and the temporary "in-flight window" defined by allocator ckpoints to avoid circular block states during gbmap rebuilds. This makes gbmap rebuilds sensitive to allocator ckpoints, which we consider more-or-less a noop in other parts of the system. Though now that I'm writing this, it might have been possible to instead include gbmap rebuild snapshots in fs traversals... but that would probably have been much more complicated. - Rebuilding the gbmap requires writing to disk and is generally much more expensive/destructive. We want to avoid trying to rebuild the gbmap when it's not possible to actually make progress. On top of this, the current trv-clobber system is a delicate, error-prone mess. --- To simplify everything related to gbmap rebuilds, I added a new internal traversal flag: LFS3_t_CKPOINTED: LFS3_t_CKPOINTED 0x04000000 Filesystem ckpointed during traversal LFS3_t_CKPOINTED is set, unconditionally, on all open traversals in lfs3_alloc_ckpoint, and provides a simple, robust mechanism for checking if _any_ allocator checkpoints have occured since a traversal was started. Since lfs3_alloc_ckpoint is required before any block allocation, this provides a strong guarantee that nothing funny happened to any allocator state during a traversal. This makes lfs3_alloc_ckpoint a bit less cheap, but the strong guarantees that allocator state is unmodified during traversal are well worth it. This makes both lookahead and gbmap passes simpler, safer, and easier to reason about. I'd like to adopt something similar+stronger for LFs3_t_MUTATED, and reduce this back to two flags, but that can be a future commit. --- Unfortunately due to the potential for recursion, this ended up reusing less logic between lfs3_alloc_rebuildgbmap and lfs3_mtree_gc than I had hoped, but at like the main chunks (lfs3_alloc_remap, lfs3_gbmap_setbptr, lfs3_alloc_adoptgbmap) could be split out into common functions. The result is a decent chunk of code and stack, but the value is high as incremental gbmap rebuilds are the only option to reduce the latency spikes introduced by the gbmap allocator (it's not significantly worse than the lookahead buffer, but both do require traversing the entire filesystem): code stack ctx before: 37164 2352 684 after: 37208 (+0.1%) 2360 (+0.3%) 684 (+0.0%) code stack ctx gbmap before: 39708 2376 848 gbmap after: 40100 (+1.0%) 2432 (+2.4%) 848 (+0.0%) Note the gbmap build is now measured with LFS3_GBMAP=1, instead of LFS3_YES_GBMAP=1 (maybe-gbmap) as before. This includes the cost of mkgbmap, lfs3_f_isgbmap, etc.	2025-10-23 23:39:55 -05:00
Christopher Haster	9d322741ca	bmap: Simplified bmap configs, reduced to one LFS3_F_GBMAP flag TLDR: This drops the idea of different bmap strategies/modes, and sorts out most of the compile-time/runtime conditional bmap interactions. --- Motivation: Benchmarking (at least up to the 32-bit word limit) has shown the bmap will unlikely be a significant bottleneck, even on large disks. The largest disks tend to be NAND, and NAND's ridiculous block size limits pressure on block allocation. There are still concerns for areas I haven't measured yet: - SD/eMMC/FTL - Small blocks, so more pressure on block allocation. In theory the logical block size can be artificially increased, but this comes with a granularity tradeoff. - I've only measured throughput, latency is a whole other story. However, users have reported lfs3_fs_gc is useful for mitigating this, so maybe latency is less of a concern now? But while there may still be room for improvement via alternative bmap strategies, the risk a concerning amount of complexity. Yes, configuration gets more complicated, but the real issue is any bmap strategies that try to track _deallocations_ (the original idea being treediffing) risk falling leaking blocks if all cases aren't covered. The current "bmap cache" strategy strikes a really nice balance where it reduces _amortized_ block allocation -> ~O(log n) without RAM, while retaining the safe, bug-resistant, single-source-of-truth properties that come with lookahead-based allocation. --- So, long story short, dropping other strategies, and now the presence of the bmap is a boolean flag. This is also the first format-specific flag: - Define LFS3_BMAP to enable the bmap logic, but note by default the bmap will still not be used. - Define LFS3_YES_BMAP to force the bmap to be used. - With LFS3_BMAP, passing LFS3_F_GBMAP to lfs3_format will include the on-disk block-map. - No flag is needed during mount, the presence of the bmap is determined by the on-disk wcompat flags (LFS3_WCOMPAT_GBMAP). This also prevents rw mounting if the bmap is not supported, but rdonly mounting is allowed. - Users can check if the bmap is in use via lfs3_fs_stat, which reports LFS3_I_GBMAP in the flags field. There's still some missing pieces, but these will be a bit more involved: - lfs3_fs_grow needs to be made bmap aware! - We probably want something like lfs3_fs_mkgbmap and lfs3_fs_rmgbmap to allow converting between bmap backed/not-backed filesystem images. Code changes minimal: code stack ctx before: 37172 2352 684 after: 37172 (+0.0%) 2352 (+0.0%) 684 (+0.0%) code stack ctx bmap before: 38844 2456 800 bmap after: 38852 (+0.0%) 2456 (+0.0%) 800 (+0.0%)	2025-10-09 14:33:27 -05:00
Christopher Haster	58c5506e85	Brought back lazy grafting, but not too lazy Continued benchmarking efforts are indicating this isn't really an optional optimization. This brings back lazy grafting, where the file leaf is allowed to fall out-of-date to minimize bshrub/btree updates. This is controlled by LFS3_o_UNGRAFT, which is similar, but independent from LFS3_o_UNCRYST: - LFS3_o_UNCRYST - File's leaf not fully crystallized - LFS3_o_UNGRAFT - File's leaf does not match disk Note it makes sense for files to be UNGRAFT only, in the case where the current crystal terminates at the end-of-file but future appends are likely. And it makes sense for files to be UNCRYST only, in cases where we graft uncrystallized blocks so the bshrub/btree makes sense. Which brings us to the main change from the previous lazy-grafting implementation: lfs3_file_lookupnext no longer includes ungrafted leaves. Instead, functions should call lfs3_file_graft if they need lfs3_file_lookupnext to make sense. This significantly reduces the code cost of lazy grafting, at the risk of needing to graft more frequently. Fortunately we don't actually need to call lfs3_file_graft all that often: - lfs3_file_read already flushes caches/leaves before attempting any bshrub/btree reads for simplicity (heavy are not currently considered a priority, if you need this consider opening two file handles). - lfs3_file_flush_ _does_ need to call lfs3_file_graft before the crystallization heuristic pokes, but if we can't resume crystallization, we would probably need to graft the crystal to satisfy the flush anyways. --- Lazy grafting, i.e. procrastinating on bshrub/btree updates during block appends, is an optimization previously dropped due to perceived nicheness: - We can only lazily graft blocks, inlined data fragments always require bshrub/btree updates since they live in the bshrub/btree. - Sync forces bshrub/btree updates anyways, so lazy grafting has no benefit for most logging applications. - This performance penalty of eagerly grafting goes away if your caches are large enough. Note that the last argument is a non-argument in littlefs's case. They whole point of littlefs is that you _don't_ need RAM to fix things. However these arguments are all moot when you consider that the "niche use case" -- linear file writes -- is the default bottleneck for most applications. Any file operation becomes a linear write bottleneck when the arguments are large enough. And this becomes a noticeable issue when benchmarking. So... This brings back lazy grafting. But with a more limited scope w.r.t. internal file operations (the above lfs3_file_lookupnext/ lfs3_file_graft changes). --- Long story short, lazy grafting is back again, reverting the ~3x performance regression for linear file writes. But now with quite a bit less code/stack cost: code stack ctx before: 36820 2368 684 after: 37032 (+0.6%) 2352 (-0.7%) 684 (+0.0%)	2025-10-01 17:57:01 -05:00
Christopher Haster	ebae43898e	bmap: Changing direction, store bmap mode in wcompat flags The idea behind separate ctrled+unctrled airspaces was to try to avoid multiple interpretations of the on-disk bmap, but I'm starting to think this adds more complexity than it solves. The main conflict is the meaning of "in-flight" blocks. When using the "uncontrolled" bmap algorithm, in-flight blocks need to be double-checked by traversing the filesystem. But in the "controlled" bmap algorithm, blocks are only marked as "in-flight" while they are truly in-flight (in-use in RAM, but not yet in use on disk). Representing these both with the same "in-flight" state risks incompatible algorithms misinterpreting the bmap across different mounts. In theory the separate airspaces solve this, but now all the algorithms need to know how to convert the bmap from different modes, adding complexity and code cost. Well, in theory at least. I'm unsure separate airspaces actually solves this due to subtleties between what "in-flight" means in the different algorithms (note both in-use and free blocks are "in-flight" in the unknown airspace!). It really depends on how the "controlled" algorithm actually works, which isn't implemented/fully designed yet. --- Long story short, due to a time crunch, I'm ripping this out for now and just storing the current algorithm in the wcompat flags: LFS3_WCOMPAT_GBMAP 0x00006000 Global block-map in use LFS3_WCOMPAT_GBMAPNONE 0x00000000 Gbmap not in use LFS3_WCOMPAT_GBMAPCACHE 0x00002000 Gbmap in cache mode LFS3_WCOMPAT_GBMAPVFR 0x00004000 Gbmap in VFR mode LFS3_WCOMPAT_GBMAPIFR 0x00006000 Gbmap in IFR mode Note GBMAPVFR/IFR != BMAPSLOW/FAST! At least BMAPSLOW/FAST can share bmap representations: - GBMAPVFR => Uncontrolled airspace, i.e. in-flight blocks may or may not be in use, need to traverse open files. - GBMAPIFR => Controlled airspace, i.e. in-flight blocks are in use, at least until powerloss, no traversal needed, but requires more bmap writes. - BMAPSLOW => Treediff by checking what blocks are in B but not in A, and what blocks are in A but not in B, O(n^2), but minimizes bmap updates. Can be optimized with a bloom filter. - BMAPFAST => Treediff by clearing all blocks in A, and then setting all blocks in B, O(n), but also writes all blocks to the bmap twice even on small changes. Can be optimized with a sliding bitmap window (or a block hashtable, though a bitmap converges to the same thing in both algorithms when >=disk_size). It will probably be worth unifying the bmap representation later (the more algorithm-specific flags there are, the harder interop becomes for users, but for now this opens a path to implementing/experimenting with bmap algorithms without dealing with this headache.	2025-10-01 17:56:08 -05:00
Christopher Haster	59a4ae6f61	bmap: Taught littlefs how to traverse the gbmap Fortunately the btree traversal logic is pretty reusable, so this just required an additional tstate (LFS3_TSTATE_BMAP). This raises an interesting question: _when_ do we traverse the bmap? We need to wait until at least mtree traversal completes for gstate to be reconstructed during lfs3_mount, but I think traversing before file btrees makes sense.	2025-10-01 17:55:27 -05:00
Christopher Haster	88180b6081	bmap: Initial scaffolding for on-disk block map This is pretty exploratory work, so I'm going to try to be less thorough in commit messages until the dust settles. --- New tag for gbmapdelta: LFS3_TAG_GBMAPDELTA 0x0104 v--- ---1 ---- -1rr New tags for in-bmap block types: LFS3_TAG_BMRANGE 0x033u v--- --11 --11 uuuu LFS3_TAG_BMFREE 0x0330 v--- --11 --11 ---- LFS3_TAG_BMINFLIGHT 0x0331 v--- --11 --11 ---1 LFS3_TAG_BMINUSE 0x0332 v--- --11 --11 --1- LFS3_TAG_BMBAD 0x0333 v--- --11 --11 --11 LFS3_TAG_BMERASED 0x0334 v--- --11 --11 -1-- New gstate decoding for gbmap: .---+- -+- -+- -+- -. cursor: 1 leb128 <=5 bytes \| cursor \| known: 1 leb128 <=5 bytes +---+- -+- -+- -+- -+ block: 1 leb128 <=5 bytes \| known \| trunk: 1 leb128 <=4 bytes +---+- -+- -+- -+- -+ cksum: 1 le32 4 bytes \| block \| total: 23 bytes +---+- -+- -+- -+- -' \| trunk \| +---+- -+- -+- -+ \| cksum \| '---+---+---+---' New bmap node revdbg string: vvv---- -111111- -11---1- -11---1- (62 62 7e v0 bb~r) bmap node New mount/format/info flags (still unsure about these): LFS3_M_BMAPMODE 0x03000000 On-disk block map mode LFS3_M_BMAPNONE 0x00000000 Don't use the bmap LFS3_M_BMAPCACHE 0x01000000 Use the bmap to cache lookahead scans LFS3_M_BMAPSLOW 0x02000000 Use the slow bmap algorithm LFS3_M_BMAPFAST 0x03000000 Use the fast bmap algorithm New gbmap wcompat flag: LFS3_WCOMPAT_GBMAP 0x00002000 Global block-map in use	2025-10-01 17:55:13 -05:00
Christopher Haster	4b7a5c9201	trv: Renamed OMDIRS -> HANDLES, OBTREE -> HBTREE Looks like these traversal states were missed in the omdir -> handle rename. I think HANDLES and HBTREE states make sense: - LFS3_TSTATE_OMDIRS -> LFS3_TSTATE_HANDLES - LFS3_TSTATE_OBTREE -> LFS3_TSTATE_HBTREE	2025-07-21 16:47:24 -05:00
Christopher Haster	457a0c0487	alloc: Added the concept of block allocator flags Currently this just has one flag the replaces the previous `erase` argument: LFS3_ALLOC_ERASE 0x00000001 Please erase the block Benefits include: - Slightly better readability at lfs3_alloc call sites. - Possibility of more allocator flags in the future: - LFS3_ALLOC_EMERGENCY - Use reserved blocks - Uh, that's all I can think of right now No code changes.	2025-07-18 16:42:45 -05:00
Christopher Haster	0828fd9bf3	Reverted LFS3_CKDATACKSUMREADS -> LFS3_CKDATACKSUMS LFS3_CKDATACKSUMREADS is just too much. The downside is it may not be clear how LFS3_CKDATACKSUMREADS interacts with the future planned LFS3_CKREADS (LFS3_CKREADS implies LFS3_CKDATACKSUMS + LFS3_CKMETAREDUND), but on the flip side you may actually be able to type LFS3_CKDATACKSUMS on the first try.	2025-07-16 14:25:20 -05:00
Christopher Haster	090611af14	scripts: dbgflags.py: Tweaked internals for readability Mainly just using 'P_NAME' instead of 'P', 'NAME' in the FLAGS table, every bit of horizontal spacing helps with these definitions.	2025-07-04 18:08:11 -05:00
Christopher Haster	19747f691e	scripts: dbgflags.py: Reimplemented filters as flags So instead of: $ ./scripts/dbgflags.py o 0x10000003 The filter is now specified as a normal(ish) argparse flag: $ ./scripts/dbgflags.py --o 0x10000003 This is a bit easier to interop with in dbg.gdb.py, and I think a bit more readable. Though -a and --a now do _very_ different things. I'm sure that won't confuse anyone...	2025-07-04 18:08:11 -05:00
Christopher Haster	a85f08cfe3	Dropped lazy grafting, but kept lazy crystallization This merges LFS3_o_GRAFT into LFS3_o_UNCRYST, simplifying the file write path and avoiding the mess that is ungrafted leaves. --- This goes for a different lazy crystallization/grafting strategy that was overlooked before. Instead of requiring all leaves to be both crystallized and grafted, we allow leaves to be uncrystallied, but they _must_ be grafted (in-tree) at all times. This gets us most of the rewrite preformance of lazy-crystallization, without needing to worry about out-of-date file leaves. Out-of-date file leaves were a headache for both code cost and concerns around confusing filesystem states and related bugs. Note LFS3_o_UNCRYST gets some extra behavior here: - LFS3_o_UNCRYST indicates when crystallization is _necessary_, and no longer when crystallization is _possible_. We already keep track of when crystallization is _possible_ via bptr's erased-state, and this lets us control recrystallization in lfs3_file_flush_ without erased-state-clearing hacks (which probably wouldn't work with the future ddtree). - We opportunistically clear the UNCRYST flag if it's not possible for future lfs3_file_crystallize_ calls to make progress: - When we crystallize a full block - When we hit the end of the file - When we hit a hole - When we hit an unaligned block --- Note this does impact performance! Unlike true lazy grafting, eagerly grafting means we're always committing to the bshrub/btree more than is strictly necessary, and this translates to more frequent btree node erases/compactions. Current simulated benchmarks show a ~3x increase (~20us -> ~60us) in write times for linear file writes on NOR flash. However: - The moment you need unaligned progs, this performance optimization goes out the window, as we need to graft bptrs before any padding fragments. - This only kicks in once we start crystallizing. So any writes < crystal_thresh (both in new files and in between blocks) are forced to commit to the bshrub/btree every flush. This risks a difficult to predict performance characteristic. - If you sync frequently (logging), we're forced to crystallize/graft anyways. - The performance hit can be alleviated with either larger writes or larger caches, though I realize this goes against littlefs's "RAM-not-required" mantra. Worst case, we can always bring back "lazy grafting" as a high-performance option in the future. Though note the above concerns around in-between/pre crystallization performance. This may only make sense when cache_size >= both prog_size and crystal_thresh. And of course, there's a significant code tradeoff! code stack ctx before: 38020 2456 656 after: 37588 (-1.1%) 2472 (+0.7%) 656 (+0.0%) Uh, ignore that stack cost. The simplified logic leads to more functions being inlined, which makes a mess of our stack measurements because we don't take shrinkwrapping into account.	2025-07-03 18:04:18 -05:00
Christopher Haster	f967cad907	kv: Adopted LFS3_o_WRSET for better key-value API integration This adds LFS3_o_WRSET as an internal-only 3rd file open mode (I knew that missing open mode would come in handy) that has some _very_ interesting behavior: - Do _not_ clear the configured file cache. The file cache is prefilled with the file's data. - If the file does _not_ exist and is small, create it immediately in lfs3_file_open using the provided file cache. - If the file _does_ exist or is not small, do nothing and open the file normally. lfs3_file_close/sync can do the rest of the work in one commit. This makes it possible to implement one-commit lfs3_set on top of the file APIs with minimal code impact: - All of the metadata commit logic can be handled by lfs3_file_sync_, we just call lfs3_file_sync_ with the found did+name in lfs3_file_opencfg when WRSET. - The invariant that lfs3_file_opencfg always reserves an mid remains intact, since we go ahead and write the full file if necessary, minimizing the impact on lfs3_file_opencfg's internals. This claws back most of the code cost of the one-commit key-value API: code stack ctx before: 38232 2400 636 after: 37856 (-1.0%) 2416 (+0.7%) 636 (+0.0%) before kv: 37352 2280 636 after kv: 37856 (+1.3%) 2416 (+6.0%) 636 (+0.0%) --- I'm quite happy how this turned out. I was worried there for a bit the key-value API was going to end up an ugly wart for the internals, but with LFS3_o_WRSET this integrates quite nicely. It also raises a really interesting question, should LFS3_o_WRSET be exposed to users? For now I'm going to play it safe and say no. While potentially useful, it's still a pretty unintuitive API. Another thing worth mentioning is that this does have a negative impact on compile-time gc. Duplication adds code cost when viewing the system as a whole, but tighter integration can backfire if the user never calls half the APIs. Oh well, compile-time opt-out is always an option in the future, and users seem to care more about pre-linked measurements, probably because it's an easier thing to find. Still, it's funny how measuring code can have a negative impact on code. Something something Goodhart's law.	2025-06-22 15:37:07 -05:00
Christopher Haster	6eba1180c8	Big rename! Renamed lfs -> lfs3 and lfsr -> lfs3	2025-05-28 15:00:04 -05:00
Christopher Haster	f7e17c8aad	Added LFS_T_RDONLY, LFS_T_RDWR, etc These mimic the relevant LFS_O_* flags, and allow users to assert whether or not a traversal will mutate the filesystem: LFS_T_MODE 0x00000001 The traversal's access mode LFS_T_RDWR 0x00000000 Open traversal as read and write LFS_T_RDONLY 0x00000001 Open traversal as read only In theory, these could also change internal allocations, but littlefs doesn't really work that way. Note we _don't_ add related LFS_GC_RDONLY, LFS_GC_RDWR, etc flags. These are sort of implied by the relevant LFS_M_* flags. Adds a bit more code, probably because of the slightly more complicated internal constants for the internal traversals. But I think the self-documentingness is worth it: code stack ctx before: 37200 2288 636 after: 37220 (+0.1%) 2288 (+0.0%) 636 (+0.0%)	2025-05-24 23:27:10 -05:00
Christopher Haster	5b74aafa17	Reworked the flag encoding again This time to account for the new LFS_o_UNCRYST and LFS_o_UNGRAFT flags. This required moving the T flags out of the way, which of course conflicted with TSTATE, so that had to move... One thing that helped was shoving LFS_O_DESYNC up with the internal state flags. It's definitely more a state flag than the other public flags, it just also happens to be user toggleable. Here's the new jenga: 8 8 8 8 .----++----++----++----. .-..----..-..-..-------. o_flags: \|t\|\| f \|\|o\|\|t\|\| o \| \|-\|\|-.--':-:\|-\|'--.-.--' \|-\|\|-\|.----.\|-'--------. t_flags: \|t\|\|f\|\|tstt\|\| t \| '-''-''----'\|----.-----' .----..-.:-:\|----\|:-:.-. m_flags: \| m \|\|c\|\|o\|\| t \|\|o\|\|m\| \|----\|\|-\|'-'\|-.--''-''-' \|----\|\|-\|---\|-\|.-------. f_flags: \| m \|\|c\| \|t\|\| f \| '----''-'---'-''-------' This adds a bit of code, but that's not the end of the world: code stack ctx before: 37172 2288 636 after: 37200 (+0.1%) 2288 (+0.0%) 636 (+0.0%)	2025-05-24 22:21:39 -05:00
Christopher Haster	f5dd6f69e8	Renamed LFS_CKMETAPARITY and LFS_CKDATACKSUMREADS - LFS_CKPARITY -> LFS_CKMETAPARITY - LFS_CKDATACKSUMS -> LFS_CKDATACKSUMREADS The goal here is to provide hints for 1. what is being checked (META, DATA, etc), and 2. on what operation (FETCHES, PROGS, READS, etc). Note that LFS_CKDATACKSUMREADS is intended to eventually be a part of a set of flags that can pull off closed fully-checked reads: - LFS_CKMETAREDUNDREADS - Check data checksums on reads - LFS_CKDATACKSUMREADS - Check metadata redund blocks on reads - LFS_CKREADS - LFS_CKMETAREDUNDREADS + LFS_CKDATACKSUMREADS Also it's probably not a bad idea for LFS_CKMETAPARITY to be harder to use. It's really not worth enabling unless you understand its limitations (<1 bit of error detection, yay). No code changes.	2025-05-24 21:55:45 -05:00
Christopher Haster	1cce0dab5c	Reverted limiting file->leaf to reads + erased-state caching Still on the fence about this, but in hindsight the code/stack difference is not _that_ much: code stack ctx before: 36460 2280 636 after: 37092 (+1.7%) 2304 (+1.1%) 636 (+0.0%) Especially with the potential to significantly speed up linear file writes/rewrites, which are usually the most common file operation. You ever just, you know, write a whole file at once? Note we can still add the previous behavior as an opt-in write strategy to save code/stack when preferred over linear write/rewrite speed. This is actually the main reason I think we should prefer lazy-crystallization by default. Of the theoretical/future write strategies, lazy-crystallization was the only one trading performance for code/stack and not vice versa (global-alignment, linear-only, fully-fragmented, etc). If we default to a small, but less performant filesystem, it risks users thinking littlefs is slow when they just haven't turned on the right flags. That being said there's a balance here. Users will probably judge littlefs based on its default code size for the same reason. --- Note this includes the generalized lfsr_file_crystallize_ API, which adds a bit of code: code stack ctx before gen-cryst: 37084 2304 636 after gen-cryst: 37092 (+0.0%) 2304 (+0.0%) 636 (+0.0%)	2025-05-23 19:48:56 -05:00
Christopher Haster	22c43124de	Limited file->leaf to reads + erased-state caching This reverts most of the lazy-grafting/crystallization logic, but keeps the general crystallization algorithm rewrite and file->leaf for caching read operations and erased-state. Unfortunately lazy-grafting/crystallization is both a code and stack heavy feature for a relatively specific write pattern. It doesn't even help if we're forced to write fragments due to prog alignment. Dropping lazy-grafting/crystallization trades off linear write/rewrite performance for code and stack savings: code stack ctx before: 37084 2304 636 after: 36428 (-1.8%) 2248 (-2.4%) 636 (+0.0%) But with file->leaf we still keep the improvements to linear read performance! Compared to pre-file->leaf: code stack ctx before file->leaf: 36016 2296 636 after lazy file->leaf: 37084 (+3.0%) 2304 (+0.3%) 636 (+0.0%) after eager file->leaf: 36428 (+1.1%) 2248 (-2.1%) 636 (+0.0%) I'm still on the fence about this, but lazy-grafting/crystallization is just a lot of code... And the first 6 letters of littlefs don't spell "speedy" last time I checked... At the very least we can always add lazy-grafting/crystallization as an opt-in write strategy later.	2025-05-23 15:22:33 -05:00
Christopher Haster	9c3a866508	Reworked crystallization to better use erased-state on rewrites This adopts lazy crystallization in _addition_ to lazy grafting, managed by separate LFS_o_UNCRYST and LFS_o_UNGRAFT flags: LFS_o_UNCRYST 0x00400000 File's leaf not fully crystallized LFS_o_UNGRAFT 0x00800000 File's leaf does not match bshrub/btree This lets us graft not-fully-crystallized blocks into the tree without needing to fully crystallize, avoiding repeated recrystallizations when linearly rewriting a file. Long story short, this gives file rewrites roughly the same performance as linear file writes. --- In theory you could also have fully crystallized but ungrafted blocks (UNGRAFT + ~UNCRYST), but this doesn't happen with the current logic. lfsr_file_crystallize eagerly grafts blocks once they're crystallized. Internally, lfsr_file_crystallize replaces lfsr_file_graft for the "don't care, gimme file->leaf" operation. This is analogous to lfsr_file_flush for file->cache. Note we do _not_ use LFS_o_UNCRYST to track erased-state! If we did, erased-state wouldn't survive lfsr_file_flush! --- Of course, this adds even more code. Fortunately not _that_ much considering how many lines of code changed: code stack ctx before: 37012 2304 636 after 37084 (+0.2%) 2304 (+0.0%) 636 (+0.0%) There is another downside however, and that's that our benchmarked disk usage is slightly worse during random writes. I haven't fully investigated this, but I think it's due to more temporary fragments/blocks in the B-tree before flushing. This can cause B-tree inner nodes to split earlier than when eagerly recrystallizing. This also leads to higher disk usage pre-flush since we keep both the old and new blocks around while uncrystallized, but since most rewrites are probably going to be CoW on top of committed files, I don't think this will be a big deal. Note the disk usage ends up the same after lfsr_file_flush.	2025-05-23 15:13:56 -05:00
Christopher Haster	9ed326f3d3	Adopted file->leaf, reworked how we track crystallization TLDR: Added file->leaf, which can track file fragments (read only) and blocks independently from file->b.shrub. This speeds up linear read/write performance at a heavy code/stack cost. The jury is still out on if this ends up reverted. --- This is another change motivated by benchmarking, specifically the significant regression in linear reads. The problem is that CTZ skip-lists are actually _really_ good at appending blocks! (but only appending blocks) The entire state of the file is contained in the last block, so file writes can resume without any reads. With B-trees, we need at least 1 B-tree lookup to resume appending, and this really adds up when writing extremely blocks. To try to mitigate this, I added file->leaf, a single in-RAM bptr for tracking the most recent leaf we've operated on. This avoids B-tree lookups during linear reads, and allowing the leaf to fall out-of-sync with the B-tree avoids both B-tree lookups and commits during writes. Unfortunately this isn't a complete win for writes. If we write fragments, i.e. cache_size < prog_size, we still need to incrementally commit to the B-tree. Fragments are a bit annoying for caching as any B-tree commit can discard the block they reside on. For reading, however, this brings read performance back to roughly the same as CTZ skip-lists. --- This also turned into more-or-less a full rewrite of the lfsr_file_flush -> lfsr_file_crystallize code path, which is probably a good thing. This code needed some TLC. file->leaf also replaces the previous eblock/eoff mechanism for erased-state tracking via the new LFSR_BPTR_ISERASED flag. This should be useful when exploring more erased-state tracking mechanisms (ddtree). Unfortunately, all of this additional in-RAM state is very costly. I think there's some cleanup that can be done (the current impl is a bit of a mess/proof-of-concept), but this does add a significant chunk of both code and stack: code stack ctx before: 36016 2296 636 after: 37228 (+3.4%) 2328 (+1.4%) 636 (+0.0%) file->leaf also increases the size of lfsr_file_t, but this doesn't show up in ctx because struct lfs_info dominates: lfsr_file_t before: 116 lfsr_file_t after: 136 (+17.2%) Hm... Maybe ctx measurements should use a lower LFS_NAME_MAX?	2025-05-23 12:15:13 -05:00
Christopher Haster	55ea13b994	scripts: Reverted del to resolve shadowed builtins I don't know how I completely missed that this doesn't actually work! Using del _does_ work in Python's repl, but it makes sense the repl may differ from actual function execution in this case. The problem is Python still thinks the relevant builtin is a local variables after deletion, raising an UnboundLocalError instead of performing a global lookup. In theory this would work if the variable could be made global, but since global/nonlocal statements are lifted, Python complains with "SyntaxError: name 'list' is parameter and global". And that's A-Ok! Intentionally shadowing language builtins already puts this code deep into ugly hacks territory.	2025-05-15 14:10:42 -05:00
Christopher Haster	f3cd9802b8	Adopted LFSR_TAG_ORPHAN, simplified internal stickynote handling This adds LFSR_TAG_ORPHAN, which simplifies quite a bit of the internal stickynote handling. Now that we don't have to worry about conflicts with future unknown types, we can add whatever types we want internally. One useful one is LFSR_TAG_ORPHAN, which lets us determine stickynote's orphan status early (in lfsr_mdir_lookupnext and lfsr_mdir_namelookup): - non-orphan stickynotes -> LFSR_TAG_STICKYNOTE - orphan stickynotes -> LFSR_TAG_ORPHAN This simplifies all the places where we need to check if a stickynote really exists, which is most of the high-level functions. One downside is that this makes stickynote _manipulation_ a bit more delicate. lfsr_mdir_lookup(LFSR_TAG_ORPHAN) no longer works as expected, for example. Fortunately we can sidestep this issue by dropping down to lfsr_rbyd_lookup when we need to interact with stickynotes directly, skipping the is-orphan checks. --- Saves a nice bit of code: code stack ctx before: 35984 2440 640 after: 35832 (-0.4%) 2440 (+0.0%) 640 (+0.0%) It got a little muddy since this now include the unknown-type changes, but here's the code diff from before we exposed LFSR_TYPE_STICKYNOTE to users: code stack ctx before: 35740 2440 640 after: 35832 (+0.3%) 2440 (+0.0%) 640 (+0.0%)	2025-04-24 16:33:24 -05:00
Christopher Haster	a34bcdb5bf	Allowed modification of unknown file types This drops the requirement that all file types are introduced with a related wcompat flag. Instead, the wcompat flag is only required if modification _would_ leak resources, and we treat unknown file types as though they are regular files. This allows modification of unknown file types without the risk of breaking anything. To compare with before the unknown-type rework: Before: > Unknown file types are allowed and may leak resources if modified, > so attempted modification (rename/remove) will error with > LFS_ERR_NOTSUP. Now: > Unknown file types are allowed but must not leak resources if > modified. If an unknown file type would leak resources, it should set > a related wcompat flag to only allow mounting RDONLY. Note this includes directories, which can leak bookmarks if removed, so filesystems using directories should set the LFSR_WCOMPAT_DIR flag. But we no longer need the LFSR_WCOMPAT_REG/LFSR_WCOMPAT_STICKYNOTE flags. --- The real tricky part was getting lfsr_rename to work with unknown types, as this broke the invariant that we only ever commit tags we know about. Fixing this required: - Fetching the non-unknown-mapped tag in lfsr_rename - Mapping all name tags to LFSR_TAG_NAME in lfsr_rbyd_appendrattr_ - Adopting LFSR_RATTR_NAME for bookmark name tags This was broken by the above lfsr_rbyd_appendrattr_ change, but it's probably good to handle these the same as other name tags anyways. This adds a bit of code, but not enough that I think this isn't worth it (or worth a build-time option): code stack ctx before: 35924 2440 640 after: 35992 (+0.0%) 2440 (+0.0%) 640 (+0.0%)	2025-04-24 16:31:04 -05:00
Christopher Haster	09c3749d7a	Reworked how unknown file types are handled This changes how we approach unknown file types. Before: > Unknown file types are allowed and may leak resources if modified, > so attempted modification (rename/remove) will error with > LFS_ERR_NOTSUP. Now: > Unknown file types are only allowed in RDONLY mode. This avoids the > whole leaking resources headache. Additionally, unknown types are now mapped to LFS_TYPE_UNKNOWN, instead of just being forwarded to the user. This allows us to add internal types/tags to the LFSR_TAG_NAME type space without worrying about conflicts with future types: - reg -> LFS_TYPE_REG - dir -> LFS_TYPE_DIR - stickynote -> LFS_TYPE_STICKYNOTE - everything else -> LFS_TYPE_UNKNOWN Thinking about potential future types, it seems most (symlinks, compressed files, etc) can be better implemented via custom attributes. Using custom attributes doesn't mean the filesystem _can't_ inject special behavior, and custom attributes allow for perfect backwards compatibility. So with future types less likely, forwarding type info to users is less important (and potentially error prone). Instead, allowing on-disk + internal types to be represented densely is much more useful. And it avoids setting an upper bound on future types prematurely. --- This also includes a minor rcompat/wcompat rework. Since we're probably going to end up with 32-bit rcompat flags anyways, might as well make them more human-readable (nibble-aligned): LFS_RCOMPAT_NONSTANDARD 0x00000001 Non-standard filesystem format LFS_RCOMPAT_WRONLY 0x00000002 Reading is disallowed LFS_RCOMPAT_BMOSS 0x00000010 Files may use inlined data LFS_RCOMPAT_BSPROUT 0x00000020 Files may use block pointers LFS_RCOMPAT_BSHRUB 0x00000040 Files may use inlined btrees LFS_RCOMPAT_BTREE 0x00000080 Files may use btrees LFS_RCOMPAT_MMOSS 0x00000100 May use an inlined mdir LFS_RCOMPAT_MSPROUT 0x00000200 May use an mdir pointer LFS_RCOMPAT_MSHRUB 0x00000400 May use an inlined mtree LFS_RCOMPAT_MTREE 0x00000800 May use an mdir btree LFS_RCOMPAT_GRM 0x00001000 Global-remove in use LFS_WCOMPAT_NONSTANDARD 0x00000001 Non-standard filesystem format LFS_WCOMPAT_RDONLY 0x00000002 Writing is disallowed LFS_WCOMPAT_REG 0x00000010 Regular file types in use LFS_WCOMPAT_DIR 0x00000020 Directory file types in use LFS_WCOMPAT_STICKYNOTE 0x00000040 Stickynote file types in use LFS_WCOMPAT_GCKSUM 0x00001000 Global-checksum in use --- Code changes: code stack ctx before: 35928 2440 640 after: 35924 (-0.0%) 2440 (+0.0%) 640 (+0.0%)	2025-04-24 16:29:19 -05:00
Christopher Haster	7dd473df82	Tweaked LFSR_TAG_STICKYNOTE encoding 0x205 -> 0x203 Now that LFS_TYPE_STICKYNOTE is a real type users can interact with, it makes sense to group it with REG/DIR. This also has the side-effect of making these contiguous. --- LFSR_TAG_BOOKMARKs, however, are still hidden from the user. This unfortunately means there will be a bit of a jump if we ever add LFS_TYPE_SYMLINK in the future, but I'm starting to wonder if that's the best way to approach symlinks in littlefs... If instead LFS_TYPE_SYMLINKS were implied via custom attribute, you could avoid the headache that comes with adding a new tag encoding, and allow perfect compatibility with non-symlink drivers. Win win. This seems like a better approach for _all_ of the theoretical future types (compressed files, device files, etc), and avoids the risk of oversaturating the type space. --- This had a surprising impact on code for just a minor encoding tweak. I guess the contiguousness pushed the compiler to use tables/ranges for more things? Or maybe 3 vs 5 is just an easier constant to encode? code stack ctx before: 35952 2440 640 after: 35928 (-0.1%) 2440 (+0.0%) 640 (+0.0%)	2025-04-24 14:35:52 -05:00
Christopher Haster	96eb38c8c2	Added LFS_REVDBG, tweaked LFS_REVNOISE This tweaks a number of extended revision count things: - Added LFS_REVDBG, which adds debug info to revision counts. This initializes the bottom 12 bits of every revision count with a hint based on rbyd type, which may be useful when debugging: - 68 69 21 v0 (hi!.) => mroot anchor - 6d 72 7e v0 (mr~.) => mroot - 6d 64 7e v0 (md~.) => mdir - 62 74 7e v0 (bt~.) => file btree node - 62 6d 7e v0 (bm~.) => mtree node This may be overwritten by the recycle counter if it overlaps, worst case the recycle counter takes up the entire revision count, but these have been chosen to at least keep some info if partially overwritten. To make this work required the LFS_i_INMTREE hack (yay global state), but a hack for debug info isn't the end of the world. Note we don't have control over data blocks, so there's always a chance they end up containing what looks like one of the above revision counts. - Renamed LFS_NOISY -> LFS_REVNOISE - LFS_REVDBG and LFS_REVNOISE are incompatible, so using both asserts. This also frees up the theoretical 0x00000030 state for an additional rev mode in the future. - Adopted LFS_REVNOISE (and LFS_REVDBG) in btree nodes as well. If you need rev noise, you probably want it in all rbyds/metadata blocks, not just mdirs. --- This had no effect on the default code size, but did affect LFS_REVNOISE: code stack ctx before: 35688 2440 640 after: 35688 (+0.0%) 2440 (+0.0%) 640 (+0.0%) revnoise before: 35744 2440 640 revnoise after: 35880 (+0.4%) 2440 (+0.0%) 640 (+0.0%) default: 35688 2440 640 revdbg: 35912 (+0.6%) 2448 (+0.3%) 640 (+0.0%) revnoise: 35880 (+0.5%) 2440 (+0.0%) 640 (+0.0%)	2025-04-23 23:20:49 -05:00
Christopher Haster	59251a755c	scripts: dbgflags.py: Tweaked to accept lfs-prefixed prefixes So: $ ./scripts/dbgflags.py -l LFS_I Is equivalent to: $ ./scripts/dbgflags.py -l I This matches some of the implicit prefixing during name lookup: $ ./scripts/dbgflags.py LFS_I_SYNC $ ./scripts/dbgflags.py I_SYNC $ ./scripts/dbgflags.py SYNC	2025-04-16 15:22:09 -05:00
Christopher Haster	270230a833	scripts: Adopted del to resolve shadowed builtins So: all_ = all; del all Instead of: import builtins all_, all = all, builtins.all The del exposes the globally scoped builtin we accidentally shadow. This requires less megic, and no module imports, though tbh I'm surprised it works. It also works in the case where you change a builtin globally, but that's a bit too crazy even for me...	2025-04-16 15:22:08 -05:00
Christopher Haster	262ad7c08e	scripts: Simplified dbgtag.py, tweaked -x/--hex decoding This drops the option to read tags from a disk file. I don't think I've ever used this, and it requires quite a bit of circuitry to implement. Also dropped -s/--string, because most tags can't be represented as strings? And tweaked -x/--hex flags to correctly parse spaces in arguments, so now these are equivalent: - ./scripts/dbgtag.py -x 00 03 00 08 - ./scripts/dbgtag.py -x "00 03 00 08"	2025-04-16 15:21:54 -05:00
Christopher Haster	415e6325d1	Moved revision count noise behind ifdef LFS_NOISY littlefs is intentionally designed to not rely on noise, even with cksum collisions (hello, perturb bit!). So it makes sense for this to be an optional feature, even if it's a small one. Disabling revision count noise by default also helps with testing. The whole point of revision count noise is to make cksum collisions less likely, which is a bit counterproductive when that's something we want to test! This doesn't really change the revision count encoding: vvvvrrrr rrrrrrnn nnnnnnnn nnnnnnnn '-.''----.----''---------.--------' '------\|---------------\|---------- 4-bit relocation revision '---------------\|---------- recycle-bits recycle counter '---------- pseudorandom noise (optional) I considered moving the recycle-bits down when we're not adding noise, but the extra logic just isn't worth making the revision count a bit more human-readable. --- This saves a small bit of code in the default build, at the cost of some code for the runtime checks in the LFS_NOISY build. Though I'm hoping future config work will let users opt-out of these runtime checks: code stack ctx before: 38548 2624 640 default after: 38508 (-0.1%) 2624 (+0.0%) 640 (+0.0%) LFS_NOISY after: 38568 (+0.1%) 2624 (+0.0%) 640 (+0.0%) Honestly the thing I'm more worried about is using one of our precious mount flags for this... There's not that many bits left!	2025-02-08 14:53:47 -06:00
Christopher Haster	a63b8e1527	Dropped internal LFS_i_UNTIDY pseudo-alias flag We really shouldn't have two names for the same thing, it just makes things more confusing, even if the public name doesn't quite match the internal usage. Especially now that we internally rely on these being the same flag. This renames LFS_i_UNTIDY -> LFS_I_MKCONSISTENT and drops the untidy/ mktidy naming internally. No code changes.	2025-02-08 14:53:47 -06:00
Christopher Haster	7cd2d4dd11	Added LFSR_WCOMPAT_GCKSUM wcompat flag The gcksum isn't actually implemented yet, I mostly just wanted to measure this code cost separately: code stack ctx before: 37768 2608 620 after: 37796 (+0.1%) 2608 (+0.0%) 620 (+0.0%) I may be procrastinating a little bit...	2025-01-28 14:41:45 -06:00
Christopher Haster	d08d254cd2	Switched to writing compat flags as le32s Most of littlefs's metadata is encoded in leb128s now, with the exception of tags (be16, sort of), revision counts (le32), cksums (le32), and flags. It makes sense for tags to be a special case, these are written and rewritten _everywhere_, but less so for flags, which are only written to the mroot and updated infrequently. We might as well save a bit of code by reusing our le32 machinery. --- This changes lfsr_format to just write out compat flags as le32s, saving a tiny bit of code at the cost of a tiny bit of disk usage (the real benefit being a tiny bit of code simplification): code stack ctx before: 37792 2608 620 after: 37772 (-0.1%) 2608 (+0.0%) 620 (+0.0%) Compat already need to handle trailing zeros gracefully, so this doesn't change anything at mount time. Also had to switch from enums to #defines thanks to C's broken enums. Wooh. We already use #defines for the other flags for this reason.	2025-01-28 14:41:45 -06:00
Christopher Haster	e5609c98ec	Renamed bsprout -> bmoss, bleaf -> bsprout I just really don't like saying bleaf. Also I think the term moss describes inlined data a bit better.	2025-01-28 14:41:45 -06:00
Christopher Haster	0cab73730e	Added LFS_WCOMPAT_RDONLY and LFS_RCOMPAT_WRONLY LFS_WCOMPAT_RDONLY seems generally useful for tools that just want to mark a filesystem is read-only. This is a common flag that exists in other filesystems (RO_COMPAT_READONLY in ext4 for example). LFS_RCOMPAT_WRONLY, on the other hand, is a bit more of a joke, but there could be some niche use cases for it (preventing double mounts?). Fortunately, these flags require no extra code, and fall out naturally from our wcompat/rcompat handling. --- Originally, the idea was to also add LFS_F_RDONLY, to match LFS_M_RDONLY and set the LFS_WCOMPAT_RDONLY flag during format. But this doesn't really work with the current API, since lfsr_format would just give you an empty filesystem you can't write to. Which is a bit silly. Maybe we should add something like lfsr_fs_mkrdonly in the future? This is probably low-priority.	2025-01-28 14:41:45 -06:00
Christopher Haster	af6ea39cca	Reworked rcompat flags Mainly to add LFS_RCOMPAT_MSPROUT. It makes sense that a littlefs driver may not want to support mroot-inlined mdirs, and this flag would be the only way to indicate that. (Currently inlined mdir -> mtree is one way, but this may not always be the case.) This also makes space for a couple planned features: LFS_RCOMPAT_NONSTANDARD 0x00000001 Non-standard filesystem format LFS_RCOMPAT_WRONLY* 0x00000002 Reading is disallowed LFS_RCOMPAT_GRM 0x00000004 May use a global-remove LFS_RCOMPAT_MSPROUT 0x00000010 May use an inlined mdir LFS_RCOMPAT_MLEAF 0x00000020 May use a single mdir pointer LFS_RCOMPAT_MSHRUB 0x00000040 May use an inlined mtree LFS_RCOMPAT_MTREE 0x00000080 May use an mdir btree LFS_RCOMPAT_BSPROUT 0x00000100 Files may use inlined data LFS_RCOMPAT_BLEAF 0x00000200 Files may use single block pointers LFS_RCOMPAT_BSHRUB 0x00000400 Files may use inlined btrees LFS_RCOMPAT_BTREE 0x00000800 Files may use btrees *Planned I've gone ahead and included rcompat flags we reserve but don't currently use (LFS_RCOMPAT_MSHRUB). It seems like a good idea to make these reservations explicit. Though we should still prohibit their use until there is a good reason, in case we want to repurpose these flags in the future. Code changes minimal (larger literal? compiler noise?): code stack ctx before: 37788 2608 620 after: 37792 (+0.0%) 2608 (+0.0%) 620 (+0.0%)	2025-01-28 14:41:45 -06:00
Christopher Haster	5f6dbdcb14	Reworked o/f/m/gc/i/t flags This is mainly to free up space for flags, we're pretty close to running out of 32-bits with future planned features: 1. Reduced file type info from 8 -> 4 bits We don't really need more than this, but it does mean type info is no longer a simple byte load. 2. Moved most internal file-state flags into the next 4 bits These are mostly file-type specific (except LFS_o_ZOMBIE), so we don't need to worry too much about overlap. 3. Compacted ck-flags into 5 bits: LFS_M_CKPROGS 0x00000800 LFS_M_CKFETCHES 0x00001000 LFS_M_CKPARITY 0x00002000 LFS_M_CKMETAREDUND* 0x00004000 LFS_M_CKDATACKSUMS 0x00008000 *Planned Now that ck-flags are a bit more mature, it's pretty clear we'll probably never have CKMETACKSUMS (ckcksums + small tag reads is crazy expensive) or CKDATAREDUND (non-trivial parity fanout makes this crazy expensives. So reserving bits for these just wastes bits. This also moves things around so ck-flags no longer overlap with open flags. It's a tight fit, and I still think file-specific ck-flags are out-of- scope, but this at least decreases flag ambiguity. New jenga: 8 8 8 8 .----++----++----++----. .-..-..-.-------.------. o_flags: \|t\|\|f\|\|t\| \| o \| \|-\|\|-\|\|-\|-------:--.---' \|-\|\|-\|\|-'--.----.------. t_flags: \|t\|\|f\|\| t \| \| tstt \| '-''-'\|----\|----'------' .----.\|----\|.--.:--:.--. m_flags: \| f \|\| t \|\|c \|\|o \|\|m \| \|----\|\|-.--'\|--\|'--''--' \|----\|\|-\|---\|--\|.------. f_flags: \| f \|\|t\| \|c \|\| f \| '----''-'---'--''------' Fortunately no major code costs: code stack ctx before: 37792 2608 620 after: 37788 (-0.0%) 2608 (+0.0%) 620 (+0.0%)	2025-01-28 14:41:45 -06:00
Christopher Haster	726bf86d21	Added dbgflags.py for easier flag debugging dbgerr.py and dbgtag.py have proven to be incredibly useful for quick debugging/introspection, so I figured why not have more of that. My favorite part is being able to quickly see all flags set on an open file handle: (gdb) p file.o.o.flags $2 = 24117517 (gdb) !./scripts/dbgflags.py o 24117517 LFS_O_WRONLY 0x00000001 Open a file as write only LFS_O_CREAT 0x00000004 Create a file if it does not exist LFS_O_EXCL 0x00000008 Fail if a file already exists LFS_O_DESYNC 0x00000100 Do not sync or recieve file updates LFS_o_REG 0x01000000 Type = regular-file LFS_o_UNFLUSH 0x00100000 File's data does not match disk LFS_o_UNSYNC 0x00200000 File's metadata does not match disk LFS_o_UNCREAT 0x00400000 File does not exist yet The only concern is if dbgflags.py falls out-of-sync often, I suspect flag encoding will have quite a bit more churn than flags/tags. But we can always drop this script in the future if this turns into a problem. --- While poking around this also ended up with a bunch of other small changes: - Added LFS_*_MODE masks for consistency with other "type<->flag embeddings" - Added compat flag comments - Adopted lowercase prefix for internal flags (LFS_o_ZOMBIE), though not sure if I'll keep this yet... - Tweaked dbgerr.py to also match ERR_ prefixes and to ignore case	2025-01-28 14:41:45 -06:00

50 Commits