littlefs

mirror of https://github.com/littlefs-project/littlefs.git synced 2025-10-29 19:47:49 +00:00

Author	SHA1	Message	Date
Christopher Haster	384a498762	Extend dir seek tests to include seeking to end of directory	2023-04-18 14:55:43 -05:00
Christopher Haster	b0a4a44e5b	Added explicit assert for minimum block size of 128 bytes There was already an assert for this, but because it included the underlying equation for the requirement it was too confusing for users that had no prior knowledge for why the assert could trigger. The math works out such that 128 bytes is a reasonable minimum requirement, so I've added that number as an explicit assert. Hopefully this makes this sort of situation easier to debug. Note that this requirement would need to be increased to 512 bytes if block addresses are ever increased to 64-bits. DESIGN.md goes into more detail why this is.	2023-04-17 19:58:09 -05:00
Christopher Haster	aae897ffd0	Added an assert for truthy-preserving bool conversions This has caught enough people that an explicit assert is warranted. How littlefs, a c99 project, should be integrated with c89 projects is still an open question, but no one deserves to debug this sort of undetected casting issue. Found by johnernberg and XinStellaris	2023-04-17 19:19:42 -05:00
Christopher Haster	e57402c8e9	Added ability to revert to inline file in lfs_file_truncate Before, once converted to a CTZ skip-list, a file would remain a CTZ skip-list even if truncated back to a size that could be inlined. This was just a shortcut in implementation. And since the fix for boundary truncates needed special handling for size==0, it made sense to extend this special condition to allow reverting to inline files. --- The only case I can think of, where reverting to an inline file would be detrimental, is if it's a readonly file that you would otherwise not need to pay the metadata overhead for. But as a tradeoff, inlining the file would free up the block it was on, so it's unclear if this really is a net loss. If the truncate is followed by a write, reverting to an inline file will always be beneficial. We assume writes will change the data, so in the non-inlined case there's no way to avoid copying the underlying block. Even if we assume padding issues are solved.	2023-04-17 18:18:06 -05:00
Christopher Haster	6dc18c38c1	Fixed block-boundary truncate issue There has been a bug in the filesystem for a while where truncating to a block boundary suffers from an off-by-one mistake that corrupts the internal representation of the CTZ skip-list. This mostly appears when the file_size == block_size, as file_size > block_size includes CTZ skip-list metadata, so the underlying block boundaries appear at slightly different offsets. --- The reason for off-by-one issue is a nuance in lfs_ctz_find that we sort of abuse to get two different behaviors. Consider the situation where this bug occurs: block 0 block 1 .--------. .--------. \| abcdef \|<-\| {ptr0} \| \| ghijkl \| \| yzabcd \| \| mnopqr \| \| \| \| stuvwx \| \| \| '--------' '--------' With these 24-byte blocks, there's an ambiguity if we wanted to point to offset 24. We could point before the block boundary, or we could point after the block boundary Before: block 0 block 1 .--------. .--------. \| abcdef \|<-\| {ptr0} \| \| ghijkl \| \| yzabcd \| \| mnopqr \| \| \| \| stuvwx \| \| \| '-------^' '--------' '-- off=24 is here After: block 0 block 1 .--------. .--------. \| abcdef \|<-\| {ptr0} \| \| ghijkl \| \| yzabcd \| \| mnopqr \| \| ^ \| \| stuvwx \| \| \| \| '--------' '-\|------' '-- off=24 is here When we want these two offsets depends on the context. We want the offset to be conservative if it represents a size, but eager if it is being used to prepare a block for writing. The workaround/hack is to prefer the eager offset, after the block boundary, but use `size-1` as the argument if we need the conservative offset. This finds the correct block, but is off-by-one in the calculated block-offset. Fortunately we happen to not use the block-offset in the places we need this workaround/hack. --- To get back to the bug, the wrong mode of lfs_ctz_find was used in lfs_file_truncate, leading to internal corruption of the CTZ skip-list. The correct behavior is size-1, with care to avoid underflow. Also I've tweaked the code to make it clear the calculated block-offset goes unused in these situations. Thanks to ghost, ajaybhargav, and others for reporting the issue, colin-foster-advantage for a reproducible test case, and rvanschoren, hgspbs for the initial solution.	2023-04-17 17:49:57 -05:00
Christopher Haster	d5dc4872cb	Expanded truncate tests to test more corner cases Removed the weird alignment requirement from the general truncate tests. This explicitly hid off-by-one truncation errors. These tests now reveal the same issue as the block-sized truncation test while also testing for other potential off-by-one errors.	2023-04-17 12:10:19 -05:00
Sosthène Guédon	24795e6b74	Add missing iterations in tests	2023-03-13 11:39:06 +01:00
Colin Foster	7b151e1abb	Add test scenario for truncating to a block size When truncation is done on a file to the block size, there seems to be an error where it points to an incorrect block. Perform a write / truncate / readback operation to verify this issue. Signed-off-by: Colin Foster <colin.foster@in-advantage.com>	2023-01-26 11:55:53 -08:00
Christopher Haster	ba1c76435a	Fixed issue where deorphan could get stuck circling between two half-orphans This of course should never happen normally, two half-orphans requires two parents, which is disallowed in littlefs for this reason. But it can happen if there is an outdated half-orphan later in the metadata linked-list. The two half-orphans can cause the deorphan step to get stuck, constantly "fixing" the first half-orphan before it has a chance to remove the problematic, outdated half-orphan later in the list. The solution here is to do a full check for half-orphans before restarting the half-orphan loop. This strategy has the potential to visit more metadata blocks unnecessarily, but avoids situations where removing a later half-orphan will eventually cause an earlier half-orphan to resolve itself. Found with heuristic powerloss testing with test_relocations_reentrant_renames after 192 nested powerlosses.	2022-12-17 12:42:05 -06:00
Christopher Haster	d1b254da2c	Reverted removal of 1-bit counter threaded through tags Initially I thought the fcrc would be sufficient for all of the end-of-commit context, since indicating that there is a new commit is a simple as invalidating the fcrc. But it turns out there are cases that make this impossible. The surprising, and actually common, case, is that of an fcrc that will end up containing a full commit. This is common as soon as the prog_size is big, as small commits are padded to the prog_size at minimum. .------------------. \ \| metadata \| \| \| \| \| \| \| +-. \|------------------\| \| \| \| foward CRC ------------. \|------------------\| / \| \| \| commit CRC -----' \| \|------------------\| \| \| padding \| \| \| \| \| \|------------------\| \ \ \| \| metadata \| \| \| \| \| \| +-. \| \| \| \| \| \| +-' \|------------------\| / \| \| \| commit CRC --------' \| \|------------------\| \| \| \| / '------------------' When the commit + crc is all contained in the fcrc, something silly happens with the math behind crcs. Everything in the commit gets canceled out: crc(m) = m(x) x^\|P\|-1 mod P(x) m ++ crc(m) = m(x) x^\|P\|-1 + (m(x) x^\|P\|-1 mod P(x)) crc(m ++ crc(m)) = (m(x) x^\|P\|-1 + (m(x) x^\|P\|-1 mod P(x))) x^\|P\|-1 mod P(x) crc(m ++ crc(m)) = (m(x) x^\|P\|-1 + m(x) x^\|P\|-1) x^\|P\|-1 mod P(x) crc(m ++ crc(m)) = 0 * x^\|P\|-1 mod P(x) This is the reason the crc of a message + naive crc is zero. Even with an initializer/bit-fiddling, the crc of the whole commit ends up as some constant. So no manipulation of the commit can change the fcrc... But even if this did work, or we changed this scheme to use two different checksums, it would still require calculating the fcrc of the whole commit to know if we need to tweak the first bit to invalidate the unlikely-but-problematic case where we happen to match the fcrc. This would add a large amount of complexity to the commit code. It's much simpler and cheaper to keep the 1-bit counter in the tag, even if it adds another moving part to the system.	2022-12-17 12:42:05 -06:00
Christopher Haster	2f26966710	Continued implementation of forward-crcs, adopted new test runners This fixes most of the remaining bugs (except one with multiple padding commits + noop erases in test_badblocks), with some other code tweaks. The biggest change was dropping reliance on end-of-block commits to know when to stop parsing commits. We can just continue to parse tags and rely on the crc for catch bad commits, avoiding a backwards-compatiblity hiccup. So no new commit tag. Also renamed nprogcrc -> fcrc and commitcrc -> ccrc and made naming in the code a bit more consistent.	2022-12-17 12:42:05 -06:00
Christopher Haster	b4091c6871	Switched to separate-tag encoding of forward-looking CRCs Previously forward-looking CRCs was just two new CRC types, one for commits with forward-looking CRCs, one without. These both contained the CRC needed to complete the current commit (note that the commit CRC must come last!). [-- 32 --\|-- 32 --\|-- 32 --\|-- 32 --] with: [ crc3 tag \| nprog size \| nprog crc \| commit crc ] without: [ crc2 tag \| commit crc ] This meant there had to be several checks for the two possible structure sizes, messying up the implementation. [-- 32 --\|-- 32 --\|-- 32 --\|-- 32 --\|-- 32 --] with: [nprogcrc tag\| nprog size \| nprog crc \| commit tag \| commit crc ] without: [ commit tag \| commit crc ] But we already have a mechanism for storing optional metadata! The different metadata tags! So why not use a separate tage for the forward-looking CRC, separate from the commit CRC? I wasn't sure this would actually help that much, there are still necessary conditions for wether or not a forward-looking CRC is there, but in the end it simplified the code quite nicely, and resulted in a ~200 byte code-cost saving.	2022-12-17 12:42:05 -06:00
Christopher Haster	91ad673c45	Cleaned up a few additional commit corner cases - General cleanup from integration, including cleaning up some older commit code - Partial-prog tests do not make sense when prog_size == block_size (there can't be partial-progs!) - Fixed signed-comparison issue in modified filebd	2022-12-17 12:42:05 -06:00
Christopher Haster	52dd83096b	Initial implementation of forward-looking erase-state CRCs This change is necessary to handle out-of-order writes found by pjsg's fuzzing work. The problem is that it is possible for (non-NOR) block devices to write pages in any order, or to even write random data in the case of a power-loss. This breaks littlefs's use of the first bit in a page to indicate the erase-state. pjsg notes this behavior is documented in the W25Q here: https://community.cypress.com/docs/DOC-10507 --- The basic idea here is to CRC the next page, and use this "erase-state CRC" to check if the next page is erased and ready to accept programs. .------------------. \ commit \| metadata \| \| \| \| +---. \| \| \| \| \|------------------\| \| \| \| erase-state CRC -----. \| \|------------------\| \| \| \| \| commit CRC ---\|-\|-' \|------------------\| / \| \| padding \| \| padding (doesn't need CRC) \| \| \| \|------------------\| \ \| next prog \| erased? \| +-' \| \| \| \| \| v \| / \| \| \| \| '------------------' This is made a bit annoying since littlefs doesn't actually store the page (prog_size) in the superblock, since it doesn't need to know the size for any other operation. We can work around this by storing both the CRC and size of the next page when necessary. Another interesting note is that we don't need to any bit tweaking information, since we read the next page every time we would need to know how to clobber the erase-state CRC. And since we only read prog_size, this works really well with our caching, since the caches must be a multiple of prog_size. This also brings back the internal lfs_bd_crc function, in which we can use some optimizations added to lfs_bd_cmp. Needs some cleanup but the idea is passing most relevant tests.	2022-12-17 12:42:05 -06:00
Christopher Haster	1278ec1d08	Adopted Brent's algorithm for cycle detection The previous cycle detection algorithm (a naive check against the largest possible tail list) is simple and gets the job done, but has the potential to take a very long time on disks with many blocks. Brent's algorithm, on the other hand, takes at most 2x the number of blocks in the tail list. Originally naive cycle detection was chosen over Floyd's algorithm to avoid the extra complexity of managing two desynced traversals for every traversal of the tail list, but Brent's algorithm is very well suited for our use case, requiring only we keep track of an additional mdir pointer on the stack as we traverse.	2022-12-17 12:41:39 -06:00
Christopher Haster	c2147c45ee	Added --gdb-pl to test.py for breaking on specific powerlosses This allows debugging strategies such as binary searching for the point of "failure", which may be more complex than simply failing an assert.	2022-12-17 12:39:42 -06:00
Christopher Haster	801cf278ef	Tweaked/fixed a number of small runner things after a bit of use - Added support for negative numbers in the leb16 encoding with an optional 'w' prefix. - Changed prettyasserts.py rule to .a.c => .c, allowing other .a.c files in the future. - Updated .gitignore with missing generated files (tags, .csv). - Removed suite-namespacing of test symbols, these are no longer needed. - Changed test define overrides to have higher priority than explicit defines encoded in test ids. So: ./runners/bench_runner bench_dir_open:0f1g12gg2b8c8dgg4e0 -DREAD_SIZE=16 Behaves as expected. Otherwise it's not easy to experiment with known failing test cases. - Fixed issue where the -b flag ignored explicit test/bench ids.	2022-12-17 12:35:44 -06:00
Christopher Haster	1f37eb5563	Adopted --subplot* in plot.py As well as --legend* and --*ticklabels. Mostly for close feature parity, making it easier to move plots between plot.py and plotmpl.py.	2022-12-16 16:47:42 -06:00
Christopher Haster	cfd4e6029a	Added --subplot* to plotmpl.py Driven primarily by a want to compare measurements of different runtime complexities (it's difficult to fit O(n) and O(log n) on the same plot), this adds the ability to nest subplots in the same .svg which try to align as much as possible. This turned out to be surprisingly complicated. As a part of this, adopted matplotlib's relatively recent constrained_layout, which behaves much more consistently. Also dropped --legend-left, no one should really be using that.	2022-12-16 16:47:30 -06:00
Christopher Haster	2d2dd8b2eb	Added plotmpl.py --github flag to match the website's foreground/background The difference between ggplot's gray and GitHub's gray was a bit jarring. This also adds --foreground and --font-color for this sort of additional color control without needing to add a new flag for every color scheme out there.	2022-12-11 23:41:36 -06:00
Christopher Haster	b0382fa891	Added BENCH/TEST_PRNG, replacing other ad-hoc sources of randomness When you add a function to every benchmark suite, you know if should probably be provided by the benchmark runner itself. That being said, randomness in tests/benchmarks is a bit tricky because it needs to be strictly controlled and reproducible. No global state is used, allowing tests/benches to maintain multiple randomness stream which can be useful for checking results during a run. There's an argument for having global prng state in that the prng could be preserved across power-loss, but I have yet to see a use for this, and it would add a significant requirement to any future test/bench runner.	2022-12-06 23:09:07 -06:00
Christopher Haster	d8e7ffb7fd	Changed lfs_emubd_get* -> lfs_emubd_* lfs_emubd_getreaded -> lfs_emubd_readed lfs_emubd_getproged -> lfs_emubd_proged lfs_emubd_geterased -> lfs_emubd_erased lfs_emubd_getwear -> lfs_emubd_wear lfs_emubd_getpowercycles -> lfs_emubd_powercycles	2022-12-06 23:09:07 -06:00
Christopher Haster	cda2f6f1da	Changed test_runner to run with -Pnone,linear by default The linear powerloss heuristic provides very good powerloss coverage without a significant runtime hit, so there's really no reason to run the tests without -Plinear. Previous behavior can be accomplished with an explicit -Pnone.	2022-12-06 23:09:07 -06:00
Christopher Haster	9b687dd96a	Added make benchmarks/testmarks rules Mostly for benchmarking, this makes it easy to view and compare runner results similarly to other csv results.	2022-12-06 23:09:07 -06:00
Christopher Haster	c4b3e9d826	A couple of script changes after CI integration - Renamed struct_.py -> structs.py again. - Removed lfs.csv, instead prefering script specific csv files. - Added *-diff make rules for quick comparison against a previous result, results are now implicitly written on each run. For example, `make code` creates lfs.code.csv and prints the summary, which can be followed by `make code-diff` to compare changes against the saved lfs.code.csv without overwriting. - Added nargs=? support for -s and -S, now uses a per-result _sort attribute to decide sort if fields are unspecified.	2022-12-06 23:09:07 -06:00
Christopher Haster	9990342440	Fixed Clang testing in CI, removed override vars in Makefile Two flags introduced: -fcallgraph-info=su for stack analysis, and -ftrack-macro-expansions=0 for cleaner prettyassert.py warnings, are unfortunately not supported in Clang. The override vars in the Makefile meant it wasn't actually possible to remove these flags for Clang testing, so this commit changes those vars to normal, non-overriding vars. This means `make CFLAGS=-Werror` and `CFLAGS=-Werror make` behave _very_ differently, but this is just an unfortunate quirk of make that needs to be worked around.	2022-12-06 23:09:07 -06:00
Christopher Haster	0c781dd822	Merge remote-tracking branch 'origin/master' into test-and-bench-runners	2022-12-06 23:08:53 -06:00
Christopher Haster	4a209344d4	Fixed bench workflow + changeprefix issue in prefix releases changeprefix.py only works on prefixes, which is a bit of a problem for flags in the workflow scripts, requiring extra handling to not hide the prefix from changeprefix.py	2022-12-06 23:07:28 -06:00
Christopher Haster	a659c02bbd	Added a bot-generated PR-comment with a simple status table The littlefs CI is actually in a nice state that generates a lot of information about PRs (code/stack/struct changes, line/branch coverage changes, benchmark changes), but GitHub's UI has changed overtime to make CI statuses harder to find for some reason. This bot comment should hopefully make this information easy to find without creating too much noise in the discussion. If not, this can always be changed later.	2022-12-06 23:07:28 -06:00
Christopher Haster	397aa27181	Removed unnecessarily heavy RAM usage from logs in bench/test.py For long running processes (testing with >1pls) these logs can grow into multiple gigabytes, humorously we never access more than the last n lines as requested by --context. Piping the stdout with --stdout does not use additional RAM.	2022-12-06 23:07:28 -06:00
Christopher Haster	65923cdfb4	Adopted script changes in GitHub Actions - Moved to Ubuntu 22.04 This notably means we no longer have to bend over backwards to install GCC 10! - Changed shell in gha to include the verbose/undefined flags, making debugging gha a bit less painful - Adopted the new test.py/test_runners framework, which means no more heavy recompilation for different configurations. This reduces the test job runtime from >1 hour to ~15 minutes, while increasing the number of geometries we are testing. - Added exhaustive powerloss testing, because of time constraints this is at most 1pls for general tests, 2pls for a subset of useful tests. - Limited coverage measurements to `make test` Originally I tried to maximize coverage numbers by including coverage from every possible source, including the more elaborate CI jobs which provide an extra level of fuzzing. But this missed the purpose of coverage measurements, which is to find areas where test cases can be improved. We don't want to improve coverage by just shoving more fuzz tests into CI, we want to improve coverage by adding specific, intentioned test cases, that, if they fail, highlight the reason for the failure. With this perspective, maximizing coverage measurement in CI is counter-productive. This changes makes it so the reported coverage is always less than actual CI coverage, but acts as a more useful metric. This also simplifies coverage collection, so that's an extra plus. - Added benchmarks to CI Note this doesn't suffer from inconsistent CPU performance because our benchmarks are based on purely simulated read/prog/erase measurements. - Updated the generated markdown table to include line+branch coverage info and benchmark results.	2022-12-06 23:07:21 -06:00
Christopher Haster	387cf6f6e0	Fixed a couple corner cases in scripts when fields are empty - Fixed added/removed count in scripts when an entry has no field in the expected results - Fixed a python-sort-type issue when by-field is missing in a result	2022-11-28 12:51:18 -06:00
Christopher Haster	0b11ce03b7	Fixed incorrect calculation of extra space needed in mdir blocks Despite the comment being correct, the calculation is somehow off by a word, meaning something must have been missed. Maybe the space for the move-delete was missed since that was added later to avoid losing move-deletes during relocations. This was found with the new exhaustive power-loss searching added to the test framework with -P2. The exact failure was test_dirs_many_reentrant:2gg2cb:k4o6. This must be the first test that ends up with all possible extra state in a single mdir block.	2022-11-28 12:51:18 -06:00
Christopher Haster	eba5553314	Fixed hidden orphans by separating deorphan search into two passes This happens in rare situations where there is a failed mdir relocation, interrupted by a power-loss, containing the destination of a directory rename operation, where the directory being renamed preceded the relocating mdir in the mdir tail-list. This requires at some point for a previous directory rename to create a cycle. If this happens, it's possible for the half-orphan to contain the only reference to the renamed directory. Since half-orphans contain outdated state when viewed through the mdir tail-list, the renamed directory appears to be a full-orphan until we fix the relocating half-orphan. This causes littlefs to incorrectly remove the renamed directory from the mdir tail-list, causes catastrophic problems down the line. The source of the problem is that the two different types of orphans really operate on two different levels of abstraction: half-orphans fix failed mdir commits, while full-orphans fix directory removes/renames. Conflating the two leads to situations where we attempt to fix assumed problems about the directory tree before we have fixed problems with the mdir state. The fix here is to separate out the deorphan search into two passes: one to fix half-orphans and correct any mdir-commits, restoring the mdirs and gstate to a known good state, then two to fix failed removes/renames. --- This was found with the -Plinear heuristic powerloss testing, which now runs on more geometries. The failing case was: test_relocations_reentrant_renames:112gg261dk1e3f3:123456789abcdefg1h1i1j1k1 l1m1n1o1p1q1r1s1t1u1v1g2h2i2j2k2l2m2n2o2p2q2r2s2t2 Also fixed/tweaked some parts of the test framework as a part of finding this bug: - Fixed off-by-one in exhaustive powerloss state encoding. - Added --gdb-powerloss-before and --gdb-powerloss-after to help debug state changes through a failing powerloss, maybe this should be expanded to any arbitrary powerloss number in the future. - Added lfs_emubd_crc and lfs_emubd_bdcrc to get block/bd crcs for quick state comparisons while debugging. - Fixed bd read/prog/erase counts not being copied during exhaustive powerloss testing. - Fixed small typo in lfs_emubd trace.	2022-11-28 12:51:18 -06:00
Christopher Haster	f89d758444	Fixed test out-of-space issues with powerloss testing These are just incorrect limits in the tests that can be triggered by powerloss testing, which can end up with more metadata-pairs than without powerloss testing due to orphans.	2022-11-28 12:51:18 -06:00
Christopher Haster	6c18b4dfb6	Added a simple help rule to the Makefile To run: $ make help	2022-11-17 10:36:20 -06:00
Christopher Haster	f73494151a	Changed default build target lfs.a -> liblfs.a This is the name expected if you are actually linking against littlefs. The use as a default build rule is mostly for linting. Most uses of littlefs likely compile directly with the sources (it is only several K of code), or use their own build system, and the previous name would have made linking a bit of a challenge. Still, this might cause some breakage for someone...	2022-11-17 10:27:00 -06:00
Christopher Haster	bcc88f52f4	A couple Makefile-related tweaks - Changed --(tool)-tool to --(tool)-path in scripts, this seems to be a more common name for this sort of flag. - Changed BUILDDIR to not have implicit slash, makes Makefile internals a bit more readable. - Fixed some outdated names hidden in less-often used ifdefs.	2022-11-17 10:26:26 -06:00
Christopher Haster	e35e078943	Renamed prefix.py -> changeprefix.py and updated to use argparse Added a couple flags to make the script a bit more flexible, and removed littlefs-specific default in line with the other scripts which aren't really littlefs-specific. (These defaults can be moved to the littlefs-specific Makefile easily enough). The original behavior can be reproduced like so: ./script/changeprefix.py lfs lfs2 --git	2022-11-16 10:46:26 -06:00
Christopher Haster	1a07c2ce0d	A number of small script fixes/tweaks from usage - Fixed prettyasserts.py parsing when '->' is in expr - Made prettyasserts.py failures not crash (yay dynamic typing) - Fixed the initial state of the emubd disk file to match the internal state in RAM - Fixed true/false getting changed to True/False in test.py/bench.py defines - Fixed accidental substring matching in plot.py's --by comparison - Fixed a missed LFS_BLOCk_CYCLES in test_superblocks.toml that was missed - Changed test.py/bench.py -v to only show commands being run Including the test output is still possible with test.py -v -O-, making the implicit inclusion redundant and noisy. - Added license comments to bench_runner/test_runner	2022-11-15 13:42:07 -06:00
Christopher Haster	6fce9e5156	Changed plotmpl.py/plot.py to not treat missing values as discontinuities	2022-11-15 13:38:13 -06:00
Christopher Haster	559e174660	Added plotmpl.py for creating svg/png plots with matplotlib Note that plotmpl.py tries to share many arguments with plot.py, allowing plot.py to act as a sort of draft mode for previewing plots before creating an svg.	2022-11-15 13:38:13 -06:00
Christopher Haster	b2a2cc9a19	Added teepipe.py and watch.py	2022-11-15 13:38:13 -06:00
Christopher Haster	3a33c3795b	Added perfbd.py and block device performance sampling in bench-runner Based loosely on Linux's perf tool, perfbd.py uses trace output with backtraces to aggregate and show the block device usage of all functions in a program, propagating block devices operation cost up the backtrace for each operation. This combined with --trace-period and --trace-freq for sampling/filtering trace events allow the bench-runner to very efficiently record the general cost of block device operations with very little overhead. Adopted this as the default side-effect of make bench, replacing cycle-based performance measurements which are less important for littlefs.	2022-11-15 13:38:13 -06:00
Christopher Haster	29cbafeb67	Renamed coverage.py -> cov.py	2022-11-15 13:38:13 -06:00
Christopher Haster	df283aeb48	Added recursive results to perf.py This adds -P/--propagate and -Z/--depth to perf.py for showing recursive results, making it easy to narrow down on where spikes in performance come from. This ended up being a bit different from stack.py's recursive results, as we end up with different (diminishing) numbers as we descend.	2022-11-15 13:38:13 -06:00
Christopher Haster	490e1c4616	Added perf.py a wrapper around Linux's perf tool for perf sampling This provides 2 things: 1. perf integration with the bench/test runners - This is a bit tricky with perf as it doesn't have its own way to combine perf measurements across multiple processes. perf.py works around this by writing everything to a zip file, using flock to synchronize. As a plus, free compression! 2. Parsing and presentation of perf results in a format consistent with the other CSV-based tools. This actually ran into a surprising number of issues: - We need to process raw events to get the information we want, this ends up being a lot of data (~16MiB at 100Hz uncompressed), so we paralellize the parsing of each decompressed perf file. - perf reports raw addresses post-ASLR. It does provide sym+off which is very useful, but to find the source of static functions we need to reverse the ASLR by finding the delta the produces the best symbol<->addr matches. - This isn't related to perf, but decoding dwarf line-numbers is really complicated. You basically need to write a tiny VM. This also turns on perf measurement by default for the bench-runner, but at a low frequency (100 Hz). This can be decreased or removed in the future if it causes any slowdown.	2022-11-15 13:38:13 -06:00
Christopher Haster	ca66993812	Tweaked scripts to share more code, added coverage calls/hits The main change is requiring field names for -b/-f/-s/-S, this is a bit more powerful, and supports hidden extra fields, but can require a bit more typing in some cases.	2022-11-15 13:38:13 -06:00
Christopher Haster	296c5afea7	Renamed bench_read/prog/erased -> bench_readed/proged/erased Yes this isn't really correct english anymore, but these names avoid the read/read ambiguity.	2022-11-15 13:38:13 -06:00
Christopher Haster	274222b518	Added some automatic sizing for field-names in scripts/runners	2022-11-15 13:38:13 -06:00

... 2 3 4 5 6 ...

877 Commits