zen - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	fix ChunkIndexToChunkHash indexing (#621)	Stefan Boberg	2023-12-19	1	-1/+1
\| \| \|	would previously index into a reserved-but-not-sized vector which is bad but not crash-inducing bad
*	Don't use copy of Payloads array when fetching memcached payload in GC (#609)	Dan Engelbrecht	2023-12-13	1	-1/+1
\| \| \|	* Don't use copy of Payloads array when fetching memcached payload in GC
*	improve trace (#606)	Dan Engelbrecht	2023-12-13	1	-34/+53
\| \| \| \| \|	* Adding some more trace scopes for better visiblity * Removed spammy trace scope when replaying oplogs * Remove "::Disk" from trace scopes - redundant now that we have merge disk and memory layers
*	mem cache perf improvements (#592)	Dan Engelbrecht	2023-12-11	1	-104/+132
\| \| \| \| \| \| \| \|	- Improvement: Refactor memory cache for faster trimming and correct trim reporting - Improvement: Added trace scopes for memory cache trimming Adding a link back to the cache item payload on the memory cache item allows us to iterate over only the items cached in memory instead of over the entire index. This also allows us to do efficient compact of the memory cache array when trimming. It adds 4 bytes of overhead to each item cached in memory.
*	fix deadlock at bucket creation (#598)	Dan Engelbrecht	2023-12-11	1	-163/+175
\| \| \| \| \| \|	- Make sure we don't hold the namespace bucket lock when we create buckets to avoid deadlock - Pass lock scope to helper functions to clarify locking rules - Block flush and gc operations for a bucket that is not yet initialized - Add ZenCacheDiskLayer::GetOrCreateBucket to avoid code duplication
*	Use correct iterator index when looking up memcached payload in ↵	Dan Engelbrecht	2023-12-05	1	-5/+4
\| \| \| \| \|	GatherReferences (#591) * Use correct iterator index when looking up memcached payload in gatherreferences
*	reserve vectors in gcv2 upfront / load factor for robin_map (#582)	Dan Engelbrecht	2023-12-04	1	-5/+20
\| \| \| \| \|	* reserve vectors in gcv2 upfront * set max load factor for robin_map indexes to reduce memory usage * set min load factor for robin_map indexes to allow them to shrink
*	memory usage estimation for memcached entries (#586)	Dan Engelbrecht	2023-12-04	1	-5/+24
\| \| \| \|	* do a more accurate memory usage estimation for memcached entries * early exit when checking memcache usage
*	use 32 bit offset and size in BlockStoreLocation (#581)	Dan Engelbrecht	2023-12-01	1	-38/+75
\| \| \|	- Improvement: Reduce memory usage in GC and diskbucket flush
*	add separate PreCache step for GcReferenceChecker (#578)	Dan Engelbrecht	2023-12-01	1	-169/+314
\| \| \| \| \| \|	- Improvement: GCv2: Use separate PreCache step to improve concurrency when checking references - Improvement: GCv2: Improved verbose logging - Improvement: GCv2: Sort chunks to read by block/offset when finding references - Improvement: GCv2: Exit as soon as no more unreferenced items are left
*	global thread worker pools (#577)	Dan Engelbrecht	2023-11-29	1	-10/+5
\| \| \|	- Improvement: Use two global worker thread pools instead of ad-hoc creation of worker pools
*	tracing for gcv2 (#574)	Dan Engelbrecht	2023-11-28	1	-0/+14
\| \| \| \| \| \|	- Improvement: Added more trace scopes for GCv2 - Bugfix: Make sure we can override flags to "false" when running `zen gc` commmand - `smallobjects`, `skipcid`, `skipdelete`, `verbose`
*	optimized index snapshot reading/writing (#561)	Stefan Boberg	2023-11-27	1	-441/+772
\| \| \| \| \|	the previous implementation of in-memory index snapshots serialise data to memory before writing to disk and vice versa when reading. This leads to some memory spikes which end up pushing useful data out of system cache and also cause stalls on I/O operations. this change moves more code to a streaming serialisation approach which scales better from a memory usage perspective and also performs much better
*	Add GC Cancel/Stop (#568)	Dan Engelbrecht	2023-11-24	1	-11/+60
\| \| \| \|	- GcScheduler will now cancel any running GC when it shuts down. - Old GC is rather limited in when it reacts to cancel of GC. GCv2 is more responsive.
*	reduce work when there are no blocks to compact (#558)	Dan Engelbrecht	2023-11-22	1	-54/+61
\| \| \| \|	* reduce work when there are no blocks to compact * fix lock scopes
*	add command line options for compact block threshold and gc verbose (#557)	Dan Engelbrecht	2023-11-21	1	-5/+24
\| \| \| \| \| \| \| \| \| \| \|	- Feature: Added new options to zenserver for GC V2 - `--gc-compactblock-threshold` GCV2 - how much of a compact block should be used to skip compacting the block, default is 90% - `--gc-verbose` GCV2 - enable more verbose output when running a GC pass - Feature: Added new options to `zen gc` command for GC V2 - `--compactblockthreshold` GCV2 - how much of a compact block should be used to skip compacting the block, default is 90% - `--verbose` GCV2 - enable more verbose output when running a GC pass - Feature: Added new parameters for endpoint `admin/gc` (PUT) - `compactblockthreshold` GCV2 - how much of a compact block should be used to skip compacting the block, default is 90% - `verbose` GCV2 - enable more verbose output when running a GC pass
*	compact separate for gc referencer (#533)	Dan Engelbrecht	2023-11-21	1	-149/+231
\| \| \| \| \|	- Refactor GCV2 so GcReferencer::RemoveExpiredData returns a store compactor, moving out the actual disk work from deleting items in the index. - Refactor GCV2 GcResult to reuse GcCompactStoreStats and GcStats - Make Compacting of stores non-parallell to not eat all the disk I/O when running GC
*	blocking queue fix (#550)	Dan Engelbrecht	2023-11-16	1	-15/+28
\| \| \| \| \| \| \| \| \|	* make BlockingQueue::m_CompleteAdding non-atomic * ZenCacheDiskLayer::Flush logging * name worker threads in ZenCacheDiskLayer::DiscoverBuckets * name worker threads in gcv2 * improved logging in ZenServerInstance * scrub threadpool naming * remove waitpid handling, we should just call wait to kill zombie processes
*	fix index out of bounds in CacheBucket::CompactState (#532)	Dan Engelbrecht	2023-11-14	1	-25/+24
\| \| \| \| \|	* use PayloadIndex for indexing into payload array * naming cleanup * fix metadata index in CacheBucket::CompactState
*	fix potential logic error in bucket manifest read	Stefan Boberg	2023-11-13	1	-17/+21
\|
*	fix bad access to unlocked state (#527)	Dan Engelbrecht	2023-11-10	1	-16/+22
\| \| \| \|	* don't touch non-locked data when creating manifest * safety assert for test dir
*	reduce number of files generated on shared instances (#524)	Stefan Boberg	2023-11-09	1	-1/+3
\|
*	disk layer gc and error/warnings cleanup (#515)	Dan Engelbrecht	2023-11-08	1	-34/+81
\| \| \| \| \| \| \|	- Improvement: Use GC reserve when writing index/manifest for a disk cache bucket when disk is low when available - Improvement: Demote errors to warning for issues that are not critical and we handle gracefully - Improvement: Treat more out of memory errors from windows as Out Of Memory errors Fixed wrong sizeof() statement for compactcas index (luckily the two structs are of same size)
*	Don't put cache entries into the memory cache on Put, only on Get (#518)	Dan Engelbrecht	2023-11-07	1	-16/+3
\|
*	gc v2 tests (#512)	Dan Engelbrecht	2023-11-06	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	* set MaxBlockCount at init * properly calculate total size * basic blockstore compact blocks test * correct detection of block swap * Use one implementation for CreateRandomBlob * reduce some data sets to increase speed of tests * reduce test time * rename BlockStoreCompactState::AddBlock -> BlockStoreCompactState::IncludeBlock
*	reduce cachebucket mem (#509)	Dan Engelbrecht	2023-11-06	1	-260/+354
\| \| \| \| \| \| \|	* reduce memory footprint for disk cache separate dense arrays for rawhash+rawsize and memcache buffer * don't write RawHash/RawSize for buckets with no such metadata * helper functions * make index into metadata and cached payload type safe * helper functions for memcached
*	multithread cache bucket (#508)	Dan Engelbrecht	2023-11-06	1	-21/+66
\| \| \| \|	* Multithread init and flush of cache bucket * tweaked threading cound for bucket discovery, disklayer flush and gc v2
*	individual gc stats (#506)	Dan Engelbrecht	2023-10-30	1	-20/+45
\| \| \| \| \|	- Feature: New parameter for endpoint `admin/gc` (GET) `details=true` which gives details stats on GC operation when using GC V2 - Feature: New options for zen command `gc-status` - `--details` that enables the detailed output from the last GC operation when using GC V2
*	New GC implementation (#459)	Dan Engelbrecht	2023-10-30	1	-86/+547
\| \| \|	- Feature: New garbage collection implementation, still in evaluation mode. Enabled by `--gc-v2` command line option
*	fix CacheBucket::CollectGarbage removing standalone entries without an ↵	Dan Engelbrecht	2023-10-27	1	-4/+7
\| \| \| \|	exclusive lock (#502)
*	merge disk and memory layers (#493)	Dan Engelbrecht	2023-10-24	1	-291/+535
\| \| \| \|	- Feature: Added `--cache-memlayer-sizethreshold` option to zenserver to control at which size cache entries get cached in memory - Changed: Merged cache memory layer with cache disk layer to reduce memory and cpu overhead
*	Remove any unreferenced blocks in block store on open (#492)	Dan Engelbrecht	2023-10-23	1	-1/+1
\| \| \|	* Remove any unreferenced blocks in block store on open
*	Filter expired cache entries against ExpiredKeys - not CAS entries to retain ↵	Dan Engelbrecht	2023-10-23	1	-40/+23
\| \| \| \|	(#491)
*	Don't prune block locations due to missing blocks a startup (#487)	Dan Engelbrecht	2023-10-20	1	-57/+3
\| \| \| \| \| \|	* Don't prune block locations due to missing blocks a startup This makes the behaviour consistent with FileCas - you can have an index that is not fully backed by data. Asking for a location that is not backed by data results in getting an empty result back Also, don't try to GC blocks that are unknown to the block store at the time of snapshot (to avoid removing data that comes in after GatherReferences in GC)
*	Add --skip-delete option to gc command (#484)	Dan Engelbrecht	2023-10-20	1	-1/+1
\| \| \| \|	- Feature: Add `--skip-delete` option to gc command - Bugfix: Fix implementation when claiming GC reserve during GC
*	minor - fix references size array	Dan Engelbrecht	2023-10-17	1	-0/+2
\|
*	don't call compact references if caching is not enabled (#478)	Dan Engelbrecht	2023-10-17	1	-6/+12
\|
*	cache reference tracking (#455)	Dan Engelbrecht	2023-10-10	1	-56/+387
\| \| \| \| \|	- Feature: Add caching of referenced CId content for structured cache records, this avoid disk thrashing when gathering references for GC - disabled by default, enable with `--cache-reference-cache-enabled` - Improvement: Faster collection of referenced CId content in project store
*	reject known bad bucket names in structured cache (#452)v0.2.27-pre0	Stefan Boberg	2023-10-06	1	-1/+32
\| \| \| \| \| \| \|	* added string_view helpers for ParseHexBytes/ParseHexNumber * reject known bad buckets in structured cache put handler (32-character hex bucket names are rejected) * also added bucket rejection logic to bucket discovery * added rejected_writes stat to HttpStructuredCache
*	Fix curruption of disk cache bucket index on GC (#448)	Dan Engelbrecht	2023-10-05	1	-44/+51
\| \| \| \| \| \| \| \| \|	* make sure we hold the index lock when reading payload data in reclaim space * don't use index snapshot when updating index in reclaim space * check that things have not moved under our feet * don't touch m_Payloads without a lock * start write block index on the highest block index * we don't need to bump writeblockindex when stopping write to a block, we will bump appropriately when we start a new block * changelog
*	reduce lock in disklayer (#447)	Dan Engelbrecht	2023-10-05	1	-10/+22
\| \| \|	* Don't block all write access to all buckets when doing GatherReferences/CollectGarbage
*	refactor comapactcas index (#443)	Dan Engelbrecht	2023-10-04	1	-3/+10
\| \| \| \| \|	- Bugfix: Fix scrub messing up payload and access time in disk cache bucket when compacting index - Improvement: Split up disk cache bucket index into hash lookup and payload array to improve performance - Improvement: Reserve space up front for compact binary output when saving cache bucket manifest to improve performance
*	faster accesstime save restore (#439)	Dan Engelbrecht	2023-10-03	1	-93/+207
\| \| \| \| \| \| \| \| \| \|	- Improvement: Reduce time a cache bucket is locked for write when flushing/garbage collecting - Change format for faster read/write and reduced size on disk - Don't lock index while writing manifest to disk - Skip garbage collect if we are currently in a Flush operation - BlockStore::Flush no longer terminates currently writing block - Garbage collect references to currently writing block but keep the block as new data may be added - Fix BlockStore::Prune used disk space calculation - Don't materialize data in filecas when we just need the size
*	Handle OOM and OOD more gracefully to not spam Sentry with error reports (#434)	Dan Engelbrecht	2023-10-02	1	-10/+25
\| \| \| \| \| \|	- Improvement: Catch Out Of Memory and Out Of Disk exceptions and report back to reqeuster without reporting an error to Sentry - Improvement: If creating bucket fails when storing and item in the structured cache, log a warning and propagate error to requester without reporting an error to Sentry - Improvement: Make an explicit flush of the active block written to in blockstore flush - Improvement: Make sure cache and cas MakeIndexSnapshot does not throw exception on failure which would cause and abnormal termniation at exit
*	lightweight gc (#431)	Dan Engelbrecht	2023-10-02	1	-0/+5
\| \| \| \| \| \|	- Feature: Add lightweight GC that only removes items from cache/project store without cleaning up data referenced in Cid store - Add `skipcid` parameter to http endpoint `admin/gc`, defaults to "false" - Add `--skipcid` option to `zen gc` command, defaults to false - Add `--gc-lightweight-interval-seconds` option to zenserver
*	adding more stats (#429)	Dan Engelbrecht	2023-09-28	1	-8/+51
\| \| \| \| \|	- Feature: Add detailed stats on requests and data sizes on a per-bucket level, use parameter `cachestorestats=true` on the `/stats/z$` endpoint to enable - Feature: Add detailed stats on requests and data sizes on cidstore, use parameter `cidstorestats=true` on the `/stats/z$` endpoint to enable - Feature: Dashboard now accepts parameters in the URL which is passed on to the `/stats/z$` endpoint
*	VFS implementation for local storage service (#396)	Stefan Boberg	2023-09-20	1	-0/+25
\| \| \|	currently, only Windows (using Projected File System) is supported
*	add more trace scopes (#362)	Dan Engelbrecht	2023-09-15	1	-13/+33
\| \| \| \| \|	* more trace scopes * Make sure ReplayLogEntries uses the correct size for oplog buffer * changelog
*	Increase retry logic (#325)	Dan Engelbrecht	2023-06-05	1	-13/+19
\| \| \| \|	* Increase timeout and number of retries in CacheBucket::PutStandaloneCacheValue when moving temporary file into place * changelog
*	fix for commented-out code which was never meant to be checked in	Stefan Boberg	2023-05-17	1	-6/+6
\|