Add support for dependency caching #2383

mbg · 2024-07-22T14:32:20Z

Work-in-progress to add support for dependency caching to the init Action.

Merge / deployment checklist

Confirm this change is backwards compatible with existing workflows.
Confirm the readme has been updated if necessary.
Confirm the changelog has been updated if necessary.

henrymercer

Looks good so far, some initial comments:

henrymercer · 2024-07-23T15:59:16Z

src/dependency-caching.ts

+
+interface CacheConfig {
+  paths: string[];
+  hash: string[];


Maybe "key" is more descriptive? Consider adding some short doc to explain what these fields do.

I have added docs to this, but I have left it as hash for now. key may be misleading, since the cache key is comprised of more than just the hash calculated from the files that match these patterns. Perhaps hashFilePatterns would be better?

No strong feelings, I think with the doc hash is fine.

src/dependency-caching.ts

henrymercer · 2024-07-23T16:34:05Z

src/dependency-caching.ts

+    const cacheConfig = CODEQL_DEFAULT_CACHE_CONFIG[language];
+
+    if (cacheConfig === undefined) {
+      logger.info(


These should probably be debug statements before we merge this PR

henrymercer · 2024-07-23T16:36:14Z

src/dependency-caching.ts

+    // with an empty string.
+    const globber = await makeGlobber(cacheConfig.hash);
+
+    if ((await globber.glob()).length === 0) {


Does this mean we list files twice (once here and once in cacheKey)? Might not be a performance problem in practice.

Yes, it does. I looked at this before your review but, unfortunately, the hashFiles implementation in @action/glob isn't exposed in a way that we can just throw an existing array of paths at it, so we'd have to copy the implementation (or a variant of it, depending on how much we care about the intricacies of theirs).

henrymercer · 2024-07-23T16:38:14Z

src/dependency-caching.ts

+      continue;
+    }
+
+    const size = await getTotalCacheSize(cacheConfig.paths, logger);


If the size is very small, should we skip the upload?

Pro for skipping: for small sizes, caching may slow down the run

Pro for uploading: this will let us deal better with registry outages or network issues

Whichever way we choose, let's document why.

Another consideration: if the size is very large, should we also skip? Larger caches are more likely to push out other cache entries. Another thing we could do is look at the existing cache usage and selectively upload based on that. This is something we can address in a followup PR.

I should have noted this in the description, but I have left this kind of thing out of the implementation for now while we are in the early stages of testing to stick to just a minimal viable implementation. I agree that we should look at this before we merge / ship anything and your points are good.

If the size is very small, should we skip the upload?

My feeling is that we probably shouldn't skip the upload if the cache is small. As you say, the benefit of dealing with third-party outages is useful, and storing/restoring a small cache should be fairly quick.

Another consideration: if the size is very large, should we also skip? Larger caches are more likely to push out other cache entries. Another thing we could do is look at the existing cache usage and selectively upload based on that. This is something we can address in a followup PR.

Agreed. I think I touched on this in the EDR as well.

src/init-action-post.ts

src/init-action.ts

mbg self-assigned this Jul 22, 2024

henrymercer reviewed Jul 23, 2024

View reviewed changes

mbg added 15 commits September 24, 2024 11:11

Add dependency-caching input to Action

7de1538

Add dependencyCachingEnabled to Config

50ffbe4

Add explicit dependency on @actions/glob

fff2014

Add caching utils

e83995c

Add bare-bones dependency caching functions

9c9c5a6

Store and restore dependency caches in init Action

55465b8

Add caching configuration for C#

7ae77cf

Add caching configuration for Go

8a6b57a

Ensure that we have files to calculate the hash for the cache key from

c43f79a

Add documentation for CacheConfig

a286e7f

Rename makeGlobber parameter

5231214

Move isDefaultSetup to actions-util.ts and change implementation

5d6790c

Store dependency caches in analyze Action

956a035

Skip uploading empty caches

163d19e

Support CPM for C#

c07c5b0

mbg force-pushed the mbg/add/dependency-caching branch from 04e457d to c07c5b0 Compare September 24, 2024 10:15

Add env var alternative to dependency-caching input

be09e31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for dependency caching #2383

Add support for dependency caching #2383

mbg commented Jul 22, 2024

henrymercer left a comment

henrymercer Jul 23, 2024

mbg Jul 26, 2024

henrymercer Jul 26, 2024

henrymercer Jul 23, 2024

henrymercer Jul 23, 2024

mbg Jul 24, 2024

henrymercer Jul 23, 2024

mbg Jul 24, 2024

Add support for dependency caching #2383

Are you sure you want to change the base?

Add support for dependency caching #2383

Conversation

mbg commented Jul 22, 2024

Merge / deployment checklist

henrymercer left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment