Can anyone advice on how kotlin incremental compilation inte kotlinlang #gradle

Can anyone advice on how kotlin incremental compil...

Rob Elliot

12/18/2023, 11:55 AM

Can anyone advice on how kotlin incremental compilation integrates with Gradle? Specifically, if I want to benefit from incremental compilation in a CI environment, which directories (if any) under

$PROJECT_DIR/build

do I need to cache between runs? Or can I just maintain the build cache (

~/.gradle/caches/build-cache-1

) and still benefit from incremental compilation? Testing suggests not... This bit of the question is a bit NOT KOTLIN (and I have asked it on the Gradle slack), but if you maintain the whole of

$PROJECT_DIR/build

between runs is there any point in maintaining the build cache (

~/.gradle/caches/build-cache-1

) or indeed using the build cache at all?

Darryl Miles

12/18/2023, 12:09 PM

Doesn't the build cache feature allow the artifacts to be stored to a centralized repository. (Gradle Enterprise?) Allowing a CI worker farm to lookup and retrieve the cached artifacts across multiple CI invocations/runs. I believe this is the purpose of using it at scale, if you are a single CI system setup yet it may not seem as useful as more manual copy/restore approaches

Darryl Miles

12/18/2023, 12:14 PM

Sorry I don't know about incremental compilation, not sure it is something I have considered in CI myself, as usually you want repeatable builds from a known initial state, usually the cost of all the CI machinery on resources/time is far more than compling a few 100 extra classes each time. Many people use

git

and it does not preserve timestamps, the key to incremental is going to be preservation of timestamps as well as data.

mbonnin

12/18/2023, 12:45 PM

There are 3 concepts: 1. up-to-date checks: do not run a task if its inputs did not change 2. build cache: copy the task output from a previous run that used the same inputs 3. incremental tasks only run a portion of a task because only a portion of its inputs changed

mbonnin

12/18/2023, 12:47 PM

Usually "kotlin incremental compilation" refers to 3. (doc) but this is super confusing because in Gradle terms, "incremental" refers to 1. (doc) So incremental builds are much different from incremental tasks.

mbonnin

12/18/2023, 12:49 PM

Not sure what folders you need to keep around but I'd recommend using the remote cache. This way, you end up downloading only those outputs that you can actually reuse.

➕ 1

tapchicoma

12/18/2023, 8:06 PM

Remote cache per-se will not help with incremental compilation(IC) on CI. Indeed new JVM approach to IC has fixed the issue of incremental compilation after cache-hit in the dependency sub-project. But to be able to use IC you need to have some previous state of task execution. If you want to use IC on CI probably the easiest solution would be to save the state of the repo after build is finished and on the next run restore this state, apply git commit and run the build. You could try to save state only for Kotlin compilation tasks, but we don't provide any guarantees on paths stability/required paths to save.

Samuel Gammon

12/19/2023, 2:12 AM

@mbonnin / @Rob Elliot -- not sure if this can help, as it's quite new, but at Buildless (https://less.build) we just added a Github Action, and it's a drop-in remote cache based on Cloudflare

Samuel Gammon

12/19/2023, 2:13 AM

scales to zero and may be useful since it eliminates the slow up-front download pattern used by GHA

very nice 1

Samuel Gammon

12/19/2023, 2:13 AM

we've had good success with it internally; if anybody here would like to try it, let me know and i can provide keys

CLOVIS

12/20/2023, 9:15 AM

How does it work technically? Is it just a SaaS Build Cache server?

CLOVIS

12/20/2023, 9:16 AM

If that's the case, for everyone reading it: the Build Cache server is free to self-host: https://docs.gradle.com/build-cache-node/ (Docker Compose & Kubernetes configs are provided)

👀 1

Samuel Gammon

12/21/2023, 9:48 AM

It's a SaaS build cache server, written from scratch 🙂 with support for many tools, not just Gradle. It's also a local Agent, so you can deploy it to your own machine. We're working on other deployment options as well (private cloud, self-host). The Agent is free forever, and the Cloud is also free during beta.

Samuel Gammon

12/21/2023, 9:49 AM

Naturally the build cache itself is written in Gradle/Kotlin, so we actually cache Buildless with Buildless, and that has afforded a lot of testing.

Samuel Gammon

12/21/2023, 9:50 AM

Especially over HTTP/3, and with smarter connection management/compression, build caching really flies. Our new Github Action also extends these benefits to CI. So yes -- the build cache node docker image is free, and totally has a place. It's useful, but to a limit, at least when I used it last managing it was rather tough. Keeping space available, but not too much, while keeping it online, wasn't our core business, and now it is, so it doesn't have to be yours.

Samuel Gammon

12/21/2023, 9:51 AM

@mbonnin if you're using the Build Cache node and have any needs that aren't being met, let us know 🙂 so long as your build is fast then we're happy lol

CLOVIS

12/21/2023, 10:04 AM

@Samuel Gammon can you generate read-only API keys? For my projects, only the CI on the

main

branch and for tags are allowed to write to the remote cache, but everyone (including external contributors) should be able to pull from it

Samuel Gammon

12/21/2023, 10:04 AM

Yes! You can, using a new feature landing soon called Cache Projects. It lets you segment your cache objects, and those can be set to a public mode, which doesn't require an API key at all. We also have read-only API keys.

Samuel Gammon

12/21/2023, 10:05 AM

(But for OSS projects, the ability to distribute objects publicly is awesome)

CLOVIS

12/21/2023, 10:05 AM

Nice, the Gradle plugin README doesn't mention how to set it up 🙂

Samuel Gammon

12/21/2023, 10:06 AM

Ah, good catch 😅 The plugin is just a configuration client in this case, which is either detecting and using the Agent, locally, or the Cloud, if you have

BUILDLESS_APIKEY

set in your env

Samuel Gammon

12/21/2023, 10:06 AM

You can find the full doc here https://docs.less.build/docs/gradle

👀 1

Samuel Gammon

12/21/2023, 10:06 AM

we'll make sure to make that more prominent

CLOVIS

12/21/2023, 10:07 AM

Does this mean the Gradle plugin doesn't work if the user hasn't installed the Agent?

Samuel Gammon

12/21/2023, 10:08 AM

The plugin is designed to be inert if the Cloud or Agent are both unavailable or disabled, in which case it falls back to Gradle's built-in caching

Samuel Gammon

12/21/2023, 10:08 AM

the Agent is nice because you get a local in-memory cache, and it also helps handle network blips

Samuel Gammon

12/21/2023, 10:08 AM

if you're on a higher latency connection, it's also helpful that it defers uploads to the cloud (so uploads are always fast during your builds)

CLOVIS

12/21/2023, 10:09 AM

Ah that's a shame, I would have expected the Gradle plugin to bundle the Agent and to be able to start it automatically, rather than it be a separate install

Samuel Gammon

12/21/2023, 10:09 AM

That's great feedback and we could probably do that 🙂 I'll keep that in mind

CLOVIS

12/21/2023, 10:09 AM

it's also helpful that it defers uploads to the cloud

can't that cause ordering issues if you have a task that tries to close a Sonatype repository?

Samuel Gammon

12/21/2023, 10:10 AM

I'm not sure I follow?

CLOVIS

12/21/2023, 10:10 AM

If the upload is deferred, a subsequent request (which depends on the previous one having finished) may fail?

Samuel Gammon

12/21/2023, 10:10 AM

We aren't actually taking control of any Sonatype publishing or any other uploads for that matter

CLOVIS

12/21/2023, 10:11 AM

Oh, it's specifically build cache uploads?

Samuel Gammon

12/21/2023, 10:11 AM

the Agent, even if you use it with newer dependency acceleration features, is only about downloads and build caching

Samuel Gammon

12/21/2023, 10:11 AM

Yes, sorry, build cache uploads are deferred

Samuel Gammon

12/21/2023, 10:11 AM

Between the local agent and cloud

CLOVIS

12/21/2023, 10:11 AM

I see, that's a great feature

Samuel Gammon

12/21/2023, 10:11 AM

Why thank you 🙂

Samuel Gammon

12/21/2023, 10:11 AM

it's so helpful when people contribute to the cache, but unfair to ask people behind higher latency links to do so all the time

Samuel Gammon

12/21/2023, 10:12 AM

this helps even that divide, sorry, should have been more clear 😅

Samuel Gammon

12/21/2023, 10:12 AM

we are definitely still learning and figuring out where we can be most impactful

CLOVIS

12/21/2023, 10:12 AM

Do you expect all devs to contribute to the cache?

Samuel Gammon

12/21/2023, 10:12 AM

oh, no, it's something any individual dev can disable -- there are several switches (env, config, gradle script, etc)

CLOVIS

12/21/2023, 10:12 AM

At the moment I don't consider anything other than the main branch and tags to be trustworthy, to ensure I will never cache some broken code somehow

👍 1

Samuel Gammon

12/21/2023, 10:12 AM

we do enable it by default, though, and hope to optimize it enough that they keep it on

Samuel Gammon

12/21/2023, 10:13 AM

we've had that concern too but it hasn't yet materialized

CLOVIS

12/21/2023, 10:13 AM

^ that is, the cache is always read-only on dev's machines

CLOVIS

12/21/2023, 10:13 AM

I see, good

Samuel Gammon

12/21/2023, 10:13 AM

not that it couldn't, i don't know if i fully trust gradle's keys

Samuel Gammon

12/21/2023, 10:13 AM

as in, we do see errant cache misses but i've never seen it mixup two pieces of code

CLOVIS

12/21/2023, 10:14 AM

Well, I much prefer a cache miss than it caching the wrong stuff

Samuel Gammon

12/21/2023, 10:14 AM

of course, and in some ways buildless is designed to be a database, but with relaxed requirements for consistency

Samuel Gammon

12/21/2023, 10:14 AM

that's what enables that deferred upload feature, for example

Samuel Gammon

12/21/2023, 10:14 AM

we prefer a cache miss over a slow large hit and download, for example, and sensible object caps are set for higher latency connections

Samuel Gammon

12/21/2023, 10:15 AM

in any case, build caching is sort of half art and science still, thankfully gradle's reports make it helpful, and our view is, you should get all those options 🙂 but with sensible defaults

Samuel Gammon

12/21/2023, 10:15 AM

that make it a little less thorny / miss prone.

CLOVIS

12/21/2023, 10:15 AM

You mentioned the Agent is a GraalVM image, I think?

Samuel Gammon

12/21/2023, 10:16 AM

yes 😄 we are big graalvm fans as well

CLOVIS

12/21/2023, 10:16 AM

If I were to create a project that would benefit from build caching, should I include the Agent as a library, or separately start it as its own process and communicate with it however the Gradle plugin currently does?

CLOVIS

12/21/2023, 10:17 AM

("project" as in "something that looks like a build tool")

Samuel Gammon

12/21/2023, 10:17 AM

normally, the agent runs as a background service; you can start/stop/manage it with the same command line that hosts the agent itself

Samuel Gammon

12/21/2023, 10:17 AM

so it's a CLI and an agent

Samuel Gammon

12/21/2023, 10:18 AM

buildless agent start

starts, etc etc, and you can get stats with

buildless agent stats

Samuel Gammon

12/21/2023, 10:18 AM

this is, effectively, a little web server, doing the caching and/or proxying; it's exposing JSON endpoints, which we plan to document

Samuel Gammon

12/21/2023, 10:18 AM

Screenshot 2023-12-21 at 2.18.50 AM.png

Samuel Gammon

12/21/2023, 10:19 AM

quick run of

buildless --help

on latest

CLOVIS

12/21/2023, 10:21 AM

Do you have stats on the Agent vs local Gradle build cache? When not connected to the cloud at all, is the agent still faster?

Samuel Gammon

12/21/2023, 10:22 AM

we don't have super clear data yet, the agent is pretty new -- we're on

rc2

, that's why we're looking for beta users 🙂

Samuel Gammon

12/21/2023, 10:22 AM

truthfully, i don't know where the agent will sit as compared to local on-disk caching; but, the drawback of on-disk is that it can never be shared

👍 1

Samuel Gammon

12/21/2023, 10:22 AM

soon, you'll be able to use the agent entirely for free, and then upload it for free to a free-tier account, etc, share it with friends within reasonable limits

Samuel Gammon

12/21/2023, 10:23 AM

so at least there is an escape hatch there, you know? but yeah, in both cases you're bound by local resources and it's disk vs disk anyway, since the agent will try to use a unix socket where it can

Samuel Gammon

12/21/2023, 10:23 AM

soon we can support HTTP/2 or HTTP/3 from the gradle plugin itself, which might impact that scenario

CLOVIS

12/21/2023, 10:23 AM

Do you expect to have a tier with unlimited read-only access for unauthenticated users? (for OSS contributors)

Samuel Gammon

12/21/2023, 10:23 AM

but it requires much deeper changes to gradle

Samuel Gammon

12/21/2023, 10:23 AM

hm, "expect" is hard to promise, but that's what we're shooting for 🙂

👍 1

Samuel Gammon

12/21/2023, 10:24 AM

if we got the adoption and people were into it, certain companies or investors are already interested

Samuel Gammon

12/21/2023, 10:24 AM

so... call it a, idk, fuzzy yes? a hopeful yes 😄

CLOVIS

12/21/2023, 10:24 AM

Makes sense, thanks for the info 🙂

Samuel Gammon

12/21/2023, 10:24 AM

of course 🙂 thanks for asking, it really helps to hear where people's thoughts are

CLOVIS

12/21/2023, 10:26 AM

Personally, I have two kinds of projects: • OSS: it's important that it also works for contributors, even if they haven't setup anything • proprietary: a SaaS cache is a tough sell…

CLOVIS

12/21/2023, 10:26 AM

If it works out, I'll probably end up trying it with OSS projects, and then making some kind of presentation at my dayjob

Samuel Gammon

12/21/2023, 10:27 AM

okay! 😄 thanks, that would be huge! on both points. Cloud is free during beta, so if you want keys to try it for OSS projects, drop me a line at

sam@less.build

and we can get them provisioned for you. re/security, I totally understand the concern there but we want to get it right.

Samuel Gammon

12/21/2023, 10:28 AM

build cache objects, i think, can be intrinsically trusted in many cases, because it's a content-addressable hash under the hood which is, in essence, self-verifying

Samuel Gammon

12/21/2023, 10:28 AM

that's a qualified sentence as in, it may or may not apply to this or that tool

😂 1

Samuel Gammon

12/21/2023, 10:28 AM

not calling out names lol

Samuel Gammon

12/21/2023, 10:28 AM

but, you know, we want to leverage that and make sure things are completely encrypted, verified, signed, etc, and transparent

CLOVIS

12/21/2023, 10:28 AM

Sorry, I wasn't clear 😅 My worry was more about external actors accessing the codebase/artifacts, than corrupting them

Samuel Gammon

12/21/2023, 10:29 AM

yes, they should be encrypted so we cannot read them

Samuel Gammon

12/21/2023, 10:29 AM

we actually don't need to; any telemetry readable within the gradle cache blob could be transmitted otherwise

Samuel Gammon

12/21/2023, 10:29 AM

and we don't want to lol

👍 1

Samuel Gammon

12/21/2023, 10:29 AM

we can do all sorts of compression, or what not, without ever understanding or decrypting those blobs.

Samuel Gammon

12/21/2023, 10:29 AM

(so long as we pick algorithms correctly)

Samuel Gammon

12/21/2023, 10:30 AM

but, i digress; the point is, that part needs to be carefully done so it's trustable.

CLOVIS

12/21/2023, 10:30 AM

Do you have a worry that people will start using your service to upload anything they want?

Samuel Gammon

12/21/2023, 10:30 AM

let me give one example: our dependency proxy will (hopefully) soon gain sigstore verification support

Samuel Gammon

12/21/2023, 10:30 AM

hm, i mean that's always a risk with UGC for sure; we apply reasonable limits and protections where we can, like any saas business, and writes are always identified anyway

👍 1

Samuel Gammon

12/21/2023, 10:31 AM

if someone wanted to distribute something nasty, they would probably have an easier time doing it elsewhere; but, again, we're always working to strengthen our posture there

👍 1

Samuel Gammon

12/21/2023, 10:31 AM

(for instance, we can speak various protocols, but we deliberately refuse to serve to browsers)

41 Views

Open in Slack

Previous Next