I ve been doing some digging into some of the <https scans g kotlinlang #kotest-contributors

I've been doing some digging into some of the <slo...

Adam S

07/19/2024, 11:43 PM

I've been doing some digging into some of the slow tests and I found something interesting that I hope someone can double check for me. ForAllExhaustivesIterationTest#forAll with 21 exhaustives should run for each cross product is slow, taking over 4 mins on CI I ran the IntelliJ profiler, and I found that the main cause of this slow speeds appeared to be because the seed file was being deleted with

deleteRecursively()

, even though it's a file, not a directory. https://github.com/kotest/kotest/blob/a95078d987f5416f815f96d292a80407c9242277/kotest-property/src/jvmMain/kotlin/io/kotest/property/seed/seedio.kt#L55 So, I replaced it with

f.deleteIfExists()

, but it was still causing a lot of slow down. I then checked where

clearSeed()

is being called from, and the only non-test invocation is in

test()

https://github.com/kotest/kotest/blob/42af0ff23307398e2723618449feda0f5646352e/kotest-property/src/commonMain/kotlin/io/kotest/property/internal/test.kt#L52 This looks suspicious to me, because it looks like

clearSeed()

is always being called, while I would expect that it should only be called if the test fails. Is my understanding correct? In any case, even if the seed should be cleared after a successful test, it is probably not good that Kotest attempts to delete the seed file so often, even if it doesn't exist. Is there a better way of doing this?

Adam S

07/19/2024, 11:49 PM

here's the CPU flamegraph with

deleteRecursively()

Adam S

07/19/2024, 11:49 PM

and with the update to

deleteIfExists()

, which is still slower than I'd like

Adam S

07/20/2024, 2:05 AM

Found the answer to my questions. > This looks suspicious to me, because it looks like clearSeed() is always being called, while I would expect that it should only be called if the test fails. Is my understanding correct? My understanding wasn't correct. The seed should be deleted after a successful test. > Is there a better way of doing this? Yes, instead of eagerly deleting the file, instead mark files for deletion in-memory (in a mutable set) and delete them when the test has completed. WIP solution here https://github.com/kotest/kotest/pull/4183

sam

07/20/2024, 3:50 AM

I'm confused. Why would writing a few seed files cause a dramatic slowdown ?

Adam S

07/20/2024, 7:39 AM

I am not completely certain, but it's not writing files that is a problem, it's checking if it exists before deleting it

Adam S

07/20/2024, 7:42 AM

and it's not just a few seed files, it's checking if a single seed file for the current test exists over 4 million times (in the 'forAll with 22 exhaustives...' test)

Adam S

07/20/2024, 8:30 AM

new build scan for PR 4183 shows

forAll with 22 exhaustives should run for each cross product

is 2x faster. 4m33s vs 2m20s

sam

07/20/2024, 9:29 AM

Is it perhaps checking the seed file on every iteration instead of just once. There isn't 4 million tests but might be 4 million iterations.

Adam S

07/20/2024, 9:36 AM

yes that's right

sam

07/20/2024, 9:36 AM

The better fix would be for that then maybe

Adam S

07/20/2024, 9:36 AM

the terminology is confusing because the 'iteration' is called 'test' https://github.com/kotest/kotest/blob/42af0ff23307398e2723618449feda0f5646352e/kotest-property/src/commonMain/kotlin/io/kotest/property/internal/test.kt#L24

Adam S

07/20/2024, 9:37 AM

I am very interested in better fixes 🙏

sam

07/20/2024, 9:37 AM

Clear failed seed can move out of there to the place that invokes test

sam

07/20/2024, 9:37 AM

See if that plus your change improve on the 2m

Adam S

07/20/2024, 9:45 AM

it would be nice to make it faster! But when I profile the tests I don't see any obvious places to improve

sam

07/20/2024, 9:45 AM

Clear failed seed defiantly only needs to run once before the start of the iterations right.

sam

07/20/2024, 9:46 AM

Won't make it slower to do that

sam

07/20/2024, 9:46 AM

But maybe you made that method so quick it won't improve it further either

Adam S

07/20/2024, 9:51 AM

I think that the slowest part is checking if the result of

property(...)

is a boolean, since

shouldBe

uses generics and has branches

Adam S

07/20/2024, 10:01 AM

Clear failed seed defiantly only needs to run once before the start of the iterations right.

So, PBT needs to load the seed (if it exists) before the iterations start. Could it be a problem if PBT loads the seed, then deletes the file, and then the entire process crashes? The seed would be deleted, even though it failed.

sam

07/20/2024, 10:06 AM

I think it's fine to nuke the seed first

sam

07/20/2024, 10:06 AM

This is test code not production

Adam S

07/20/2024, 10:17 AM

true

Adam S

07/20/2024, 10:19 AM

although, it should be easy to just move the 'delete seed file' code to be done after all PBT iterations are done

Adam S

07/20/2024, 10:19 AM

all that's needed is the TestPath...

Adam S

07/20/2024, 10:20 AM

that would make my PR simpler

sam

07/20/2024, 10:20 AM

Yeah that works

Adam S

07/20/2024, 1:14 PM

I think that the slowest part is checking if the result of
property(...)
is a boolean, since
shouldBe
uses generics and has branches

I've looked into this more. I don't think

shouldBe

is slow. I guess it's just that the testFn is a suspend function, so launching it is slow? Also,

BeforePropertyContextElement

and

AfterPropertyContextElement

are a bit slow. But I think they're being deprecated?

sam

07/20/2024, 1:14 PM

Once we split out the property test stuff I'm going to revisit property test hooks for 6.0

sam

07/20/2024, 1:14 PM

I can't split the projects though while there are 30 PRs in flight as it would break too many prs

sam

07/20/2024, 1:29 PM

my linux machine really doesn't like the size of our project. It's just completely killed my machine again running check

Adam S

07/20/2024, 1:38 PM

whoa, crazy

Adam S

07/20/2024, 1:38 PM

that sounds like something more significant than the size of the project... I wonder

sam

07/20/2024, 1:39 PM

its all the native compilation

sam

07/20/2024, 1:39 PM

it doesn't like doing the linux native builds

Adam S

07/20/2024, 1:40 PM

huh, interesting...

Adam S

07/20/2024, 1:40 PM

usually it's Mac machines are slower, since they build all targets

sam

07/20/2024, 1:40 PM

yeah but people tend to have powerful mac's using M3 or whatever

sam

07/20/2024, 1:41 PM

not my pentium 100 linux machien

sam

07/20/2024, 1:41 PM

256Mb ram

sam

07/20/2024, 1:41 PM

1gb hard drive

Adam S

07/20/2024, 1:42 PM

whoa yeah, okay, I can see how those specs would be a factor

sam

07/20/2024, 1:42 PM

I'm exaggerating how old it is

sam

07/20/2024, 1:42 PM

but it's slow

Adam S

07/20/2024, 1:42 PM

how is performance if you run

./gradlew jvmTest

sam

07/20/2024, 1:43 PM

still slow

sam

07/20/2024, 1:43 PM

I tend to run from intellij tho

Adam S

07/20/2024, 1:43 PM

could you share a Gradle build scan? I can take a look and see if anything jumps out as a quick fix

sam

07/20/2024, 1:44 PM

sure, remind me how to trigger a build scan again

sam

07/20/2024, 1:44 PM

--scan ?

Adam S

07/20/2024, 1:44 PM

yes

Adam S

07/20/2024, 1:46 PM

something I'm waiting for is Gradle to finish implementing is automatically setting the JDK used to run the Gradle daemon. At the moment Gradle just picks JAVA_HOME, and if that's different to what's used on CI to populate the remote build cache, then there will be a lot of cache-misses. If the JDK is stable, then that would mean more cache-hits.

Adam S

07/20/2024, 1:46 PM

that's not a complete fix, but it'd help

sam

07/20/2024, 1:46 PM

didn't they adjust that in 8.9?

Adam S

07/20/2024, 1:47 PM

yeah there's an option, but it's incubating, and it doesn't auto-download missing toolchains (although it does auto-discover them)

sam

07/20/2024, 1:47 PM

ah ok I was thinking of this I think

Adam S

07/20/2024, 1:48 PM

in my

~/.gradle/gradle.properties

I set java home, which I think helps

Copy code

# Try to help with Build Cache re-use by always using the same JDK
org.gradle.java.home=/Library/Java/JavaVirtualMachines/temurin-17.jdk/Contents/Home/

sam

07/20/2024, 1:49 PM

how would my local JAVA_HOME be affected by the remote cache ?

Adam S

07/20/2024, 1:51 PM

it's the other way around - different JDKs mean different cache entries

sam

07/20/2024, 1:52 PM

you mean if different JDKs are used for building locally, then the local cache isn't going to be as effective?

Adam S

07/20/2024, 1:52 PM

If CI uses Java 17 to run Gradle, then the buildscript classpath is considered to be completely different to if I locally use Java 21. The buildscript classpath is part of the cache-key computation for basically everything.

Adam S

07/20/2024, 1:52 PM

yeah exactly

Adam S

07/20/2024, 1:52 PM

it doesn't break anything, but it does hurt

sam

07/20/2024, 1:53 PM

I'm confused though like why would I care what the CI is doing? Like how does it affect my local build speed ?

Adam S

07/20/2024, 1:54 PM

CI populates the remote build cache, which your machine can read

sam

07/20/2024, 1:54 PM

oh that's interesting

sam

07/20/2024, 1:54 PM

makes sense now

Adam S

07/20/2024, 1:54 PM

in theory, if the master branch is green, and you do a fresh checkout and run

gradle check

, then all tasks should be instantly up-to-date!

👍🏻 1

sam

07/20/2024, 1:55 PM

my machine might not be taking advantage the latest remote cache entries

Adam S

07/20/2024, 1:55 PM

exactly

sam

07/20/2024, 1:56 PM

so once that new gradle feature is ready, you can be explicit about the version of the JDK used for both the gradle deamon, and the java tasks it forks

Adam S

07/20/2024, 1:56 PM

yup!

sam

07/20/2024, 1:57 PM

clever

Adam S

07/20/2024, 1:59 PM

yeah, Gradle can be really great sometimes

sam

07/20/2024, 1:59 PM

its super complicated though

💯 1

sam

07/20/2024, 2:00 PM

I feel that the gradle DSL is the worst DSL ever written. But that's partly because it all started out in groovy and they took "advantage" of some groovy's quirks. By moving to Kotlin its becoming gradually more clear.

Adam S

07/20/2024, 2:00 PM

definitely

sam

07/20/2024, 2:08 PM

https://gradle.com/s/enrjchrn2ivlu

sam

07/20/2024, 2:08 PM

it failed on a test tho

sam

07/20/2024, 3:16 PM

lol my machine just tanked again, had to power off power on. I really want to split this project out tonight.

Adam S

07/20/2024, 5:17 PM

hmm I don't see anything obviously wrong in the build scan, apart from you're using Java 21

Adam S

07/20/2024, 5:18 PM

using Java 17 (the same as the GitHub runners) would help, but not significantly

sam

07/20/2024, 5:18 PM

Just my rubbish machine then

sam

07/20/2024, 5:18 PM

And all the expensive native targets

Adam S

07/20/2024, 5:19 PM

you could try limiting the Gradle workers - add this to

~/.gradle/gradle.properties

Copy code

org.gradle.workers.max=2

sam

07/20/2024, 5:19 PM

I have 32 cores

sam

07/20/2024, 5:19 PM

But I will try that

Adam S

07/20/2024, 5:20 PM

by default max-workers will be 32 then, but yeah, you can play around with the number

Adam S

07/20/2024, 5:20 PM

it's not a great workaround though

sam

07/20/2024, 5:21 PM

Better than a forced reboot though lol

Adam S

07/20/2024, 5:21 PM

haha yes

Adam S

07/20/2024, 5:21 PM

I'm surprised that the native targets are so heavy though...

Adam S

07/20/2024, 5:21 PM

do you get similar problems on other projects?

🚫 1

Adam S

07/20/2024, 5:22 PM

the build scan only sees 16 cores 🤔 https://scans.gradle.com/s/enrjchrn2ivlu#infrastructure

sam

07/20/2024, 5:26 PM

Maybe I only have 16 I'll check again

sam

07/20/2024, 5:26 PM

Its an i9

Adam S

07/20/2024, 5:27 PM

yeah, so 32 cores...

Adam S

07/20/2024, 5:28 PM

I wonder if something is misconfigured in the hardware? But that doesn't really make sense.

Adam S

07/20/2024, 5:29 PM

I'm just thinking about how sometimes the RAM clock speed is configurable and sometimes people forget to set it correctly

sam

07/20/2024, 5:29 PM

I can build other giant projects like intellij

Adam S

07/20/2024, 5:30 PM

the GitHub runners have 4 cores so it really shouldn't be a problem

Adam S

07/20/2024, 5:30 PM

hmmmmm interesting

Adam S

07/20/2024, 5:30 PM

do the crashes only happen when running Gradle through IntelliJ? Could that be related?

Adam S

07/20/2024, 5:31 PM

because yeah, IntelliJ is huge

sam

07/20/2024, 5:46 PM

Happens in regular gradle too

sam

07/20/2024, 5:46 PM

Its intermittent

Adam S

07/21/2024, 12:19 AM

hmm okay, so I think it's either a problem with Gradle or with Kotlin Gradle Plugin 🤔

Adam S

07/21/2024, 12:19 AM

possibly the Kotlin compiler daemon

Adam S

07/21/2024, 12:20 AM

if you have time, could you make a report? https://kotl.in/issue

sam

07/21/2024, 12:21 AM

you think it's an issue with the kotlin compiler vs my machine just being underpowered for the size of the project ?

Adam S

07/21/2024, 12:21 AM

yes

Adam S

07/21/2024, 12:21 AM

the GitHub Action runner machines have less resources, and while they're slow, they don't crash

Adam S

07/21/2024, 12:22 AM

maybe it's a hardware issue. If it's an older laptop, maybe something is loose or it overheats?

sam

07/21/2024, 12:22 AM

It's a desktop, and the gfx card is truely shit. I think perhaps when I'm trying to do too many things inside xfce it just gives up.

Adam S

07/21/2024, 12:23 AM

but I think it's more likely to be something with Gradle or Kotlin, since you can build IntelliJ

Adam S

07/21/2024, 12:24 AM

Maybe enabling Kotlin build reports might reveal something https://kotlinlang.org/docs/gradle-compilation-and-caches.html#build-reports

sam

07/21/2024, 12:27 AM

ok let me give that a go and do a full build scan

Adam S

07/21/2024, 12:30 AM

xfce.... I wonder if it could be a problem with the JS tests using headless Chrome? Since that launches a new application.

sam

07/21/2024, 12:30 AM

perhaps

sam

07/21/2024, 12:30 AM

I'm running an old ubunutu too

sam

07/21/2024, 12:31 AM

22.04

Adam S

07/21/2024, 12:31 AM

ah okay, that should be fine? I don't really know though

sam

07/21/2024, 12:31 AM

should be since it's LTS

Adam S

07/21/2024, 12:31 AM

yeah, definitely

sam

07/21/2024, 12:32 AM

I still wanna split the project up though, it's too complicated for people to dive in given the 40 modules IMO

Adam S

07/21/2024, 12:32 AM

Gradle has some sort of a VFS watch, but that should be fine. But I guess you could try disabling. https://docs.gradle.org/current/userguide/file_system_watching.html#supported_operating_systems

sam

07/21/2024, 12:33 AM

I'm trying to fix something atm, so will run these suggestions later this evening. Don't want my machine to crash mid zone 🙂

Adam S

07/21/2024, 12:33 AM

for sure!

Adam S

07/21/2024, 12:34 AM

I work on the Kotlin Gradle Plugin team, so I'm doing my part in trying to find problems 😄

sam

07/21/2024, 12:34 AM

oh you do

sam

07/21/2024, 12:34 AM

that's pretty cool

sam

07/21/2024, 12:34 AM

anyone at jetbrains use kotest?

Adam S

07/21/2024, 12:37 AM

umm kind of, I know Arrow uses it

Adam S

07/21/2024, 12:37 AM

I'm merging Dokkatoo into Dokka, and that uses Kotest

sam

07/21/2024, 12:37 AM

not heard of dokkatoo

sam

07/21/2024, 12:37 AM

I guess its a play on dokka2 ?

Adam S

07/21/2024, 12:38 AM

exactly

Adam S

07/21/2024, 12:38 AM

basically the official Dokka Gradle Plugin is not very compatible with Gradle. It's quite outdated. So I wrote Dokkatoo, just as a better wrapper around the actual Dokka Engine

sam

07/21/2024, 12:39 AM

ok makes sense

sam

07/21/2024, 12:41 AM

I'm excited to get pushing on kotest 6

sam

07/21/2024, 12:41 AM

I have a much better intellij plugin experience almost ready to go

Adam S

07/21/2024, 12:51 AM

that's fantastic

Adam S

07/21/2024, 9:53 AM

going back to the original topic,

forAll with 22 exhaustives should run for each cross product

now runs 4 times faster than before! 4m33s vs 1m00s https://scans.gradle.com/s/42qggo5p6kaxm/tests/slowest-tests#forall-with-22-exhaustives-should-run-for-each-cross-product-2

sam

07/21/2024, 1:57 PM

Sick

5 Views

Open in Slack

Previous Next