Has anybody dug into any details around what makes Kotlin us kotlinlang #announcements

Has anybody dug into any details around what makes...

Thorkild

07/05/2020, 10:33 AM

Has anybody dug into any details around what makes Kotlin use a lot of memory when compiling? I've got a project that consists of a mix of generated and handwritten code and I've had to bump the compiler to 6GB memory to see it complete.. and I am curious. The project is only about 48000 lines (with whitespace) and 160-170 classes (+ a few data classes here and there). The largest kotlin class is 2600 lines long (with whitespace). (targeting jvm -- server style application)

👀 4

🙀 1

dmitriy.novozhilov

07/05/2020, 10:04 PM

Hello Can you share your project, if it's open source, so we can investigate this problem?

LeoColman

07/06/2020, 3:17 AM

The compiler usually takes a lot and uses a lot of memory when I'm using a lot of lambdas ans high order functions

LeoColman

07/06/2020, 3:17 AM

At #CT0G9SD7Z some classes takes 40s to show syntax errors in IntelliJ

LeoColman

07/06/2020, 3:17 AM

So I'd guess that the compiler is not very optimized when dealing with big classes and functions

Thorkild

07/06/2020, 4:21 AM

@dmitriy.novozhilov Sadly, no, it is a closed source project 😞 During an earlier run, I did jmap a bit on what consumed memory, and I was slightly surprised by the large increase in character array usage. Over a few snapshots, the "leader board" on memory usage seemed pretty stable. Attached is a jmap dump from when it was hitting the VM ceiling in an earlier run. I should rerun it now after it has went from 4GB to 6GB and see if it is similar.

kotlin-maven-high-memory-usage-during-compilation.txt

Thorkild

07/06/2020, 4:23 AM

This code doesn't really use that much advanced concepts, since a lot of it is highly boilerplated serialization/deserialization (but due to the lack of a helpful object model, it is cumbersome to do it through reflection etc., and instead code generation has been a good way to ensure we get typed interfaces with the data).

Thorkild

07/06/2020, 4:24 AM

(this is a kotlin-maven project, btw, if that would make any difference -- the excessive memory usage is visible both when building it from Intellij and straight maven -- not that this is surprising)

mikhail.zarechenskiy

07/06/2020, 7:20 AM

Just a guess, does your code contain many warnings?

mikhail.zarechenskiy

07/06/2020, 7:26 AM

Also, maybe you can provide just a part of your generated code? Without its actual semantics / naming, just to see which constructs do you use

sdeleuze

07/06/2020, 9:52 AM

@Thorkild What version of Kotlin compiler are you using?

Thorkild

07/06/2020, 10:41 AM

@sdeleuze 1.3.72

louiscad

07/06/2020, 10:43 AM

@Thorkild And about the first question of Mikhail at least… many code warnings or not?

Thorkild

07/06/2020, 10:49 AM

Working on checking. Maven shows none, but I find that highly unlikely that there isnt a single one, so I am checking if it is swallowing the warnings. I do use suppress annotations due to casting (unchecked_cast), which I am wondering if maybe generates a warning internally, and then the suppress just stops it from outputting the warning.

louiscad

07/06/2020, 10:50 AM

So, the Kotlin compilation is started from a maven build, not a Gradle one, right?

Thorkild

07/06/2020, 10:52 AM

Yes. The same problem exists when I Run it from IntelliJ, though, but I am unsure if it runs it through maven or not.

Thorkild

07/06/2020, 11:27 AM

310 unchecked cast suppressions in the code. I removed some of the suppressions, and maven then told me about the warnings, so I do not seem to have any warnings other than the ones suppressed. I tend to prefer to see warnings as errors , and avoid getting into the habit of seeing warnings as normal, so I tend to jump on warnings. So, it seems, if the problem is number of warnings, then the challenge is that the suppress doesn't make it ignore it completely.

Thorkild

07/06/2020, 11:45 AM

I now have to use 7GB to compile it, since I am now using more of those classes, so I am guessing it escalates for every time I use the classes containing the suppressed warnings.

mikhail.zarechenskiy

07/06/2020, 12:26 PM

No, 370 or 1000 warnings isn't bad at all, it'd be bad if there were several thousands of them

Matteo Mirk

07/06/2020, 12:34 PM

Since you have generated code, one thing you may try to relieve the compilation burden is to package the generated sources a separate module and import that as a dependency, so that you don’t have to recompile those files every time. A simple maven multi-module build will do the trick.

Matteo Mirk

07/06/2020, 12:46 PM

Moreover, your numbers for LoC and classes don’t seem too problematic, so the culprit of the problem must be somewhere else. Meanwhile, you can try to use this maven extension to enable incremental compilation and further reduce build time: https://github.com/takari/takari-lifecycle

Thorkild

07/06/2020, 1:34 PM

I have split out part of the code, but for this one it isn't the best thing to do (but I will have to if I have to). I dumped the heap of it and looked at the object memory usage now (through visualvm), but I dumped it too early so I think I did it before the stage where it really goes off the rails, so I am doing it again.

Matteo Mirk

07/06/2020, 3:04 PM

(sorry about Takari, I used it in the past but didn’t think it was only for Java)

Thorkild

07/08/2020, 11:13 AM

After giving up on visualvm's calculations (it ran for 24 hours without finishing), I tried MemoryAnalyser (Mat) from Eclipse (I have this feeling of dejavu of doing the exact same order of tests before for other things..). The main culprit is char[], but that's not the root cause of course, but the "leak" is divided across 66000 entries of org.jetbrains.kotlin.load.java.structure.impl.classFiles.BinaryJavaMethod . Trying to dig down further a bit, but on random sample, they all retain the same amount of bytes, which is 99% in the returnType "org.jetbrains.kotlin.load.java.structure.impl.classFiles.PlainJavaClassifierType",

Thorkild

07/08/2020, 11:13 AM

I think it might disregard the char[] referenced in this leak part of the leak analysis

Thorkild

07/08/2020, 11:15 AM

I have through through what is "special" with the code I have, and one thing I have thought of might be a problem (but I have for now not found evidence for it) is that this is an API that tries to give a typed layer on an underlying structure where field names etc. are text strings. So it has a very simple data class with two fields (field name and class-reference).. and there are a little over 5000 of those classes spread across companion objects across the 130-160 classes

Thorkild

07/08/2020, 11:15 AM

so if there is an assumption about few companion objects per class or some caching of that.... that might be a killer here

Matteo Mirk

07/13/2020, 1:07 PM

Does your project use kotlinx.html? Could be the cause with compiler 1.3.72 https://github.com/Kotlin/kotlinx.html/issues/147

8 Views

Open in Slack

Previous Next