Day 15 solution thread kotlin intensifies kotlinlang #advent-of-code

Join Slack

Day 15 solution thread :kotlin-intensifies:

# advent-of-code

adamratzman

12/15/2020, 4:55 AM

Day 15 solution thread K

David Whittaker

12/15/2020, 6:03 AM

Looking forward to seeing your functional solution.

Copy code

private fun runProgram() {
    val last = mutableMapOf<Int,Int>()
    val previous = mutableMapOf<Int,Int>()

    last[2] = 1
    last[0] = 2
    last[1] = 3
    last[7] = 4
    last[4] = 5
    last[14] = 6
    last[18] = 7

    var wasfirst = true
    var current = 18
    var pos = 8
    while (pos <= 30000000) {
        if (wasfirst) {
            last[current] = pos - 1
            current = 0
            if (last.containsKey(0)) {
                previous[0] = last[0]!!
            }
            last[0] = pos
            wasfirst = false
        } else {
            val d = last[current]!! - previous[current]!!
            if (last.containsKey(d)) {
                previous[d] = last[d]!!
            } else {
                wasfirst = true
            }
            current = d
            last[d] = pos
        }
        pos++
    }

    println("Puzzla answer: ${current}")
}

adamratzman

12/15/2020, 6:04 AM

Just trying to clean it up now 😄 Yours looks clean!

🙏 1

David Whittaker

12/15/2020, 6:06 AM

Hahaha -- oops left that

in there - all my solutions start with single letter variables to save time. Forgot to change that one.

😄 1

adamratzman

12/15/2020, 6:58 AM

My solution (MutableMap.insertOrAppend is a utility function)

Copy code

package aoc2020

import common.*

class Aoc2020Day15 : Problem(2020, 15) {
    val numbers = input.split(",").map { it.toInt() }

    val numberGenerator = sequence {
        val memo = mutableMapOf<Int, MutableList<Int>>()
        numbers.forEachIndexed { index, i -> memo.insertOrAppend(i, index); yield(i) }
        var current = numbers.last()
        var i = numbers.size
        while (true) {
            val currentIndices = memo.getValue(current)
            val difference = if (currentIndices.size != 1) currentIndices[currentIndices.lastIndex] - currentIndices[currentIndices.lastIndex - 1]
            else 0
            yield(difference)
            memo.insertOrAppend(difference, i)
            memo[difference] = memo[difference]!!.takeLast(2).toMutableList()
            current = difference
            i++
        }
    }


    override fun solvePart1(): Any {
        return numberGenerator.take(2020).last()
    }

    override fun solvePart2(): Any {
        return numberGenerator.take(30000000).last()

    }
}

fun main() {
    Aoc2020Day15().solve()
}

andyb

12/15/2020, 9:46 AM

That was pretty straightforward today

Day 15

andyb

12/15/2020, 10:04 AM

Just realised that I can simplify my code by using

Copy code

val num = counter - previousNumbers.getOrDefault(speak, counter)

Jakub Gwóźdź

12/15/2020, 12:09 PM

I did the first approach storing prev entries in HashMap and it was... 2.5s in JVM, but over 5 minutes in JS-Browser. It's probably because Kotlin transpilation to JS is heavily flawed regarding to maps. Then I thought "f... it, memory is cheap". And preallocated IntArray(30000000). Works instantly under 0.6s both on JS and JVM Stupid puzzle today 😕

ephemient

12/15/2020, 1:47 PM

https://github.com/ephemient/aoc2020/blob/main/kt/src/main/kotlin/io/github/ephemient/aoc2020/Day15.kt array-based

ephemient

12/15/2020, 1:49 PM

https://github.com/ephemient/aoc2020/blob/dcca540f34bcd04aa16de7fd5b2d386a1d5b9487/kt/src/main/kotlin/io/github/ephemient/aoc2020/Day15.kt map-based version in git history was a little more elegant, but either way this is very little code (although it did take quite a while to figure out)

ephemient

12/15/2020, 1:50 PM

really, this was all it took…

Untitled

Nir

12/15/2020, 2:58 PM

Copy code

val input = listOf(8,0,17,4,1,12)

fun run(lastTurn: Int): Int {
    val state = mutableMapOf<Int, Int>().also { map ->
        input.subList(0, input.size-1).withIndex().associateTo(map) { it.value to it.index+1 }
    }
    var lastNumber = input.last()

    for (turn in (input.size) until lastTurn) {
        val curNumber = turn - state.getOrPut(lastNumber) { turn }
        lastNumber = curNumber
    }

    return lastNumber
}

fun part1() = run(2020)
fun part2() = run(30000000)

todd.ginsberg

12/15/2020, 4:07 PM

I ended up writing a sequence:

Copy code

class Day15(input: String) {

    private val startingNumbers = input.split(",").map { it.toInt() }

    fun solve(turns: Int): Int =
        memoryGame().drop(turns-1).first()

    private fun memoryGame(): Sequence<Int> = sequence {
        yieldAll(startingNumbers)
        val memory = startingNumbers.mapIndexed { index, i -> i to index }.toMap().toMutableMap()
        var turns = startingNumbers.size
        var sayNext = 0
        while(true) {
            yield(sayNext)
            val lastTimeSpoken = memory[sayNext] ?: turns
            memory[sayNext] = turns
            sayNext = turns - lastTimeSpoken
            turns++
        }
    }
}

todd.ginsberg

12/15/2020, 4:08 PM

Blog: https://todd.ginsberg.com/post/advent-of-code/2020/day15/

🙌 2

Fredrik Rødland

12/15/2020, 4:42 PM

https://github.com/fmmr/advent/blob/master/src/main/kotlin/no/rodland/advent_2020/Day15.kt

bjonnh

12/15/2020, 7:44 PM

https://github.com/bjonnh/advent_of_code/blob/master/src/main/kotlin/y2020/day15/main.kt

bjonnh

12/15/2020, 7:44 PM

not happy because it is slow

bjonnh

12/15/2020, 7:44 PM

but it works

Nir

12/15/2020, 7:45 PM

You can get a roughly x2 speedup simply by only accessing the cache once

Nir

12/15/2020, 7:45 PM

Copy code

lastNum = cache[lastNum]?.let { pos - it } ?: 0L
        cache[oldNum] = pos

Nir

12/15/2020, 7:45 PM

I had this initially too but you can have a single

getOrPut

call instead

Nir

12/15/2020, 7:46 PM

as a bonus it even looks nicer

bjonnh

12/15/2020, 8:53 PM

I don't see how to replace that with a getorput

bjonnh

12/15/2020, 8:53 PM

oh i see

bjonnh

12/15/2020, 8:53 PM

nevermind

Nir

12/15/2020, 8:54 PM

yeah it wasn't that obvious to me at first either

Nir

12/15/2020, 8:58 PM

hmm actually now I'm doubting myself... 🙂

Nir

12/15/2020, 8:59 PM

i could have sworn I ran it with getOrPut and got the same answer, but now I'm wondering how that can work, will need to double checkl ater

bjonnh

12/15/2020, 8:59 PM

your map construction is complicated

bjonnh

12/15/2020, 8:59 PM

there is a much simpler way

bjonnh

12/15/2020, 9:00 PM

val map = input.mapIndexed { idx, it -> it to idx }.toMap().toMutableMap()

Nir

12/15/2020, 9:01 PM

toMap.toMutableMap?

Nir

12/15/2020, 9:01 PM

can't just do toMutableMap?

bjonnh

12/15/2020, 9:01 PM

I don't think so

Nir

12/15/2020, 9:01 PM

Anyhow, I didn't do it that way on purpose, I don't like creating the extra map

bjonnh

12/15/2020, 9:02 PM

well you made a sublist…

Nir

12/15/2020, 9:02 PM

yes

Nir

12/15/2020, 9:02 PM

that doesn't create an extra list

Nir

12/15/2020, 9:02 PM

creates a view

bjonnh

12/15/2020, 9:02 PM

hmm

bjonnh

12/15/2020, 9:03 PM

didn't know that, cool

Nir

12/15/2020, 9:03 PM

Yeah, it's like a slice in python basically with more awkward syntax 🙂

Nir

12/15/2020, 9:04 PM

I have been trying to write kotlin where I'm not dumping tons of extra copies of data structures around even if it is not necessary

Nir

12/15/2020, 9:05 PM

Sorry it's not getOrPut, but simply put

Nir

12/15/2020, 9:05 PM

that's needed

bjonnh

12/15/2020, 9:06 PM

wouldn't mutableMapOf<Int,Int>().also { itputAll(list.mapIndexed { idx,it->it to idx})}

bjonnh

12/15/2020, 9:07 PM

be as fast as your solution anyway?

bjonnh

12/15/2020, 9:09 PM

put wouldn't work either

bjonnh

12/15/2020, 9:09 PM

oh if I use ?:pos

bjonnh

12/15/2020, 9:09 PM

yeah

Nir

12/15/2020, 9:10 PM

yeah, I did

turn - (state.put(lastNumber, turn) ?: turn)

Nir

12/15/2020, 9:10 PM

or obviously

state.put(lastNumber, turn)?.let { turn - it } ?: 0

bjonnh

12/15/2020, 9:11 PM

makes me win ~500ms not bad

Nir

12/15/2020, 9:12 PM

That's surprising, still takes me a good 5-6 seconds for part 2 even with this trick

bjonnh

12/15/2020, 9:13 PM

oh takes 3 on my machine

bjonnh

12/15/2020, 9:13 PM

(the full thing)

Nir

12/15/2020, 9:13 PM

yeah makes sense. With your mapIndexed, you'll still have to insert it into the map afterwards though so I don't think you're gaining anything

Nir

12/15/2020, 9:14 PM

I think you pretty much need to have

associateTo

there unless there's another function that does that

bjonnh

12/15/2020, 9:14 PM

what's your CPU?

Nir

12/15/2020, 9:14 PM

that was on a laptop

Nir

12/15/2020, 9:14 PM

i dunno, maybe an i7 or something

Nir

12/15/2020, 9:15 PM

I didn't benchmark or anything, just looking at how long it says in grade 🙂

bjonnh

12/15/2020, 9:15 PM

Copy code

fun
    val cache = mutableMapOf<Int, Int>().also { it.putAll(list.mapIndexed { idx, it -> it to idx }) }
    val num = turns - 1
    if (num < list.size) return list[num]
    var lastNum = list.last()
    (list.size - 1 until num).forEach { pos -> lastNum = pos - (cache.put(lastNum, pos) ?: pos) }
    return lastNum
}

fun main() {
    val input = listOf(9, 3, 1, 0, 8, 4)
    println(finalNumCached(input, 2020))
    println(finalNumCached(input, 30000000))
}

bjonnh

12/15/2020, 9:15 PM

gradle tells me 3s but that's a ryzen 3700X that's a pretty fast machine

Nir

12/15/2020, 9:17 PM

What would you name this function

Nir

12/15/2020, 9:17 PM

Copy code

fun MutableMap<K, V>.foo(k: K, v: V) = put(k, v) ?: v

bjonnh

12/15/2020, 9:20 PM

updateOrDefault?

bjonnh

12/15/2020, 9:25 PM

also I can run it in ~2s by disabling the GC and giving it 2g of ram (Epsilon GC)

Nir

12/15/2020, 9:33 PM

maybe updateAndDefault

Nir

12/15/2020, 9:33 PM

it always does the update is the thing

Nir

12/15/2020, 9:33 PM

getAndUpdate maybe

Nir

12/15/2020, 9:33 PM

since it's doing the get first in the typical use case

bjonnh

12/15/2020, 9:35 PM

yeah getThenUpdate

bjonnh

12/15/2020, 9:35 PM

maybe And is clear enough though

Nir

12/15/2020, 9:36 PM

I think maybe actually putOrDefault; in a vacuum it's less clear but given that

put

already exists it makes sense

bjonnh

12/15/2020, 9:42 PM

putWithDefault?

Nir

12/15/2020, 9:45 PM

yeah that's probably better actually

Nir

12/15/2020, 9:45 PM

👍

Nir

12/15/2020, 9:45 PM

I love extension functions

Nir

12/15/2020, 9:46 PM

been using this as well:

Copy code

fun <K, V> MutableMap<K, V>.update(k: K, transform: (V?) -> V) = set(k, transform(get(k)))

bjonnh

12/15/2020, 9:47 PM

Using graalVM, 5.4s 😄

Nir

12/15/2020, 9:47 PM

very useful, e.g. incrementing

foo.update(key) { (it ?: 0) + 1}

Nir

12/15/2020, 9:56 PM

for loop vs for each? Interesting. technically for each does not need to support break or continue really, right... though you could do qualified returns from the lambda I believe, so I'm not sure

bjonnh

12/15/2020, 9:57 PM

no I deleted 😄

bjonnh

12/15/2020, 9:57 PM

that was a mistake

bjonnh

12/15/2020, 9:57 PM

I didn't use the same GC parameters that's why

Nir

12/15/2020, 9:58 PM

hah ok

Nir

12/15/2020, 9:58 PM

makes sense

bjonnh

12/15/2020, 9:58 PM

they have the exact same performance

bjonnh

12/15/2020, 10:00 PM

Using large pages I won another 500ms

bjonnh

12/15/2020, 10:01 PM

I may reach the below 2s 😄

bjonnh

12/15/2020, 10:12 PM

nope

bjonnh

12/15/2020, 10:12 PM

too bad 😄

Nir

12/15/2020, 10:13 PM

just use a giant array instead of a map

Nir

12/15/2020, 10:14 PM

it's kind of gross because you need to preallocate it and if it's not big enough then it's no bueno

Nir

12/15/2020, 10:14 PM

but it's pretty fast if it is big enough

bjonnh

12/15/2020, 10:28 PM

1.1s

bjonnh

12/15/2020, 10:28 PM

yep that's fast

Nir

12/15/2020, 10:29 PM

hard to imagine how to really beat that at this point; it's an O(N) algorithm with just a couple of very cheap steps in the main loop

bjonnh

12/15/2020, 10:30 PM

yeah in the jvm probably not

bjonnh

12/15/2020, 10:33 PM

oh 0.7s

bjonnh

12/15/2020, 10:33 PM

IntArray instead of Array

bjonnh

12/15/2020, 10:33 PM

0.356s

bjonnh

12/15/2020, 10:33 PM

using the epsilonGC and quite a few other optimizations (large pages, compressed class pointers and compressed oops)

bjonnh

12/15/2020, 10:38 PM

I'll stop here, but probably using jaotc would go even faster

bjonnh

12/16/2020, 12:08 AM

interesting, the few rust solutions I found are all slower than that

Nir

12/16/2020, 12:25 AM

with an identical approach?

bjonnh

12/16/2020, 12:30 AM

seems so I don't know rust enough

bjonnh

12/16/2020, 12:30 AM

but they are preallocating yes

Nir

12/16/2020, 12:38 AM

pretty strange. maybe they forgot to turn on optimizations 😂

Nir

12/16/2020, 12:39 AM

a lot of people come to rust from languages like ruby and actually forget this

bjonnh

12/16/2020, 12:40 AM

no I did compile it and turned them on

bjonnh

12/16/2020, 12:41 AM

but yeah I was expecting it to be 2-5 times faster

bjonnh

12/16/2020, 12:41 AM

but in term of number of operations (CPU measured using perf), solutions are pretty similar

bjonnh

12/16/2020, 12:41 AM

and the code is small, so I expect the AOT to have optimized it pretty well

Nir

12/16/2020, 12:47 AM

I wouldn't expect 5 times, the JVM tends to actually do well on this kind of microbenchmark

Nir

12/16/2020, 12:47 AM

But it is strange it's not at least moderately faster

Nir

12/16/2020, 12:47 AM

The big question is did they have hash table API so that they only have to go through it once

Nir

12/16/2020, 12:48 AM

This problem is basically a hash table benchmark :-)

Nir

12/16/2020, 12:48 AM

Unless of course they used Vec in which case it's definitely surprising

bjonnh

12/16/2020, 12:49 AM

looks like vec to me

bjonnh

12/16/2020, 12:49 AM

I couldn't find any solution that runs in less than 400ms

bjonnh

12/16/2020, 12:49 AM

(in rust)

ephemient

12/16/2020, 12:23 PM

@bjonnh at least for my solution, Rust debug vs Rust release makes a huge difference

Untitled

ephemient

12/16/2020, 12:24 PM

using Linux's

perf stat

to measure a bare-bones C solution I whipped up, I'm pretty sure it's not possible for me to do better than that on this hardware

Untitled

ephemient

12/16/2020, 12:26 PM

all the time is going into L1 cache misses

bjonnh

12/16/2020, 5:16 PM

Yes I did run every code I found in rust as opt level 3

bjonnh

12/16/2020, 5:16 PM

didn't even dare to try non optimized

bjonnh

12/16/2020, 5:17 PM

@ephemient Can you give me your c solution so I test on my machine ? see how it compares to my kotlin optimized version?

ephemient

12/16/2020, 5:18 PM

https://gist.github.com/ephemient/d3c9a554361ab349a3df76f5d4da71c4

bjonnh

12/16/2020, 5:19 PM

that's fast 😄

bjonnh

12/16/2020, 5:20 PM

with my input you are at 670 million instructions

bjonnh

12/16/2020, 5:20 PM

my kotlin optimized version is 1 billion

bjonnh

12/16/2020, 5:20 PM

so it is not that far

bjonnh

12/16/2020, 5:21 PM

Perf says 350 ms for my kotlin solution, 342ms for your C version

ephemient

12/16/2020, 5:22 PM

perf stat -d

will give some more detail (such as L1 hit rates)

bjonnh

12/16/2020, 5:23 PM

7.83% misses for the kotlin version 14.6% for your version

bjonnh

12/16/2020, 5:24 PM

there is a slight difference that you are loading the file on disk, I hard coded the solution though

bjonnh

12/16/2020, 5:24 PM

so that may play a little

ephemient

12/16/2020, 5:24 PM

tiny input file so that shouldn't account for much

ephemient

12/16/2020, 5:25 PM

so I'd estimate that JVM is touching more local memory to do its own housekeeping work, which isn't actually taking much more time

ephemient

12/16/2020, 5:25 PM

but the root of the problem is that there's a lot of data to work through and that is limited by hardware regardless of the language

bjonnh

12/16/2020, 5:26 PM

(I compiled with -O3 -march=native for your program)

bjonnh

12/16/2020, 5:26 PM

JVM I disabled the GC (epsilonGC)

bjonnh

12/16/2020, 5:27 PM

interestingly, doing nums[x]=y for all my inputs adds 30ms to the program compared to reading the file

bjonnh

12/16/2020, 5:28 PM

I'm a bit confused as why but it seems consistent

bjonnh

12/16/2020, 5:28 PM

oh and also I'm running it two times in my kotlin program

bjonnh

12/16/2020, 5:28 PM

didn't thought about that, I run it twice for 2020 and 30m

bjonnh

12/16/2020, 5:28 PM

that's another optimization right here

ephemient

12/16/2020, 5:29 PM

well, in theory the extra branch could be bad

ephemient

12/16/2020, 5:29 PM

in practice it's always* predicted correctly

bjonnh

12/16/2020, 5:31 PM

yeah the 2020 was so fast that it doesn't change much

bjonnh

12/16/2020, 5:31 PM

I get the same time (within error range)

bjonnh

12/16/2020, 5:32 PM

I could cheat and duplicate the code to remove the branch

bjonnh

12/16/2020, 5:33 PM

no difference except an higher cache miss rate (but still within error)

11 Views

Open in Slack

Previous Next