<https adventofcode com 2024 day 22|Advent of Code 2024 day kotlinlang #advent-of-code

Join Slack

<Advent of Code 2024 day 22> (spoilers) :thread:

# advent-of-code

Advent of Code 2023 day 22

12/22/2024, 5:00 AM

Advent of Code 2024 day 22 (spoilers) 🧵

Dan Fingal-Surma

12/22/2024, 5:54 AM

What I'm doing right now is, for every secret number, compute the map of 4 deltas to price. That gives you a candidate set of delta sequences. Then find the one that maximizes price using those same maps. This works on the test input but has failed not the real input. I have to come back to debugging later.

Renette Ros

12/22/2024, 5:56 AM

Part 1 was fairly straightforward - just using generateSequence. I got the wrong answer originally because I misunderstood the 3 numbers as 3 separate new secrets.

Dan Fingal-Surma

12/22/2024, 5:56 AM

Kotlin makes this easy to do using generate sequence, windowed, zipWithNext, zip, etc

👍 4

Michael de Kaste

12/22/2024, 5:59 AM

glory to kotlins sequence/iterable functions 🙏

Day 22 Michael.cpp

🙌 2

Renette Ros

12/22/2024, 6:00 AM

Part 2, building on the sequence generated in part 1, use zipWithNext + windowed + groupingBy to calculate the total price for each sequence of deltas. I originally missed the rule that each buyer only wants to buy one hiding spot, fixed that by adding a distinctBy to the inner sequences.

Dan Fingal-Surma

12/22/2024, 6:00 AM

Thank you, that is my bug

👍 1

Dan Fingal-Surma

12/22/2024, 6:01 AM

I'm like this is obviously logically correct lol

Michael de Kaste

12/22/2024, 6:01 AM

I really wish there was some sort of 'associateBy' with an

onConflict: (T, T) -> T

function

Michael de Kaste

12/22/2024, 6:02 AM

and I also wish Maps had a merge function on their own instead of calling nested merges on their keys. Feels very stdlib

Dan Fingal-Surma

12/22/2024, 6:03 AM

So in Guava they have buildOrThrow() on ImmutableMap.Builder for this reason

Dan Fingal-Surma

12/22/2024, 6:03 AM

You probably don't want to silently lose data

Michael de Kaste

12/22/2024, 6:07 AM

Iirc, C# actually just throws an exception if you try to remap a key I'm fine with how Java/Kotlin does it, because

map[x] = y

is functionally the same as

array[x] = y

but I just think association could use a 'what if you want to recalculate the value`. I understand that, this is what 'groupingBy' does -> do not actually use a list, but define what you want. I just feel like groupingBy was introduced at Kotlin's start lifecycle and never got retouched with the same love as the normal Sequence/Iteratable functions do

Dan Fingal-Surma

12/22/2024, 6:07 AM

I think

reveresed

will produce the first one wins behavior. Will try it when I'm back in a computer

👍 1

Michael de Kaste

12/22/2024, 6:08 AM

yeah, but then you would lose the advantages of using a sequence, because then you would need to collect them and then reverse iterator order them.

Dan Fingal-Surma

12/22/2024, 6:08 AM

Hmm good point

Dan Fingal-Surma

12/22/2024, 6:09 AM

At first I thought you said C++ and I was like what are you talking about 😂

Michael de Kaste

12/22/2024, 6:10 AM

I had a professional Java -> C# -> Kotlin growth and C# REALLLLLY annoyed me with their lack of data structures and how they used them. Linq was great though

Dan Fingal-Surma

12/22/2024, 6:13 AM

Alright so forEach-as-collect

Dan Fingal-Surma

12/22/2024, 6:13 AM

I've done about half C++ and half Java but now I'm a Kotlin stan

Dan Fingal-Surma

12/22/2024, 6:27 AM

That worked. It bothers me that

forEach

and

collect

have different names

bj0

12/22/2024, 6:35 AM

holy smokes, took me 1.5 hours to think of what to try, but once i did it took about 5 min to solve

bj0

12/22/2024, 6:37 AM

looks like most people thought of it an hour and a half before i did laugh cry face palm

bj0

12/22/2024, 6:40 AM

i need to learn how to use grouping better, i never think of it

Marcin Wisniowski

12/22/2024, 6:41 AM

My part 2

kingsley

12/22/2024, 6:43 AM

I was doing

shl 10

instead of

shl 11

for the multiplied by 2048 and this ended up taking over 30 minutes to debug 🤦‍♂️ I tried to be smart with part 2 and failed woefully, so my final solution ends up taking about 4 seconds to compute. I'm curious if there's a smart way to speed this up

Max Thiele

12/22/2024, 6:46 AM

After some cleanup it doesn't look too terrible

Jonathan Kolberg

12/22/2024, 6:49 AM

Solved part 2, but was a stupid brute force, not really proud of the code, but hey it got me the stars: https://github.com/bulldog98/advent-of-code/blob/main/advent2024/src/main/kotlin/year2024/Day22.kt I have to watch again, how Sebastian did his parallelization of code.

Marcin Wisniowski

12/22/2024, 6:52 AM

@Jonathan Kolberg I think you want to switch to

Dispatchers.Default

, otherwise your

async

is not doing much.

👍 2

Jonathan Kolberg

12/22/2024, 6:54 AM

@Marcin Wisniowski thanks for the tip, I noticed it but I'm using coroutines for the first real time, so yeah.

Jakub Gwóźdź

12/22/2024, 7:04 AM

Today finally was the day when I gave up and brute forced it. 12 cores for twenty-something minutes went brrrrr. I'll make it proper solution later, I already think I know how.

Jakub Gwóźdź

12/22/2024, 7:05 AM

Copy code

fun <T, R> List<T>.mapParallel(op: (T) -> R) = runBlocking { map { async(Dispatchers.Default) { op(it) } }.awaitAll() }

for the rescue 🙂

Dan Fingal-Surma

12/22/2024, 7:06 AM

https://github.com/dfings/advent-of-code/blob/main/src/2024/problem_22.main.kts

Dan Fingal-Surma

12/22/2024, 7:07 AM

Takes a few seconds

Dan Fingal-Surma

12/22/2024, 7:18 AM

I’d be surprised if you find a faster algorithm but I’ve been surprised before

ephemient

12/22/2024, 7:30 AM

https://github.com/ephemient/aoc2024/blob/main/kt/aoc2024-lib/src/commonMain/kotlin/com/github/ephemient/aoc2024/Day22.kt

Anirudh

12/22/2024, 7:43 AM

I did a much more explicit union-all-"first"-keys and then brute force maxOf each key. also used a

data class Changes(a, b, c, d)

but I think it would be better to just build a Unified map on the first pass itself 🤔 with the values being added to a list, or just the sum directly 💡 will do that now

HCP

12/22/2024, 7:44 AM

Today was certainly much easier than yesterday! (I finally finished yesterday about 30 minutes after todays puzzles were released) I too had the issue where things worked for the example, but not the input... because I was maximising the price when getting the sequences from each buyer, rather than just the first instance of the sequence. It also sounds like I must also go and learn about all the fancy generate sequence, windows, zip stuff... as I am guessing I did a lot of "manual" work here... such as:

Copy code

fun getSequences(priceList: List<List<Int>>): List<Map<List<Int>, Int>> {
        val sequences = mutableListOf<MutableMap<List<Int>, Int>>()
        priceList.forEachIndexed { i, list ->
            sequences.add(mutableMapOf())
            list.forEachIndexed { j, price ->
                if (j >= 4) {
                    val sequence = listOf(
                        list[j-3]-list[j-4],
                        list[j-2]-list[j-3],
                        list[j-1]-list[j-2],
                        list[j]-list[j-1])
                    if (!sequences[i].containsKey(sequence)) sequences[i][sequence] = price
                }
            }
        }
        return sequences
    }

which is easily solved using built-in Kotlin functionality?

Jakub Gwóźdź

12/22/2024, 8:12 AM

I'd say it's abuse of stdlib at this point, but single-threaded it fits under 1s. What is not surprising: map operations are noticeable faster when keys are

.toString()

-ed

Untitled.kt

👍 1

Anirudh

12/22/2024, 8:14 AM

> just build a Unified map on the first pass ... sum directly 💡 ok, this way was MUCH faster. I didn't time the previous way, will commit & Ctrl+Z a bit Old Part Two time: 7.109126799s New Part Two time: 1.074562800s not quite 10x but in that range

Michael de Kaste

12/22/2024, 8:16 AM

@Jakub Gwóźdź what about

.reduce{ acc, it -> acc shl 5 + it + 9 }

Michael de Kaste

12/22/2024, 8:16 AM

at least I think that would work 🤔

Jakub Gwóźdź

12/22/2024, 8:19 AM

for code calculation?

phldavies

12/22/2024, 8:19 AM

urgh - Advent of Reading Comprehension again! I spent far too long debugging part 2 when I was using the example from part1 and expecting the part2 answer 🤦‍♂️

same 1

Jakub Gwóźdź

12/22/2024, 8:19 AM

yeah I tried reduce, similar performance as toString 🙂

Jakub Gwóźdź

12/22/2024, 8:20 AM

(I don't know why, intuition says Map<Long,...> should be faster than Map<String,...>, but experiments says nope

Jakub Gwóźdź

12/22/2024, 8:21 AM

@Michael de Kaste even

acc*100+it

works for that reduction

Anirudh

12/22/2024, 8:25 AM

for me, I changed

Copy code

Changes(a, b, c, d) to chg.last().first

Copy code

"$a,$b,$c,$d" to chg.last().first

and it's gone up to 1.3 secs, from 1.0 secs. but I'm not benchmarking correctly. just using

measureTime

Jakub Gwóźdź

12/22/2024, 8:27 AM

30% of change can happen from various things outside your program

👍 1

Jakub Gwóźdź

12/22/2024, 8:28 AM

I'm using measureTime as well, good enough for AoC 🙂

Anirudh

12/22/2024, 8:30 AM

yup, I see that variation too now, goes from 1.0 to 1.2 with the data class as key

Anirudh

12/22/2024, 8:32 AM

my part Two code with unified map on first pass. so only need to hold one buyer's sequence at a time in memory. and of course the keys of the unified map goes up to about 41k keys (around 1900-1950 keys from each buyer) but the values are Int's

Untitled.kt

phldavies

12/22/2024, 8:52 AM

interestingly I see destructuring the result of

zipWithNext(Long::minus)

into

"$a $b $c $d"

is the fastest (I'm using

distinctBy

to take only the first instance seen in the sequence)

Copy code

BenchDay22.dataClass                  avgt    3  1.065 ± 0.129   s/op
BenchDay22.destructureToString        avgt    3  0.516 ± 0.473   s/op
BenchDay22.joinToString               avgt    3  0.867 ± 0.142   s/op
BenchDay22.list                       avgt    3  1.462 ± 0.142   s/op
BenchDay22.reduce                     avgt    3  0.705 ± 0.227   s/op

Dan Fingal-Surma

12/22/2024, 9:34 AM

HashMap beats mutableMapOf (aka LinkedHashMap)

👍 1

Dan Fingal-Surma

12/22/2024, 10:09 AM

This is a real heap hog,

hash map entries

Jakub Gwóźdź

12/22/2024, 10:11 AM

How is that possible, @Dan Fingal-Surma ? Five digits can make 100000 max entries in worst case…

Dan Fingal-Surma

12/22/2024, 10:12 AM

I have 1 map per input line

Dan Fingal-Surma

12/22/2024, 10:13 AM

sequence price map for that input

Jakub Gwóźdź

12/22/2024, 10:15 AM

Ok, so c.a 1500 lines times 2000 entries worst case…

max?

Dan Fingal-Surma

12/22/2024, 10:16 AM

2256 * 1922

👍 2

Anirudh

12/22/2024, 10:16 AM

ah yes, I too did a List<Map...> in my first run. so yeah, each run would have 1997 entries max (but usually 1900-1950 due to repeats). so I guess you have approx 2200 ~~bots~~ monkeys. (I had approx 1500 ~~bots~~ monkeys)

Anirudh

12/22/2024, 10:19 AM

but the "union" of all keys (also for my first run) will have about 40k entries. so about 100 times smaller. so if you process for 'sums' or lists while generating the list-of-deltas and don't store it, you'll only need to have 1950 + 40k max entries at a time.

👍 1

Dan Fingal-Surma

12/22/2024, 10:25 AM

Yeah that’s a good idea

Jakub Gwóźdź

12/22/2024, 10:45 AM

yeah I have input 1571 lines, last hashMap is 40951 entries big. Which gives ~40% of all possibilities, probably way more as the digits like 89898 and 45454 give the same sequence

1,-1,1,-1,1

. But it's big enough percentage to justify counting in

IntArray(20*20*20*20*20) { 0 }

instead of a hashMap

Dan Fingal-Surma

12/22/2024, 10:46 AM

Much faster with one map

👍 1

phldavies

12/22/2024, 10:46 AM

@Jakub Gwóźdź I've literally just done that ( but with an

Array<Array<Array<IntArray>>>

)

Copy code

Warming up 2 puzzles for 10s each for year 2024 day 22...
	Acc warmed up with 74 iterations
	Default warmed up with 21 iterations
year 2024 day 22 part 1
	 Array took 22.518514ms 👑: 16039090236
	 Default took 24.194791ms (1.07x): 16039090236
year 2024 day 22 part 2
	 Array took 134.197583ms 👑: 1808
	 Default took 455.922416ms (3.40x): 1808

Jonathan Kolberg

12/22/2024, 10:46 AM

There are only 19 possiblities for the price difference, so an

IntArray(19*19*19*19*19) { 0 }

is enought

Jakub Gwóźdź

12/22/2024, 10:47 AM

yes, but 20x is easier to debug than 19x 🙂

😀 1

phldavies

12/22/2024, 10:47 AM

using two arrays, one to accumulate the banana count and one to accumulate the last seen buyer for that sequence - the latter is used to filter for just the first per buyer before incrementing the banana count

Jakub Gwóźdź

12/22/2024, 10:48 AM

honestly, I have this feeling that with

Copy code

private fun Long.nextSecret(): Long = step1().step2().step3()

private fun Long.step1() = shl(6) xor this and 0xFFFFFF
private fun Long.step2() = shr(5) xor this and 0xFFFFFF
private fun Long.step3() = shl(11) xor this and 0xFFFFFF

it might be enlightening to print all the numbers in sequence in binary 🙂

Anirudh

12/22/2024, 10:52 AM

I was going to check the binary of the output for part 1 but then just decide to finish part 1 and see if that investigation is needed for part 2.

Dan Fingal-Surma

12/22/2024, 10:52 AM

Copy code

fun encode(deltas: List<Int>) = ((deltas[0] + 10) shl 15) or ((deltas[1] + 10) shl 10) or ((deltas[2] + 10) shl 5) or ((deltas[3] + 10))

Dan Fingal-Surma

12/22/2024, 10:52 AM

(playing around)

phldavies

12/22/2024, 10:54 AM

why not

deltas.reduce { acc, delta -> acc shl 5 or (delta and 0x1F) }

Michael de Kaste

12/22/2024, 10:54 AM

this is why I stopped doing leetcode aswell, its micro-optimilization that takes away from the beauty of languages, but you could do this:

Copy code

part2 {
    val bananas = Array(19){ Array(19){ Array(19){ IntArray(19) } } }
    val visited = Array(19){ Array(19){ Array(19){ Array(19){ BitSet(secrets.size) } } } }
    var max = 0
    secrets.forEachIndexed { index, line ->
        line.map { it % 10 }.windowed(5){ (a,b,c,d,e) ->
            val ax = b - a + 9
            val bx = c - b + 9
            val cx = d - c + 9
            val dx = e - d + 9
            if(!visited[ax][bx][cx][dx].get(index)){
                bananas[ax][bx][cx][dx] += e
                visited[ax][bx][cx][dx].set(index)
                if(bananas[ax][bx][cx][dx] > max){
                    max = bananas[ax][bx][cx][dx]
                }
            }
        }
    }
    max
}

It speeds up my part2 to 213.419400ms Could be even faster with classic for loops

👍 1

phldavies

12/22/2024, 10:55 AM

Copy code

context(PuzzleInput) fun part2(): Int {
        val bananas = Array(19) { Array(19) { Array(19) { IntArray(19) } } }
        val seen = Array(19) { Array(19) { Array(19) { IntArray(19) { -1 } } } }
        var max = 0

        Parsers.Longs().forEachIndexed { buyer, secret ->
            generateSequence(secret) { it.evolve() }
                .map { it % 10 }.take(2001)
                .windowed(5) { (a, b, c, d, e) ->
                    val i = (e - d).mod(19)
                    val j = (d - c).mod(19)
                    val k = (c - b).mod(19)
                    val l = (b - a).mod(19)
                    if (seen[i][j][k][l] != buyer) {
                        seen[i][j][k][l] = buyer
                        bananas[i][j][k][l] += e.toInt()
                        max = maxOf(bananas[i][j][k][l], max)
                    }
                }
                .count()
        }

        return max
    }

was my take, @Michael de Kaste

👍 1

Michael de Kaste

12/22/2024, 10:56 AM

something something great minds 😂

Jakub Gwóźdź

12/22/2024, 10:58 AM

yeah I'm now around 400ms, I see some places for improvement, but need to do some other things 🙂

Jakub Gwóźdź

12/22/2024, 11:04 AM

ok, introducing buyer table took me to 300ms, but made the whole solution less readable 🙂

Dan Fingal-Surma

12/22/2024, 11:05 AM

The array approach is too ugly for me to continue with

Dan Fingal-Surma

12/22/2024, 11:05 AM

If I wanted to write C++ I’d write C++

Jakub Gwóźdź

12/22/2024, 11:05 AM

same 🙂

Jakub Gwóźdź

12/22/2024, 11:05 AM

and if I wanted super speed I wouldn't be using sequences anyway

Dan Fingal-Surma

12/22/2024, 11:19 AM

going to

Copy code

.map { (a, b, c, d) -> a.mod(19) * 6859 + b.mod(19) * 361 + c.mod(19) * 19 + d.mod(19) }

from

Copy code

.map { (a, b, c, d) -> "$a $b $c $d" }

saves 300-400ms

Dan Fingal-Surma

12/22/2024, 11:21 AM

IntArray(19 * 19 * 19 * 19) { 0 }

is clean enough and saves a further 250 ms

Jakub Gwóźdź

12/22/2024, 11:24 AM

yeah my

Copy code

fun part2(input: Input): Any {
    val array = IntArray(19 * 19 * 19 * 19 * 19)
    val done = BooleanArray(19 * 19 * 19 * 19 * 19)
    input.forEach { seed ->
        done.fill(false)
        generateSequence(seed, Long::nextSecret)
            .map { it % 10 }
            .take(2001)
            .windowed(5)
            .forEach { last5 ->
                val code = last5.zipWithNext { a, b -> b - a }.fold(0L) { acc, l -> acc * 19 + l + 9 }.toInt()
                if (!done[code]) {
                    done[code] = true
                    array[code] += last5.last().toInt()
                }
            }
    }
    return array.max()
}

makes it in 300ms (+/- 10%)

Max Thiele

12/22/2024, 11:29 AM

Moved it all to arrays and computing everything in a single pass. Part 1 is still my old implementation. Part 2 is almost equally fast (but uses coroutines...) Not very idiomatic anymore, but pretty fast... part2Fast:

Copy code

Part 1 median:      7.035 ms  (709 benchmark iterations)
Part 2 median:      7.179 ms  (676 benchmark iterations)

Dan Fingal-Surma

12/22/2024, 11:30 AM

whoops I was calculating prices twice due to re-using the sequence instead of converting to list

Dan Fingal-Surma

12/22/2024, 11:44 AM

Around 600ms from 10s

Dan Fingal-Surma

12/22/2024, 12:01 PM

Always make sure the code still gives the right answer before posting lol

😁 1

Jakub Gwóźdź

12/22/2024, 12:05 PM

@Dan Fingal-Surma oh that’s nothing. A few days ago I posted a visualization I was so proud of just to find out later that because of multiple optimizations on the way it was no longer correct 😁

Dan Fingal-Surma

12/22/2024, 12:10 PM

175ish ms, goodnight

👍 1

Dan Fingal-Surma

12/22/2024, 12:11 PM

(I was missing a + on line 18 which somehow compiled and produced the wrong answer??)

Dan Fingal-Surma

12/22/2024, 12:11 PM

oh it was dropping the final clause, treating it as a standalone expression

kingsley

12/22/2024, 12:47 PM

Tidied up a bit more and I think I'm done for this day. Got it down to:

Copy code

Cold
Part1: ~24ms. Part2: ~180ms

Warm
Part1: ~7ms. Part2: ~94ms

kingsley

12/22/2024, 12:54 PM

Also replaced

list.slice(a..b)

with

list.subList(a, b+1)

and saved a few extra milliseconds

ephemient

12/22/2024, 1:56 PM

@Jakub Gwóźdź

it might be enlightening to print all the numbers in sequence in binary 🙂

good luck with that 😉 https://en.wikipedia.org/wiki/Xorshift

today i learned 1

Jakub Gwóźdź

12/22/2024, 2:04 PM

@ephemient oh wait so this is a real PRNG? 😮 TIL

plus one 1

phldavies

12/22/2024, 4:23 PM

Copy code

Warming up 2 puzzles for 10s each for year 2024 day 22...
	Array warmed up with 520 iterations
	Default warmed up with 22 iterations
year 2024 day 22 part 1
	 Array took 7.885264ms 👑: 16039090236
	 Default took 15.524028ms (1.97x): 16039090236
year 2024 day 22 part 2
	 Array took 11.130472ms 👑: 1808
	 Default took 449.944569ms (40.42x): 1808

I think I'm done now - single threaded, no coroutines.

Jakub Gwóźdź

12/22/2024, 4:24 PM

no fancy window(5) / zipWithNext neither, I presume? 🙂

phldavies

12/22/2024, 4:26 PM

Copy code

context(PuzzleInput) fun part2(): Int {
        val len = 0xFFFFF
        val bananas = IntArray(len)
        val seen = IntArray(len) { -1 }
        var max = 0

        for ((buyer, secret) in Parsers.Longs().withIndex()) {
            var s = secret
            var prevPrice = (s % 10).toInt()
            var delta = 0
            for (i in 0..<2000) {
                s = s.evolve()
                val price = (s % 10).toInt()
                delta = (delta shl 5 and 0xFFFFF) or (9 + price - prevPrice)
                prevPrice = price
                if (i >= 3 && seen[delta] < buyer) {
                    seen[delta] = buyer
                    val newBananas = bananas[delta] + price
                    bananas[delta] = newBananas
                    max = maxOf(newBananas, max)
                }
            }
        }

        return max
    }

nope - just abusing a nice large sparse array and some bit twiddling

👍 1

Dan Fingal-Surma

12/22/2024, 6:54 PM

I was gonna say, looks like a hash function to me

phldavies

12/22/2024, 6:56 PM

Each delta is (made to be) in the range of 0..18, which means they all fit in 5 bits. Four would take up 20 bits. So I just store them all in a single Int. shl 5 keeping the lowest 20 bits keeps just the last for rolling deltas.

🧠 1

kingsley

12/22/2024, 7:03 PM

I did exactly the same thing with the 5 bits. Though I used just 1 array. And I was able to reduce the size from 0xfffff to about half of that since 0..18 doesn't necessarily use up all 5 bits

phldavies

12/22/2024, 7:05 PM

How did you go about ensuring you only count a single instance of each with only a single array?

kingsley

12/22/2024, 7:06 PM

Copy code

val map = IntArray(608850)
for (n in input) for ((k, v) in sequences(n)) map[k] += v
println(map.max())

phldavies

12/22/2024, 7:07 PM

Ah you track the state per sequence. I was attempting to avoid any sequences so I used two arrays total for the entire solution.

phldavies

12/22/2024, 7:08 PM

Tracked the last input index to avoid needing to clear the "seen" array between buyers

kingsley

12/22/2024, 7:08 PM

Yea. I tried that but it became slower. I guess using the extra seen array should avoid the issue I was having. Though it also felt dirty creating such a large array

phldavies

12/22/2024, 7:09 PM

"large" is relative. It's 2.5MB of ints.

kingsley

12/22/2024, 7:10 PM

True. My thinking was relative to how much of it actually gets filled. I think I only had about 18k or so unique keys (can't quite remember now)

phldavies

12/22/2024, 7:12 PM

I did try a more compact 19*18*18*18 array layout but shifting was faster. Figured I could spare a few more KB

kingsley

12/22/2024, 7:14 PM

Haha. I also did the same. But the memory saving here ended up costing more in runtime when finding the actual max. The flat array also looks nicer, so whatever

kingsley

12/22/2024, 7:15 PM

Ah. I could have just done the max inline

👍 1

phldavies

12/22/2024, 7:15 PM

I maintained the max value as I increased each key, to avoid needing a second scan. I only use the array to keep track of individual sums

➕ 2

kingsley

12/22/2024, 7:16 PM

Makes sense

Dan Fingal-Surma

12/22/2024, 8:55 PM

Looking at my last screenshot above. Somehow if inline the addPrices function into the loop, I go from 175ms to 250ms. Any idea why? (live link: https://github.com/dfings/advent-of-code/blob/main/src/2024/problem_22.main.kts)

Dan Fingal-Surma

12/22/2024, 8:58 PM

incrementally computing the max only saves me about 2-5ms

Dan Fingal-Surma

12/22/2024, 8:58 PM

Hmm these numbers are not warmed

Dan Fingal-Surma

12/22/2024, 8:59 PM

Still a 50ms penalty for inlining even with warming

Dan Fingal-Surma

12/22/2024, 9:00 PM

currently hitting 110ms warmed

Dan Fingal-Surma

12/22/2024, 9:05 PM

might be stack allocating the boolean array?

kingsley

12/22/2024, 9:15 PM

You could try removing the

{ 0 }

and

{ false }

when initializing the int and Boolean arrays

Dan Fingal-Surma

12/22/2024, 9:16 PM

Right but that's the same in both cases?

kingsley

12/22/2024, 9:18 PM

The initializer does an extra fill operation that might be a bit heavy, especially for the Boolean array which also happens for every entry in the initial values

Dan Fingal-Surma

12/22/2024, 9:51 PM

Maybe saves 3ms? Hard to say. You could expect such a redundant initialization to be optimized out

Dan Fingal-Surma

12/22/2024, 9:53 PM

ok but adding

.toIntArray()

after

.toList()

on prices saves a good 15-20ms

Dan Fingal-Surma

12/22/2024, 9:53 PM

now I’m sub 100

Dan Fingal-Surma

12/22/2024, 9:54 PM

doing

+ 9

instead of

.mod(19)

also saves another 15-20ms

👍 1

Dan Fingal-Surma

12/22/2024, 10:03 PM

ok and then another 20ms by removing the sequence

Dan Fingal-Surma

12/22/2024, 10:03 PM

Copy code

val prices = IntArray(2001)
    var v = s
    for (i in 0..2000) {
        prices[i] = (v % 10).toInt()
        v = next(v)
    }

Dan Fingal-Surma

12/22/2024, 10:39 PM

now the inlining penalty has gone away

Dan Fingal-Surma

12/22/2024, 10:46 PM

Another 20 ms:

Copy code

(v xor s) % (1 shl 24)

Copy code

(v xor s) and 0xFFFFFF

Dan Fingal-Surma

12/22/2024, 10:47 PM

in the 35-40 range now

👏 2

Dan Fingal-Surma

12/22/2024, 10:57 PM

good time to stop

19 Views

Open in Slack

Previous Next