warning Day 8 Solution Thread warning kotlinlang #advent-of-code

Join Slack

:warning: Day 8 Solution Thread :warning:

# advent-of-code

adamratzman

12/08/2020, 4:41 AM

⚠️ Day 8 Solution Thread ⚠️

David Whittaker

12/08/2020, 5:14 AM

At least I got sub 300 for part 1, that makes me happy!

when

to the rescue yet again!!

adamratzman

12/08/2020, 5:15 AM

Ikr, when is really nice here!

adamratzman

12/08/2020, 5:16 AM

Copy code

private val input = readInput("input8.txt")

data class Instruction(val name: String, val argument: Int)

fun main() {
    val instructions = input.split("\n").map { it.split(" ") }.map { Instruction(it[0], it[1].toInt()) }
    fun part1(): Int {
        var accumulator = 0
        var currentLine = 0
        val linesRun = mutableListOf<Int>()

        while (currentLine < instructions.size && currentLine !in linesRun) {
            linesRun += currentLine
            val instruction = instructions[currentLine]
            when (instruction.name) {
                "acc" -> {
                    accumulator += instruction.argument
                    currentLine++
                }
                "nop" -> currentLine++
                "jmp" -> currentLine += instruction.argument
            }
        }
        return accumulator
    }
    println("Part 1: ${part1()}")

    fun part2() {
        instructions
            .mapIndexed { index, instruction -> index to instruction }
            .filter { it.second.name in listOf("jmp", "nop") }
            .map { it.first }.forEach { a ->
                var accumulator = 0
                val testInstructions = instructions.toMutableList().apply {
                    val oldInstruction = this[a]
                    this[a] = oldInstruction.copy(name = if (oldInstruction.name == "jmp") "nop" else "jmp")
                }

                var currentLine = 0
                val linesRun = mutableListOf<Int>()
                while (currentLine < testInstructions.size && currentLine !in linesRun) {
                    val instruction = testInstructions[currentLine]
                    linesRun += currentLine
                    when (instruction.name) {
                        "acc" -> {
                            accumulator += instruction.argument
                            currentLine++
                        }
                        "nop" -> currentLine++
                        "jmp" -> currentLine += instruction.argument
                    }
                }
                if (currentLine == testInstructions.size) {
                    println("Part 2: $accumulator")
                    return@forEach
                }
            }
    }

    part2()

}

adamratzman

12/08/2020, 5:17 AM

Did the naive solution of going through every jmp/nop, switching it, and trying. Still ran in under a second

David Whittaker

12/08/2020, 5:22 AM

I took a gamble and only did nop -> jmp, but that had no solution so I had to change it to jmp -> nop. Sigh

😥 1

adamratzman

12/08/2020, 5:23 AM

Unlucky 😞

David Whittaker

12/08/2020, 5:26 AM

ikr. 😞 wait - do you have an inner

input

shadowing your outer

input

? lol -- naming is hard, naming fast is super hard

Nir

12/08/2020, 5:38 AM

Untitled

adamratzman

12/08/2020, 5:40 AM

Yep @David Whittaker, I do 😂 I forgot that I named the actual file input “input”. I’m going to clean this one up though

Edgars

12/08/2020, 7:55 AM

My solution . Kept wondering if there is a more optimal way to do part 2 instead of replacing every `jmp`/`nop` with the opposite, checking for loops, trying the next instruction, etc. I suppose you could backtrack from the point where it loops back, but part 2 runs in under 2ms (avg. of 20k runs), so... good enough!

Joris PZ

12/08/2020, 8:22 AM

Yeah, brute forcing is more than fast enough. I considered launching simulators in parallel coroutines, but it probably wouldn't solve much in this case.

Copy code

| Platform         | Average (ms) | Measurements (ms) |
| -----------------| ------------:|------------------:|
| GraalVM          | 8.7±9.5      | `47, 15, 16, 7, 5, 5, 3, 4, 5, 6, 5, 5, 7, 5, 5, 5, 5, 6, 6, 4` |
| Node JS          | 23.3±14.1    | `81, 37, 20, 22, 24, 17, 20, 17, 18, 23, 15, 18, 17, 18, 15, 16, 16, 20, 19, 23` |
| Native           | 124±17       | `106, 134, 141, 112, 136, 105, 139, 106, 134, 107, 153, 108, 128, 135, 117, 119, 154, 90, 118, 124` |

ephemient

12/08/2020, 10:44 AM

https://github.com/ephemient/aoc2020/blob/main/kt/src/main/kotlin/io/github/ephemient/aoc2020/Day8.kt + https://github.com/ephemient/aoc2020/blob/main/kt/src/main/kotlin/io/github/ephemient/aoc2020/Machine.kt

ephemient

12/08/2020, 10:44 AM

separate module in case of reuse... not sure if we should expect that or not

Nir

12/08/2020, 1:26 PM

Yeah I didn't see an obviously better way than brute force, would be interested to see

Edgars

12/08/2020, 1:29 PM

@Nir I think one could backtrack from the point where a loop "connects" and change, for example, the previous command, test it, if it's broken, change the one before that and so on. Basically, when you detect a loop, run the program in reverse, changing the instructions and trying again from that point.

Nir

12/08/2020, 1:44 PM

I thought about that but it's a bit vague, you don't really know which instruction in the loop sequence to change

Nir

12/08/2020, 1:45 PM

So at best it seems like more guided brute force

Edgars

12/08/2020, 1:45 PM

All of them, one at a time. Much like brute-forcing from the top. Except a bit different.

Edgars

12/08/2020, 1:47 PM

Like:

Copy code

detect loop
pointer--
replace instruction if necessary // decrement accumulator it's an acc
run code
if detects loop, put the original instruction back, pointer--, try again
else hooray!

Edgars

12/08/2020, 1:47 PM

Of course, you'd need to run the altered instructions in a different "computer". Or save which instruction you changed and what address you were at.

Edgars

12/08/2020, 1:49 PM

Though, I think the worst-case time complexity is still essentially the same.

Nir

12/08/2020, 2:01 PM

There isn't really anyway to run it backwards, you'd need to keep all the state

Nir

12/08/2020, 2:03 PM

So it's complicating the data structures and such considerably for maybe a better heuristic... Might make the worst case worse

Edgars

12/08/2020, 2:06 PM

Yeah. Alternatively, you could let it loop once more to see which commands are executed in the loop and then try those one by one going forwards. But then I guess it is possible that a

jmp

will jump you into a loop that you otherwise wouldn't have got to.

todd.ginsberg

12/08/2020, 2:10 PM

I brute forced part 2. Basically I generate new instruction sets one at a time and try them out until one terminates successfully rather than fails. Meh. I did write a data class for the instructions ("instruction, flip thine self!") and the computer ("run until you stop and tell me how you stopped"). From there it was just mapping data. I'll probably revise it before I post it, but work calls for now!

ephemient

12/08/2020, 2:11 PM

I implemented it in Haskell (was easier there), will port to Kotlin later - doing a graph search for part 2 is way faster than brute force

todd.ginsberg

12/08/2020, 2:17 PM

My brute force:

Copy code

fun solvePart2(): Int =
        instructions
            .indices
            .asSequence()
            .mapNotNull { index -> instructions.flipIndexOrNull(index) }
            .mapNotNull { inst ->
                Computer(inst).run {
                    if (runUntilTerminate() == Computer.ExecutionState.TERMINATED) accumulator
                    else null
                }
            }.first()

Nir

12/08/2020, 4:20 PM

@Edgars I was actually wrong, it's almost certainly possible to do it in linear time, just the approach we were discussing isn't the way

Nir

12/08/2020, 4:20 PM

Made another thread for it

Edgars

12/08/2020, 4:25 PM

I saw. Looks like way too much effort. Unless you want to run all AoC solutions in 9ms or something.

Nir

12/08/2020, 4:35 PM

Yeah I took a stab at getting it to work, and my approach was wrong.

Nir

12/08/2020, 4:36 PM

Trying to give up on it but the nagging voice in my head won't leave me alone, haha

Edgars

12/08/2020, 4:39 PM

Don't forget, all work and no play makes Jack a dull boy.

ephemient

12/08/2020, 4:41 PM

https://ephemient.github.io/aoc2020/results.txt I'm at about 4ms total so far

Nir

12/08/2020, 5:01 PM

eh I'm more interested in chasing the algorithm than raw perf, in kotlin

Nir

12/08/2020, 5:01 PM

reasoning about constants in performance in languages like kotlin, python, etc just hurts my brain 🙂

Nir

12/08/2020, 5:01 PM

rewrite it in C++ (or Rust) is the best improvement

ephemient

12/08/2020, 5:02 PM

I'm writing and running equivalent benchmarks in 4 languages, so I think I've got a decent handle on the constant factors.

Nir

12/08/2020, 5:05 PM

what ratios of constant factors are you typically seeing between these 4 languages

ephemient

12/08/2020, 5:31 PM

obviously, it'll depend on the task. for AoC thus far, it's around Haskell: ~2-8x slower than Kotlin/JVM (much worse when handling strings, due to the non-packed default representation; I know how to fix that but I'm lazy). Python: ~3-10x slower than JVM (Python 3.9 is faster than whatever version I was using last year). Rust: same-2x faster than JVM

Nir

12/08/2020, 5:53 PM

interesting

todd.ginsberg

12/08/2020, 6:02 PM

Post: https://todd.ginsberg.com/post/advent-of-code/2020/day8/ Code: https://github.com/tginsberg/advent-2020-kotlin/blob/main/src/main/kotlin/com/ginsberg/advent2020/Day08.kt

bjonnh

12/08/2020, 6:14 PM

I have some ideas for a bit less brute-forcy approach (kind of backtracking with keeping states when jumping or noping)

bjonnh

12/08/2020, 6:15 PM

But my solution takes 42ms (including loading the file)

bjonnh

12/08/2020, 6:15 PM

so meh

bjonnh

12/08/2020, 6:15 PM

https://github.com/bjonnh/advent_of_code/blob/master/src/main/kotlin/y2020/day08/main.kt

Marcin Wisniowski

12/08/2020, 6:16 PM

https://gitlab.com/Nohus/advent-of-code-2020/-/blob/master/src/main/kotlin/day8_2/Solution.kt

bjonnh

12/08/2020, 6:21 PM

Day8Bench.naive sample 6372 0.784 ± 0.004 ms/op Day8Bench.naive:naive·p0.00 sample 0.685 ms/op Day8Bench.naive:naive·p0.50 sample 0.766 ms/op Day8Bench.naive:naive·p0.90 sample 0.825 ms/op Day8Bench.naive:naive·p0.95 sample 0.911 ms/op Day8Bench.naive:naive·p0.99 sample 1.332 ms/op Day8Bench.naive:naive·p0.999 sample 1.606 ms/op Day8Bench.naive:naive·p0.9999 sample 2.544 ms/op Day8Bench.naive:naive·p1.00 sample 2.544 ms/op

bjonnh

12/08/2020, 6:21 PM

that's much faster that what I thought

bjonnh

12/08/2020, 6:21 PM

(this include loading the file)

Nir

12/08/2020, 6:22 PM

I wonder if JVM based programs have a bit of an unfair advantage in a microbenchmark situation like this

Nir

12/08/2020, 6:23 PM

the JVM will be running continuously as the the benchmark loops, probably never returning the memory used in the solution to the OS

bjonnh

12/08/2020, 6:23 PM

well in real life you don't leave your JVM after each operation anyway

bjonnh

12/08/2020, 6:23 PM

except if you do some kind of serverless stuff

Nir

12/08/2020, 6:24 PM

Sure, but in real life non-naive C++ code would never allocate arrays in a hot path, for example

bjonnh

12/08/2020, 6:24 PM

I can try by asking the JVM to do a GC after

Nir

12/08/2020, 6:24 PM

it just makes me wonder what the best way to really compare apples to apples is

bjonnh

12/08/2020, 6:25 PM

I mean I can check how long it takes to run it

bjonnh

12/08/2020, 6:25 PM

Benchmark Mode Cnt Score Error Units Day8Bench.naive sample 749 6.732 ± 0.312 ms/op Day8Bench.naive:naive·p0.00 sample 4.801 ms/op Day8Bench.naive:naive·p0.50 sample 5.603 ms/op Day8Bench.naive:naive·p0.90 sample 11.731 ms/op Day8Bench.naive:naive·p0.95 sample 12.362 ms/op Day8Bench.naive:naive·p0.99 sample 15.827 ms/op Day8Bench.naive:naive·p0.999 sample 21.856 ms/op Day8Bench.naive:naive·p0.9999 sample 21.856 ms/op Day8Bench.naive:naive·p1.00 sample 21.856 ms/op

bjonnh

12/08/2020, 6:26 PM

that's with a GC cycle at each run

bjonnh

12/08/2020, 6:26 PM

(this is part1 and part2 together)

Edgars

12/08/2020, 6:26 PM

Makes me wonder if I should gc after each benchmark as well then. Otherwise I get, like, 20k runs and 2ms for the whole thing.

bjonnh

12/08/2020, 6:26 PM

well 6ms for the whole thing is not bad either 😄

bjonnh

12/08/2020, 6:27 PM

and I didn't tweak the GC

bjonnh

12/08/2020, 6:27 PM

also I'm storing a lot of useless data

bjonnh

12/08/2020, 6:27 PM

this is absolutely not an optimized solution

Edgars

12/08/2020, 6:27 PM

Just wondering if running the same thing over and over again for a minute is a reasonable benchmark. I used to do just 5 runs, I bet I'd get, like, 20ms then. 😄

bjonnh

12/08/2020, 6:28 PM

that's why you use jmh… to try to account for that

bjonnh

12/08/2020, 6:28 PM

it did run it 749 times here for example

bjonnh

12/08/2020, 6:29 PM

I could ask it to do more if I want

bjonnh

12/08/2020, 6:29 PM

also the machine is pretty busy

Edgars

12/08/2020, 6:29 PM

TIL. Will check that out.

bjonnh

12/08/2020, 6:30 PM

Copy code

% time ./gradlew run                                                                                                                                                                       
> Task :run
(1859, 1235)

BUILD SUCCESSFUL in 528ms
2 actionable tasks: 1 executed, 1 up-to-date
./gradlew run  0.84s user 0.05s system 98% cpu 0.902 total

bjonnh

12/08/2020, 6:30 PM

that includes… gradle

bjonnh

12/08/2020, 6:32 PM

Copy code

time ./bin/advent-of-code                                                                                                                                                             
(1859, 1235)
./bin/advent-of-code  0.17s user 0.04s system 127% cpu 0.170 total

bjonnh

12/08/2020, 6:34 PM

After clearing all the caches (on Linux, dropping pages type 1 2 and 3 ) :

Copy code

time ./bin/advent-of-code                                                                                                                                                             
(1859, 1235)
./bin/advent-of-code  0.15s user 0.03s system 105% cpu 0.179 total

bjonnh

12/08/2020, 6:36 PM

so yeah there is some JVM start time overhead, but really that's not as bad as people say

bjonnh

12/08/2020, 6:39 PM

export JAVA_OPTS="-XX:+UnlockExperimentalVMOptions -XX:+UseZGC -Xmx5m -Xlog:gc" ; time ./bin/advent-of-code

bjonnh

12/08/2020, 6:39 PM

hah that even work with 5m

bjonnh

12/08/2020, 6:39 PM

and it is not that slow either

bjonnh

12/08/2020, 6:49 PM

I'm trying with aotc

bjonnh

12/08/2020, 6:49 PM

that may help even more, already I win 20ms by using -Xshare:dump and -Xshare:on

bjonnh

12/08/2020, 6:50 PM

won 20ms more with AOTC

Nir

12/08/2020, 6:51 PM

yeah these startup times are really not bad

Nir

12/08/2020, 6:51 PM

makes me think that command line utilities in kotlin (even JVM) could be a decent choice

bjonnh

12/08/2020, 6:52 PM

yes

bjonnh

12/08/2020, 6:52 PM

they absolutely are

bjonnh

12/08/2020, 6:52 PM

and these are full run times 😄

Nir

12/08/2020, 6:52 PM

it's funny because if you exclude the JVM there isn't really any other obvious choices for command line utilities in linux. python is super slow, Go is super.... well, Go.

Nir

12/08/2020, 6:53 PM

And C/C++/Rust are overkill 99% of the time

bjonnh

12/08/2020, 6:53 PM

I like rust

bjonnh

12/08/2020, 6:53 PM

but I found that I write Kotlin so much faster than Rust

bjonnh

12/08/2020, 6:53 PM

and that it is fast enough for 99.9% of my problems

bjonnh

12/08/2020, 6:54 PM

jaotc is heavy :D

bjonnh

12/08/2020, 6:54 PM

Copy code

export JAVA_OPTS="-Xshare:on -XX:SharedArchiveFile=classes.jsa  -XX:+UnlockExperimentalVMOptions -XX:AOTLibrary=./java_base.so " ; time ./bin/advent-of-code                          
Helloworld
./bin/advent-of-code  0.03s user 0.02s system 106% cpu 0.049 total

bjonnh

12/08/2020, 6:54 PM

just displaying hello world

bjonnh

12/08/2020, 6:55 PM

so there is quite an overhead still

bjonnh

12/08/2020, 6:57 PM

Copy code

/tmp/o  0.00s user 0.00s system 77% cpu 0.002 total

bjonnh

12/08/2020, 6:57 PM

a C hello world

bjonnh

12/08/2020, 6:58 PM

Copy code

python /tmp/o.py  0.02s user 0.01s system 96% cpu 0.021 total

bjonnh

12/08/2020, 6:58 PM

python is faster than I thought to start

Nir

12/08/2020, 6:58 PM

yeah not having GC is a big productivity penalty

bjonnh

12/08/2020, 6:58 PM

(I'm doing 10 runs, not clearing caches, and showing lowest result)

bjonnh

12/08/2020, 6:58 PM

I disabled the GC in Java to see if it helped

bjonnh

12/08/2020, 6:58 PM

doesn't change anything in term of speed

bjonnh

12/08/2020, 6:59 PM

(using the Epsilon GC)

Nir

12/08/2020, 6:59 PM

said as someone who's been writing C++ professionally for close to a decade. I'd never opt for a no-GC language in an application where I could tolerate it.

Nir

12/08/2020, 6:59 PM

which is practically all of them

bjonnh

12/08/2020, 6:59 PM

so it is not the GC thas slows that program

bjonnh

12/08/2020, 6:59 PM

I think it has to do with class loading

bjonnh

12/08/2020, 7:16 PM

I can reach 70ms on the command line

bjonnh

12/08/2020, 7:16 PM

with Kotlin and AOT

bjonnh

12/08/2020, 7:16 PM

142ms fresh start

bjonnh

12/08/2020, 7:16 PM

that's still pretty bad

bjonnh

12/08/2020, 7:21 PM

40ms for the hello world, that's pretty close to python 😄

bjonnh

12/08/2020, 7:22 PM

it is heavy on processing though…

bjonnh

12/08/2020, 7:22 PM

that would work for a command, but painful for a script

Nir

12/08/2020, 7:22 PM

I finally fully understood an O(N) solution

Nir

12/08/2020, 7:22 PM

was not easy for me

ephemient

12/08/2020, 7:22 PM

Copy code

$ cc -nostdlib -ohello hello.S
$ time ./hello
Hello, world!
0.00user 0.00system 0:00.00elapsed 66%CPU (0avgtext+0avgdata 344maxresident)k
0inputs+0outputs (0major+27minor)pagefaults 0swaps

hello.S

bjonnh

12/08/2020, 7:23 PM

With caches removed Python 56 ms Kotlin(AOTC and all) 80ms

bjonnh

12/08/2020, 7:27 PM

also I'm using perf now to get real stats

bjonnh

12/08/2020, 7:28 PM

That's the stats of today's parts1 and 2:

Copy code

114.79 msec task-clock                #    1.517 CPUs utilized            ( +-  2.38% )
               430      context-switches          #    0.004 M/sec                    ( +-  0.73% )
                12      cpu-migrations            #    0.104 K/sec                    ( +-  3.51% )
             6,018      page-faults               #    0.052 M/sec                    ( +-  0.29% )
       457,879,005      cycles                    #    3.989 GHz                      ( +-  0.51% )
       499,379,616      instructions              #    1.09  insn per cycle           ( +-  0.17% )
        96,479,627      branches                  #  840.454 M/sec                    ( +-  0.17% )
         4,003,302      branch-misses             #    4.15% of all branches          ( +-  0.24% )

           0.07567 +- 0.00199 seconds time elapsed  ( +-  2.63% )

bjonnh

12/08/2020, 7:28 PM

the JVM is FAST

6 Views

Open in Slack

Previous Next