`a mapIndexed index value gt value to myList index toMap ` kotlinlang #announcements

`a.mapIndexed { index, value -> value to myList...

trevjones

11/21/2019, 6:05 PM

a.mapIndexed { index, value -> value to myList[index] }.toMap()

Hullaballoonatic

11/21/2019, 6:05 PM

this also creates an extra interim list of pairs

trevjones

11/21/2019, 6:06 PM

yup. probably need to roll your own if that intermediate collections/pairs are going to be an issue

Hullaballoonatic

11/21/2019, 6:06 PM

it's also identical to

a.zip(b)

trevjones

11/21/2019, 6:08 PM

maybe could do it as a sequence and wrap all call to associate to track the index?

Hullaballoonatic

11/21/2019, 6:08 PM

sequences inherently have additional collection creation overhead. i'm hoping they get an update to be optimized for equal performance with collections

👌 1

karelpeeters

11/21/2019, 7:28 PM

a.withIndex().associateBy({ (_, v) -> v }, { (i, _) -> b[i] })

looks terrible but doesn't have any overhead.

trevjones

11/21/2019, 7:38 PM

I like that:

Copy code

inline fun <K, V> Iterable<K>.associateByIndex(valueSelector: (index: Int) -> V): Map<K, V> {
    return withIndex().associateBy({ (_, k) -> k}, { (i, _) -> valueSelector(i) })
}

val a = listOf(1,2,3)
val b = listOf("one", "two", "three")
val c = a.associateByIndex { b[it] }
println(c.toString())

karelpeeters

11/21/2019, 7:39 PM

This should be called

associateWith

, no? Also, are you sure

crossinline

is necessary? It kind of defeats the purpose of

inline

trevjones

11/21/2019, 8:02 PM

it is associating a key with a value via the index, not to the index? and you are right no need for a crossinline.

trevjones

11/21/2019, 8:12 PM

allocations is roughly

2N+3

where N is the size of the IterableReceiver assuming the valueSelector doesn't allocate

Hullaballoonatic

11/21/2019, 8:47 PM

it would be called

associateWithIndexed

according to stdlib naming convention if including the indices in the params of the transform

Arkady Bazhanov

11/21/2019, 8:47 PM

another possible solution would be:

Copy code

fun <K, V> Iterable<K>.associateWith(values: Iterable<V>): Map<K, V> {
    return values.iterator().run {
        associateWith { next() }
    }
}

Hullaballoonatic

11/21/2019, 8:47 PM

I was almost certain

withIndex

had overhead...

trevjones

11/21/2019, 8:48 PM

allocates an iterator and indexed value per item in the iterable. so it is N+1 allocations

Arkady Bazhanov

11/21/2019, 8:51 PM

I guess my solution has no such overhead

karelpeeters

11/21/2019, 9:01 PM

If allocating an iterator is a problem you're definitely using the wrong language.

😂 3

karelpeeters

11/21/2019, 9:43 PM

A JMH benchmark: https://pastebin.com/raw/VViiXKYF Results:

Copy code

Benchmark                              Mode  Cnt     Score     Error  Units
MapBenchmarks.iterator                thrpt   10  5095.020 ± 105.089  ops/s
MapBenchmarks.manualAlloc             thrpt   10  4190.481 ± 209.660  ops/s
MapBenchmarks.manual                  thrpt   10  2686.919 ±  60.247  ops/s
MapBenchmarks.mapIndexed              thrpt   10  3820.297 ±  34.474  ops/s
MapBenchmarks.sequenceZip             thrpt   10  2415.201 ±  60.417  ops/s
MapBenchmarks.withIndexAssociate      thrpt   10  2360.287 ±  43.676  ops/s
MapBenchmarks.withIndexAssociateByTo  thrpt   10  2311.393 ±  19.049  ops/s
MapBenchmarks.zip                     thrpt   10  4021.002 ± 127.191  ops/s

I can't really think of a reason why

manualAlloc

is slower than

iterator

, but I've spent too much time on this already 🙂

Hullaballoonatic

11/21/2019, 10:36 PM

hahahahaha. you're the best, Karel

Hullaballoonatic

11/21/2019, 10:37 PM

since it was the best version, mind posting the code for that one?

Hullaballoonatic

11/21/2019, 10:37 PM

copy/paste instead of thinking is my specialty!

karelpeeters

11/21/2019, 10:38 PM

All of the code is on pastebin.

Hullaballoonatic

11/21/2019, 10:41 PM

oh 😛

Hullaballoonatic

11/21/2019, 10:41 PM

somehow missed that. too interested in the benchmarks

trevjones

11/21/2019, 10:41 PM

perhaps manualAlloc requires more hash calculations than iterator. but yea can't see it either

karelpeeters

11/21/2019, 10:44 PM

But they both just call

Map.put

for each element.

Hullaballoonatic

11/21/2019, 10:44 PM

What prize does @Arkady Bazhanov win?

karelpeeters

11/21/2019, 10:45 PM

He has to explain why it's faster first!

Hullaballoonatic

11/21/2019, 10:45 PM

🎁

Hullaballoonatic

11/21/2019, 10:48 PM

maybe somehow it's the request of the indices.

Hullaballoonatic

11/21/2019, 10:48 PM

or probably the memory reallocation involved with the mutablelist

Hullaballoonatic

11/21/2019, 10:49 PM

iterator approach might set aside the exact block of memory from outset

Hullaballoonatic

11/21/2019, 10:49 PM

🤷‍♂️

karelpeeters

11/21/2019, 10:49 PM

for (i in listA.indiches)

gets to compiled to a java-style

int size = listA.size; for(int i = 0; i < size; i++)

so that should be fine.

karelpeeters

11/21/2019, 10:50 PM

And I preallocate the map manually as well so that should be fine too.

karelpeeters

11/21/2019, 10:50 PM

I'll actually profile tomorrow.

trevjones

11/21/2019, 10:50 PM

List(10_000) { Random.nextInt() }

ends up backed by ArrayList so those lookups should be O(1) cheap. my thinking is JVM impl is crazy good with iterators?

Hullaballoonatic

11/21/2019, 10:50 PM

does mutableMap not have to recallocate a new block after reaching a certain size?

trevjones

11/21/2019, 10:50 PM

oh hey what if

trevjones

11/21/2019, 10:51 PM

well jmh should even it out. but maybe it was more hash collisions in one vs the next?

karelpeeters

11/21/2019, 10:51 PM

@Hullaballoonatic Yes, but

manualAlloc

preallocates the right size.

karelpeeters

11/21/2019, 10:52 PM

@trevjones I ran everything twice, and got similar results.

Hullaballoonatic

11/21/2019, 10:52 PM

How does it perform with non-random numbers?

trevjones

11/21/2019, 10:52 PM

I figured as much, but the psuedo random could throw results in a single run.

karelpeeters

11/21/2019, 11:15 PM

Non-random numbers:

Copy code

MapBenchmarks.iterator                thrpt   10  6338.874 � 1001.794  ops/s
MapBenchmarks.manual                  thrpt   10  6469.501 �   79.655  ops/s
MapBenchmarks.manualAlloc             thrpt   10  7929.022 �  107.842  ops/s
MapBenchmarks.mapIndexed              thrpt   10  4913.874 �   63.817  ops/s
MapBenchmarks.sequenceZip             thrpt   10  5423.528 �  329.522  ops/s
MapBenchmarks.withIndexAssociate      thrpt   10  5299.546 �   98.017  ops/s
MapBenchmarks.withIndexAssociateByTo  thrpt   10  4802.971 �   98.678  ops/s
MapBenchmarks.zip                     thrpt   10  4801.796 �  337.945  ops/s

Random numbers again:

Copy code

MapBenchmarks.iterator                thrpt   10  5128.317 � 151.363  ops/s
MapBenchmarks.manual                  thrpt   10  2673.094 �  27.443  ops/s
MapBenchmarks.manualAlloc             thrpt   10  5484.122 � 123.091  ops/s
MapBenchmarks.mapIndexed              thrpt   10  3889.280 �  38.294  ops/s
MapBenchmarks.sequenceZip             thrpt   10  2499.712 �  24.050  ops/s
MapBenchmarks.withIndexAssociate      thrpt   10  2383.809 �  26.758  ops/s
MapBenchmarks.withIndexAssociateByTo  thrpt   10  2353.679 �  62.273  ops/s
MapBenchmarks.zip                     thrpt   10  4128.566 � 105.939  ops/s

Now

manualAlloc

is beating everything as I would expect... Typical benchmarking 🙂

👏 1

ilya.gorbunov

11/22/2019, 2:55 AM

@karelpeeters the difference between

iterator

and

manualAlloc

can be in the capacity of the map:

manualAlloc

uses the exact collection size, and

associateWith

(in

iterator

) takes in account the default map fill factor of 0.75, so it uses the capacity equal to 4/3 times of the size.

ilya.gorbunov

11/22/2019, 2:59 AM

We have had a request for such function and even considered it for 1.3, but haven't found a good name for that, and

assocateWith

has been already taken by the overload with value selector. https://youtrack.jetbrains.com/issue/KT-21587

karelpeeters

11/22/2019, 1:35 PM

Ah right, completely forgot about map fill factors. Thanks!

Open in Slack

Previous Next