Is there a nice way to define a list of extension functions kotlinlang #announcements

Is there a nice way to define a list of extension ...

nkiesel

03/15/2021, 5:11 PM

Is there a nice way to define a list of extension functions? Background: in one of our user group discussions we discussed various ways to code "replace or add" for lists. We then wanted to test all of the proposed solutions using a `listOfProposals.forEach { f -> ... }`but could not find a way to define such a list. The best I have right now is to use a list of non-generic anonymous functions, where each item calls one of he proposed solutions. But that does not look "nice":

Copy code

fun <T> List<T>.replaceOrAdd1(item: T, predicate: (T) -> Boolean): List<T> { ... }
fun <T> List<T>.replaceOrAdd2(item: T, predicate: (T) -> Boolean): List<T> { ... }
data class Person(val name: String, val age: Int, val email: String, val address: String = "")
val listOfProposals = listOf(
    fun(list: List<Person>, person: Person, predicate: (Person) -> Boolean) = list.replaceOrAdd1(person, predicate),
    fun(list: List<Person>, person: Person, predicate: (Person) -> Boolean) = list.replaceOrAdd2(person, predicate),
)
// <https://pl.kotl.in/GvbQWnFEq> is the link to the working example

Nir

03/15/2021, 5:20 PM

You can write

Copy code

val functions = listOf(
    List<Person>::replaceOrAdd1,
    List<Person>::replaceOrAdd2,
    List<Person>::replaceOrAdd3,
    List<Person>::replaceOrAdd4,
)

Nir

03/15/2021, 5:21 PM

seems to compile without changing anything else

Nir

03/15/2021, 5:21 PM

is that the improvement you are looking for? Not completely certain what the part is that you think isn't nice

Nir

03/15/2021, 5:22 PM

I guess you are also concerned that it is not generic?

nkiesel

03/15/2021, 5:23 PM

That was my first idea but does not compile for me. Which Kotlin version and backend?

Youssef Shoaib [MOD]

03/15/2021, 5:23 PM

Here's a neater version based on the code that you provided: https://pl.kotl.in/x6z-_JUPe

Nir

03/15/2021, 5:23 PM

i just changed it at the link you provided me

Nir

03/15/2021, 5:23 PM

and it compiles

Nir

03/15/2021, 5:23 PM

Also, if you want to make it generic, simply make it a function instead of a variable

Nir

03/15/2021, 5:23 PM

Copy code

inline fun <reified T> functions() = listOf(
    List<T>::replaceOrAdd1,
    List<T>::replaceOrAdd2,
    List<T>::replaceOrAdd3,
    List<T>::replaceOrAdd4,
)

Youssef Shoaib [MOD]

03/15/2021, 5:24 PM

It is basically what @Nir suggested but for some reason including the List type makes it compile

nkiesel

03/15/2021, 5:24 PM

I get "replaceOrAdd1 is a member and and an extension at the same time. References to such elements are not allowed"

Youssef Shoaib [MOD]

03/15/2021, 5:25 PM

is replaceOrAdd1 inside of a different class? as in is it a member of a class?

Nir

03/15/2021, 5:25 PM

https://pl.kotl.in/RracLWi5N

nkiesel

03/15/2021, 5:26 PM

hmm. that works for me as well. But my "scratch file in IntellJ" gave me that error I posted above.

nkiesel

03/15/2021, 5:32 PM

"using the List type" was the missing part of my attempts. Perhaps I have too much trust in Kotlin type inference because I did not even try that. I also like the

inline fun <reified T> functions()

approach

nkiesel

03/15/2021, 5:34 PM

btw: which of the 4 implementations would get your vote? As a reference, our lists will at most contains a few dozen entries, so elegance and understanability rate a bit higher than performance for me here.

Nir

03/15/2021, 5:34 PM

yeah, the inline approach is the only way to keep it generic, which IMHO is pretty useful, you may want to test on different types and there's no real downside

Nir

03/15/2021, 5:36 PM

For readability + reasonable performance, I would say that 4 is probably the best

Nir

03/15/2021, 5:36 PM

It's more efficient than 1, and about equally readable

Nir

03/15/2021, 5:37 PM

2 is, IMHO, just bad

nkiesel

03/15/2021, 5:37 PM

I just tried the solutions in my scratch file and I still get the same error. Full disclosure: the scratch file does not have the

fun main()

wrapper so that I can execute the code. But that seems to have side effects. I just started using these scratch files and perhaps do not yet underdtand their limitations

Nir

03/15/2021, 5:38 PM

2 is both harder to read, and it's actually pretty inefficient (fully copy the list twice if replacing does not occur)

Nir

03/15/2021, 5:39 PM

4 is probably the most efficient, I think. But it would be a big difference.

Nir

03/15/2021, 5:39 PM

not be a big difference sorry

Nir

03/15/2021, 5:39 PM

Ugh sorry, 3 is the most efficient

nkiesel

03/15/2021, 5:39 PM

#2: yes, looked a tiny bit better until it was realized that w/o the

notReplaced

in the loop it would perform a "replaceAll" instead of "replaceFirst"

Nir

03/15/2021, 5:39 PM

getting n umbers mixed up

Nir

03/15/2021, 5:40 PM

basically, only 3 and 4 are even worth considering here

Nir

03/15/2021, 5:40 PM

4 being a bit more readable and concise, 3 being a bit more efficient, I'd choose 4.

Nir

03/15/2021, 5:40 PM

1 and 2 are dominated by other solutions

nkiesel

03/15/2021, 5:41 PM

3 and 4 are essencially identical from performance POV, no?

Nir

03/15/2021, 5:41 PM

not quite

nkiesel

03/15/2021, 5:41 PM

I like 4 better just because it's easier to understand for me

Nir

03/15/2021, 5:41 PM

yeah, 4 is the most readable for sure.

Nir

03/15/2021, 5:41 PM

4 is two-pass

Nir

03/15/2021, 5:41 PM

3 is, most likely, one pass

nkiesel

03/15/2021, 5:42 PM

yes, agreed

Nir

03/15/2021, 5:43 PM

but, I dunno, once the JVM is done with it, I'm on very thin ice to speculate which will be faster, if there will even be a difference... in C++ for a large vector (list), I'd be pretty confident that 3 would be fastest.

Nir

03/15/2021, 5:43 PM

3 can be ensured to be faster by reserving enough space beforehand

Nir

03/15/2021, 5:43 PM

if you use arrayList directly

nkiesel

03/15/2021, 5:43 PM

yeah. but 3 really looks like "trying too hard" for me.

Nir

03/15/2021, 5:43 PM

but we're really getting into the weeds for questionable benefit here. So I'd just stick to 4.

Nir

03/15/2021, 5:44 PM

It's trying too hard for the level of benefit

nkiesel

03/15/2021, 5:44 PM

🙂

Nir

03/15/2021, 5:44 PM

mutable data structures are definitely necessary sometimes, I wouldn't allow something to become N^2 instead of N just to avoid a local mutable variable

Nir

03/15/2021, 5:44 PM

but here's it's all just linear anyway, one extra pass, who cares

nkiesel

03/15/2021, 5:45 PM

yup. thx for your feedback

Nir

03/15/2021, 5:45 PM

👍

Nir

03/15/2021, 5:45 PM

minor comment, if you supply 4 functions like that, I'd really suggest doing the 1-2-3-4 in written order

Nir

03/15/2021, 5:46 PM

lol, I kept getting confused because my brain just assumed that, that's why I kept saying the wrong number before

ephemient

03/15/2021, 5:46 PM

I'd go for something like 4 but with

buildList

nkiesel

03/15/2021, 5:46 PM

yeah, should have cleaned it up a bit more.

nkiesel

03/15/2021, 5:47 PM

was collected from a larger discussion chat

ephemient

03/15/2021, 5:49 PM

Copy code

fun <T> List<T>.replaceOrAdd(replacement: T, predicate: (T) -> Boolean): List<T> = buildList {
    val replaced = false
    for (item in this@replaceOrAdd) {
        if (!replaced && predicate(item)) {
            add(replacement)
            replaced = true
        } else {
            add(item)
        }
    }
    if (!replaced) {
        add(replacement)
    }
}

ephemient

03/15/2021, 5:49 PM

not the shortest, but to me I'd rather write imperative code than have a mix of mutability and functional code

nkiesel

03/15/2021, 5:51 PM

needs a

in the last if... never mind, you just corrected

Nir

03/15/2021, 5:52 PM

eh, I dunno, to me it seems very natural to first establish whether you'll be replacing, or adding, as it leads to two very different approaches

ephemient

03/15/2021, 5:52 PM

actually does make me think of trickier solution though:

Copy code

fun <T> List<T>.replaceOrAdd(replacement: T, predicate: (T) -> Boolean): List<T> = buildList {
    val iterator = this@replaceOrAdd.iterator()
    for (item in iterator) {
        if (predicate(item)) break
        add(item)
    }
    add(replacement)
    for (item in iterator) add(item)
}

nkiesel

03/15/2021, 5:53 PM

def. scores high on the "tricky" scale

nkiesel

03/15/2021, 5:54 PM

but neat

Nir

03/15/2021, 5:54 PM

if you like to work with list and creating new data structures a lot (as this code seems to indicate) then you may also want to add a

replaceAt

function

Nir

03/15/2021, 5:54 PM

List<T>.replaceAt(index: Int, value: T): List<T>

Nir

03/15/2021, 5:55 PM

basically the non-mutating version of []

nkiesel

03/15/2021, 5:55 PM

on of my team member s brought that up because he ran into such "replace or add" situation. Don't think it's very common

Nir

03/15/2021, 5:55 PM

maybe not, but to be fair, this

replaceOrAdd

function is arguably quite a bit more natural (and efficient) on a mutable list to start with

Joel

03/15/2021, 6:32 PM

@ephemient I think that will always add the replacement even if the predicate returns false for every element. Not 100% if that's desirable as I only briefly skimmed the prior 66 messages. 🙂

nkiesel

03/15/2021, 6:33 PM

yes, that is the idea: either replace (if matching), or else add at the end of the list. Kind of an "upsert"

👍 1

Joel

03/15/2021, 6:37 PM

Copy code

fun <T> List<T>.replaceOrAdd(item: T, predicate: (T) -> Boolean): List<T> {
    val left = takeWhile { !predicate(it) }
    return left + item + (this - left).drop(1)
}

I submit my readable but untested slightly suboptimal version.

Nir

03/15/2021, 6:39 PM

maybe I'm misreading minus but that seems incorrect

ephemient

03/15/2021, 6:40 PM

sure, inefficient one-liner:

takeWhile { !predicate(it) } + item + dropWhile { !predicate(it) }.drop(1)

👆 1

Nir

03/15/2021, 6:40 PM

I'm probably liable to steer clear of anything involving List - List

Joel

03/15/2021, 6:41 PM

Yep that's what I was thinking. For optimal I like the

buildList

version you proposed.

Nir

03/15/2021, 6:41 PM

it seems like List - element and List - List even differ significantly in unexpected ways. List - element only removes the first occurence. List - List removes all occurences.

nkiesel

03/15/2021, 6:43 PM

I read through these

takeWhile

versions 3 times now and still can't confirm their correctness in my head. I e.g. wonder if this really implements the "only replace the first match" condition (which actually was never really specified, but all the other implementations do that)

ephemient

03/15/2021, 6:44 PM

mine does, @Joel’s doesn't

Joel

03/15/2021, 6:44 PM

Where's the javadoc when you need one??

nkiesel

03/15/2021, 6:44 PM

in the graveyard, with KDoc dancing on it's grave...

ephemient

03/15/2021, 6:45 PM

https://kotlinlang.org/api/latest/jvm/stdlib/kotlin.collections/minus.html have to look at the right overload... as @Nir says, the behavior is different between .minus(element) and .minus(elements)

Nir

03/15/2021, 6:46 PM

IMHO overloading minus for List - element, is, borderline, the List - List one though is really not too cool

Nir

03/15/2021, 6:46 PM

I can't recall ever seeing that in another language

Joel

03/15/2021, 6:46 PM

Does List.remove take out duplicates?

Nir

03/15/2021, 6:47 PM

You mean duplicates in the List, that aren't on the list of removals? I don't think so.

Joel

03/15/2021, 6:47 PM

mutableListOf(1,2,3,2,1).remove(2)

ephemient

03/15/2021, 6:47 PM

well, it's that there isn't a particular overload for .minus(elements: List<T>), it's for .minus(elements: Iterable<T>)

Joel

03/15/2021, 6:48 PM

Alright fine throw my solution in the trash, see if I care

Joel

03/15/2021, 6:49 PM

It's suboptimal anyways 🤣

ephemient

03/15/2021, 6:51 PM

now, of course the latter could be implemented as

Copy code

fun <T> Iterable<T>.minus(elements: Iterable<T>) {
    val remainingElementsToRemove = elements.toMutableSet()
    return this.filterNot { remainingElementsToRemove.remove(it) }
}

and it would have the "remove only once" behavior

ephemient

03/15/2021, 6:51 PM

but it's not, it doesn't convert a Set to a MutableSet, so it doesn't keep track of how many times it has removed an element

ephemient

03/15/2021, 6:51 PM

thus, it removes all repetitions

ephemient

03/15/2021, 6:52 PM

.minus(element) = .remove(element), .minus(elements) = .removeAll(elements)

Joel

03/15/2021, 6:53 PM

The default implementation of minus does not, correct?

Joel

03/15/2021, 6:54 PM

I like that minus implementation, need to write that one down somewhere

ephemient

03/15/2021, 6:54 PM

@Joel not sure what you mean.

listOf(1, 2, 3, 1) - 1 == listOf(2, 3, 1); listOf(1, 2, 3, 1) - listOf(1) == listOf(2, 3)

ephemient

03/15/2021, 6:54 PM

that's what's in stdlib

Joel

03/15/2021, 6:54 PM

Yep

ephemient

03/15/2021, 6:54 PM

the one I wrote is not

Joel

03/15/2021, 6:55 PM

It would return

listOf(2,3,1)

ephemient

03/15/2021, 6:55 PM

stdlib has a set optimization which mine doesn't

ephemient

03/15/2021, 6:55 PM

hence it has different behavior

Joel

03/15/2021, 6:55 PM

Well, that assumes that

remainingElementsToRemove.remove(it)

only removes once

Joel

03/15/2021, 6:55 PM

And breaks early if it finds the element

ephemient

03/15/2021, 6:55 PM

well, if you convert it to a set there is only one element to remove

ephemient

03/15/2021, 6:56 PM

if you wanted

listOf(1, 1, 1) - listOf(1, 1) == listOf(1)

, well mine doesn't do that. you'd need a multiset (or map) to keep track.

ephemient

03/15/2021, 6:57 PM

Well, that assumes that
remainingElementsToRemove.remove(it)
only removes once

And breaks early if it finds the element

I don't understand what you mean by that

Nir

03/15/2021, 6:57 PM

if the stdlib uses a set, doesn't it remove all repetitions that are unrelated to the elements that are requested for removal?

Nir

03/15/2021, 6:57 PM

That seems pretty surprising

ephemient

03/15/2021, 6:58 PM

it uses a set on the

elements

argument, not

this

Nir

03/15/2021, 6:58 PM

Ah I see

ephemient

03/15/2021, 6:58 PM

internal implementation detail, it could just not convert and perform the

in/.contains

query each time, to the same effect (except for runtime)

Nir

03/15/2021, 6:59 PM

I am open to seeing examples otherwise, but list - list feels really smelly, like anytime you end up using it, there's some earlier issue with your data structures

ephemient

03/15/2021, 7:00 PM

Set - Iterable makes sense, but I tend to agree - Iterable - Iterable is less good. but it's the same implementation either way, so I guess they just generalized it...

Joel

03/15/2021, 7:03 PM

I think there are a number of inconsistencies. If it's using a set under the hood, then the elements have to map to the same hashcode (and equals? I always forget). If they map to the same hashcode (and equals) then it could be argued that the caller does want to remove all instances of the element because it's the "same" element. So what you deem incorrect may be totally correct for someone else's use case.

ephemient

03/15/2021, 7:06 PM

if your elements have a hashCode() that is inconsistent with equals() they're breaking the Object contract in all sorts of other ways, we can disregard that as a possibility

ephemient

03/15/2021, 7:06 PM

using hashCode() is meant to be an optimization over using just equals(). it still uses equals(), it just uses hashCode() first

ephemient

03/15/2021, 7:07 PM

that's how HashSet works

Joel

03/15/2021, 7:07 PM

If I ask the machine to do

listOf(1,1,1) - listOf(1)

emptyList()

and

listOf(1,1)

are both valid answers

Joel

03/15/2021, 7:08 PM

So it's not wrong, it's just a very important implementation detail

ephemient

03/15/2021, 7:09 PM

it's documented

Joel

03/15/2021, 7:09 PM

Excellent, the ~~prosecution~~ defense rests

ephemient

03/15/2021, 7:10 PM

it is somewhat unfortunate that

.removeAll()

isn't as well documented on the Kotlin side... but it behaves just like Java's

.removeAll()

so shrug just go read the original Javadoc I guess

Nir

03/15/2021, 7:25 PM

Set - Iterable is fine. That's just very clearly set substraction; I can't imagine this operation could raise any major question marks about either behavior or implementation here

3 Views

Open in Slack

Previous Next