Learning data structures in kotlin and was curious if why th kotlinlang #getting-started

Learning data structures in kotlin and was curious...

Colton Idle

12/29/2022, 10:02 PM

Learning data structures in kotlin and was curious if/why there wasn't a LinkedList impl in kotlin. Anyone know what

@elizarov

was talking about here that itd be "harmful" to have it?

ephemient

12/29/2022, 10:03 PM

in theory, there could be some unique advantages to

LinkedList

. however, by conforming to the

List

interface, none of those are realized in Java at all

ephemient

12/29/2022, 10:03 PM

instead, you have a strange list where

get()

is linear-time, unlike every other list in existence

Chris Lee

12/29/2022, 10:04 PM

in addition to the extra memory overhead for storage - each entry requires additional references to the next item.

ephemient

12/29/2022, 10:05 PM

correct, the memory overhead per element is ~quadruple what an ArrayList needs (with allocator metadata included), and the additional reference-following required is more overhead as well

ephemient

12/29/2022, 10:07 PM

LinkedBlockingQueue shows some of what you can do with a linked list that doesn't conform to the List interface

Colton Idle

12/29/2022, 10:07 PM

So... if I needed to use a linked list... is the general concensus that you'd just implement your own?

ephemient

12/29/2022, 10:07 PM

but in Kotlin we can just treat that as an internal implementation detail of Channel

Colton Idle

12/29/2022, 10:07 PM

Oh LinkedBlockingQueue is a thing?

Chris Lee

12/29/2022, 10:07 PM

depends what the need really is

ephemient

12/29/2022, 10:08 PM

in almost all circumstances I'd just use an ArrayList (e.g. normal list in Kotlin), or possibly an ArrayDeque if you have mutation at both ends

Colton Idle

12/29/2022, 10:08 PM

I suppose so 😂 Right now... I'm studying data structures for whiteboarding style interview questions (unfortunately)

Chris Lee

12/29/2022, 10:09 PM

or possibly a Kotlin channel if the structure is intended to drive concurrent workloads.

Colton Idle

12/29/2022, 10:14 PM

So if someone said "does kotlin have a linkedList impl" the answer would be no?

Colton Idle

12/29/2022, 10:14 PM

What about stack or queue? I suppose you can use a Deque and it can be a stack, queue, or LL?

ephemient

12/29/2022, 10:14 PM

I would say no, it does not.

❤️ 1

ephemient

12/29/2022, 10:14 PM

a MutableList is a completely serviceable stack

➕ 1

Colton Idle

12/29/2022, 10:15 PM

Thanks for teaching btw. Datastructures scare me. 🙃 so interviewing has not been fun. lol

Chris Lee

12/29/2022, 10:15 PM

at least not part of the Kotlin standard library. You can use Java’s LinkedList or other 3rd party libs that implement it.

👍 1

ephemient

12/29/2022, 10:15 PM

and ArrayDeque is a completely serviceable queue (or stack, but you don't need its double-ended-ness)

Kristian Nedrevold

12/29/2022, 10:16 PM

You have Stack in Java right? But it uses the vector interface and should not be used?

ephemient

12/29/2022, 10:17 PM

yes, Vector and other old Java collections such as Stack have internal synchronization which both unnecessary for almost all purposes and insufficient when it actually matters

Chris Lee

12/29/2022, 10:18 PM

along with their sickly cousin, Hashtable…

ephemient

12/29/2022, 10:18 PM

feels like Java 1.0's classpath library was made from a CS101 course 😓

Chris Lee

12/29/2022, 10:19 PM

textbook -> java library. 🤦

ephemient

12/29/2022, 10:21 PM

something I do think is missing from kotlin stdlib is a priority queue (usually implemented with a binary heap). but you can usually just use Java's, and it's like 30 lines of code if you can't, so it's not that bad

👍 1

Kristian Nedrevold

12/29/2022, 10:23 PM

Yeah, that is probably the only one I have used outside of the kotlin stdlib.

ephemient

12/29/2022, 10:23 PM

well, it took me 39LoC I guess https://github.com/ephemient/aoc2021/blob/main/kt/src/nonJvmMain/kotlin/com/github/ephemient/aoc2021/PriorityQueue.kt

🙌 2

Francesc

12/29/2022, 10:56 PM

Nitpick,

storage

should be

val

in that gist

Colton Idle

12/29/2022, 10:59 PM

wow. thanks everyone. learning a ton. i very much am brushing up on datastructures and learning more about java and kotlin

Javier

12/29/2022, 11:42 PM

In that Kotlin thread Elizarov explains why indeed

Javier

12/29/2022, 11:43 PM

@sandeep549 I cannot agree with you. All deques (incl. queues and stack) are much faster when they are based on arrays. You should never use linked lists for that purpose, especially if you do competitive programming where performance is important.

You only need linked lists when you need to insert into the middle in O(1). This almost never happens in practice and extremely rarely needed in competitive programming. When it does happen you usually need an “intrusive list” which Java’s
LinkedList
does not provide anyway.

Toddobryan

12/30/2022, 3:49 AM

As long as you can increase the size of the arrays in amortized constant time (which ArrayList does), and can keep track of where you are at each end, a doubly-linked list can be faked with an array very nicely. The only disadvantage is that you could end up with an array where lots of the elements are unused. At that point, you have a choice between resizing or wasting the memory. So one advantage of a true linked list implementation is that adding and deleting elements is constant time for every operation, whereas they’ll be mostly constant time for an array-based implementation with occasional pauses when the array needs to be resized either larger or smaller. If you don’t mind those occasional hiccups, use the array.

ephemient

12/30/2022, 4:07 AM

allocating the memory for a new link node can also result in pauses, so that isn't as large of an advantage as it sounds

ephemient

12/30/2022, 4:09 AM

being able to atomically insert without locks is a thing that linked lists could do over array lists, but that is not something that Java's LinkedList is capable of due to its interface

Albert Chang

12/30/2022, 4:23 AM

https://youtu.be/YQs6IC-vgmo▾

paulpaul1076

12/30/2022, 4:25 AM

In functional programming linked lists are always used as the #1 data structure.

Toddobryan

12/30/2022, 5:03 AM

ephemient [8:07 PM]

allocating the memory for a new link node can also result in pauses, so that isn’t as large of an advantage as it sounds

But only if there’s garbage collection or a similar issue. If you have an array representing a list of 10k items, and you’ve run out of space, the default code for ArrayList is going to allocate an array of size 20k and copy all 10k elements of the first array into it. Adding to either end of a linked list (assuming you have references to both ends) will involve allocation of a new node and the setting of three references: the previous end gets a reference to the new node, the new node gets a reference to the previous end, and the new node gets a reference to whatever is used to indicated end-ness, null or a dummy reference. The work is literally constant every time. It’s true that the JVM could decide to do something else during that time, but it’s a guarantee with an array implementation that every so often, you will be doing a lot more work for some operations because of the way that add works on ArrayList.

👍 1

ephemient

12/30/2022, 5:10 AM

by the time you've gotten to 10k elements, a linked list is much more likely to have added significant memory pressure to the rest of your program than an ArrayList (which is a lot less work for GC to scan). the work is "constant", but that's a simplification. amortized work is what makes GC faster at allocation under most common circumstances.

ephemient

12/30/2022, 5:20 AM

Pavel, true in that it works well as an immutable, recursive data structure. but a. that is not what is presented by LinkedList, and b. even in functional programming languages, we recognize that (linked) lists are not a good data structure for many purposes. as a long-time Haskell programmer, I can confidently state that the most important use of a linked list there is for control flow. you should expect that most "lists" get fused and thus are never materialized in full; if a non-trivial list grows to any significant size, it is a code smell or a design error

👍 1

Albert Chang

12/30/2022, 7:14 AM

The most important reason is that because elements of linked lists are not compact in memory, linked lists greatly increase cache misses, which is why even traversing linked lists (or searching for the position to insert or delete) is much slower than traversing array lists (50~100 times slower according to the talk I posted above). This makes the usage of linked lists very very limited.

Klitos Kyriacou

12/30/2022, 10:22 AM

On the subject of cache misses, simply traversing an ArrayList, without actually looking at the content of each element, does indeed make good use of the CPU cache. However, on the JVM an ArrayList is a list of Objects; even integers are boxed when put into an ArrayList. This means that reading each element of an ArrayList can end up in lots of cache misses because the elements' contents may be stored all over the heap, instead of being contiguous. For better performance there are primitive collection libraries, such as Trove, Fastutil and Eclipse Collections.

paulpaul1076

12/31/2022, 3:31 AM

@ephemient how often do you deal with arrays of 10k elements anyways? Maybe once a year you will write a program that does that. I never do that and I work in big data. In Scala I always use lists.

ephemient

12/31/2022, 3:39 AM

back when I worked with Scala, we used

Seq

for almost everything. but even so: Scala's

List

isn't

SeqLike

so it doesn't suffer the stupid API issues that Java's

LinkedList

does

paulpaul1076

01/01/2023, 12:05 AM

@ephemient java’s linked list mostly useless indeed, although i hear that LinkedHashMap uses it to preserve insertion order, you can remove a node from LinkedList in O(1) time

Chris Lee

01/01/2023, 12:07 AM

Java’s LinkedHashMap doesn’t use LinkedList; the code shows it’s own, internal, use-case-specific linking structure.

paulpaul1076

01/01/2023, 12:11 AM

@Chris Lee thanks for pointing out, but still, that structure is a linked list, even though it isn’t java.util.LinkedList, I meant that in general there are use cases for linked lists,

ephemient

01/01/2023, 12:15 AM

there are uses for data structures with internal links. but what LinkedHashMap does with its nodes cannot be done with an external linked list, so a LinkedList ADT has very little utility

ephemient

01/01/2023, 12:16 AM

this is consistent with Roman's position in the thread as well

Colton Idle

01/01/2023, 3:41 PM

ADT?

ephemient

01/01/2023, 4:38 PM

https://en.wikipedia.org/wiki/Abstract_data_type

ephemient

01/01/2023, 4:39 PM

as opposed to https://fuchsia.dev/fuchsia-src/development/languages/c-cpp/fbl_containers_guide/introduction for example - that's much more useful for linked lists

ephemient

01/01/2023, 4:41 PM

but it's not something that can be represented in Java/Kotlin in a way that fits in with the rest of the language

4 Views

Open in Slack

Previous Next