If anyone needs an automatic differentiation engine in Kotin kotlinlang #datascience

If anyone needs an automatic differentiation engin...

elizarov

05/04/2019, 1:19 PM

If anyone needs an automatic differentiation engine in Kotin you can use this gist of mine (I don’t have time to turn it into an actual library). It is not the most optimal implementation in terms of performance, but it is quite easy-to-use in terms of API and has very concise and simple implementation + Kotlin DSL FTW. https://gist.github.com/elizarov/1ad3a8583e88cb6ea7a0ad09bb591d3d I’ts backwards-mode (back-propagation), so you get derivative of one output value by all input values in one pass. That is the thing you’d need to do gradient descent. Use it like this:

Copy code

val x = D(2) // define variable(s) and their values
val y = grad { sqr(x) + 5 * x + 3 } // write formulae in grad context
assertEquals(17.0, y.x) // the value of result (y)
assertEquals(9.0, x.d)  // dy/dx

👍 3

altavir

05/04/2019, 1:40 PM

I have autodiff in kmath for Jvm via commons math with similar design. I can add it as mpp implementation with appropriate remark about author.

👍 1

altavir

05/04/2019, 1:43 PM

https://github.com/mipt-npm/kmath/blob/master/kmath-commons/src/test/kotlin/scientifik/kmath/expressions/AutoDiffTest.kt

elizarov

05/04/2019, 2:31 PM

Yeah. I don’t like “stringly-typed” designs. The one in my gist is 100% type-safe and fully composable.

altavir

05/04/2019, 2:47 PM

Basic DerivativeStructure works exactly like your example (only with infinite derivatives). The example shows forward declaration of expression, which could use forward declaration and define parameters in a separate place. I have not found a way to do so without string keys.

altavir

05/04/2019, 2:47 PM

Implementatio here https://github.com/mipt-npm/kmath/blob/master/kmath-commons/src/main/kotlin/scientifik/kmath/expressions/DiffExpression.kt

elizarov

05/04/2019, 2:48 PM

I don’t see why would you need forward declarations. You can just declare functions with parameters, call them, and have engine compute derivatives as a side-effect, too.

elizarov

05/04/2019, 2:50 PM

And no, this is completely different approach. You seem to explicitly build expression tree and then compute symbolic derivatives?

altavir

05/04/2019, 2:51 PM

What I wanted is to be able to generate function and parameters in different places. Currently I can do both using either DerivativeStructureField or ExpressionContext.

elizarov

05/04/2019, 2:51 PM

My engine does not. It is an algorithmic diff engine. It computes the result and derivatives in one go, without explicitly building an expression tree. It is compatible with any logic (ifs, when, loops) and scales to computations of extremely big sizes (line neural networks)

elizarov

05/04/2019, 2:52 PM

You can compare performance on some fairly large expression to get a feel of a difference.

altavir

05/04/2019, 2:54 PM

I do understand what it does. Exactly the same as DerivativeStructure in commons math. I can use your implementation for mpp without loading commons math.

altavir

05/04/2019, 2:56 PM

It does not create the expression tree.

elizarov

05/04/2019, 2:57 PM

commons-math derivativestructure is too inefficient. Compare, for example, how much abstraction its definition of

sin()

function uses and how concrete and concise mine is:

Copy code

fun AD.sin(x: D): D = derive(D(sin(x.x))) { z ->
    x.d += z.d * cos(x.x)
}

altavir

05/04/2019, 2:59 PM

Sorry, I am on phone. I will write in a little bit more detail, when I get home. I will be happy to add your implementation and then compare performance.

elizarov

05/04/2019, 2:59 PM

Actually, I have not even started to optimize my code. I brought it to the state of the most beatiful API, not maximal performance.

altavir

05/04/2019, 2:59 PM

I need mpp API anyway

altavir

05/04/2019, 3:01 PM

Your API is almost identical to kmath.

elizarov

05/04/2019, 3:01 PM

Note, though, mine is hard-coded for backwards-mode. There is no support for forward-mode. So it’s only efficient when your functions map R^n to R^m and n >> m

elizarov

05/04/2019, 3:02 PM

K API

altavir

05/04/2019, 3:03 PM

Yep. Kotlin just asks for this design style

altavir

05/04/2019, 5:54 PM

Finally got home. The problem with your implementation compared with Commons-math is that it does not allow neither higher derivatives (and second derivatives is quite usual use case) nor, partial derivatives. So it is strictly univariate first-derivative engine. The function

grad

is a bit confusing since gradient is usually used for scalar multivariate function and generates vector, not scalar. It is still nice to have, I will generalize it for different types of object using

ExtendedField

so it will be applicable to complex numbers and (in future) to vectors of numbers.

elizarov

05/04/2019, 8:10 PM

Indeed. It is tailored for gradient descent. It computes gradient of result with respect to all input parameters.

altavir

05/05/2019, 7:20 AM

@elizarov I've integrated the code in kmath, but I really did not like accessing the derivatives via mutable state of input, so I've changed API to store derivatives only during actual calculation and not to leak it outside. The experimental code is here: https://github.com/mipt-npm/kmath/blob/autodiff-experiment/kmath-core/src/commonMain/kotlin/scientifik/kmath/operations/AutoDiff.kt. I've also added actual gradient and divergence calculation to the result. My solution will probably have minor overhead due to derivative lookup, but I think it will be rather small.

altavir

05/05/2019, 7:21 AM

Tests with modified API are here: https://github.com/mipt-npm/kmath/blob/autodiff-experiment/kmath-core/src/commonTest/kotlin/scientifik/kmath/operations/AutoDiffTest.kt

9 Views

Open in Slack

Previous Next