altavir
01/24/2021, 7:37 AMBoxing addition completed in 22157 millis
Specialized addition completed in 1840 millis
Nd4j specialized addition completed in 1309 millis
Viktor addition completed in 1966 millis
Parallel stream addition completed in 1457 millis
Automatic field addition completed in 1773 millis
Lazy addition completed in 14157 millisND4J uses OpenBlas under the hood. And I think @Iaroslav Postovalov told me that is uses parallel execution. I wonder if there is a large overhead on top of BLAS. Because the results are very close.
Iaroslav Postovalov
01/24/2021, 12:22 PMaltavir
01/24/2021, 12:23 PMIaroslav Postovalov
01/24/2021, 12:23 PMaltavir
01/24/2021, 12:25 PM