Guys, a question. What big data is big enough for ...
# datascience
a
Guys, a question. What big data is big enough for you. I plan to re-implement some descriptive statistics routines for kotlin-statistics and it is important to know what data sample sizes are used. For example if data is of TB size, one should go for different algorithm (there is an ongoing discussion in commons-math email list).
h
My usage of it only involves thousands to tens of thousands
I do think it'd be nice to have both "storage" and "storeless" versions like commons math does.
t
Same here. Maximum I'll ever do is hundreds of thousands