hey mates, i am new student of university, i am learning big-data analyse, does anybody know which lib i can learn and tools for Various algorithms, but also other machine learning or deep learning algorithms, such as: random forest, specific data analysis and data visualization
There must be a process of data preprocessing, and corresponding visualization assistance must be carried out during data preprocessing, and the characteristics of the dataset must be observed and analyzed (for example, drawing a box plot or violin plot to observe whether there are outliers in the data features, drawing a scatter plot or heat map matrix to observe the state between the features of the dataset, drawing a histogram or histogram to understand the distribution of data features, etc.).
now i know
https://github.com/JetBrains/kotlin-spark-api and
https://github.com/breandan/kotlingrad