Any way to load roBERTa model using kotlin I am aware that c kotlinlang #datascience

Any way to load roBERTa model using kotlin? I am a...

Ananiya

07/26/2022, 3:12 PM

Any way to load roBERTa model using kotlin? I am aware that certain language have implementation to vectorize the text like

RobertaTokenizerFast

in pytho.. In kotlin I was managed to get the size and some information using `OnnxInferenceModel`(kotlindl) but I am quite confused on vectorizing the sentence because i don't have access to the vocabulary, the approach used the vectorize the sent.. or the dimensions to implement the vectorizer manualy So I was wondering if there's any possible way or library that could workaround

Ilya Muradyan

07/26/2022, 4:30 PM

@Julia Beliaeva @zaleslaw

zaleslaw

07/27/2022, 12:06 PM

At this moment I could not find a way to port tokenizers

Ananiya

07/27/2022, 2:23 PM

Thanks @zaleslaw for the response Is there any other model that I can utilize instead of roBerta I tried importing pretrained keras model which doesn't workout due to the the lack of Embedding layer Or maybe I should wait until the layers are ready

roman.belov

07/29/2022, 10:26 AM

@Ananiya I guess https://github.com/londogard/londogard-nlp-toolkit/ should work for preprocessing

Ananiya

07/29/2022, 4:44 PM

@roman.belov thanks for the response! I looked at this library earlier but doesn't seems to provide same vectorizer as Roberta

Hampus Londögård

08/01/2022, 5:50 AM

@Ananiya it’s something I’m working on adding through using DJL. You should be able to use DJL directly for now 🙂 It supports HuggingFace Tokenizers.

Ananiya

08/01/2022, 5:52 AM

Interesting! thanks @Hampus Londögård for letting me know

Hampus Londögård

08/29/2022, 8:01 AM

@Ananiya I’ve added support for

ClassifierPipeline

TokenClassificationPipeline

which means that Text. Classiification and Token Classiification (e.g. NER) is supported now 👍 Files can be loaded from file-system (ONNX & PyTorch) and from HuggingFace Hub (ONNX)

Ananiya

08/29/2022, 8:20 AM

Fantastic @Hampus Londögård ! Excited to test it

40 Views

Open in Slack

Previous Next