very specific, but that’s an area that is not well...
# datascience
c
very specific, but that’s an area that is not well-covered, e.g. Tika addresses this but only in the most simplistic way by allowing the user to define regular expressions for types that are sought in the text…