Can you tell us a bit about the project? Apache Tika is an open source content detection and analysis framework written in Java. It detects and extracts metadata and text from over a thousand different file types. In addition to providing a Java library, Tika has server and command-line editions suitable for use from other programming […]
Read More.. ASF Project Spotlight: Apache Tika