Skip to main content

Micha Kops: Content Detection, Metadata and Content Extraction with Apache Tika

Content Detection, Metadata and Content Extraction with Apache Tika: Encountering the situation that you want to extract meta-data or content from a file – be it an office document, a spreadsheet or even an mp3 or image – or you’d like to detect the content type for a given file, then Apache Tika might be a helpful tool for you. It supports a variety of document formats and...

Community: Java Tools