Micha Kops: Content Detection, Metadata and Content Extraction with Apache Tika