如何在 Android 上使用 Apache Tika

How to use Apache Tika on Android

我正在尝试使用 Apache tika 来解析一些文档,但它给了我很多错误和警告。

build.gradle

dependencies {
    ...
    compile ('org.apache.tika:tika-parsers:1.14'){
        exclude group: 'org.json', module: 'json'
        exclude group: 'commons-logging', module: 'commons-logging'
        exclude group: 'org.apache.httpcomponents', module: 'httpclient'}
...
}

为了让这个问题更容易阅读,我不会在这里粘贴警告(它们是 600+)并且我会忽略它们但错误是这些:

Error:Uncaught translation error: java.lang.IllegalArgumentException: already added: Lorg/apache/xmlbeans/xml/stream/Location;
Error:Uncaught translation error: java.lang.IllegalArgumentException: already added: Lorg/apache/xmlbeans/xml/stream/ReferenceResolver;
Error:Uncaught translation error: java.lang.IllegalArgumentException: already added: Lorg/apache/xmlbeans/xml/stream/XMLEvent;
Error:Uncaught translation error: java.lang.IllegalArgumentException: already added: Lorg/apache/xmlbeans/xml/stream/XMLInputStream;
Error:Uncaught translation error: java.lang.IllegalArgumentException: already added: Lorg/apache/xmlbeans/xml/stream/XMLName;
Error:Uncaught translation error: java.lang.IllegalArgumentException: already added: Lorg/apache/xmlbeans/xml/stream/XMLStreamException;
Error:Uncaught translation error: java.lang.IllegalArgumentException: already added: Lorg/apache/xmlbeans/xml/stream/utils/NestedThrowable;
Error:Uncaught translation error: java.lang.IllegalArgumentException: already added: Lorg/apache/xmlbeans/xml/stream/utils/NestedThrowable$Util;
Error:8 errors; aborting
Error:Execution failed for task ':app:transformClassesWithDexForDebug'.
> com.android.build.api.transform.TransformException: java.lang.RuntimeException: java.lang.RuntimeException: com.android.ide.common.process.ProcessException: java.util.concurrent.ExecutionException: com.android.ide.common.process.ProcessException: Return code 1 for dex process
Information:BUILD FAILED

请问您有解决办法吗?或者更好的是,您知道我可以用来提取文档元数据的更好的库吗?

尝试使用这个,它对我有用。

compile group: 'org.apache.tika', name: 'tika-parsers', version: '1.14'

致遇到此问题的任何人

该解决方案在新版本

已弃用

当前解决方案

Depedencies 中放这个。

classpath 'org.apache.tika:tika-parsers:1.14'