89
Tika Auto Detector Parser
Tika AutoDetectParser is a class which automatically figure out what kind of content a file has, and then calls appropriate parser.
This class contains constructors and methods that are given below.
Tika AutoDetectParser Constructors
Constructor | Description |
---|---|
public AutoDetectParser() | It creates an auto-detecting parser instance using the default Tika configuration. |
public AutoDetectParser(Detector detector) | It creates an auto-detecting parser instance. |
public AutoDetectParser(Parser… parsers) | It creates an auto-detecting parser instance using the specified set of parser. |
public AutoDetectParser(Detector detector, Parser… parsers) | It creates an auto-detecting parser instance using the specified detector and parser. |
public AutoDetectParser(TikaConfig config) | Creates an auto-detecting parser instance using the specified set tika config. |
Tika AutoDetectParser Methods
Following are the methods of Tika AutoDetectParser Class.
Method | Description |
---|---|
public Detector getDetector() | It returns the type detector used by this parser to auto-detect the type of a document. |
public void setDetector(Detector detector) | It sets the type detector used by this parser to auto-detect the type of a document. |
public void parse(InputStream stream, ContentHandler handler, Metadata metadata, ParseContext context) throws IOException, SAXException, TikaException | It is used to parse the input file. |
public void parse(InputStream stream, ContentHandler handler, Metadata metadata) throws IOException, SAXException, TikaException | It calls the Parser.parse(InputStream, ContentHandler, Metadata, ParseContext) method with an empty ParseContext. |
Tika AutoDetectParser Example
Output:
Following is the content of hello.txt file after extraction.
Hello Welcome to Tutor Aspire
Next TopicTika Parsing Document to Plain Text