Package org.apache.poi.extractor.ole2
Class OLE2ScratchpadExtractorFactory
java.lang.Object
org.apache.poi.extractor.ole2.OLE2ScratchpadExtractorFactory
Scratchpad-specific logic for
OLE2ExtractorFactory
and
org.apache.poi.extractor.ExtractorFactory
, which permit the other two to run with
no Scratchpad jar (though without functionality!)
Note - should not be used standalone, always use via the other two classes
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionstatic POITextExtractor
createExtractor
(DirectoryNode poifsDir) Look for certain entries in the stream, to figure it out what format is desired Note - doesn't check for core-supported formats! Note - doesn't check for OOXML-supported formatsstatic void
identifyEmbeddedResources
(POIOLE2TextExtractor ext, List<Entry> dirs, List<InputStream> nonPOIFS) Returns an array of text extractors, one for each of the embedded documents in the file (if there are any).
-
Constructor Details
-
OLE2ScratchpadExtractorFactory
public OLE2ScratchpadExtractorFactory()
-
-
Method Details
-
createExtractor
Look for certain entries in the stream, to figure it out what format is desired Note - doesn't check for core-supported formats! Note - doesn't check for OOXML-supported formats- Parameters:
poifsDir
- the directory node to be inspected- Returns:
- the format specific text extractor
- Throws:
IOException
- when the format specific extraction fails because of invalid entires
-
identifyEmbeddedResources
public static void identifyEmbeddedResources(POIOLE2TextExtractor ext, List<Entry> dirs, List<InputStream> nonPOIFS) throws IOException Returns an array of text extractors, one for each of the embedded documents in the file (if there are any). If there are no embedded documents, you'll get back an empty array. Otherwise, you'll get one openPOITextExtractor
for each embedded file.- Parameters:
ext
- the extractor holding the directory to start parsingdirs
- a list to be filled with directory references holding embeddednonPOIFS
- a list to be filled with streams which aren't based on POIFS entries- Throws:
IOException
- when the format specific extraction fails because of invalid entires
-