Nuxeo ECM Projects 5.4.3-SNAPSHOT

org.nuxeo.ecm.core.convert.plugins.text.extractors
Class Html2TextConverter

java.lang.Object
  extended by org.nuxeo.ecm.core.convert.plugins.text.extractors.Html2TextConverter
All Implemented Interfaces:
Converter

public class Html2TextConverter
extends Object
implements Converter

Extract the text content of HTML documents while trying to respect the paragraph structure.

Author:
Thomas Roger, Olivier Grisel

Constructor Summary
Html2TextConverter()
           
 
Method Summary
 BlobHolder convert(BlobHolder blobHolder, Map<String,Serializable> parameters)
          Main method to handle the real Conversion Job.
 void init(ConverterDescriptor descriptor)
          Initializes the Converter.
 
Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

Html2TextConverter

public Html2TextConverter()
Method Detail

convert

public BlobHolder convert(BlobHolder blobHolder,
                          Map<String,Serializable> parameters)
                   throws ConversionException
Description copied from interface: Converter
Main method to handle the real Conversion Job.

Returned BlobHolder must implement CachableBlobHolder, otherwise result won't be cached.

Specified by:
convert in interface Converter
Throws:
ConversionException

init

public void init(ConverterDescriptor descriptor)
Description copied from interface: Converter
Initializes the Converter.

This can be used to retrieve some configuration information from the XMap Descriptor.

Specified by:
init in interface Converter

Nuxeo ECM Projects 5.4.3-SNAPSHOT

Copyright © 2011 Nuxeo SAS. All Rights Reserved.