Skip to main content

Convert HTML and HTML String to Word in Java


In this article, I am going to demonstrate two approaches to convert HTML to Word in Java applications, they are:

  • Convert HTML to Word
  • Convert HTML String to Word

Add Dependencies

Free Spire.Doc for Java library is used to implement this task. If you use maven, you need to specify the following dependencies in your project's pom.xml file for including Free Spire.Doc for Java into your Java project.

<repositories>   
    <repository>   
        <id>com.e-iceblue</id>   
        <name>e-iceblue</name>   
        <url>http://repo.e-iceblue.com/nexus/content/groups/public/</url>   
    </repository>   
</repositories>   
<dependencies>   
    <dependency>   
        <groupId> e-iceblue </groupId>   
        <artifactId>spire.doc.free</artifactId>   
        <version>3.9.0</version>   
    </dependency>   
</dependencies>

For non-maven projects, you can download Free Spire.Doc for Java pack from this website and add Spire.Doc.jar in the lib folder into your project as a dependency.

Convert HTML to Word

Free Spire.Doc for Java library provides a Document class that represents a Word document. This class offers a saveToFile(String, FileFormat) method that can be used to convert a HTML file to Word document.

The input HTML document:


import com.spire.doc.Document;
import com.spire.doc.FileFormat;
import com.spire.doc.documents.XHTMLValidationType;
 
public class HtmlToWord {
    public static void main(String []args){
        //Create a Document instance
        Document document = new Document();
        //Load a Html file
        document.loadFromFile("Input.html", FileFormat.Html, XHTMLValidationType.None);
 
        //Save Html to Word
        document.saveToFile("HtmlToWord.docx",FileFormat.Docx_2013);
    }
}

The output Word document:


Convert HTML String to Word

To convert a HTML String to Word, you need to invoke the Paragraph.appendHTML(String) method.

import com.spire.doc.Document;
import com.spire.doc.FileFormat;
import com.spire.doc.Section;
 
public class HtmlStringToWord {
    public static void main(String []args){
        //Create a Document instance
        Document document = new Document();
        //Add a section
        Section sec = document.addSection();
 
        //HTML string
        String htmlString = "<html><head/><body> <h1>Html Heading</h1><p>This is an html document in a string literal.</p></body></html>";
 
        //Add a paragraph to the section and append a html string to the paragraph
        sec.addParagraph().appendHTML(htmlString);
 
        //Save the result document
        document.saveToFile("HTMLstringToWord.docx", FileFormat.Docx_2013);
    }
}

The output document:








Comments

Popular posts from this blog

3 Ways to Generate Word Documents from Templates in Java

A template is a document with pre-applied formatting like styles, tabs, line spacing and so on. You can quickly generate a batch of documents with the same structure based on the template. In this article, I am going to show you the different ways to generate Word documents from templates programmatically in Java using Free Spire.Doc for Java library. Prerequisite First of all, you need to add needed dependencies for including Free Spire.Doc for Java into your Java project. There are two ways to do that. If you use maven, you need to add the following code to your project’s pom.xml file. <repositories>               <repository>                   <id>com.e-iceblue</id>                   <name>e-iceblue</name>                   <url>http: //repo.e-iceblue.com/nexus/content/groups/public/</url>                </repository>       </repositories>       <dependencies>           <dependency>               <g

Simple Java Code to Convert Excel to PDF in Java

This article demonstrates a simple solution to convert an Excel file to PDF in Java by using free Excel API – Free Spire.XLS for Java . The following examples illustrate two possibilities to convert Excel to PDF:      Convert the whole Excel file to PDF     Convert a particular Excel Worksheet to PDF Before start with coding, you need to Download Free Spire.XLS for Java package , unzip it and import Spire.Xls.jar file from the lib folder in your project as a denpendency. 1. Convert the whole Excel file to PDF Spire.XLS for Java provides saveToFile method in Workbook class that enables us to easily save a whole Excel file to PDF. import com.spire.xls.FileFormat; import com.spire.xls.Workbook; public class ExcelToPDF {     public static void main(String[] args){         //Create a Workbook         Workbook workbook = new Workbook();         workbook.loadFromFile( "Sample.xlsx" );         //Fit to page         workbook.getConverterSetting().setShee

Insert and Extract OLE objects in Word in Java

You can use OLE (Object Linking and Embedding) to include content from other programs, such as another Word document, an Excel or PowerPoint document to an existing Word document. This article demonstrates how to insert and extract embedded OLE objects in a Word document in Java by using Free Spire.Doc for Java API.   Add dependencies First of all, you need to add needed dependencies for including Free Spire.Doc for Java into your Java project. There are two ways to do that. If you use maven, you need to add the following code to your project’s pom.xml file.     <repositories>               <repository>                   <id>com.e-iceblue</id>                   <name>e-iceblue</name>                   <url>http://repo.e-iceblue.com/nexus/content/groups/public/</url>               </repository>       </repositories>       <dependencies>           <dependency>               <groupId>