devxlogo

Calling JTidy from Java to Convert HTML to XHTML

The open source JTidy project does an excellent job of converting HTML files to the newer XHTML standard. The following code shows how to invoke JTidy programmatically from Java:

/*In:  C:Data_Localxmldocs	est.htmlOut: C:Data_Localxmldocs	estXHTML.xml*/import org.w3c.tidy.Tidy;import java.io.FileInputStream;import java.io.FileOutputStream;import org.w3c.dom.Document;public class HTML_to_XHTML{   public static void main(String[] args){      try{         FileInputStream FIS=new FileInputStream("C://Data_Local            //xml//docs//test.html");         FileOutputStream FOS=new FileOutputStream("C://Data_Local            //xml//docs//testXHTML.xml");            Tidy T=new Tidy();         Document D=T.parseDOM(FIS,FOS);         }      catch (java.io.FileNotFoundException e)         {System.out.println(e.getMessage());}         }   }}

Charlie has over a decade of experience in website administration and technology management. As the site admin, he oversees all technical aspects of running a high-traffic online platform, ensuring optimal performance, security, and user experience.

See also  Seven Service Boundary Mistakes That Create Technical Debt

About Our Editorial Process

At DevX, we’re dedicated to tech entrepreneurship. Our team closely follows industry shifts, new products, AI breakthroughs, technology trends, and funding announcements. Articles undergo thorough editing to ensure accuracy and clarity, reflecting DevX’s style and supporting entrepreneurs in the tech sphere.

See our full editorial policy.