When you create an HTML document in Office 2000, Office-specific markup tags are embedded in it. These tags help "round-trip" the document for editing purposes.
You will find this filter and the article about it at:
Once you have completed editing an HTML document in Word 2000 or Excel 2000, you can use the Office HTML Filter to remove the Office-specific markup tags from the final copy of the HTML document. By removing the tags, you reduce the size of the document, which in turn reduces both the amount of space used on Web servers as well as the time it takes to download the page.
Here is some vb.net code that will let you apply this filter to your current Word document.
The code seems simple enough but it took me a while to figure out that I had to use the CHR(34) rather than a quote or quotes surrounding quotes. That is one of the reason I am also sharing this code.
Dim myProcess As System.Diagnostics.Process = New System.Diagnostics.Process
myProcess.StartInfo.FileName = "filter.exe "
myProcess.StartInfo.Arguments = " -bcmt " & Chr(34) & file_name & Chr(34)
strTemp1 = myProcess.StartInfo.Arguments
myProcess.StartInfo.WindowStyle = System.Diagnostics.ProcessWindowStyle.Hidden
I hope this might be of value to someone.