Clean Options
Advance Items
Actually, there's much garbage in a html file, especially for local files. The options in this group are most common useless information for local files. But be sure, it's mainly for local file, not for online pages. The information is useless for local file but maybe useful online.
Embedded elementsEmbedded elements are usually some media embedded in a page, such as flash, movie, music, and other media or ActiveX controls. Generally it's useless for your local files. If you want to retain the element please don't select it.
ScriptsAfter you downloaded a page, but when you open it again, error occurs, it shows so slowly, or ask you to go online, so many troubles. Do you experience it? Why? that's scripts in your page.
JavaScripts and VBScripts are usually for interactive actions. Generally it's garbage for a local HTML file, except it's a special page,such as a web page game or widget. The scripts often do some irrelevant work, such as go online, download advertisement, it's nothing about the content in the page. So remove them is strongly recommended.
Online pages and special active/dynamic pages, don't select this.
Meta infoMeta infos are stored in the HTML head, it's irrelevant to page body - what you see on the page (except charset). Most meta infos are keywords and descriptions for online pages to be searched easily. They are useful for online pages but useless for local pages. In some website, their page's meta info maybe larger than their content for they put too much keywords there.
If this option selected, all meta info except page coding will be removed.
StyleIn modern web pages, style sheet is very important. But some website put the CSS sheet directly in the page, instead of a linked style sheet file. Thus, if you download a lot of pages of the same site, the same style sheet will stored in all the pages. In this version, I have no time to implement the function to extract all the style sheet to a linked file outside. So if you don't care much about the page looking, you can try to remove it.(clean in single file mode or simulation in multi files mode) if the result can be accept, then remove it.
FormsForms are always for online page's interactiion. Maybe you download some pages form a forum, but there's a login form and a comment form in every page you stored. Obviously, they are useless. Remove them.
LinksHyperlink is the basic fact of web pages. But for downloaded pages, if you only want the text and the layout, or the links are broken or online addresses, the links can also be removed. This option is especially useful for some over-linked pages, such as a page from Wikipedia.
ImagesIf you saved a web page as a single htm file, without images. There will be dead images in the page when you open it. You can remove all the image links in the page by select this option.
You can read more details and see examples of Advance Cleaning on our website here.
Basic Items
White SpaceIf there's some useless space in the page code, it will be removed.
Line BreaksIf you needn't edit the page, line breaks and formats or source code are useless. Remove them can save some space.
But if you want to edit them in the future, don't select this.
CommentsComments are marks generated by page editors. I you don't want to edit the page's source code, remove it will save some space.
Blank TagsBlank tags, futile tags, are often generated by some web page software, such as Frontpage, Dreamweaver, or converted from some document file, such as MS Word. There are much more garbage code in such pages, but they are too complex. In this version, I could only do some simple work to this, only remove blank tags, including tags embraced spaces or styles imposed to them.
DeclaresIn some page there're some declares generated by page editing software. Such as <!doctype... and <html nls.... It's useless for local pages.
You can read more details and see examples of Basic Cleaning, please visit our website here
Clean Mode Selector
There are three prepared Clean Mode choices, with specified cleaning items. Basic Mode is mainly for HTML source code optimiztion, not advance items checked. Advance Mode is mainly for extremely cleaning for some type elements in pages. Customise Mode is a combination of above two You can modified them all freely.
Caution: Options when use filter
If you have checked to use filter, then be care of the items you want to retain. For example, if you use a filter, clean a page, the result should be with images, links and styles remained, so you should not check these three boxed up in this panel.
