WebArchiveX Component Set
Overview
Samples |
Changelog |
API
Ever thought how "Save as Web Archive (MHT)" works?
Ever thought how "Save as Web Archive (MHT)" works?
Ever wanted to extract this functionality from Internet Explorer and incorporate
it in your own applications? Ever needed a super-easy-to-install module that
doesn't depend on any other program or library? If the answer is yes WebArchiveX™
Component was designed especially for you.
WebArchiveX builds web archives (MHT)*
and it does it much
better than "Save as web archive
MHT" of Microsoft Internet Explorer! Rich WebArchiveX
API lets you tap the functionality of professional web
archiving from your own application within minutes. WebArchiveX COM is a full-featured flexible tool that,
unlike Internet
Explorer, works
independently of any other program software
or library.
WebArchiveX can be used in any type of application and is compatible with a wide range of programming languages
and environments. This help provides samples for .NET C#, C++, Java Script, Visual Basic and
ASP (Active Server Pages). WebArchiveX package includes sample projects
for C++ and Visual Basic.
Please see the WebArchiveX samples page to get a
quick impression of what WebArchiveX does.
Kindly refer to Appendix A for a brief history of WebArchiveX.
With WebArchiveX you can:
- Build web archive MHT files including HTML, style sheets, JavaScript, XML,
XSL, Sound files (e.g. WAV), Office files (e.g. DOC), PDF files and many
many more
- Build self-updateable forms using MakeArchiveFromDoc
(for details please see "Testform" sample in the "Samples
\ JavaScript" directory of the WebArchiveX distribution)
- Create MHT web archive files "on the fly" in ASP and write them
directly into Response using MakeArchiveStream
- Encrypt, Decrypt
and Email web archive files
- Detect and archive dynamically loaded images in HTML and scripts (using
the Image object)
- Apply custom changes using AddPreParsingFilter
and AddPostParsingFilter methods
- Make MHT from a dynamically modified HTML document object using MakeArchiveFromDoc
method
- Add files manually using AddFile method
- Use your own custom tags using AddResourceTag
method
- Use custom MIME types using AddMimeType
method
- Use custom encoding using SetCharset method
- Control level of spidering using SpiderLevel
property
- Control logging using LogLevel and LogFile
properties
- Receive progress callbacks using CallbackWindow,
CallbackMessageID, InfoWindow
and InfoMessageID properties
WebArchiveX API fully supports:
- Dynamic HTML
- All MIME types
- External and internal frames
- External and internal scripts and style sheets
- Multi-threaded environments
- All programming languages that support COM
- All character sets
Minimal Requirements
Please note that WebArchiveX doesn't use MFC.
- Windows 95/98/Me/NT/2000/XP/2003
- Internet Explorer 4.1 or higher
In order to compile our test projects you will need Microsoft Visual Studio 6.0 (SP5) or higher.
Acknowledgments
We would like to jointly thank the following people for their valuable help:
PRIMAsoftware
(for finding a very elusive bug in spidering mechanism)
R&D team of C.D.I. Systems (1992) Ltd
(for an extensive testing in ASP environment)
Mark Malamut from Fulcrum Point
Technologies (for the idea of MakeArchiveFromDoc)
Cheng Tan from Bay Prospector
(for pointing out the problem of retrieving files from ISAPI. The bug was
fixed in v4.0)
George Gueorgiev from Verity, Inc.
(for help and understanding during development of version 5.5)
Kevin Huck (for testing betas of version 5.5)
Elmar Denkmann from Baywotch, GmbH.
(for excellent advises and pedantic tests of version 6.0)
* Web Archive MHT is a single
file that, unlike regular HTML, embodies all needed resources such as frames,
linked documents, images, style sheets and scripts. In fact you can store whole
websites into a single web archive MHT. Web archive MHT is an Internet standard for sending HTML documents within MIME formatted
archives/messages. HTML documents, images, scripts etc. are included as body parts and are referenced by URLs in the
archive/message. Packing web pages into MHT web archive avoids problems such as missing
images, styles or scripts when you publish your documents on the Web or send
them by email.
-
WebArchiveX
API - WebArchiveX
Website - C Systems Website -
C Systems - Creative software solutions since 1996. All rights reserved. Terms of use.