Whoa, that's a huge question you're asking. The implementation could be quite extensive. Why do you need to read these files? Do you need to render their contents, or just read them for some other reason (ie searching)?
Dan East
How do I read HTML, PDF, RTF, CHM or DOC files for input??