1.
Nisarg N Pathak
2.
Dr. Nilesh K Modi
3.
Dr. S M Shah
Abstract
The Web harbors a large number of communities — groups of content-creators sharing a common interest — each of which manifests it self as a set of interlinked Web pages. New groups and commercial Web directories together contain of the order of 20,000 such communities; our particular interest here is on particular topic based communities. There is a type of information called Unexpected Information, which is of great interest. Finding unexpected information is useful in many applications. For example, it is useful for a company to find unexpected information about its competitors, e.g., unexpected services and products that its competitors offer. With this information, the company can learn from its competitors and/or design counter measures to improve its competitiveness. The research tries to form a group of common objective web sites and then derive information by comparing those web sites. The research proposes a methodology through which we can group all those same type of web sites and can find out some unexpected information from it.
Keywords: Web Unite, Web Mining, Information Extraction