SemNet was developed in spring 2011 as a part of a bachelor's thesis at FI MU.

Building of domain-specific semantic networks from web pages

Abstract

The aim of the thesis is to describe the issues related to design of semantic networks focused on a specific domain and to develop tools for creation, maintenance and querying of such networks. Specifically, the method of HTML web page crawling and scraping is described in detail. The implemented tools are modular and extensible, forming a framework for further development.

Outline

  1. Introduction
  2. Data, information, knowledge
    1. Knowledge representation
      1. Metadata
      2. Ontology
      3. Upper ontology
    2. Semantic networks
      1. History
      2. Types
    3. The road to semantic web
      1. Contemporary web
      2. Web as a graph
      3. Resource identification
      4. RDF(S)
      5. OWL
  3. Semantic network building toolset
    1. Requirements
    2. Platform
    3. Third-party software
    4. Piped object processor
      1. Architecture
      2. Development
    5. Crawler
      1. Addressed issues
      2. Design
    6. Other modules
      1. Scrapers
      2. StatementMapper
      3. SesameWriter
    7. Usage
  4. Semantic network of artworks
    1. Design
      1. Selected sources
    2. Properties
  5. Conclusion