Compile GALILEI Project

Pascal Francq

January 5, 2012 (January 11, 2011)

Abstract

This document describes how the download, compile and install the GALILEI platform.

1 Introduction

Actually, there are only a development release:
  1. The GALILEI 1.2 release is obsolete.
  2. Some modifications in the API are still going on.
Several steps must be follow to download, compile and install the GALILEI platform. All the source code is available through the subversion server svn.otlet-institute.org.

2 Project Management

All official libraries and applications related to the GALILEI platform are managed through cmake. To build a given project, you must go to the directory, create a sub-directory (for example ’build’) and go into it. In this latest directory, you must type:
cmake .. [OPTIONS] 
make or make VERBOSE=1
If you develop, one interesting option is ’-DCMAKE_BUILD_TYPE=Debugfull’ which generates all the necessary debugging symbols.
If you want to install the library:
sudo make install
In practice, it is not necessary to install most projects:
  • The R and GALILEI libraries can be managed through environment variables.
  • The plug-ins must never be installed.
  • KGALILEICenter can be run from the directory where it is compiled.

3 R Library

First, the R library must be installed. Several libraries with development support must be installed in order to get R compiled:
  • SQLite.
  • MySQL.
  • libcurl.
  • Qt and KDE (to compile graphical widgets).
You can download the source code from subversion:
svn co svn://svn.otlet-institute.org/home/subversion/r/trunk r
You may create an environment variable R_LIB pointing to the root directory containing R.

4 GALILEI Library

Next, you must install the GALILEI library. The Qt library with development is necessary if you want to compile graphical widgets. You can download the source code from subversion:
svn co svn://svn.otlet-institute.org/home/subversion/galilei/trunk galilei
You may create an environment variable GALILEI_LIB pointing to the root directory containing GALILEI.

5 Plug-in Projects

Several plug-in projects are available, each project provides one or several plug-ins. The compilation procedure is identical for all the plug-ins. The Qt and KDE are necessary to compile the graphical parts (configuration dialog boxes). To download the source code of a plug-in project plug-in (if we suppose that all projects are download in the same root directory galilei_plugins):
svn co svn://svn.otlet-institute.org/home/subversion/plugin/trunk galilei_plugins/plugin
Actually, the "official" plug-in projects are:
clustering-evaluation It proposes a plug-in that implements three measures to evaluate the different clustering algorithms for validation purposes: the adjusted Rand index, the recall and the precision. This plug-in is only useful for researchers.
feedback It provides a plug-in that computes the profile descriptions.
filters It supplies several plug-ins, each plug-in provides a filter (e-mail, HTML, MS-DOC, PDF, PostScript, RTF and plain text) used to build the document descriptions. To compile this project, the libwv2 library must be installed.
gca It affords plug-ins for document and profile clustering. Two plug-in uses the Similarity-based Grouping Genetic Algorithm, and the two others the Nearest Neighbors Grouping Genetic Algorithm
gmysql It proposes a plug-in to manage a MySQL database server. It is the only storage medium actually supported.
gravitation It provides two plug-ins to compute the community and topic descriptions.
featureseval It supplies a plug-in that computes some concept weights based on the document corpus (for example the mutual information).
kmeans It affords two plug-ins that implements a k-Means algorithm to cluster documents and profiles.
langs It proposes various plug-ins for different languages (Arabic, German, Danish, English, Spanish, Finnish, French, Hungarian, Italian, Dutch, Norwegian, Portuguese, Romanian, Russian, Swedish and Turkish). Each language plug-in provides a stemming algorithm a set of stopwords.
multi-space It supplies several plug-ins, each plug-in implements the similarity measure of the tensor space model for the documents, the profiles and/or the communities.
statfeatures It proposes a plug-in to compute some statistics on the concepts extracted from the documents. This plug-in is only useful for researchers.
statsims It affords a plug-in to compute statistics on the objects similarities. This plug-in is only useful for researchers.
subslevel It supplies a plug-in that implements a simple method to rank documents and profiles in a community.
sugs It proposes a plug-in that implements a simple document suggestion method for profiles.
textanalyze It proposes different plug-ins that extract tokens from textual content, reduce the indexing space by stemming them and, eventually, filter them. These plug-ins are used when the document descriptions must be computed.

6 KGALILEICenter

KGALILEICenter is a KDE-based application that is supposed to help developers to monitor a GALILEI instance. You can get the source code from subversion:
svn co svn://svn.otlet-institute.org/home/subversion/kgalileicenter/trunk kgalileicenter

7 UpGALILEI

UpGALILEI is a program that runs a script (using the internal scripting language). It may be useful to launch tests or automatic processes.
svn co svn://svn.otlet-institute.org/home/subversion/upgalilei/trunk upgalilei