Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
10 changes: 5 additions & 5 deletions .classpath
Original file line number Diff line number Diff line change
Expand Up @@ -21,10 +21,10 @@
<attribute name="maven.pomderived" value="true"/>
</attributes>
</classpathentry>
<classpathentry kind="lib" path="target/Libraries/align.jar"/>
<classpathentry kind="lib" path="target/Libraries/alignsvc.jar"/>
<classpathentry kind="lib" path="target/Libraries/ontowrap.jar"/>
<classpathentry kind="lib" path="target/Libraries/procalign.jar"/>
<classpathentry kind="lib" path="target/Libraries/spmf.jar"/>
<classpathentry kind="lib" path="lib/align.jar"/>
<classpathentry kind="lib" path="lib/alignsvc.jar"/>
<classpathentry kind="lib" path="lib/ontowrap.jar"/>
<classpathentry kind="lib" path="lib/procalign.jar"/>
<classpathentry kind="lib" path="lib/spmf.jar"/>
<classpathentry kind="output" path="target/classes"/>
</classpath>
1 change: 1 addition & 0 deletions .gitignore
Original file line number Diff line number Diff line change
@@ -1 +1,2 @@
/.apt_generated/
target/
23 changes: 21 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -10,9 +10,28 @@ An automatic data linking tool developed by [DOREMUS][3].
3. **Indexing and Instance matching**: We apply standard NLP techniques to index the instance profiles by using a term frequency vector model. The threshold value of Legato applies to the similarity computed at this stage. Low thresholds are recommended to ensure high recall (default 0.2).
4. **Link repairing**: A post-processing step to repair erroneous links generated in the matching step by clustering highly similar instances together and applying a key-identification adn ranking algorthims.

<b> How to run <i> Legato </i> </b>
<b> How to build <i> Legato </i> </b>
========
For running <b> <i> Legato </i> </b> through the GUI, please run the "main.java" class in the "legato" package. Then, select the source, the target and a reference alignement (if availble). Then, you can choose between two treatment's modes:
### Using Maven
The first time you build this project, run the following command to generate Maven dependencies:
```
mvn initialize
```
Then, everytime you want to build:
```
mvn clean package
```
To run:
```
mvn exec:java
```

### Manually
For running <b> <i> Legato </i> </b> through the GUI, please run the "main.java" class in the "legato" package.

<b> How to use <i> Legato </i> </b>
========
Select the source, the target and a reference alignement (if availble). Then, you can choose between two treatment's modes:
- Automatic allows to filter resources by fixing only the classes to compare.
- Manual allows to filter resources by classe and comparate by a set of selected properties.
The field "threshold value" allows to define Legato's threshold in the Instance matching step. Legato will consider only resources with a similarity higher than the threshold value.
Expand Down
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
163 changes: 163 additions & 0 deletions pom.xml
Original file line number Diff line number Diff line change
Expand Up @@ -18,6 +18,134 @@
<target>1.8</target>
</configuration>
</plugin>
<plugin>
<groupId>org.codehaus.mojo</groupId>
<artifactId>exec-maven-plugin</artifactId>
<version>1.6.0</version>
<executions>
<execution>
<goals>
<goal>java</goal>
</goals>
</execution>
</executions>
<configuration>
<mainClass>legato.Main</mainClass>
</configuration>
</plugin>
<plugin>
<groupId>org.apache.maven.plugins</groupId>
<artifactId>maven-shade-plugin</artifactId>
<version>3.1.1</version>
<executions>
<execution>
<phase>package</phase>
<goals>
<goal>shade</goal>
</goals>
</execution>
</executions>
</plugin>
<plugin>
<groupId>org.apache.maven.plugins</groupId>
<artifactId>maven-jar-plugin</artifactId>
<version>2.4</version>
<configuration>
<archive>
<manifest>
<mainClass>legato.Main</mainClass>
<addClasspath>true</addClasspath>
<classpathPrefix>libs/</classpathPrefix>
</manifest>
</archive>
</configuration>
</plugin>
<plugin>
<groupId>org.apache.maven.plugins</groupId>
<artifactId>maven-install-plugin</artifactId>
<version>2.5.2</version>
<executions>
<!-- SPMF -->
<execution>
<id>install-spmf</id>
<phase>initialize</phase>
<goals>
<goal>install-file</goal>
</goals>
<configuration>
<file>lib/spmf.jar</file>
<groupId>ca.pfv.spmf</groupId>
<artifactId>SPMF</artifactId>
<version>1.0</version>
<packaging>jar</packaging>
</configuration>
</execution>

<!-- align -->
<execution>
<id>install-align</id>
<phase>initialize</phase>
<goals>
<goal>install-file</goal>
</goals>
<configuration>
<file>lib/align.jar</file>
<groupId>org.semanticweb.owl</groupId>
<artifactId>align</artifactId>
<version>1.0</version>
<packaging>jar</packaging>
</configuration>
</execution>

<!-- alignsvc -->
<execution>
<id>install-alignsvc</id>
<phase>initialize</phase>
<goals>
<goal>install-file</goal>
</goals>
<configuration>
<file>lib/alignsvc.jar</file>
<groupId>org.semanticweb.owl</groupId>
<artifactId>alignsvc</artifactId>
<version>1.0</version>
<packaging>jar</packaging>
</configuration>
</execution>

<!-- ontowrap -->
<execution>
<id>install-ontowrap</id>
<phase>initialize</phase>
<goals>
<goal>install-file</goal>
</goals>
<configuration>
<file>lib/ontowrap.jar</file>
<groupId>org.semanticweb.owl</groupId>
<artifactId>ontowrap</artifactId>
<version>1.0</version>
<packaging>jar</packaging>
</configuration>
</execution>

<!-- proc-align -->
<execution>
<id>install-proc-align</id>
<phase>initialize</phase>
<goals>
<goal>install-file</goal>
</goals>
<configuration>
<file>lib/procalign.jar</file>
<groupId>org.semanticweb.owl</groupId>
<artifactId>proc-align</artifactId>
<version>1.0</version>
<packaging>jar</packaging>
</configuration>
</execution>
</executions>
</plugin>
</plugins>
</build>

Expand Down Expand Up @@ -119,6 +247,41 @@
</dependency>


<!-- lib/spmf.jar -->
<dependency>
<groupId>ca.pfv.spmf</groupId>
<artifactId>SPMF</artifactId>
<version>1.0</version>
</dependency>

<!-- lib/align.jar -->
<dependency>
<groupId>org.semanticweb.owl</groupId>
<artifactId>align</artifactId>
<version>1.0</version>
</dependency>

<!-- lib/alignsvc.jar -->
<dependency>
<groupId>org.semanticweb.owl</groupId>
<artifactId>alignsvc</artifactId>
<version>1.0</version>
</dependency>

<!-- lib/ontowrap.jar -->
<dependency>
<groupId>org.semanticweb.owl</groupId>
<artifactId>ontowrap</artifactId>
<version>1.0</version>
</dependency>

<!-- lib/procalign.jar -->
<dependency>
<groupId>org.semanticweb.owl</groupId>
<artifactId>proc-align</artifactId>
<version>1.0</version>
</dependency>

</dependencies>

</project>
2 changes: 1 addition & 1 deletion src/legato/indexer/VectorGenerator.java
Original file line number Diff line number Diff line change
Expand Up @@ -42,7 +42,7 @@ public DocVector[] GetDocumentVectors() throws IOException {
for (int docId = 0; docId < totalNoOfDocumentInIndex; docId++) {
Terms vector = indexReader.getTermVector(docId, LEGATO.getInstance().FIELD_CONTENT);
TermsEnum termsEnum = null;
//Verification integrit� de vector
//Verification integrité de vector
try
{
termsEnum = vector.iterator(termsEnum);
Expand Down
124 changes: 0 additions & 124 deletions target/Libraries/pom.xml

This file was deleted.

1 change: 0 additions & 1 deletion target/classes/.gitignore

This file was deleted.

Binary file removed target/classes/legato/LEGATO.class
Binary file not shown.
Binary file removed target/classes/legato/Main.class
Binary file not shown.
Binary file removed target/classes/legato/cluster/Cluster.class
Binary file not shown.
Binary file removed target/classes/legato/cluster/ClusterList.class
Binary file not shown.
Binary file removed target/classes/legato/cluster/Clustering.class
Binary file not shown.
Binary file removed target/classes/legato/cluster/DocVec.class
Binary file not shown.
Binary file not shown.
Binary file removed target/classes/legato/cluster/Vecteur.class
Binary file not shown.
Binary file removed target/classes/legato/document/CBDBuilder.class
Binary file not shown.
Binary file removed target/classes/legato/document/DocumentBuilder.class
Binary file not shown.
Binary file removed target/classes/legato/document/FileManager.class
Binary file not shown.
Binary file removed target/classes/legato/gui/DatasetFileChooser$1.class
Binary file not shown.
Binary file removed target/classes/legato/gui/DatasetFileChooser.class
Binary file not shown.
Binary file removed target/classes/legato/gui/GUI$1.class
Binary file not shown.
Binary file removed target/classes/legato/gui/GUI$10.class
Binary file not shown.
Binary file removed target/classes/legato/gui/GUI$11.class
Binary file not shown.
Binary file removed target/classes/legato/gui/GUI$12.class
Binary file not shown.
Binary file removed target/classes/legato/gui/GUI$13.class
Binary file not shown.
Binary file removed target/classes/legato/gui/GUI$14.class
Binary file not shown.
Binary file removed target/classes/legato/gui/GUI$2.class
Binary file not shown.
Binary file removed target/classes/legato/gui/GUI$3.class
Binary file not shown.
Binary file removed target/classes/legato/gui/GUI$4.class
Binary file not shown.
Binary file removed target/classes/legato/gui/GUI$5.class
Binary file not shown.
Binary file removed target/classes/legato/gui/GUI$6.class
Binary file not shown.
Binary file removed target/classes/legato/gui/GUI$7.class
Binary file not shown.
Binary file removed target/classes/legato/gui/GUI$8.class
Binary file not shown.
Binary file removed target/classes/legato/gui/GUI$9.class
Binary file not shown.
Binary file removed target/classes/legato/gui/GUI.class
Binary file not shown.
Binary file removed target/classes/legato/indexer/AllTerms.class
Binary file not shown.
Binary file removed target/classes/legato/indexer/DocVector.class
Binary file not shown.
Binary file removed target/classes/legato/indexer/IndexOpener.class
Binary file not shown.
Binary file removed target/classes/legato/indexer/Indexer.class
Binary file not shown.
Binary file removed target/classes/legato/indexer/VectorGenerator.class
Binary file not shown.
Binary file removed target/classes/legato/keys/KeysClassifier.class
Binary file not shown.
Binary file removed target/classes/legato/keys/SILK.class
Binary file not shown.
Binary file removed target/classes/legato/keys/Sakey.class
Binary file not shown.
Binary file removed target/classes/legato/keys/SilkConfig.class
Binary file not shown.
Binary file removed target/classes/legato/keys/SupportMergedKeys.class
Binary file not shown.
Binary file removed target/classes/legato/keys/def/Key.class
Binary file not shown.
Binary file removed target/classes/legato/keys/def/KeyList.class
Binary file not shown.
Binary file removed target/classes/legato/match/Align.class
Binary file not shown.
Binary file removed target/classes/legato/match/Map.class
Binary file not shown.
Binary file removed target/classes/legato/match/MapList.class
Binary file not shown.
Binary file removed target/classes/legato/match/Matchifier.class
Binary file not shown.
Binary file not shown.
Binary file removed target/classes/legato/rdf/ModelManager.class
Binary file not shown.
Binary file removed target/classes/legato/rdf/PathManager$Path.class
Binary file not shown.
Binary file removed target/classes/legato/rdf/PathManager.class
Binary file not shown.
Binary file removed target/classes/legato/rdf/Prop.class
Binary file not shown.
Binary file removed target/classes/legato/rdf/PropList.class
Binary file not shown.
Binary file removed target/classes/legato/utils/A.class
Binary file not shown.
Binary file removed target/classes/legato/utils/PropertyHandler.class
Binary file not shown.
Binary file removed target/classes/legato/utils/Stemmer.class
Binary file not shown.
Binary file removed target/classes/legato/utils/StopWords.class
Binary file not shown.
Binary file removed target/classes/legato/utils/ValueComparator.class
Binary file not shown.
Binary file removed target/legato-0.0.1-SNAPSHOT.jar
Binary file not shown.
5 changes: 0 additions & 5 deletions target/maven-archiver/pom.properties

This file was deleted.

Loading