Skip to content

Commit

Permalink
Browse files Browse the repository at this point in the history
  • Loading branch information
TortugaAttack committed Mar 13, 2020
2 parents 1676107 + 2cc0ea5 commit 876beab
Show file tree
Hide file tree
Showing 5 changed files with 146 additions and 0 deletions.
24 changes: 24 additions & 0 deletions .github/workflows/maven.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,24 @@
# This workflow will build a Java project with Maven
# For more information see: https://help.github.com/actions/language-and-framework-guides/building-and-testing-java-with-maven

name: Java CI with Maven

on:
push:
branches: [ master ]
pull_request:
branches: [ master ]

jobs:
build:

runs-on: ubuntu-latest

steps:
- uses: actions/checkout@v2
- name: Set up JDK 1.11
uses: actions/setup-java@v1
with:
java-version: 1.11
- name: Build with Maven
run: mvn -B package --file pom.xml
30 changes: 30 additions & 0 deletions .github/workflows/mavenpublish.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,30 @@
# This workflow will build a package using Maven and then publish it to GitHub packages when a release is created
# For more information see: https://github.com/actions/setup-java#apache-maven-with-a-settings-path

name: Maven Package

on:
release:
types: [created]

jobs:
build:

runs-on: ubuntu-latest

steps:
- uses: actions/checkout@v2
- name: Set up JDK 1.8
uses: actions/setup-java@v1
with:
java-version: 1.8
server-id: github # Value of the distributionManagement/repository/id field of the pom.xml
settings-path: ${{ github.workspace }} # location for the settings.xml file

- name: Build with Maven
run: mvn -B package --file pom.xml

- name: Publish to GitHub Packages Apache Maven
run: mvn deploy -s $GITHUB_WORKSPACE/settings.xml
env:
GITHUB_TOKEN: ${{ github.token }}
1 change: 1 addition & 0 deletions CNAME
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
rdfrepair.org
90 changes: 90 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,90 @@
![Logo][logo]

[logo]: https://github.com/dice-group/GraphCompression/raw/develop/logo.png "RDFRePair Logo"

Compression of RDF graphs using the gRePair algorithm.
Further on provides an even smaller compression using KD2 Trees.

# Download and install

### General

Download standalone.zip from the latest release.
```bash
wget https://.../standalone.zip
unzip standalone.zip
cd rdfrepair
```

### Ubuntu and Debian:

Download deb package install using
```
not yet ready
```

### Arch Linux:

NOT READY YET!
```bash
yaourt -S rdfrepair
```

# Usage

Compression using KD2 Trees
```bash
rdfrepair -c -tkd2 IN_FILE.ttl OUT_FILE.grp
```

Compression using gRePair algorithm
```bash
rdfrepair -c -tkd2 -digrams IN_FILE.ttl OUT_FILE.grp
```

Decompression (both KD2 as well as gRePair algorithm) using TURTLE format (N-TRIPLE, RDF/XML or TURTLE is possible)
```bash
rdfrepair -d -out TURTLE -tkd2 IN_FILE.grp OUT_FILE.ttl
```

To adjust the RAM usage to 4GB use the following:

```
export RDF_RE_PAIR_XMX=4g
```

# Querying KD2 with Fuseki

### Prepacked

Download fuseki-kd2.tar.gz from the latest release

```bash
wget http://.../fuseki-kd2.tar.gz
tar -xzvf fuseki-kd2.tar.gz
cd apache-jena-fuseki-3.13.1/
cp YOUR_KD2_COMPRESSED_FILE file.grp
./fuseki-server --config=kd2_example.ttl
```


### From scratch

1. Download the latest [apache jena fuseki](https://jena.apache.org/download/#jena-fuseki)
2. Copy the rdfrepair-1.0.jar from the latest release to `run/extra/` in the fuseki folder.
3. Download the [fuseki_example.ttl](https://github.com/dice-group/GraphCompression/blob/develop/fuseki_example.ttl)
4. change the last line fileName to the actual fileName of your KD2 compressed file.
5. Start fuseki using `fuseki-server --config=fuseki_example.ttl`

You can reach the read-only service at [http://localhost:3030/kd2/sparql](http://localhost:3030/kd2/sparql)

Depending on the size it may take a bit to load.


# Download precompressed files
Name | Size | Triples | Link
------------ | ------------- | ------------- | -------------
ArchivesHub | | 1.361.815 |
Jamendo | | 1.047.950 |
ScholaryData (rich) | | 859.840 |
DBLP 2017 | | 88.150.324 |
1 change: 1 addition & 0 deletions _config.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
theme: jekyll-theme-slate

0 comments on commit 876beab

Please sign in to comment.