ESLM: Improving entity summarization by leveraging language models

The ESLM model is designed to efficiently perform entity summarization tasks. At its core, ESLM utilizes a language model augmented with an attention mechanism, allowing for context-aware processing and prediction. This mechanism ensures that the model focuses on the most relevant aspects of data sequences. Additionally, ESLM employs a Multi-Layer Perceptron (MLP) for accurate triple scoring, which aids in selecting the most relevant triples for each entity. To augment the model’s effectiveness, we also enriched it with knowledge graph embeddings (KGE), leveraging their rich semantic information.

Dataset

On this experiment, we used two datasets:

ESBM (version 1.2) includes 175 entities, comprising 150 entities sourced from DBpedia and 25 entities from LinkedMDB.
FACES dataset is made up of 50 entities.

Environment and Dependency

Environment

Ubuntu 10.04.2 LTS
python 3.6+
pytorch 1.7.0

Installation

https://github.com/u2018/ESLM.git

ESLM Framework

Currently, ESLM framework includes a selection of three highly effective models for entity summarization tasks as follows:

BERT (Bidirectional Encoder Representations from Transformers)
ERNIE (Enhanced Representation through kNowledge Integration)
T5 (Text-to-Text Transfer Transformer)

To utilize these models in the ESLM framework, you simply need to reference them by their lowercase names in the command line. For instance:

To use BERT, you would type bert.
For ERNIE, type ernie.
And for T5, enter t5

Usage information

main.py [-h] [--train] [--no-train] [--test] [--no-test] [--enrichment] [--no-enrichment] [--model MODEL] [--max_length MAX_LENGTH] [--epochs EPOCHS] [--learning_rate LEARNING_RATE]

ESLM

Training and evaluating the model using the standard settings

python main.py --model t5  --no-enrichment

Evaluating only

python main.py --model t5  --no-enrichment --no-train

ESLM Enrichment with KGE

Training and evaluating the model using the standard settings

python main.py --model t5

Evaluating only

python main.py --model t5 --no-train

How to Cite

@inproceedings{ESLM2024,
  author = {Firmansyah, Asep Fajar and Mousallem, Diego and Ngonga Ngomo, Axel-Cyrille},
  booktitle = {The Semantic Web},
  title = {ESLM: Improving Entity Summarization by Leveraging Language Models},
  url = {https://doi.org/10.1007/978-3-031-60626-7_9},
  year = 2024
}

Contact

If you have any questions or feedbacks, feel free to contact us at asep.fajar.firmansyah@upb.de

Name		Name	Last commit message	Last commit date
Latest commit History 475 Commits
classes		classes
data_inputs		data_inputs
datasets		datasets
dbpedia-esbm		dbpedia-esbm
dbpedia-kge-model		dbpedia-kge-model
evaluator		evaluator
faces-esbm		faces-esbm
faces-kge-model		faces-kge-model
lmdb-esbm		lmdb-esbm
lmdb-kge-model		lmdb-kge-model
.gitignore		.gitignore
.pylintrc		.pylintrc
LICENSE		LICENSE
README.md		README.md
eslm-version-3.0.png		eslm-version-3.0.png
evaluation.py		evaluation.py
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ESLM: Improving entity summarization by leveraging language models

Dataset

Environment and Dependency

Environment

Installation

ESLM Framework

Usage information

ESLM

ESLM Enrichment with KGE

How to Cite

Contact

About

Releases

Packages

Languages

License

dice-group/ESLM

Folders and files

Latest commit

History

Repository files navigation

ESLM: Improving entity summarization by leveraging language models

Dataset

Environment and Dependency

Environment

Installation

ESLM Framework

Usage information

ESLM

ESLM Enrichment with KGE

How to Cite

Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages