Lin: Unsupervised extraction of tasks from textual communication

This is the code for the paper presented at COLING 2020. You can find the paper here.

Lin Performance

In our experiments, Lin outperforms strong baselines of BERT, FastText and Universal Sentence Encoder + SVM across multiple domains

Results on email dataset

Model	Accuracy	Precision	Recall	F1 score
SVM + USE	89.35	54.40	82.42	65.51
FastText	69.53	69.95	68.62	69.25
BERT	89.17	74.82	82.85	78.58
Lin Syntax	93.34	74.48	69.80	72.06
Lin Semantics	91.08	58.62	93.36	72.01
Lin	95.36	83.82	77.29	80.42

Results on chat dataset (baselines trained on email dataset)

Model	Accuracy	Precision	Recall	F1 score
SVM + USE	83.42	70.09	72.11	71.09
FastText	78.80	71.66	41.34	52.43
BERT	92.68	85.32	89.42	87.32
Lin Syntax	92.12	87.12	84.61	85.85
Lin Semantics	89.40	74.07	96.15	83.68
Lin	94.85	94.73	86.53	90.45

Setup Instructions

Download and extract this repo
Install dependencies

pip install -r requirements.txt

Download stanfordnlp english library

import stanfordnlp
stanfordnlp.download('en')

Download verbnet 3 and stopwords in nltk

import nltk
nltk.download('verbnet3')
nltk.download('stopwords')

How to run Lin

Running Lin on given datasets

python main.py --dataset=<DATASET>

DATASET can be either chat or email. On the provided dataset, the code will extract tasks, store them in the same excel file under column Tasks and report accuracy, precision, recall and f1 score.

Running Lin on your dataset

python main.py --dataset=<path to your dataset>

Please note that your dataset must be an excel file with a Sentence column that has the sentences from which tasks are to be extracted. Running Lin on custom dataset will also extract and store tasks the same way, but will not report accuracy, precision, recall and f1 score.

Additional Code

Training baseline models

python train.py --model=<MODEL>

MODEL can be either bert, fasttext or svm. The selected model is then trained on email dataset. Training is done using 5-fold cross validation and the model with best f1 score is stored in data/TrainedModels. The average accuracy, precision, recall and f1 score are reported. Training models multiple times will overwrite previously trained model.

Evaluating trained models

python evaluate.py --model=<MODEL> --dataset=<DATASET>

DATASET can be either chat or email. Metrics reported are accuracy, precision, recall and f1 score.

Citation

If you use Lin in your paper, please cite us:

@inproceedings{diwanji-etal-2020-lin,
    title = "Lin: Unsupervised Extraction of Tasks from Textual Communication",
    author = "Diwanji, Parth  and
      Guo, Hui  and
      Singh, Munindar  and
      Kalia, Anup",
    booktitle = "Proceedings of the 28th International Conference on Computational Linguistics",
    month = dec,
    year = "2020",
    address = "Barcelona, Spain (Online)",
    publisher = "International Committee on Computational Linguistics",
    url = "https://www.aclweb.org/anthology/2020.coling-main.164",
    pages = "1815--1819",
    abstract = "Commitments and requests are a hallmark of collaborative communication, especially in team settings. Identifying specific tasks being committed to or request from emails and chat messages can enable important downstream tasks, such as producing todo lists, reminders, and calendar entries. State-of-the-art approaches for task identification rely on large annotated datasets, which are not always available, especially for domain-specific tasks. Accordingly, we propose Lin, an unsupervised approach of identifying tasks that leverages dependency parsing and VerbNet. Our evaluations show that Lin yields comparable or more accurate results than supervised models on domains with large training sets, and maintains its excellent performance on unseen domains.",
}

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
code		code
config		config
data		data
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lin: Unsupervised extraction of tasks from textual communication

Lin Performance

Setup Instructions

How to run Lin

Running Lin on given datasets

Running Lin on your dataset

Additional Code

Training baseline models

Evaluating trained models

Citation

About

Releases

Packages

Contributors 2

Languages

License

Parth27/Lin

Folders and files

Latest commit

History

Repository files navigation

Lin: Unsupervised extraction of tasks from textual communication

Lin Performance

Setup Instructions

How to run Lin

Running Lin on given datasets

Running Lin on your dataset

Additional Code

Training baseline models

Evaluating trained models

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages