Skip to content

Latest commit

 

History

History
156 lines (115 loc) · 5.4 KB

README.md

File metadata and controls

156 lines (115 loc) · 5.4 KB

A Literature Abstract Annotation Tool for Argumentation Mining

In this repository, we share a literature abstract annotation tool for argumentation mining. It is also the annotation web tool mentioned in the paper "A Structure-Aware Argument Encoder for Literature Discourse Analysis". . You can watch a tutorial on how to use the tool at https://www.youtube.com/watch?v=xILPAn5EZH4.

The tool is a complete annotation suite based on Django that includes the following features.

  • User (Annotator)
    • login
    • register
    • sign-out
    • setting
  • Reviewer
    • login
    • rectification
  • Task distribution
    • system distribution
    • user application
  • Argumentation Mining
    • argument type classification
    • argument relation recognition
  • Admin
  • Leaderboard
  • Search Engine

Our annotation process is that users (annotators) need to register and log in to their accounts first, and then managers can assign corresponding jobs to any user, and users who meet the permissions can also apply jobs freely. Reviewers can make corrections on major disagreements. User permissions can be manually updated by the administrator in the admin site, and all annotated records can be viewed.

1 2 3 4 5 6

Requirements

  • python==3.7
  • django==3.0
  • nltk==3.7
  • pymysql==1.0.2
  • tqdm==4.64.0
  • numpy
  • sklearn
# create conda environment
conda create -n django3 python=3.7
source activate django3

# install packages
pip install -r requirements.txt

Create Database

Please first install mysql or other database for storing user data, we take mysql for example.

DROP DATABASE if exists amcl;
CREATE DATABASE if not exists amcl DEFAULT CHARACTER SET utf8mb4 COLLATE utf8mb4_general_ci;

Django Setting

please set up database and e-mail in amcl/settings.py. Besides, you also need to set the IP address and port in annotation/utils/mail.py, which should be consistent with the project deployment.

Django Migration

python manage.py makemigrations
python manage.py migrate
python manage.py createsuperuser

Django Server

python manage.py runserver 127.0.0.1:8888

When everything is ready, you can visit annotation homepage in http://127.0.0.1:8888/annotation, and admin homepage in http://127.0.0.1:8888/admin/. Based on this code, you have the flexibility to add tasks or functions that you want to add.

Task Assignment

  • Insert Data
python insert_data.py
  • Create Virtual User
python create_user.py
  • Apply Job

You can log in the website with the created test user accounts now.

# annotator
username/password: test1/test1

# reviewer
username/password: test2/test2

Annotator can also apply for jobs freely, and administrators can modify the assign function in annotation/utils/assign to change the distributed logic.

Reviewer handles disagreements in multiple annotators, and there is an additional "review" column on their homepage.

  • Assign Job to User (optional)

You may want to assign jobs to users manually, you can do it as follows.

from annotation.utils.job import assign

# assign 10 abstracts to user with uid 0
assign(uid=1, job_size=10, job_type=0)

We provide a service website for testing, which host on http://210.16.187.147:60026/amcl/. We created two new users, annotator and reviewer, where the username and password are both annotator or reviewer.

For some reasons, this website can only be accessed through Fudan University's intranet.

How to Cite

If you extend or use this work, please cite the paper where it was introduced.

@inproceedings{li-etal-2022-structure,
    title = "A Structure-Aware Argument Encoder for Literature Discourse Analysis",
    author = "Li, Yinzi  and
      Chen, Wei  and
      Wei, Zhongyu  and
      Huang, Yujun  and
      Wang, Chujun  and
      Wang, Siyuan  and
      Zhang, Qi  and
      Huang, Xuanjing  and
      Wu, Libo",
    booktitle = "Proceedings of the 29th International Conference on Computational Linguistics",
    month = oct,
    year = "2022",
    address = "Gyeongju, Republic of Korea",
    publisher = "International Committee on Computational Linguistics",
    url = "https://aclanthology.org/2022.coling-1.619",
    pages = "7093--7098",
    abstract = "Existing research for argument representation learning mainly treats tokens in the sentence equally and ignores the implied structure information of argumentative context. In this paper, we propose to separate tokens into two groups, namely framing tokens and topic ones, to capture structural information of arguments. In addition, we consider high-level structure by incorporating paragraph-level position information. A novel structure-aware argument encoder is proposed for literature discourse analysis. Experimental results on both a self-constructed corpus and a public corpus show the effectiveness of our model. Resources are available at https://github.com/lemuria-wchen/SAE.",
}

Get Data

In our paper, we introduce a new argumentation mining corpus that focuses on the field of climate change, which we host in our new repository.