CourseProject

This project is part of the CS410 Fall21 course.

Project Topic

Retrieving & ranking customer communication/complaints for optimal routing & resolution of customer concerns

Description

As part of this project, developed a demo tool that takes a public dataset of 2000 complaints of a Telecom company and helps to rank the complaints using raking rules & query retrieval. The dataset used has 2000 complaints from year 2000 to 2006 on Internet, TV/Cable, Phone and Service. For demo purpose the user need to a year & month to rank the complaints during that period. The tool uses pre-configured query terms & product/sentiment/churn weights to rank/prioritize complaints.

Below are Product & Sentiment/Churn and their respective query terms used.

{"Internet":"internet xfinity"},{"TV":"tv television cable channel dish"},{"Phone":"phone cellphone wireless voice"},{"Bundle":"bundle"},{"Service":"service outage appointment schedule called hold times"}

{"Negative":"disappointing multiple annoying worst negative suck terrible rude ignorant"},{"Churn":"long time years loyal used to changing cutting"}

Ranking Approach used

Tool uses product terms to identify the related product(s) & weights for products to generate product score of the complaint.")
Tool tags the complaints into Negative/Sentiment & Churn based on some word terms & weights : sentiment weight & churn weigt ")
Then using above, a final rank/score for the complaint is generated using below scoring function") Final score = sum of product scores + churn weight * churn score + Sentiment weight * Sentiment score")
The complaints for the Year & Month are ranked based on the above final score")
The ranked complaints are written to a output file with product tags and scores for reference"
In addition to writing to output file, the tool displays top 5 complaints for the year & month entered and the scores.

Oppurtunities to Improve

There is oppurtunity to re-design/improve scoring. Since the tool uses bm25 query ranker, if a complaint has large comments( more total words )and less use of query terms then the score for the product will be less and accordingly if a complaint uses query term more often and has overall less comments(less total words) the score will be high.

To run this tool

Download data & src folders and run main.py. This tool uses rank_bm25 library so you need to pip install that and other packages used in main.py

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
data		data
src		src
Progress Report.pdf		Progress Report.pdf
Project_Proposal.docx		Project_Proposal.docx
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CourseProject

Project Topic

Description

Ranking Approach used

Oppurtunities to Improve

To run this tool

Sample Run & Output

About

Releases

Packages

Languages

SathishRama/CourseProject

Folders and files

Latest commit

History

Repository files navigation

CourseProject

Project Topic

Description

Ranking Approach used

Oppurtunities to Improve

To run this tool

Sample Run & Output

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages