word2vec_medical_record

This project aim to utilize neural network to analyze sequence input from structured free-text medical records. At this stage, models are trained to give categorical output and optimized with categorical loss. In addition, a word embedding can be generated during training.

Antoine Bordes, et al. "Joint Learning of Words and Meaning Representations for Open-Text Semantic Parsing." AISTATS(2012) https://www.hds.utc.fr/~bordesan/dokuwiki/lib/exe/fetch.php?id=en%3Apubli&cache=cache&media=en:bordes12aistats.pdf

Alexis Conneau, et al. "Very Deep Convolutional Networks for Natural Language Processing." (2016) https://arxiv.org/abs/1606.01781

Preprocessing the original data

The original data was stored in .xls document, extracted from a medical record database. We prepare our training data with the following procedures:

Remove any training data with missing item
Remove non-english characters
Remove all next-line character '\r\n'
Assign an UUID to each set of training data.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.ipynb_checkpoints		.ipynb_checkpoints
20170101_430.xls		20170101_430.xls
20170431_831.xls		20170431_831.xls
201709_1231.xls		201709_1231.xls
20180101_0430.xls		20180101_0430.xls
CT_protocols_(2018).pdf		CT_protocols_(2018).pdf
README.md		README.md
data_processer.ipynb		data_processer.ipynb
proccessed.xls		proccessed.xls

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

word2vec_medical_record

Preprocessing the original data

About

Releases

Packages

Contributors 2

Languages

yeahshow/word2vec_medical_record

Folders and files

Latest commit

History

Repository files navigation

word2vec_medical_record

Preprocessing the original data

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages