This repo is a collection of related papers and resources for the paper Time Series Analysis for Education: Methods, Applications, and Future Directions, which is the first to comprehensively review time series analysis in educational contexts, covering mainstream methods such as forecasting, classification, clustering, and anomaly detection to the best of our knowledge. We will regularly collect and update the latest resources in this repo.
Fig. 1 The hierarchical categorization of the key components of time series analysis in education.
Time Series Analysis for Education: Methods, Applications, and Future Directions
Authors: Shengzhong Mao, Chaoli Zhang, Yichi Song, Jindong Wang, Xiao-Jun Zeng, Zenglin Xu (IEEE Senior Member), and Qingsong Wen (IEEE Senior Member)
Fig. 2 Comparison of our survey with existing educational surveys (last 5 years).
The paper introduces and elaborates on four primary time series methods—forecasting, classification, clustering, and anomaly detection—highlighting their specific applications in educational contexts, as presented in Fig. 3. By examining how these methods are utilized, we can discover valuable applications and guide future research in this evolving field.
Fig. 3 The overview of time series analysis in educational contexts.
If you find this repository helpful for your work, please kindly cite our survey paper.
@article{xxx,
title={Time Series Analysis for Education: Methods, Applications, and Future Directions},
author={xxx},
journal={arXiv preprint arXiv:xxx},
year={2024}
}
- Educational data mining
- Educational Tasks and Applications
- Public educational datasets
- Educational venues
- 2024: A Comprehensive Survey on Deep Learning Techniques in Educational Data Mining. [Paper]
- 2023: Classification technique and its combination with clustering and association rule mining in educational data mining—A survey. [Paper]
- 2023: Handling Big Data in Education: A Review of Educational Data Mining Techniques for Specific Educational Problems. [Paper]
- 2022: Educational data mining: A bibliometric analysis of an emerging field. [Paper]
- 2022: Educational Data Mining: A Comprehensive Review and Future Challenges. [Paper]
- 2021: Educational Data Mining: A Review. [Paper]
- 2020: Big Educational Data & Analytics: Survey, Architecture and Challenges. [Paper]
- 2020: Educational data mining: a systematic review of research and emerging trends. [Paper]
- 2020: Educational data mining and learning analytics: An updated survey. [Paper]
- 2019: A systematic review of deep learning approaches to educational data mining. [Paper]
- 2018: Educational data mining applications and tasks: A survey of the last 10 years. [Paper]
- 2024: Educational data mining and learning analytics: A review of educational management in e-learning. [Paper]
- 2024: Reviewing the differences between learning analytics and educational data mining: Towards educational data science. [Paper]
- 2023: Educational data mining versus learning analytics: A review of publications from 2015 to 2019. [Paper]
- 2021: Educational Data Mining versus Learning Analytics: A Review of Publications From 2015 to 2019. [Paper]
- 2021: Comparison of learning analytics and educational data mining: A topic modeling approach. [Paper]
- 2020: Educational data mining and learning analytics: An updated survey. [Paper]
- 2019: Educational data mining and learning analytics for 21st century higher education: A review and synthesis. [Paper]
- 2022: A review on data fusion in multimodal learning analytics and educational data mining. [Paper]
- 2022: Multimodal educational data fusion for students' mental health detection. [Paper]
- 2021: Review on publicly available datasets for educational data mining. [Paper]
- 2020: Multimodal data fusion in learning analytics: A systematic review. [Paper]
- 2019: Building pipelines for educational data using AI and multimodal analytics: A “grey‐box” approach. [Paper]
- 2023: Prediction of student academic performance based on their emotional wellbeing and interaction on various e-learning platforms. [Paper]
- 2022: Educational data mining: prediction of students' academic performance using machine learning algorithms. [Paper]
- 2022: Educational Data Mining for Student Performance Prediction: A Systematic Literature Review (2015-2021).[Paper]
- 2022: Educational data mining to predict students' academic performance: A survey study. [Paper]
- 2021: Contributions of machine learning models towards student academic performance prediction: a systematic review. [Paper]
- 2021: A systematic literature review of student'performance prediction using machine learning techniques. [Paper]
- 2021: Educational Data Mining Techniques for Student Performance Prediction: Method Review and Comparison Analysis. [Paper]
- 2020: Predicting student performance using data mining and learning analytics techniques: A systematic literature review. [Paper]
- 2019: Students' academic performance and dropout predictions: A review. [Paper]
- 2024: Research on learning behavior patterns from the perspective of educational data mining: Evaluation, prediction and visualization. [Paper]
- 2023: Learning analytics on student engagement to enhance students' learning performance: A systematic review. [Paper]
- 2023: Dynamic interaction between student learning behaviour and learning environment: Meta-analysis of student engagement and its influencing factors. [Paper]
- 2022: Predicting students' performance in e-learning using learning process and behaviour data. [Paper]
- 2022: Learning analytics in online learning environment: A systematic review on the focuses and the types of student-related analytics data. [Paper]
- 2021: Impact of learner's characteristics and learning behaviour on learning performance during a fully online course. [Paper]
- 2021: Fostering student engagement with motivating teaching: An observation study of teacher and student behaviours. [Paper]
- 2021: [HTML] Synthesis of student engagement with digital technologies: a systematic review of the literature. [Paper]
- 2020: Utilizing Student Time Series Behaviour in Learning Management Systems for Early Prediction of Course Performance. [Paper]
- 2024: Anomaly detection in the course evaluation process: a learning analytics–based approach. [Paper]
- 2023: Anomaly Detection in the Course Evaluation Process. [Paper]
- 2023: Context-aware analysis of group submissions for group anomaly detection and performance prediction. [Paper]
- 2022: Educational Anomaly Analytics: Features, Methods, and Challenges. [Paper]
- 2022: Data Processing Model of Students' Evaluation of Teaching Based on Outlier Detection. [Paper]
- 2022: An introduction to statistical techniques used for detecting anomaly in test results. [Paper]
- 2021: Early Detecting the At-risk Students in Online Courses Based on Their Behavior Sequences. [Paper]
- 2021: Anomaly detection for early warning in object-oriented programming course. [Paper]
- 2021: Unsupervised anomaly detection with distillated teacher-student network ensemble. [Paper]
- 2020: Improving affect detection in game-based learning with multimodal data fusion. [Paper]
- Student Performance Dataset: 649 samples, 30 features, focusing on estimating student end-of-term scores. [Link]
- User Knowledge Modeling Dataset: 403 samples, 5 features, suitable for classification and clustering of learners' knowledge levels. [Link]
- Educational Process Mining Dataset: 230,318 instances, 13 features, supports forecasting, classification, and clustering. [Link]
- Open University Learning Analytics Dataset (OULAD): Data for seven modules, used for identifying at-risk students and predicting engagement. [Link]
- Student Academics Performance: 300 records, 24 features, primarily used for predicting student performance. [Link]
- KEEL Data: Datasets for knowledge discovery compatible with KEEL software. [Link]
- MOOC Lectures Dataset: Word embeddings and topic vectors from 12,032 Coursera video lectures. [Link]
- Flip Teaching in Physics Lab Data: Performance data of 1,233 engineering students in Physics and Electricity courses. [Link]
- Outcome-Based Education (OBE) Dataset: 34.65 million entries assessing outcome-based education. [Link]
- Influencing Factors of Teacher Burnout: Survey responses from 876 teachers on self-concept, efficacy, and burnout. [Link]
- Academic Performance Evolution Data: Performance data of 12,411 engineering students across 44 features. [Link]
- HarvardX Person-Course: 338,223 entries with 20 features for analyzing user progress and predicting course outcomes. [Link]
- MOOC-Ed Network Dataset: Designed to examine dropout rates and self-regulated learning behaviors in MOOCs. [Link]
- CAMEO Dataset: Information on student activities in MITx and HarvardX courses, used to study cheating through multiple accounts. [Link]
- Canvas Network Open Courses: 325,000 records with 25 features detailing activities in 238 courses, often used for clustering tasks. [Link]
- Video Game Learning Analytics: Data from 331 students in preschool settings, focusing on phonological awareness and literacy. [Link]
- Interdisciplinary Student Dataset: 807 observations with 29 variables, covering demographic data of 543 undergraduates and 246 graduates. [Link]
- The KDD Cup: Launched in 2010, this competition focuses on predicting student performance using tutoring system logs. The 2015 edition centered on forecasting dropouts in XuetangX MOOCs. [Link]
- The NAEP Competition: The 2017 edition used deidentified click-stream data from middle school students using ASSISTments, while the 2019 edition focused on predicting student activities based on early test data. [Link]
- EdNet Dataset: Collected by Santa, an AI tutoring service in Korea, this hierarchical dataset includes four sub-datasets focused on logged actions from 780,000 users. [Link]
- Riiid AIEd Challenge 2020: A Kaggle competition dataset with 418 lectures, 170 questions, and actions from 393,656 users, aimed at developing knowledge tracing models. [Link]
- DataShop@CMU: Repository with 40 datasets offering secure storage and analytical tools for learning science research. [Link]
- NUS Multisensor Presentation: Time series, video, and audio data from 51 individuals, used for feedback on oral presentation skills. [Link]
- Learn Moodle August 2016: An anonymized dataset capturing user activities from the Learn Moodle MOOC, including badges, course completions, and event logs. [Link]
- MUTLA Dataset: Multimodal data for analyzing teacher-student interactions, including brainwave data and webcam footage. [Link]
- Junyi Academy Dataset: Student interaction data with the Junyi Academy platform, detailing study durations and response accuracy. [Link]
- International Conference on Educational Data Mining (EDM)
- Artificial Intelligence in Education (AIED)
- International Conference on Learning Analytics & Knowledge
- ACM Conference on Learning @ Scale
- ACM Special Interest Group on Computer Science Education
- IEEE International Conference on Advanced Learning Technologies
- IEEE CAI Workshop on AI for Education
- KDD Workshop on AI for Education
- KDD Tutorial on Multimodal Educational Data Mining in K-12 Education
- AAAI Workshop on Artificial Intelligence for Education
- IJCAI Workshop on Artificial Intelligence for Education
- NeurIPS Workshop on Machine Learning for Education
- ICML Workshop on Machine Learning for Education
- Journal of Educational Data Mining
- Journal of Learning Analytics
- IEEE Transactions on Learning Technologies
- International Journal of Artificial Intelligence in Education
- Computers & Education
- Educational Research Review
- Journal of Computer Assisted Learning
- British Journal of Educational Technology
- International Journal of Technology Enhanced Learning