Skip to main content Skip to navigation

Class-attention Video Transformer for Engagement Intensity Prediction

Project Overview

The document explores the application of generative AI, specifically the Class Attention in Video Transformer (CavT), in enhancing student engagement prediction in online education. It addresses the challenges faced in remote learning environments, particularly the difficulty in gauging student engagement due to the absence of visual cues typical of in-person classrooms. The CavT employs a transformer model to analyze video data, incorporating an innovative sampling technique known as Binary Order Representatives Sampling (BorS) to enhance prediction accuracy. Findings indicate that the combination of CavT and BorS significantly surpasses the performance of existing methods in predicting engagement across various datasets. Overall, this research highlights the potential of generative AI technologies to improve the assessment of student engagement in remote learning contexts, ultimately contributing to more effective educational strategies.

Key Applications

Class Attention in Video Transformer (CavT) with Binary Order Representatives Sampling (BorS)

Context: Online education engagement prediction for students in remote learning environments

Implementation: Implemented as an end-to-end model using video frames for engagement intensity prediction, leveraging deep learning techniques.

Outcomes: Achieved state-of-the-art performance in engagement prediction on the EmotiW-EP and DAiSEE datasets, with lower mean square error (MSE) than previous methods.

Challenges: Class imbalance in training datasets and difficulty in extracting meaningful engagement features from video frames.

Implementation Barriers

Technical Barrier

Class imbalance in engagement intensity levels across datasets impacts predictive accuracy.

Proposed Solutions: Utilizing techniques like Binary Order Representatives Sampling to generate heterogeneous video sequences and improve model training.

Practical Barrier

Difficulty in assessing engagement in varied online learning environments compared to physical classrooms.

Proposed Solutions: Developing more robust engagement detection mechanisms using computer vision and machine learning.

Project Team

Xusheng Ai

Researcher

Victor S. Sheng

Researcher

Chunhua Li

Researcher

Zhiming Cui

Researcher

Contact Information

For information about the paper, please contact the authors.

Authors: Xusheng Ai, Victor S. Sheng, Chunhua Li, Zhiming Cui

Source Publication: View Original PaperLink opens in a new window

Project Contact: Dr. Jianhua Yang

LLM Model Version: gpt-4o-mini-2024-07-18

Analysis Provider: Openai

Let us know you agree to cookies