Abstract:Grading SQL queries can be a time-consuming, tedious and challenging task, especially as the number of student submissions increases. Several systems have been introduced in an attempt to mitigate these challenges, but those systems have their own limitations. This paper describes our novel approach to automating the process of grading SQL queries. Unlike previous approaches, we employ a unique convolutional neural network architecture that employs a parameter-sharing approach for different machine learning tasks that enables the architecture to induce different knowledge representations of the data to increase its potential for understanding SQL statements.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the challenge of automated SQL query scoring. Specifically, the authors focus on the following key issues: 1. **Possibility of multiple correct answers**: SQL queries can be written in multiple different ways to achieve the same functionality. For example, the simple query "List the names of professors who teach the 'Introduction to Programming' course" can be written in multiple ways, including non - nested queries, nested EXISTS queries, nested IN queries, etc. As the complexity of the query increases, the number of possible correct answers will also increase significantly. 2. **Consistency of partial scoring**: For incompletely correct queries, how to consistently assign partial scores is a challenge. Due to the diversity of SQL queries, it becomes very difficult to ensure the consistency of the scoring criteria. 3. **Tediousness of scoring**: As the number of SQL queries submitted by students increases, manual scoring becomes very time - consuming and error - prone. Teachers need to spend a lot of time checking and scoring students' SQL queries one by one. To solve these problems, the authors propose an automated SQL query scoring system based on the self - attention mechanism and convolutional neural network (CNN). The uniqueness of this system lies in its use of the parameter - sharing method, which enables the model to learn from different tasks and induce different data representations, thereby better understanding SQL statements. ### Formula Representation When describing the model architecture, some formulas and mathematical expressions are involved. Here are the formula representations of several key parts: 1. **Convolutional self - attention layer**: - The convolutional encoding layer is used to model the query \( Q \) and the value \( V \): \[ Q = W_Q \cdot E, \quad V = W_V \cdot E \] where \( E \) is the input embedding vector, and \( W_Q \) and \( W_V \) are weight matrices. - The self - attention mechanism is calculated by dot - product similarity: \[ \text{Attention}(Q, K, V) = \text{softmax}\left(\frac{QK^T}{\sqrt{d_k}}\right)V \] where \( d_k \) is the dimension of the key vector. 2. **Pooling strategy**: - Use global average pooling to reduce the dimension of the problem: \[ \text{GlobalAvgPool}(X) = \frac{1}{n} \sum_{i = 1}^{n} X_i \] Calculate the average value of each channel of the feature map \( X \). 3. **Loss function**: - Binary cross - entropy loss is used to train the model: \[ L = -\frac{1}{N} \sum_{i = 1}^{N} \left[ y_i \log(p_i) + (1 - y_i) \log(1 - p_i) \right] \] where \( y_i \) is the true label, \( p_i \) is the predicted probability, and \( N \) is the number of samples. Through these methods, this system aims to improve the automation degree of SQL query scoring, reduce the time and error rate of manual scoring, and ensure the consistency and accuracy of scoring.

An Automated SQL Query Grading System Using An Attention-Based Convolutional Neural Network

Edit Based Grading of SQL Queries

Attention-based Recurrent Convolutional Neural Network for Automatic Essay Scoring.

Automated Content Grading Using Machine Learning

Automatic Grading Tool for Jupyter Notebooks in Artificial Intelligence Courses

Automated Scoring of Graphical Open-Ended Responses Using Artificial Neural Networks

Automatic short answer grading and feedback using text mining methods

Comment Text Grading for Chinese Graduate Academic Dissertation Using Attention Convolutional Neural Networks

CatSQL: Towards Real World Natural Language to SQL Applications.

An automated essay evaluation system using natural language processing and sentiment analysi

SimGrade: Using Code Similarity Measures for More Accurate Human Grading

Automatic Short Answer Grading via Multiway Attention Networks

Beyond human subjectivity and error: a novel AI grading system

Survey on Automated Short Answer Grading with Deep Learning: from Word Embeddings to Transformers

RYANSQL: Recursively Applying Sketch-based Slot Fillings for Complex Text-to-SQL in Cross-Domain Databases

AI-assisted Automated Short Answer Grading of Handwritten University Level Mathematics Exams

Automated Assessment of Multimodal Answer Sheets in the STEM domain

A survey on deep learning approaches for text-to-SQL

Automatic Short Math Answer Grading via In-context Meta-learning

Intelligent Question Answering System by Deep Convolutional Neural Network in Finance and Economics Teaching

Automatic short answer grading by encoding student responses via a graph convolutional network