Prediction of waterborne freight activity with Automatic identification System using Machine learning
Sanjeev Bhurtyal,Hieu Bui,Sarah Hernandez,Sandra Eksioglu,Magdalena Asborno,Kenneth N. Mitchell,Marin Kress
DOI: https://doi.org/10.1016/j.cie.2024.110757
IF: 7.18
2024-12-01
Computers & Industrial Engineering
Abstract:This paper addresses latency issues related to publicly available port-level commodity tonnage reports. To predict commodity tonnage at the port-level, near real time vessel tracking data is used with historical Waterborne Commerce Statistics (WCS) with a machine learning model. Currently, commodity throughput is derived from WCS data which is released publicly approximately two years after collection. This latency presents a challenge for short-term planning and other operational uses. To reduce latency, this study leverages near real time vessel tracking data from the Automatic Identification System (AIS) data set. Long Short-Term Memory (LSTM), Temporal Convolutional Network (TCN), and Temporal Fusion Transformer (TFT) machine learning models are developed using the features extracted from AIS and the historical WCS data. The output of the model is the prediction of the quarterly volume of commodities (in tons) at the port terminals for four quarters in the future. Two types of models are developed: (i) uncategorized - a single model trained on all port terminals; (ii) categorized - four models (one per dominant vessel type at the port terminal, i.e., cargo, tanker, tug/tow, and mixed). The uncategorized model outperformed the categorized model based on the Mean Absolute Percentage Error (MAPE). The uncategorized LSTM model has the highest accuracy among all model types. Results show that the model has higher accuracy for port terminals that handle a specific type of vessel, compared to the port terminals that handle more than one vessel type. Six of seven commodity groups have a MAPE of less than 30% under the LSTM uncategorized model framework. The application of the model enables port authorities and stakeholders to make short-term capacity expansion and infrastructure investment decisions based on commodity volume.
computer science, interdisciplinary applications,engineering, industrial