Classification of GLM Flashes Using Random Forests
Jacquelyn Ringhausen,Phillip Bitzer,William Koshak,John Mecikalski
DOI: https://doi.org/10.1029/2021ea001861
IF: 3.68
2021-11-01
Earth and Space Science
Abstract:Abstract [The Geostationary Lightning Mapper (GLM) detects total lightning continuously from space, and does not distinguish intra‐cloud (IC) from cloud‐to‐ground (CG) lightning. This research focuses on differentiating CG and IC lightning detected by GLM using a random forests (RF) model. GLM flash and group characteristics are implemented into the RF model and used to predict flash type. The most important flash characteristic for distinguishing flash type is the maximum group area. Other features with high feature importance include time‐of‐day, elongation, propagation, footprint, slope, maximum distance between groups and events, and mean energy. Skill scores showcase the model's ability to distinguish flash type with moderate skill, with 81% probability of detection, 71% percent correct, 36% false alarm rate, 36% false alarm ratio, and 56% critical success index. These scores improve further when study area is limited to CONUS and the Southeastern United States. These results can be used to aid in future climatological analysis of flash type.]Plain Language Summary The Geostationary Lightning Mapper (GLM) is a satellite‐based lightning sensor that allows for continuous detection of light emitted from the cloud tops due to lightning. GLM does not distinguish if the lightning is connecting to ground (CG) or remaining in the cloud (IC). In order to distinguish CG and IC flashes, this research uses a machine learning method called random forests (RF). The RF model attempts to classify lightning flashes based on their size, duration, and intensity. The most important flash characteristics for distinguishing flash type are the features related to the areal size of the lightning and the time of day the lightning occurs. Overall, moderate success is shown when attempting to divide total lightning into CG and IC over the 2018 period. This information can be used by researchers to improve the use of GLM in the study of different storm types as well as aiding in wildfire forecasting.Key Points Although there is substantial overlap in the distributions for cloud‐to‐ground (CG) and intra‐cloud (IC) flash characteristics, the mean values differ Most important features for flash distinction are maximum group area, time‐of‐day, elongation, propagation, footprint, slope, and max group/event distance Results show skill in the random forests model's ability to distinguish CG and IC flashes in the Geostationary Lightning Mapper data
geosciences, multidisciplinary,astronomy & astrophysics