Revealing Chemical Trends: Insights from Data-Driven Visualization and Patent Analysis in Exposomics Research

Dagny Aurich,Emma L Schymanski,Flavio de Jesus Matias,Paul A Thiessen,Jun Pang
DOI: https://doi.org/10.1021/acs.estlett.4c00560
2024-08-30
Abstract:Understanding historical chemical usage is crucial for assessing current and past impacts on human health and the environment and for informing future regulatory decisions. However, past monitoring data are often limited in scope and number of chemicals, while suitable sample types are not always available for remeasurement. Data-driven cheminformatics methods for patent and literature data offer several opportunities to fill this gap. The chemical stripes were developed as an interactive, open source tool for visualizing patent and literature trends over time, inspired by the global warming and biodiversity stripes. This paper details the underlying code and data sets behind the visualization, with a major focus on the patent data sourced from PubChem, including patent origins, uses, and countries. Overall trends and specific examples are investigated in greater detail to explore both the promise and caveats that such data offer in assessing the trends and patterns of chemical patents over time and across different geographic regions. Despite a number of potential artifacts associated with patent data extraction, the integration of cheminformatics, statistical analysis, and data visualization tools can help generate valuable insights that can both illuminate the chemical past and potentially serve toward an early warning system for the future.
What problem does this paper attempt to address?