AdapterEM: Pre-trained Language Model Adaptation for Generalized Entity Matching using Adapter-tuning

John Bosco Mugeni,Steven Lynden,Toshiyuki Amagasa,Akiyoshi Matono

DOI: https://doi.org/10.1145/3589462.3589498

2023-05-30

Abstract:Entity Matching (EM) involves identifying different data representations referring to the same entity from multiple data sources and is typically formulated as a binary classification problem. It is a challenging problem in data integration due to the heterogeneity of data representations. State-of-the-art solutions have adopted NLP techniques based on pre-trained language models (PrLMs) via the fine-tuning paradigm, however, sequential fine-tuning of overparameterized PrLMs can lead to catastrophic forgetting, especially in low-resource scenarios. In this study, we propose a parameter-efficient paradigm for fine-tuning PrLMs based on adapters, small neural networks encapsulated between layers of a PrLM, by optimizing only the adapter and classifier weights while the PrLMs parameters are frozen. Adapter-based methods have been successfully applied to multilingual speech problems achieving promising results, however, the effectiveness of these methods when applied to EM is not yet well understood, particularly for generalized EM with heterogeneous data. Furthermore, we explore using (i) pre-trained adapters and (ii) invertible adapters to capture token-level language representations and demonstrate their benefits for transfer learning on the generalized EM benchmark. Our results show that our solution achieves comparable or superior performance to full-scale PrLM fine-tuning and prompt-tuning baselines while utilizing a significantly smaller computational footprint $\approx 13\%$ of the PrLM parameters.

Computation and Language,Databases

What problem does this paper attempt to address?

The paper aims to address several key issues in Entity Matching (EM). Specifically: 1. **Data Representation Heterogeneity**: In the real world, data from different sources come in various formats (such as structured, semi-structured, or unstructured data), making it difficult for traditional entity matching methods to handle. 2. **Catastrophic Forgetting**: Pre-trained Language Models (PrLMs) tend to forget previously learned knowledge during fine-tuning, especially in resource-limited scenarios. 3. **High Storage Costs**: Full fine-tuning for each new task leads to significant storage overhead, as a complete model checkpoint needs to be saved for each task. To address these issues, the authors propose an adapter-based method called AdapterEM. This method leverages small neural networks (i.e., adapters) to optimize specific parts of the pre-trained language model instead of fine-tuning the entire model. This approach not only reduces computational resource consumption but also effectively mitigates the problem of catastrophic forgetting and performs well in multiple benchmark tests. Experimental results show that under both low-resource and fully-resourced settings, this method outperforms or matches full fine-tuning in most tasks while significantly reducing computational overhead.

AdapterEM: Pre-trained Language Model Adaptation for Generalized Entity Matching using Adapter-tuning

ELP-Adapters: Parameter Efficient Adapter Tuning for Various Speech Processing Tasks

Efficient Test Time Adapter Ensembling for Low-resource Language Varieties

LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models

On the Effectiveness of Adapter-based Tuning for Pretrained Language Model Adaptation

A New Adapter Tuning of Large Language Model for Chinese Medical Named Entity Recognition

MerA: Merging Pretrained Adapters for Few-Shot Learning

Parameter-Efficient Fine-Tuning With Adapters

Lightweight Adapter Tuning for Multilingual Speech Translation

Generative Adapter: Contextualizing Language Models in Parameters with A Single Forward Pass

SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters

Leveraging Pretrained Language Models for Enhanced Entity Matching: A Comprehensive Study of Fine-Tuning and Prompt Learning Paradigms

VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks

Adapters: A Unified Library for Parameter-Efficient and Modular Transfer Learning

Adaptable Adapters

Adaptive Adapters: an Efficient Way to Incorporate BERT into Neural Machine Translation

Experience Adapter: Adapting Pre-trained Language Models for Continual Task Planning.

Empirical Analysis of Efficient Fine-Tuning Methods for Large Pre-Trained Language Models

Exploiting Adapters for Cross-Lingual Low-Resource Speech Recognition

MedAdapter: Efficient Test-Time Adaptation of Large Language Models towards Medical Reasoning

FE-Adapter: Adapting Image-based Emotion Classifiers to Videos