Towards Better Entity Linking with Multi-View Enhanced Distillation

  • Yi Liu ,
  • Yuan Tian ,
  • ,
  • Xinlong Wang ,
  • Yanan Cao ,
  • Fang Fang ,
  • Wen Zhang ,
  • Haizhen Huang ,
  • Denvy Deng ,
  • Qi Zhang

ACL 2023 |

Publication

Dense retrieval is widely used for entity linking to retrieve entities from large-scale knowledge bases. Mainstream techniques are based on a dual-encoder framework, which encodes mentions and entities independently and calculates their relevances via rough interaction metrics, resulting in difficulty in explicitly modeling multiple mention-relevant parts within entities to match divergent mentions. Aiming at learning entity representations that can match divergent mentions, this paper proposes a \textbf{M}ulti-\textbf{V}iew Enhanced \textbf{D}istillation (MVD) framework, which can effectively transfer knowledge of multiple fine-grained and mention-relevant parts within entities from cross-encoders to dual-encoders. Each entity is split into multiple views to avoid irrelevant information being over-squashed into the mention-relevant view. We further design cross-alignment and self-alignment mechanisms for this framework to facilitate fine-grained knowledge distillation from the teacher model to the student model. Meanwhile, we reserve a global-view that embeds the entity as a whole to prevent dispersal of uniform information. Experiments show our method achieves state-of-the-art performance on several entity linking benchmarks\footnote{Our code is available at \url{https://github.com/Noen61/MVD}}.

GitHubGitHub