搜广推&NLP00-NLP/IR中顶会文章整理(每周更新一次)

本文转载自查看原文 2021-05-06 12:24 860 自然语言处理(NLP)/ 搜广推

paper repository

字体颜色设置: 我是蓝色 Markdown进阶（更改字体、颜色、大小，设置文字背景色，调整图片大小设置居中)
DBLP Conference:
SIGIR: DBLP WWW: DBLP WSDM: DBLP CIKM: DBLP KDD: DBLP

ACL: DBLP EMNLP: DBLP

ICLR: DBLP AAAI: DBLP IJCAI: DBLP NIPS: DBLP

信息检索/数据挖掘/数据库: CCF推荐列表

信息检索/数据挖掘/自然语言处理/人工智能近几年流程表

说明: 🔨：投稿阶段-->💻:审稿中 -->★:列表公开-->🌙:已经整理

	2019	2020	2021
SIGIR	★	🌙	💻 noti:4.14
CIKM	★	★	🔨 ddl:3.24 crd:8.23
WWW	★	★	🌙
WSDM	★	★	🌙
KDD	★	★	💻 noti:5.17
ACL	★	★	💻
EMNLP	★	★	🔨
ICLR	★	★	★
AAAI	★	★	★
IJCAI	★	★
NIPS	★	★	★
TKDE
TOIS
AIRS
ECIR

🐴:正在看 📱:笔记 🏠:重点 ☁️:看过了,啥玩意

Pretrained Langeuage Model

Github 资源

huggingface/transformers: https://github.com/huggingface/transformers
Transformers API: https://huggingface.co/transformers/
tomohideshibata/BERT-related-papers:https://github.com/tomohideshibata/BERT-related-papers
thu/GNNPapers:https://github.com/thunlp/GNNPapers
Jiakui/awesome-bert:https://github.com/Jiakui/awesome-bert
xieyuqiang/GNN4NLP-Papers: https://github.com/IndexFziQ/GNN4NLP-Papers
THU/OpenMatch: https://github.com/thunlp/OpenMatch
THU/PLMPapers: https://github.com/thunlp/PLMpapers

综述类

[IR 排序模型综述] A Deep Look into Neural Ranking Models for Information Retrieval [2019, guojiafeng] arXiv
[NLP NERZ综述] A Survey on Deep Learning for Named Entity Recognition [2020-03] arXiv
[BERT 进展] A Primer in BERTology: What we know about how BERT works
Matching Algorithms: Fundamentals, Applications and Challenges [2021-03]arXiv
Information retrieval: a view from the Chinese IR community [chengxueqi, 2021]PDF
Retrieving and Reading : A Comprehensive Survey on Open-domain Question Answering arXiv
Pretrained Transformers for Text Ranking: BERT and Beyond arXiv

2021

未标注会议

Dynamic Model for Query-Document Expansion towards Improving Retrieval Relevance PDF ☁️20210414
A Neural Passage Model for Ad-hoc Document Retrieval PDF
An In-depth Analysis of Passage-Level Label Transfer for Contextual Document Ranking arXiv
Mitigating the Position Bias of Transformer Models in Passage Re-Ranking arXiv [ecir2021]
Neural document expansion for ad-hoc information retrieval arXiv [亚马逊] ☁️20200414
PARADE: Passage Representation Aggregation for Document Reranking arXiv
Neural Methods for Effective, Efficient, and Exposure-Aware Information Retrieval [PhD thesis]arXiv
MIMICS: A Large-Scale Data Collection for Search Clarification arXiv
Unbiased Learning to Rank via Propensity Ratio Scoring arXiv
Conformer-Kernel with Query Term Independence for Document Retrieval arXiv
Exploring Classic and Neural Lexical Translation Models for Information Retrieval: Interpretability, Effectiveness, and Efficiency Benefits arXiv
Longformer for MS MARCO Document Re-ranking Task arXiv
ORCAS: 18 Million Clicked Query-Document Pairs for Analyzing Search arXiv
Composite Re-Ranking for Efficient Document Search with BERT arXiv
Graph-based Hierarchical Relevance Matching Signals for Ad-hoc Retrieval arXiv
Match-Ignition: Plugging PageRank into Transformer for Long-form Text Matching [chengxueqi] arXiv
ERNIE-DOC: The Retrospective Long-Document Modeling Transformer arXiv
The Expando-Mono-Duo Design Pattern for Text Ranking with Pretrained Sequence-to-Sequence Models arXiv
A Systematic Evaluation of Transfer Learning and Pseudo-labeling with BERT-based Ranking Models arXiv
ABNIRML: Analyzing the Behavior of Neural IR Models arXiv
Assessing the Benefits of Model Ensembles in Neural Re-Ranking for Passage Retrieval [ECIR short paper] arXiv
Distilling Dense Representations for Ranking using Tightly-Coupled Teachers [Jimmy Lin] arXiv
Redefining Absent Keyphrases and their Effect on Retrieval Effectiveness arXiv [naacl2021]
A Systematic Evaluation of Transfer Learning and Pseudo-labeling with BERT-based Ranking Models arXiv
Improving Efficient Neural Ranking Models with Cross-Architecture Knowledge Distillation arXiv
Investigating the case of weak baselines in Ad-hoc Retrieval and Question Answering arXiv
An Attention-based Deep Relevance Model for Few-shot Document Filtering
Topic-enhanced knowledge-aware retrieval model for diverse relevance estimation pdf
A Pseudo-relevance feedback framework combining relevance matching and semantic matching for information retrieval PDF
RocketQA: An Optimized Training Approach to Dense Passage Retrieval for Open-Domain Question Answering arXiv
CoRT: Complementary Rankings from Transformers arXiv
On the Calibration and Uncertainty of Neural Learning to Rank Models arXiv
Keyword-Attentive Deep Semantic Matching arXiv
ABNIRML: Analyzing the Behavior of Neural IR Models [AI2] arXiv
Evaluation of BERT and ALBERT Sentence Embedding Performance on Downstream NLP Tasks arXiv
Less is More: Pre-training a Strong Siamese Encoder Using a Weak Decoder [MS] arXiv
Learning To Retrieve: How to Train a Dense Retrieval Model Effectively and Efficiently
OpenMatch: An Open-Source Package for Information Retrieval arXiv
An end-to-end pseudo relevance feedback framework for neural document retrieval [2019]

SIGIR2021 审稿中，4.14 出结果官网

长文:151/720, 21%, 短文: 145/526, 27%
1.Optimizing Dense Retrieval Model Training with Hard Negatives [MA SHAO PING]
2.Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling
3.Dual-View Distilled BERT for Sentence Embedding arXiv
4.AdsGNN: Behavior-Graph Augmented Relevance Modeling in Sponsored Search arXiv
5.Wizard of Search Engine: Access to Information Through Conversations with Search Engines
6.Self-Supervised Contrastive Learning for Code Retrieval and Summarization via Semantic-Preserving Transformations
7.Not All Relevance Scores are Equal: Efficient Uncertainty and Calibration Modeling for Deep Retrieval Models [****]
8.B-PROP: Bootstrapped Pre-training with Representative Words Prediction for Ad-hoc Retrieval [****]
9.Learning to Rank for Mathematical Formula Retrieval
10.Investigating User Behavior in Legal Case Retrieval
11.Few-Shot Conversational Dense Retrieval
12.Heterogeneous Attention Network for Effective and Efficient Cross-modal Retrieval
13.PAN: Prototype-based Adaptive Network for Robust Cross-modal Retrieval
14.Adapting Interactional Observation Embedding for Counterfactual Learning to Rank
15.MS MARCO: Benchmarking Ranking Models in the Large-Data Regime
16.KeyBLD: Selecting Key Blocks with Local Pre-ranking for Long Document Information Retrieval
17.Decoupling Representation and Regressor for Long-Tailed Information Cascade Prediction
18.On the Orthogonality of Bias and Effectiveness in Ad hoc Retrieval
19.Entity Retrieval Using Fine-Grained Entity Aspects
20.Contextualized Offline Relevance Weighting for Efficient and Effective Neural Retrieval
21.Conversational vs Traditional: Comparing Search Behavior and Outcome in Legal Case Retrieval
22.Learning Robust Dense Retrieval Models from Incomplete Relevance Labels
23.Joint Learning of Deep Retrieval Model and Product Quantization based Embedding Index
24.APRF-Net: Attentive Pseudo-Relevance Feedback Network for Query Categorization
25.Podcast Metadata and Content: Episode Relevance and Attractiveness in Ad Hoc Search
26.Revisiting the Tag Relevance Prediction Problem
27.Computationally Efficient Optimization of Plackett-Luce Ranking Models for Relevance and Fairness
28.Learning a Fine-Grained Review-based Transformer Model for Personalized Product Search
29.Do Affective Cues Validate Behavioural Metrics for Search?
30.Modeling Intent Graph for Search Result Diversification [wenjirong]
31.Group based Personalized Search by Integrating Search Behaviour and Friend Network [wenjirogn]
32.When Fair Ranking Meets Uncertain Inference arXiv [***]
33.Intra-Document Cascading: Learning to Select Passages for Neural Document Ranking
34.IDRQA: Iterative Document Reranking for Open-domain Multi-hop Question Answering
35.Improving Bi-encoder Document Ranking Models with Two Rankers and Multi-teacher Distillation
36.SPLADE: Sparse Lexical and Expansion Model for First Stage Ranking
37.Significant Improvements over the State of the Art? A Case Study of the MS MARCO Document Ranking Leaderboard
38.Towards a Better Tradeoff between Effectiveness and Efficiency in Pre-Ranking: A Learnable Feature Selection based Approach
39.Improving Transformer-Kernel Ranking Model Using Conformer and Query Term Independence
40.A Systematic Evaluation of Transfer Learning and Pseudo-labeling with BERT-based Ranking Models
41.Societal Biases in Retrieved Contents: Measurement Framework and Adversarial Mitigation for BERT Rankers
42.Learning Passage Impacts for Inverted Indexes(https://arxiv.org/abs/2104.12016) [下载]
43.Text-to-Text Multi-view Learning for Passage Re-ranking
44.[AutoDebias: Learning to Debias for Recommendation] [2021-07-02]
1. [Set2setRank: Collaborative Set to Set Ranking for Implicit Feedback based Recommendation]
1. [Long-Tail Hashing]
47.[Looking at CTR Prediction Again: Is Attention All You Need?]
1. [Allowing for The Grounded Use of Temporal Difference Learning in Large Ranking Models via Sub State Updates ]
49.[How Do Biased Search Result Rankings Affect User Attitudes on Debated Topics?]
50 [Partner Matters! An Empirical Study on Fusing Personas for Personalized Response Selection in Retrieval-Based Chatbots]
1. [ScaleFreeCTR: MixCache-based Distributed Training System for CTR Models with Huge Embedding Table]
52 [Enhanced Doubly Robust Learning for Debiasing Post-Click Conversion Rate Estimation]
53 [Improving Video Retrieval by Adaptive Margin]
54 [Intra-Document Cascading: Learning to Select Passages for Neural Document Ranking]
55 [Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling]
56 [A Graph-Enhanced Click Model for Web Search]
57 [Adapting Interactional Observation Embedding for Counterfactual Learning to Rank]
58 [Tracing Knowledge State with Individual Cognition and Acquisition Estimation]
59 [A General Method For Automatic Discovery of Powerful Interactions In Click-Through Rate Prediction]
60 [Computationally Efficient Optimization of Plackett-Luce Ranking Models for Relevance and Fairness]
61 [Learning Graph Meta Embeddings for Cold-Start Ads in Click-Through Rate Prediction]
62 [Societal Biases in Retrieved Contents: Measurement Framework and Adversarial Mitigation of BERT Rankers]
63 [Wizard of Search Engine: Access to Information Through Conversations with Search Engines]
64 Clicks can be Cheating: Counterfactual Recommendation for Mitigating Clickbait Issue
65 AdsGNN: Behavior-Graph Augmented Relevance Modeling in Sponsored Search
66 Few-Shot Conversational Dense Retrieval
67 Optimizing Dense Retrieval Model Training with Hard Negatives
68 IDRQA: Iterative Document Reranking for Open-domain Multi-hop Question Answering
69 Answer Complex Questions: Path Ranker Is All You Need
70 On the Orthogonality of Bias and Effectiveness in Ad hoc Retrieval
71 Podcast Metadata and Content: Episode Relevance and Attractiveness in Ad Hoc Search
72 Contextualized Offline Relevance Weighting for Efficient and Effective Neural Retrieval
73 Dual-View Distilled BERT for Sentence Embedding
74 GemNN: Gating-enhanced Multi-task Neural Networks with Feature Interaction Learning for CTR Prediction
75 Deep Position-wise Interaction Network For CTR Prediction
76 Deep User Match Network for Click-Through Rate Prediction
77 Revisiting the Tag Relevance Prediction Problem
78 Dual Unbiased Recommender Learning for Implicit Feedback
79 Explicit Semantic Cross Feature Learning via Pre-trained Graph Neural Networks for CTR Prediction
80 KeyBLD: Selecting Key Blocks with Local Pre-ranking for Long Document Information Retrieval
81 Significant Improvements over the State of the Art? A Case Study of the MS MARCO Document Ranking Leaderboard
82 Cross-Batch Negative Sampling for Training Two-Tower Recommenders
83 RLNF: Reinforcement Learning based Noise Filtering for Click-Through Rate Prediction
84 A Test Collection for Ad-hoc Dataset Retrieval
85 WWW3E8: 259,000 Relevance Labels for Studying the Effect of Document Presentation Order for Relevance Assessors

CIKM2021 投稿中,8.23 CRD 官网

WWW2021

Robust Generalization and Safe Query-Specialization in Counterfactual Learning to Rank AMiner
A Linguistic Study on Relevance Modeling in Information Retrieval [chengxuqi & guojiafeng]AMiner
Maximizing Marginal Fairness for Dynamic Learning to Rank [qingao ai] AMiner
An Alternative Cross Entropy Loss for Learning-to-Rank AMiner
Using Prior Knowledge to Guide BERT's Attention in Semantic Textual Matching Tasks AMiner
Graph-based Hierarchical Relevance Matching Signals for Ad-hoc Retrieval AMiner
MIRA: Leveraging Multi-Intention Co-click Information in Web-scale Document Retrieval using Deep Neural Networks AMiner
Event-Driven Query Expansion 。。。
Density-Ratio Based Personalised Ranking from Implicit Feedback AMiner
Relevance-guided Supervision for OpenQA with ColBERT arXiv
An Adversarial Imitation Click Model for Information Retrieval arXiv

WSDM2021:

PROP: Pre-training with Representative Words Prediction for Ad-hoc Retrieval [chengxueqi & guojiafeng] arXiv
Bias-Variance Decomposition for Ranking PDF
Unifying Online and Counterfactual Learning to Rank: A Novel Counterfactual Estimator that Effectively Utilizes Online Interventions
Pretrained Transformers for Text Ranking: BERT and Beyond PDF
Non-Clicks Mean Irrelevant? Propensity Ratio Scoring As a Correction PDF
Beyond Relevance: Trustworthy Answer Selection via Consensus Verification PDF
Beyond Probability Ranking Principle: Modeling the Dependencies among Documents PDF
QueryBlazer: Efficient Query Autocompletion Framework PDF

KDD2021: 5.17 录用结果官网

ACL2021:

EMNLP2021:

AAAI2021: 1692篇

笔记整理:

论文整理:

Learning to Truncate Ranked Lists for Information Retrieval [chengxueqi & guojiafeng]AMiner
A Graph-based Relevance Matching Model for Ad-hoc Retrieval AMiner
LET: Linguistic Knowledge Enhanced Graph Transformer for Chinese Short Text Matching AMiner
Making the Relation Matters: Relation of Relation Learning Network for Sentence Semantic Matching AMiner
Fake it Till You Make it: Self-Supervised Semantic Shifts for Monolingual Word Embedding Tasks AMiner

IJCAI2021:

ICLR2021:

Are Neural Rankers still Outperformed by Gradient Boosted Decision Trees? AMiner
Individually Fair Rankings AMiner
Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval AMiner

NIPS2021:

2020

SIGIR2020:

SIGIR2020: 官网 dblp aminer
SIGIR会议之文本表征、检索重排序、阅读理解论文整理
一、文本的表征

Convolutional Embedding for Edit Distance arXiv
Match^2: A Matching over Matching Model for Similar Question Identification arXiv AMiner
Hier-SPCNet: A Legal Statute Hierarchy-based Heterogeneous Network for Computing Legal Document Similarity arXiv
Attending to Inter-sentential Features in Neural Text Classification pdf
Towards Differentially Private Text Representations arXiv
Expansion via Prediction of Importance with Contextualization arXiv
DC-BERT: Decoupling Question and Document for Efficient Contextual Encoding [jiang daxin] arXiv [AMiner](DC-BERT: Decoupling Question and Document for Efficient Contextual Encoding)
Improving Matching Models with Hierarchical Contextualized Representations for Multi-turn Response Selection arXiv
Symmetric Regularization based BERT for Pair-wise Semantic Reasoning arXiv

二、检索及重排序

Table Search Using a Deep Contextualized Language Model arXiv
ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction over BERT arXiv
Finding the Best of Both Worlds: Faster and More Robust Top-k Document Retrieval PDF
Efficient Document Re-Ranking for Transformers by Precomputing Term Representations arXiv
Training Curricula for Open Domain Answer Re-Ranking arXiv
SetRank: Learning a Permutation-Invariant Ranking Model for Information Retrieval arXiv
MarkedBERT: Integrating Traditional IR cues in pre-trained language models for passage retrieval pdf
Context-Aware Term Weighting For First-Stage Passage Retrieval PDF
Learning Term Discrimination arXiv
Improving Document-Level Text Retrieval using Local Attention in the Transformer-Kernel Pooling Model arXiv
Query-level Early Exit for Additive Learning-to-Rank Ensembles arXiv
Efficiency Implications of Term Re-Weighting for Passage Retrieval PDF
Contextual Re-Ranking with Behavior Aware Transformers [MS]arXiv AMiner
An analysis of BERT in document ranking [mashaoping] arXiv* AMiner

三、QA及阅读理解

Open-Retrieval Conversational Question Answering arXiv AMiner
Reranking for Efficient Transformer-based Answer Selection arXiv

四、其他

1. Automatic Generation of Topic Labels arXiv
CrossBERT with Triplet Neural Architecture for Entity Property Ranking arXiv
Few-Shot Generative Conversational Query Rewriting arXiv

五. 补充

Investigating reading behavior in Fine-grained Relevance Judgment [liuyiqun & mashaoping]AMiner
Query-level Early Exit for Additive Learning-to-Rank Ensembles AMiner
Sampling Bias Due to Near-Duplicates in Learning to Rank AMiner
Policy-Aware Unbiased Learning to Rank for Top-k Rankings AMiner
Accelerated Convergence for Counterfactual Learning to Rank AMiner
Cascade Model-based Propensity Estimation for Counterfactual Learning to Rank AMiner
Re-ranking Answer Selection with Similarity Aggregation AMiner
Multi-grouping Robust Fair Ranking AMiner
Feature Transformation for Neural Ranking Models AMiner
SetRank: Learning a Permutation-Invariant Ranking Model for Information Retrieval [wenjirong & chengxueqi ] AMiner
Efficient Document Re-Ranking for Transformers by Precomputing Term Representations AMiner
Training Curricula for Open Domain Answer Re-Ranking AMiner
A Deep Recurrent Survival Model for Unbiased Ranking [gaikun] AMiner
Studying Ranking-Incentivized Web Dynamics AMiner
Choppy: Cut Transformer For Ranked List Truncation AMiner
Unsupervised Semantic Hashing with Pairwise Reconstruction AMiner
Improving Contextual Language Models for Response Retrieval in Multi-Turn Conversation AMiner
Retrieving Potential Causes from a Query Event AMiner
A Study of Neural Matching Models for Cross-lingual IR AMiner
A Quantum Interference Inspired Neural Matching Model for Ad-hoc Retrieval AMiner
Improving Matching Models with Hierarchical Contextualized Representations for Multi-turn Response Selection AMiner
Soft Kernel-based Ranking on a Statistical Manifold AMiner
Analyzing and Learning from User Interactions for Search Clarification AMiner
Knowledge Enhanced Personalized Search [renjirong] AMiner
Topic Propagation in Conversational Search AMiner
Search Result Explanations Improve Efficiency and Trust AMiner
Encoding History with Context-aware Representation Learning for Personalized Search AMiner
Do Neural Ranking Models Intensify Gender Bias? AMiner
Query Resolution for Conversational Search with Limited Supervision AMiner
Local Self-Attention over Long Text for Efficient Document Retrieval arXiv
End-to-End Contextualized Document Indexing and Retrieval with Neural Networks PDF
Listwise Learning to Rank by Exploring Unique Ratings arXiv

CIKM2020:

Fine-Grained Relevance Annotations for Multi-Task Document Ranking and Question Answering arXiv
Beyond 512 Tokens: Siamese Multi-depth Transformer-based Hierarchical Encoder for Long-Form Document Matching arXiv
Distant Supervision in BERT-based Adhoc Document Retrieval PDF
Learning Better Representations for Neural Information Retrieval with Graph Information PDF

WWW2020:

Selective Weak Supervision for Neural Information Retrieval arXiv
Context-Aware Document Term Weighting for Ad-Hoc Search PDF
Leveraging Passage-level Cumulative Gain for Document Ranking PDF
IART: Intent-aware Response Ranking with Transformers in Information-seeking Conversation Systems
arXiv
HDCT:Context-Aware Document Term Weighting for Ad-Hoc Search PDF

WADM2020*

KDD2020:

1.Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding AMiner

2.Mining Implicit Relevance Feedback from User Behavior for Web Question Answering AMiner

FreeDOM: A Transferable Neural Architecture for Structured Information Extraction on Web Documents AMiner
MinSearch: An Efficient Algorithm for Similarity Search under Edit Distance AMiner
Embedding-based Retrieval in Facebook Search AMiner
Managing Diversity in Airbnb Search AMiner
Improving Deep Learning For Airbnb Search AMiner

ACL2020:

QUASE: Question-Answer Driven Sentence Encoding [ai2]arXiv
tBERT: Topic Models and BERT Joining Forces for Semantic Similarity Detection PDF
Rationalizing Text Matching: Learning Sparse Alignments via Optimal Transport AMiner
Extractive Summarization as Text Matching AMiner
Neural Graph Matching Networks for Chinese Short Text Matching AMiner

EMNLP2020:

BERT-QE: Contextualized Query Expansion for Document Re-ranking [sunle]arXiv
Beyond [CLS] through Ranking by Generation AMiner
Modularized Transfomer based Ranking Framework AMiner
Wasserstein Distance Regularized Sequence Representation for Text Matching in Asymmetrical Domains [wenjirong] AMiner
A Bilingual Generative Transformer for Semantic Sentence Embedding AMiner
MultiCQA: Zero Shot Transfer of Self Supervised Text Matching Models on a Massive Scale AMiner
SLM: Learning a Discourse Language Representation with Sentence Unshuffling AMiner
Better Highlighting: Creating Sub Sentence Summary Highlights
KERMIT: Complementing Transformer Architectures with Encoders of Explicit Syntactic Interpretations AMiner
With More Contexts Comes Better Performance: Contextualized Sense Embeddings for All Round Word Sense Disambiguation AMiner
Tell Me How to Ask Again: Question Data Augmentation with Controllable Rewriting in Continuous Space [zhouming] AMiner
Cross Thought for Sentence Encoder Pre training AMiner
Long Document Ranking with Query-Directed Sparse Transformer arXiv
Ad-hoc Document Retrieval using Weak-Supervision with BERT and GPT2 AMiner
DiPair: Fast and Accurate Distillation for Trillion-Scale Text Matching and Pair Modeling arXiv

AAAI2020:

IJCAI2020:

ICLR2020:

Pre-training Tasks for Embedding-based Large-scale Retrieval arXiv AMiner
Poly-encoders: Architectures and Pre-training Strategies for Fast and Accurate Multi-sentence Scoring AMiner
Encoding word order in complex embeddings AMiner
Towards Hierarchical Importance Attribution: Explaining Compositional Semantics for Neural Sequence Models AMiner

NIPS2020:

2019

SIGIR2019:

CIKM2019:

IMN: Interactive Matching Network for Multi-Turn Response Selection in Retrieval-Based Chatbots arXiv

WWW2019:

KDD2019:

Fairness-Aware Ranking in Search & Recommendation Systems with Application to LinkedIn Talent Search AMiner
MediaRank: Computational Ranking of Online News Sources AMiner
MOBIUS: Towards the Next Generation of Query-Ad Matching in Baidu's Sponsored Search [baidu] AMiner
A Memory-Efficient Sketch Method for Estimating High Similarities in Streaming Sets AMiner

ACL2019:

EMNLP2019:

Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks arXiv
Bridging the Gap Between Relevance Matching and Semantic Matching for Short Text Similarity Modeling

AAAI2019:

DRCN: Semantic Sentence Matching with Densely-connected Recurrent and Co-attentive Information arXiv

IJCAI2019:

ICLR2019:

NIPS2019:

other

Context-Aware Sentence/Passage Term Importance Estimation For First Stage Retrieval arXiv
Multi-Turn Response Selection for Chatbots with Deep Attention Matching Network [BaiDu ACL2018]

LTR

1.Document Ranking with a Pretrained Sequence-to-Sequence Model
2.[2020 NeurIPS] [51] Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
3.Match-Ignition: Plugging PageRank into Transformer for Long-form Text Matching
4.Cross-Thought for Sentence Encoder Pre-training
5.On the Sentence Embeddings from Pre-trained Language Models
6.PROP: Pre-training with Representative Words Prediction for Ad-hoc Retrieval
7.[2020 ACL-Enhancing Pre-trained Chinese Character Representation with Word-aligned Attention
8.Integrating fine- and coarse-grained information for natural language understanding
9.2020 SIGIR】MarkedBERT: Integrating Traditional IR Cues in Pre-trained Language Models for Passage Retrieval
10.【2019 AAAI】Semantic Sentence Matching with Densely-connected Recurrent and Co-attentive Information
11.Towards Personalized and Semantic Retrieval: An End-to-End Solution for E-commerce Search via Embedding Learning
12.[2020 KDD] Embedding-based Retrieval in Facebook Search.
13.[ 2020 SIGIR ] ColBERT Efficient and Effective Passage Search via Contextualized Late Interaction over BERT
14.[ 2020 ICLR ] Reformer: The efficient Transformer
15.[SIGIR 2020] Efficient Document Re-Ranking for Transformers by Precomputing Term Representations
16.Adaptive Multi-Attention Network Incorporating Answer Information for Duplicate Question Detection
17.[2019 AAAI] [66] Semantic Sentence Matching with Densely-Connected Recurrent and Co-Attentive Information

日常补充

2021-04-23

1.Semantic Models for the First-stage Retrieval: A Comprehensive Review
2.A Replication Study of Dense Passage Retriever
4.Efficient Retrieval Optimized Multi-task Learning
5.Anytime Ranking on Document-Ordered Indexes
6.Co-BERT: A Context-Aware BERT Retrieval Model Incorporating Local and Query-specific Context [孙乐]
7.BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models
8.ASBERT: Siamese and Triplet network embedding for open question answering
9.Contextualized Query Embeddings for Conversational Search
10.[
11.SciCo: Hierarchical Cross-Document Coreference for Scientific Concepts
12.COIL: Revisit Exact Lexical Match in Information Retrieval with Contextualized Inverted List [NAACL2021]
1. RoFormer: Enhanced Transformer with Rotary Position Embedding arXiv
14.Simple and Efficient ways to Improve REALM arXiv
15.SimCSE: Simple Contrastive Learning of Sentence Embeddings arXiv
16.Joint Passage Ranking for Diverse Multi-Answer Retrieval arXiv
17.Knowledge Neurons in Pretrained Transformers arXiv
18.Demystifying the Better Performance of Position Encoding Variants for Transformer arXiv
19.Chinese Sentences Similarity via Cross-Attention Based Siamese Network arXiv
20.Reference-based Weak Supervision for Answer Sentence Selection using Web Data arXiv
21.Context-Aware Interaction Network for Question Matching arXiv
22.A Graph-guided Multi-round Retrieval Method for Conversational Open-domain Question Answering arXiv
23.Ranking Structured Objects with Graph Neural Networks arXiv
24.Fast, Effective and Self-Supervised: Transforming Masked LanguageModels into Universal Lexical and Sentence Encoders arXiv
25.Towards Robust Neural Retrieval Models with Synthetic Pre-Training arXiv
26.Lattice-BERT: Leveraging Multi-Granularity Representations in Chinese Pre-trained Language Models arXiv [NAACL2021]
27.Designing a Minimal Retrieve-and-Read System for Open-Domain Question Answering arXiv
28.Sentence Embeddings by Ensemble Distillation arXiv
29.Significant Improvements over the State of the Art? A Case Study of the MS MARCO Document Ranking Leaderboard arXiv
30.Latte-Mix: Measuring Sentence Semantic Similarity with Latent Categorical Mixtures []

2021-04-27
1.Extract then Distill: Efficient and Effective Task-Agnostic BERT Distillation arXiv [THU]

2021-05-04
1.Extract then Distill: Efficient and Effective Task-Agnostic BERT Distillation
2. SmoothI: Smooth Rank Indicators for Differentiable IR Metrics
3.Transformers: "The End of History" for NLP?

2021-05-05
1.Pretrained Transformers for Text Ranking:
BERT and Beyond
2. Comparing Score Aggregation Approaches for Document Retrieval with Pretrained Transformers [ECIR2021]
3.A White Box Analysis of ColBERT [ECIR2021]
4.TREC Deep Learning Track: Reusable Test Collections in the Large Data Regime
5.

博客收藏

免责声明！

本站转载的文章为个人学习借鉴使用，本站对版权不负任何法律责任。如果侵犯了您的隐私权益，请联系本站邮箱yoyou2525@163.com删除。

猜您在找 NLP以及CV顶会记录 Python爱好者社区历史文章列表（每周append更新一次） NLP/CL 顶会收录【NLP-00-3】BLEU计算文本相关性在搜广推三大场景中的应用 NLP（二十一）人物关系抽取的一次实战使用NLP从文章中自动提取关键字搜广推05-文本匹配入门一些 NLP中的mask的作用 NLP中的数据增强