paper repository
字体颜色设置: 我是蓝色 Markdown进阶(更改字体、颜色、大小,设置文字背景色,调整图片大小设置居中)
DBLP Conference:
SIGIR: DBLP WWW: DBLP WSDM: DBLP CIKM: DBLP KDD: DBLP
ICLR: DBLP AAAI: DBLP IJCAI: DBLP NIPS: DBLP
信息检索/数据挖掘/数据库: CCF推荐列表
信息检索/数据挖掘/自然语言处理/人工智能 近几年流程表
说明: 🔨:投稿阶段-->💻:审稿中 -->★:列表公开-->🌙:已经整理
2019 | 2020 | 2021 | 2022 | 2023 | |
---|---|---|---|---|---|
SIGIR | ★ | 🌙 | 💻 noti:4.14 |
||
CIKM | ★ | ★ | 🔨 ddl:3.24 crd:8.23 |
||
WWW | ★ | ★ | 🌙 | ||
WSDM | ★ | ★ | 🌙 | ||
KDD | ★ | ★ | 💻 noti:5.17 |
||
ACL | ★ | ★ | 💻 | ||
EMNLP | ★ | ★ | 🔨 | ||
ICLR | ★ | ★ | ★ | ||
AAAI | ★ | ★ | ★ | ||
IJCAI | ★ | ★ | |||
NIPS | ★ | ★ | ★ | ||
TKDE | |||||
TOIS | |||||
AIRS | |||||
ECIR |
🐴:正在看 📱:笔记 🏠:重点 ☁️:看过了,啥玩意
Pretrained Langeuage Model
Github 资源
- huggingface/transformers: https://github.com/huggingface/transformers
- Transformers API: https://huggingface.co/transformers/
- tomohideshibata/BERT-related-papers:https://github.com/tomohideshibata/BERT-related-papers
- thu/GNNPapers:https://github.com/thunlp/GNNPapers
- Jiakui/awesome-bert:https://github.com/Jiakui/awesome-bert
- xieyuqiang/GNN4NLP-Papers: https://github.com/IndexFziQ/GNN4NLP-Papers
- THU/OpenMatch: https://github.com/thunlp/OpenMatch
- THU/PLMPapers: https://github.com/thunlp/PLMpapers
综述类
-
[IR 排序模型综述] A Deep Look into Neural Ranking Models for Information Retrieval [2019, guojiafeng] arXiv
-
[NLP NERZ综述] A Survey on Deep Learning for Named Entity Recognition [2020-03] arXiv
-
[BERT 进展] A Primer in BERTology: What we know about how BERT works
-
Matching Algorithms: Fundamentals, Applications and Challenges [2021-03]arXiv
-
Information retrieval: a view from the Chinese IR community [chengxueqi, 2021]PDF
-
Retrieving and Reading : A Comprehensive Survey on Open-domain Question Answering arXiv
-
Pretrained Transformers for Text Ranking: BERT and Beyond arXiv
2021
未标注会议
-
Dynamic Model for Query-Document Expansion towards Improving Retrieval Relevance PDF ☁️20210414
-
A Neural Passage Model for Ad-hoc Document Retrieval PDF
-
An In-depth Analysis of Passage-Level Label Transfer for Contextual Document Ranking arXiv
-
Mitigating the Position Bias of Transformer Models in Passage Re-Ranking arXiv [ecir2021]
-
Neural document expansion for ad-hoc information retrieval arXiv [亚马逊] ☁️20200414
-
PARADE: Passage Representation Aggregation for Document Reranking arXiv
-
Neural Methods for Effective, Efficient, and Exposure-Aware Information Retrieval [PhD thesis]arXiv
-
MIMICS: A Large-Scale Data Collection for Search Clarification arXiv
-
Unbiased Learning to Rank via Propensity Ratio Scoring arXiv
-
Conformer-Kernel with Query Term Independence for Document Retrieval arXiv
-
Exploring Classic and Neural Lexical Translation Models for Information Retrieval: Interpretability, Effectiveness, and Efficiency Benefits arXiv
-
Longformer for MS MARCO Document Re-ranking Task arXiv
-
ORCAS: 18 Million Clicked Query-Document Pairs for Analyzing Search arXiv
-
Composite Re-Ranking for Efficient Document Search with BERT arXiv
-
Graph-based Hierarchical Relevance Matching Signals for Ad-hoc Retrieval arXiv
-
Match-Ignition: Plugging PageRank into Transformer for Long-form Text Matching [chengxueqi] arXiv
-
ERNIE-DOC: The Retrospective Long-Document Modeling Transformer arXiv
-
The Expando-Mono-Duo Design Pattern for Text Ranking with Pretrained Sequence-to-Sequence Models arXiv
-
A Systematic Evaluation of Transfer Learning and Pseudo-labeling with BERT-based Ranking Models arXiv
-
ABNIRML: Analyzing the Behavior of Neural IR Models arXiv
-
Assessing the Benefits of Model Ensembles in Neural Re-Ranking for Passage Retrieval [ECIR short paper] arXiv
-
Distilling Dense Representations for Ranking using Tightly-Coupled Teachers [Jimmy Lin] arXiv
-
Redefining Absent Keyphrases and their Effect on Retrieval Effectiveness arXiv [naacl2021]
-
A Systematic Evaluation of Transfer Learning and Pseudo-labeling with BERT-based Ranking Models arXiv
-
Improving Efficient Neural Ranking Models with Cross-Architecture Knowledge Distillation arXiv
-
Investigating the case of weak baselines in Ad-hoc Retrieval and Question Answering arXiv
-
An Attention-based Deep Relevance Model for Few-shot Document Filtering
-
Topic-enhanced knowledge-aware retrieval model for diverse relevance estimation pdf
-
A Pseudo-relevance feedback framework combining relevance matching and semantic matching for information retrieval PDF
-
RocketQA: An Optimized Training Approach to Dense Passage Retrieval for Open-Domain Question Answering arXiv
-
CoRT: Complementary Rankings from Transformers arXiv
-
On the Calibration and Uncertainty of Neural Learning to Rank Models arXiv
-
Keyword-Attentive Deep Semantic Matching arXiv
-
ABNIRML: Analyzing the Behavior of Neural IR Models [AI2] arXiv
-
Evaluation of BERT and ALBERT Sentence Embedding Performance on Downstream NLP Tasks arXiv
-
Less is More: Pre-training a Strong Siamese Encoder Using a Weak Decoder [MS] arXiv
-
Learning To Retrieve: How to Train a Dense Retrieval Model Effectively and Efficiently
-
OpenMatch: An Open-Source Package for Information Retrieval arXiv
-
An end-to-end pseudo relevance feedback framework for neural document retrieval [2019]
SIGIR2021 审稿中,4.14 出结果 官网
-
长文:151/720, 21%, 短文: 145/526, 27%
-
1.Optimizing Dense Retrieval Model Training with Hard Negatives [MA SHAO PING]
-
2.Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling
-
3.Dual-View Distilled BERT for Sentence Embedding arXiv
-
4.AdsGNN: Behavior-Graph Augmented Relevance Modeling in Sponsored Search arXiv
-
5.Wizard of Search Engine: Access to Information Through Conversations with Search Engines
-
6.Self-Supervised Contrastive Learning for Code Retrieval and Summarization via Semantic-Preserving Transformations
-
7.Not All Relevance Scores are Equal: Efficient Uncertainty and Calibration Modeling for Deep Retrieval Models [****]
-
8.B-PROP: Bootstrapped Pre-training with Representative Words Prediction for Ad-hoc Retrieval [****]
-
9.Learning to Rank for Mathematical Formula Retrieval
-
10.Investigating User Behavior in Legal Case Retrieval
-
11.Few-Shot Conversational Dense Retrieval
-
12.Heterogeneous Attention Network for Effective and Efficient Cross-modal Retrieval
-
13.PAN: Prototype-based Adaptive Network for Robust Cross-modal Retrieval
-
14.Adapting Interactional Observation Embedding for Counterfactual Learning to Rank
-
15.MS MARCO: Benchmarking Ranking Models in the Large-Data Regime
-
16.KeyBLD: Selecting Key Blocks with Local Pre-ranking for Long Document Information Retrieval
-
17.Decoupling Representation and Regressor for Long-Tailed Information Cascade Prediction
-
18.On the Orthogonality of Bias and Effectiveness in Ad hoc Retrieval
-
19.Entity Retrieval Using Fine-Grained Entity Aspects
-
20.Contextualized Offline Relevance Weighting for Efficient and Effective Neural Retrieval
-
21.Conversational vs Traditional: Comparing Search Behavior and Outcome in Legal Case Retrieval
-
22.Learning Robust Dense Retrieval Models from Incomplete Relevance Labels
-
23.Joint Learning of Deep Retrieval Model and Product Quantization based Embedding Index
-
24.APRF-Net: Attentive Pseudo-Relevance Feedback Network for Query Categorization
-
25.Podcast Metadata and Content: Episode Relevance and Attractiveness in Ad Hoc Search
-
26.Revisiting the Tag Relevance Prediction Problem
-
27.Computationally Efficient Optimization of Plackett-Luce Ranking Models for Relevance and Fairness
-
28.Learning a Fine-Grained Review-based Transformer Model for Personalized Product Search
-
29.Do Affective Cues Validate Behavioural Metrics for Search?
-
30.Modeling Intent Graph for Search Result Diversification [wenjirong]
-
31.Group based Personalized Search by Integrating Search Behaviour and Friend Network [wenjirogn]
-
32.When Fair Ranking Meets Uncertain Inference arXiv [***]
-
33.Intra-Document Cascading: Learning to Select Passages for Neural Document Ranking
-
34.IDRQA: Iterative Document Reranking for Open-domain Multi-hop Question Answering
-
35.Improving Bi-encoder Document Ranking Models with Two Rankers and Multi-teacher Distillation
-
36.SPLADE: Sparse Lexical and Expansion Model for First Stage Ranking
-
37.Significant Improvements over the State of the Art? A Case Study of the MS MARCO Document Ranking Leaderboard
-
38.Towards a Better Tradeoff between Effectiveness and Efficiency in Pre-Ranking: A Learnable Feature Selection based Approach
-
39.Improving Transformer-Kernel Ranking Model Using Conformer and Query Term Independence
-
40.A Systematic Evaluation of Transfer Learning and Pseudo-labeling with BERT-based Ranking Models
-
42.Learning Passage Impacts for Inverted Indexes(https://arxiv.org/abs/2104.12016) [下载]
-
44.[AutoDebias: Learning to Debias for Recommendation] [2021-07-02]
-
- [Set2setRank: Collaborative Set to Set Ranking for Implicit Feedback based Recommendation]
-
- [Long-Tail Hashing]
-
47.[Looking at CTR Prediction Again: Is Attention All You Need?]
-
- [Allowing for The Grounded Use of Temporal Difference Learning in Large Ranking Models via Sub State Updates ]
-
49.[How Do Biased Search Result Rankings Affect User Attitudes on Debated Topics?]
-
50 [Partner Matters! An Empirical Study on Fusing Personas for Personalized Response Selection in Retrieval-Based Chatbots]
-
- [ScaleFreeCTR: MixCache-based Distributed Training System for CTR Models with Huge Embedding Table]
-
52 [Enhanced Doubly Robust Learning for Debiasing Post-Click Conversion Rate Estimation]
-
53 [Improving Video Retrieval by Adaptive Margin]
-
54 [Intra-Document Cascading: Learning to Select Passages for Neural Document Ranking]
-
55 [Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling]
-
56 [A Graph-Enhanced Click Model for Web Search]
-
57 [Adapting Interactional Observation Embedding for Counterfactual Learning to Rank]
-
58 [Tracing Knowledge State with Individual Cognition and Acquisition Estimation]
-
59 [A General Method For Automatic Discovery of Powerful Interactions In Click-Through Rate Prediction]
-
60 [Computationally Efficient Optimization of Plackett-Luce Ranking Models for Relevance and Fairness]
-
61 [Learning Graph Meta Embeddings for Cold-Start Ads in Click-Through Rate Prediction]
-
62 [Societal Biases in Retrieved Contents: Measurement Framework and Adversarial Mitigation of BERT Rankers]
-
63 [Wizard of Search Engine: Access to Information Through Conversations with Search Engines]
-
64 Clicks can be Cheating: Counterfactual Recommendation for Mitigating Clickbait Issue
-
65 AdsGNN: Behavior-Graph Augmented Relevance Modeling in Sponsored Search
-
66 Few-Shot Conversational Dense Retrieval
-
67 Optimizing Dense Retrieval Model Training with Hard Negatives
-
68 IDRQA: Iterative Document Reranking for Open-domain Multi-hop Question Answering
-
69 Answer Complex Questions: Path Ranker Is All You Need
-
70 On the Orthogonality of Bias and Effectiveness in Ad hoc Retrieval
-
71 Podcast Metadata and Content: Episode Relevance and Attractiveness in Ad Hoc Search
-
72 Contextualized Offline Relevance Weighting for Efficient and Effective Neural Retrieval
-
73 Dual-View Distilled BERT for Sentence Embedding
-
74 GemNN: Gating-enhanced Multi-task Neural Networks with Feature Interaction Learning for CTR Prediction
-
75 Deep Position-wise Interaction Network For CTR Prediction
-
76 Deep User Match Network for Click-Through Rate Prediction
-
77 Revisiting the Tag Relevance Prediction Problem
-
78 Dual Unbiased Recommender Learning for Implicit Feedback
-
79 Explicit Semantic Cross Feature Learning via Pre-trained Graph Neural Networks for CTR Prediction
-
80 KeyBLD: Selecting Key Blocks with Local Pre-ranking for Long Document Information Retrieval
-
81 Significant Improvements over the State of the Art? A Case Study of the MS MARCO Document Ranking Leaderboard
-
82 Cross-Batch Negative Sampling for Training Two-Tower Recommenders
-
83 RLNF: Reinforcement Learning based Noise Filtering for Click-Through Rate Prediction
-
84 A Test Collection for Ad-hoc Dataset Retrieval
-
85 WWW3E8: 259,000 Relevance Labels for Studying the Effect of Document Presentation Order for Relevance Assessors
CIKM2021 投稿中,8.23 CRD 官网
WWW2021
-
Robust Generalization and Safe Query-Specialization in Counterfactual Learning to Rank AMiner
-
A Linguistic Study on Relevance Modeling in Information Retrieval [chengxuqi & guojiafeng]AMiner
-
Maximizing Marginal Fairness for Dynamic Learning to Rank [qingao ai] AMiner
-
An Alternative Cross Entropy Loss for Learning-to-Rank AMiner
-
Using Prior Knowledge to Guide BERT's Attention in Semantic Textual Matching Tasks AMiner
-
Graph-based Hierarchical Relevance Matching Signals for Ad-hoc Retrieval AMiner
-
MIRA: Leveraging Multi-Intention Co-click Information in Web-scale Document Retrieval using Deep Neural Networks AMiner
-
Event-Driven Query Expansion 。。。
-
Density-Ratio Based Personalised Ranking from Implicit Feedback AMiner
-
Relevance-guided Supervision for OpenQA with ColBERT arXiv
-
An Adversarial Imitation Click Model for Information Retrieval arXiv
WSDM2021:
-
PROP: Pre-training with Representative Words Prediction for Ad-hoc Retrieval [chengxueqi & guojiafeng] arXiv
-
Bias-Variance Decomposition for Ranking PDF
-
Unifying Online and Counterfactual Learning to Rank: A Novel Counterfactual Estimator that Effectively Utilizes Online Interventions
-
Pretrained Transformers for Text Ranking: BERT and Beyond PDF
-
Non-Clicks Mean Irrelevant? Propensity Ratio Scoring As a Correction PDF
-
Beyond Relevance: Trustworthy Answer Selection via Consensus Verification PDF
-
Beyond Probability Ranking Principle: Modeling the Dependencies among Documents PDF
-
QueryBlazer: Efficient Query Autocompletion Framework PDF
KDD2021: 5.17 录用结果 官网
ACL2021:
EMNLP2021:
AAAI2021: 1692篇
笔记整理:
论文整理:
-
Learning to Truncate Ranked Lists for Information Retrieval [chengxueqi & guojiafeng]AMiner
-
A Graph-based Relevance Matching Model for Ad-hoc Retrieval AMiner
-
LET: Linguistic Knowledge Enhanced Graph Transformer for Chinese Short Text Matching AMiner
-
Making the Relation Matters: Relation of Relation Learning Network for Sentence Semantic Matching AMiner
-
Fake it Till You Make it: Self-Supervised Semantic Shifts for Monolingual Word Embedding Tasks AMiner
IJCAI2021:
ICLR2021:
-
Are Neural Rankers still Outperformed by Gradient Boosted Decision Trees? AMiner
-
Individually Fair Rankings AMiner
-
Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval AMiner
NIPS2021:
2020
SIGIR2020:
SIGIR2020: 官网 dblp aminer
SIGIR会议之文本表征、检索重排序、阅读理解论文整理
一、文本的表征
- Convolutional Embedding for Edit Distance arXiv
- Match^2: A Matching over Matching Model for Similar Question Identification arXiv AMiner
- Hier-SPCNet: A Legal Statute Hierarchy-based Heterogeneous Network for Computing Legal Document Similarity arXiv
- Attending to Inter-sentential Features in Neural Text Classification pdf
- Towards Differentially Private Text Representations arXiv
- Expansion via Prediction of Importance with Contextualization arXiv
- DC-BERT: Decoupling Question and Document for Efficient Contextual Encoding [jiang daxin] arXiv [AMiner](DC-BERT: Decoupling Question and Document for Efficient Contextual Encoding)
- Improving Matching Models with Hierarchical Contextualized Representations for Multi-turn Response Selection arXiv
- Symmetric Regularization based BERT for Pair-wise Semantic Reasoning arXiv
二、检索及重排序
- Table Search Using a Deep Contextualized Language Model arXiv
- ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction over BERT arXiv
- Finding the Best of Both Worlds: Faster and More Robust Top-k Document Retrieval PDF
- Efficient Document Re-Ranking for Transformers by Precomputing Term Representations arXiv
- Training Curricula for Open Domain Answer Re-Ranking arXiv
- SetRank: Learning a Permutation-Invariant Ranking Model for Information Retrieval arXiv
- MarkedBERT: Integrating Traditional IR cues in pre-trained language models for passage retrieval pdf
- Context-Aware Term Weighting For First-Stage Passage Retrieval PDF
- Learning Term Discrimination arXiv
- Improving Document-Level Text Retrieval using Local Attention in the Transformer-Kernel Pooling Model arXiv
- Query-level Early Exit for Additive Learning-to-Rank Ensembles arXiv
- Efficiency Implications of Term Re-Weighting for Passage Retrieval PDF
- Contextual Re-Ranking with Behavior Aware Transformers [MS]arXiv AMiner
- An analysis of BERT in document ranking [mashaoping] arXiv* AMiner
三、QA及阅读理解
- Open-Retrieval Conversational Question Answering arXiv AMiner
- Reranking for Efficient Transformer-based Answer Selection arXiv
四、其他
-
- Automatic Generation of Topic Labels arXiv
- CrossBERT with Triplet Neural Architecture for Entity Property Ranking arXiv
- Few-Shot Generative Conversational Query Rewriting arXiv
五. 补充
-
Investigating reading behavior in Fine-grained Relevance Judgment [liuyiqun & mashaoping]AMiner
-
Query-level Early Exit for Additive Learning-to-Rank Ensembles AMiner
-
Sampling Bias Due to Near-Duplicates in Learning to Rank AMiner
-
Policy-Aware Unbiased Learning to Rank for Top-k Rankings AMiner
-
Accelerated Convergence for Counterfactual Learning to Rank AMiner
-
Cascade Model-based Propensity Estimation for Counterfactual Learning to Rank AMiner
-
Re-ranking Answer Selection with Similarity Aggregation AMiner
-
Multi-grouping Robust Fair Ranking AMiner
-
Feature Transformation for Neural Ranking Models AMiner
-
SetRank: Learning a Permutation-Invariant Ranking Model for Information Retrieval [wenjirong & chengxueqi ] AMiner
-
Efficient Document Re-Ranking for Transformers by Precomputing Term Representations AMiner
-
Training Curricula for Open Domain Answer Re-Ranking AMiner
-
A Deep Recurrent Survival Model for Unbiased Ranking [gaikun] AMiner
-
Studying Ranking-Incentivized Web Dynamics AMiner
-
Choppy: Cut Transformer For Ranked List Truncation AMiner
-
Unsupervised Semantic Hashing with Pairwise Reconstruction AMiner
-
Improving Contextual Language Models for Response Retrieval in Multi-Turn Conversation AMiner
-
Retrieving Potential Causes from a Query Event AMiner
-
A Study of Neural Matching Models for Cross-lingual IR AMiner
-
A Quantum Interference Inspired Neural Matching Model for Ad-hoc Retrieval AMiner
-
Improving Matching Models with Hierarchical Contextualized Representations for Multi-turn Response Selection AMiner
-
Soft Kernel-based Ranking on a Statistical Manifold AMiner
-
Analyzing and Learning from User Interactions for Search Clarification AMiner
-
Knowledge Enhanced Personalized Search [renjirong] AMiner
-
Topic Propagation in Conversational Search AMiner
-
Search Result Explanations Improve Efficiency and Trust AMiner
-
Encoding History with Context-aware Representation Learning for Personalized Search AMiner
-
Do Neural Ranking Models Intensify Gender Bias? AMiner
-
Query Resolution for Conversational Search with Limited Supervision AMiner
-
Local Self-Attention over Long Text for Efficient Document Retrieval arXiv
-
End-to-End Contextualized Document Indexing and Retrieval with Neural Networks PDF
-
Listwise Learning to Rank by Exploring Unique Ratings arXiv
CIKM2020:
-
Fine-Grained Relevance Annotations for Multi-Task Document Ranking and Question Answering arXiv
-
Beyond 512 Tokens: Siamese Multi-depth Transformer-based Hierarchical Encoder for Long-Form Document Matching arXiv
-
Distant Supervision in BERT-based Adhoc Document Retrieval PDF
-
Learning Better Representations for Neural Information Retrieval with Graph Information PDF
WWW2020:
-
Selective Weak Supervision for Neural Information Retrieval arXiv
-
Context-Aware Document Term Weighting for Ad-Hoc Search PDF
-
Leveraging Passage-level Cumulative Gain for Document Ranking PDF
-
IART: Intent-aware Response Ranking with Transformers in Information-seeking Conversation Systems
arXiv -
HDCT:Context-Aware Document Term Weighting for Ad-Hoc Search PDF
WADM2020*
KDD2020:
1.Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding AMiner
2.Mining Implicit Relevance Feedback from User Behavior for Web Question Answering AMiner
-
FreeDOM: A Transferable Neural Architecture for Structured Information Extraction on Web Documents AMiner
-
MinSearch: An Efficient Algorithm for Similarity Search under Edit Distance AMiner
-
Embedding-based Retrieval in Facebook Search AMiner
-
Managing Diversity in Airbnb Search AMiner
-
Improving Deep Learning For Airbnb Search AMiner
ACL2020:
-
QUASE: Question-Answer Driven Sentence Encoding [ai2]arXiv
-
tBERT: Topic Models and BERT Joining Forces for Semantic Similarity Detection PDF
-
Rationalizing Text Matching: Learning Sparse Alignments via Optimal Transport AMiner
-
Extractive Summarization as Text Matching AMiner
-
Neural Graph Matching Networks for Chinese Short Text Matching AMiner
EMNLP2020:
-
BERT-QE: Contextualized Query Expansion for Document Re-ranking [sunle]arXiv
-
Beyond [CLS] through Ranking by Generation AMiner
-
Modularized Transfomer based Ranking Framework AMiner
-
Wasserstein Distance Regularized Sequence Representation for Text Matching in Asymmetrical Domains [wenjirong] AMiner
-
A Bilingual Generative Transformer for Semantic Sentence Embedding AMiner
-
MultiCQA: Zero Shot Transfer of Self Supervised Text Matching Models on a Massive Scale AMiner
-
SLM: Learning a Discourse Language Representation with Sentence Unshuffling AMiner
-
Better Highlighting: Creating Sub Sentence Summary Highlights
-
KERMIT: Complementing Transformer Architectures with Encoders of Explicit Syntactic Interpretations AMiner
-
With More Contexts Comes Better Performance: Contextualized Sense Embeddings for All Round Word Sense Disambiguation AMiner
-
Tell Me How to Ask Again: Question Data Augmentation with Controllable Rewriting in Continuous Space [zhouming] AMiner
-
Cross Thought for Sentence Encoder Pre training AMiner
-
Long Document Ranking with Query-Directed Sparse Transformer arXiv
-
Ad-hoc Document Retrieval using Weak-Supervision with BERT and GPT2 AMiner
-
DiPair: Fast and Accurate Distillation for Trillion-Scale Text Matching and Pair Modeling arXiv
AAAI2020:
IJCAI2020:
ICLR2020:
-
Pre-training Tasks for Embedding-based Large-scale Retrieval arXiv AMiner
-
Poly-encoders: Architectures and Pre-training Strategies for Fast and Accurate Multi-sentence Scoring AMiner
-
Encoding word order in complex embeddings AMiner
-
Towards Hierarchical Importance Attribution: Explaining Compositional Semantics for Neural Sequence Models AMiner
NIPS2020:
2019
SIGIR2019:
CIKM2019:
- IMN: Interactive Matching Network for Multi-Turn Response Selection in Retrieval-Based Chatbots arXiv
WWW2019:
KDD2019:
-
Fairness-Aware Ranking in Search & Recommendation Systems with Application to LinkedIn Talent Search AMiner
-
MediaRank: Computational Ranking of Online News Sources AMiner
-
MOBIUS: Towards the Next Generation of Query-Ad Matching in Baidu's Sponsored Search [baidu] AMiner
-
A Memory-Efficient Sketch Method for Estimating High Similarities in Streaming Sets AMiner
ACL2019:
EMNLP2019:
-
Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks arXiv
-
Bridging the Gap Between Relevance Matching and Semantic Matching for Short Text Similarity Modeling
AAAI2019:
- DRCN: Semantic Sentence Matching with Densely-connected Recurrent and Co-attentive Information arXiv
IJCAI2019:
ICLR2019:
NIPS2019:
other
-
Context-Aware Sentence/Passage Term Importance Estimation For First Stage Retrieval arXiv
-
Multi-Turn Response Selection for Chatbots with Deep Attention Matching Network [BaiDu ACL2018]
LTR
- 1.Document Ranking with a Pretrained Sequence-to-Sequence Model
- 2.[2020 NeurIPS] [51] Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
- 3.Match-Ignition: Plugging PageRank into Transformer for Long-form Text Matching
- 4.Cross-Thought for Sentence Encoder Pre-training
- 5.On the Sentence Embeddings from Pre-trained Language Models
- 6.PROP: Pre-training with Representative Words Prediction for Ad-hoc Retrieval
- 7.[2020 ACL-Enhancing Pre-trained Chinese Character Representation with Word-aligned Attention
- 8.Integrating fine- and coarse-grained information for natural language understanding
- 9.2020 SIGIR】MarkedBERT: Integrating Traditional IR Cues in Pre-trained Language Models for Passage Retrieval
- 10.【2019 AAAI】Semantic Sentence Matching with Densely-connected Recurrent and Co-attentive Information
- 11.Towards Personalized and Semantic Retrieval: An End-to-End Solution for E-commerce Search via Embedding Learning
- 12.[2020 KDD] Embedding-based Retrieval in Facebook Search.
- 13.[ 2020 SIGIR ] ColBERT Efficient and Effective Passage Search via Contextualized Late Interaction over BERT
- 14.[ 2020 ICLR ] Reformer: The efficient Transformer
- 15.[SIGIR 2020] Efficient Document Re-Ranking for Transformers by Precomputing Term Representations
- 16.Adaptive Multi-Attention Network Incorporating Answer Information for Duplicate Question Detection
- 17.[2019 AAAI] [66] Semantic Sentence Matching with Densely-Connected Recurrent and Co-Attentive Information
日常补充
2021-04-23
- 1.Semantic Models for the First-stage Retrieval: A Comprehensive Review
- 2.A Replication Study of Dense Passage Retriever
-
- 4.Efficient Retrieval Optimized Multi-task Learning
- 5.Anytime Ranking on Document-Ordered Indexes
- 6.Co-BERT: A Context-Aware BERT Retrieval Model Incorporating Local and Query-specific Context [孙乐]
- 7.BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models
- 8.ASBERT: Siamese and Triplet network embedding for open question answering
- 9.Contextualized Query Embeddings for Conversational Search
- 10.[
- 11.SciCo: Hierarchical Cross-Document Coreference for Scientific Concepts
- 12.COIL: Revisit Exact Lexical Match in Information Retrieval with Contextualized Inverted List [NAACL2021]
-
- RoFormer: Enhanced Transformer with Rotary Position Embedding arXiv
- 14.Simple and Efficient ways to Improve REALM arXiv
- 15.SimCSE: Simple Contrastive Learning of Sentence Embeddings arXiv
- 16.Joint Passage Ranking for Diverse Multi-Answer Retrieval arXiv
- 17.Knowledge Neurons in Pretrained Transformers arXiv
- 18.Demystifying the Better Performance of Position Encoding Variants for Transformer arXiv
- 19.Chinese Sentences Similarity via Cross-Attention Based Siamese Network arXiv
- 20.Reference-based Weak Supervision for Answer Sentence Selection using Web Data arXiv
- 21.Context-Aware Interaction Network for Question Matching arXiv
- 22.A Graph-guided Multi-round Retrieval Method for Conversational Open-domain Question Answering arXiv
- 23.Ranking Structured Objects with Graph Neural Networks arXiv
- 24.Fast, Effective and Self-Supervised: Transforming Masked LanguageModels into Universal Lexical and Sentence Encoders arXiv
- 25.Towards Robust Neural Retrieval Models with Synthetic Pre-Training arXiv
- 26.Lattice-BERT: Leveraging Multi-Granularity Representations in Chinese Pre-trained Language Models arXiv [NAACL2021]
- 27.Designing a Minimal Retrieve-and-Read System for Open-Domain Question Answering arXiv
- 28.Sentence Embeddings by Ensemble Distillation arXiv
- 29.Significant Improvements over the State of the Art? A Case Study of the MS MARCO Document Ranking Leaderboard arXiv
- 30.Latte-Mix: Measuring Sentence Semantic Similarity with Latent Categorical Mixtures []
2021-04-27
1.Extract then Distill: Efficient and Effective Task-Agnostic BERT Distillation arXiv [THU]
2021-05-04
1.Extract then Distill: Efficient and Effective Task-Agnostic BERT Distillation
2. SmoothI: Smooth Rank Indicators for Differentiable IR Metrics
3.Transformers: "The End of History" for NLP?
2021-05-05
1.Pretrained Transformers for Text Ranking:
BERT and Beyond
2. Comparing Score Aggregation Approaches for Document Retrieval with Pretrained Transformers [ECIR2021]
3.A White Box Analysis of ColBERT [ECIR2021]
4.TREC Deep Learning Track: Reusable Test Collections in the Large Data Regime
5.