Vision-and-Language Transformer Dialogue Diffusion PLM Hallucination Retrieval
2024
- [EMNLP2023] Enhancing Chat Language Models by Scaling High-quality Instructional Conversations » 17 Apr 2024
- [ICLR2024] #INSTAG: INSTRUCTION TAGGING FOR ANALYZING SUPERVISED FINE-TUNING OF LARGE LANGUAGE MODELS » 15 Apr 2024
- [Arxiv 2404]HyperCLOVA X Technical Report » 05 Apr 2024
- [Arxiv 2402] REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering » 29 Mar 2024
- [TACL2021] Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies » 20 Mar 2024
- [ACL2023] ReAugKD: Retrieval-Augmented Knowledge Distillation For Pre-trained Language Models » 18 Mar 2024
- [Arxiv 2401] Blinded by Generated Contexts: How Language Models Merge Generated and Retrieved Contexts for Open-Domain QA? » 13 Mar 2024
- [EMNLP2023] IfQA: A Dataset for Open-domain Question Answeringunder Counterfactual Presuppositions » 11 Mar 2024
- [EMNLP2023] SMoP: Towards Efficient and Effective Prompt Tuning with Sparse Mixture-of-Prompts » 08 Mar 2024
- [ICLR2024] SELF-RAG: LEARNING TO RETRIEVE, GENERATE, AND CRITIQUE THROUGH SELF-REFLECTION » 19 Feb 2024
- [Arxiv 2312] NoMIRACL: Knowing When You Don’t Know for Robust Multilingual Retrieval-Augmented Generation » 02 Feb 2024
- [EMNLP2023] EXPLORE-INSTRUCT: Enhancing Domain-Specific Instruction Coverage through Active Exploration » 31 Jan 2024
- [EMNLP2023] Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs » 29 Jan 2024
- [EMNLP2023] Active Retrieval Augmented Generation » 12 Jan 2024
2023
- A Survey of Large Language Models (4) » 24 Dec 2023
- A Survey of Large Language Models (3) » 24 Dec 2023
- A Survey of Large Language Models (2) » 24 Dec 2023
- A Survey of Large Language Models (1) » 24 Dec 2023
- [ICML2023] A Watermark for Large Language Models » 17 Dec 2023