Paper Reviews

[EMNLP2023] Enhancing Chat Language Models by Scaling High-quality Instructional Conversations » 17 Apr 2024
[ICLR2024] #INSTAG: INSTRUCTION TAGGING FOR ANALYZING SUPERVISED FINE-TUNING OF LARGE LANGUAGE MODELS » 15 Apr 2024
[Arxiv 2404]HyperCLOVA X Technical Report » 05 Apr 2024
[ICLR2024] DP-OPT: MAKE LARGE LANGUAGE MODEL YOUR PRIVACY-PRESERVING PROMPT ENGINEER » 03 Apr 2024
[ICLR2024] LOFTQ: LORA-FINE-TUNING-AWARE QUANTIZATION FOR LARGE LANGUAGE MODELS » 01 Apr 2024
[Arxiv 2402] REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering » 29 Mar 2024
[EMNLP2023] EtiCor: Corpus for Analyzing LLMs for Etiquettes » 27 Mar 2024
[EMNLP2023] Uncertainty Guided Global Memory Improves Multi-Hop Question » 25 Mar 2024
[EMNLP2021 best paper] Visually Grounded Reasoning across Languages and Cultures » 22 Mar 2024
[TACL2021] Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies » 20 Mar 2024
[ACL2023] ReAugKD: Retrieval-Augmented Knowledge Distillation For Pre-trained Language Models » 18 Mar 2024
[Arxiv 2305] Trusting Your Evidence: Hallucinate Less with Context-aware Decoding » 15 Mar 2024
[Arxiv 2401] Blinded by Generated Contexts: How Language Models Merge Generated and Retrieved Contexts for Open-Domain QA? » 13 Mar 2024
[EMNLP2023] IfQA: A Dataset for Open-domain Question Answeringunder Counterfactual Presuppositions » 11 Mar 2024
[EMNLP2023] SMoP: Towards Efficient and Effective Prompt Tuning with Sparse Mixture-of-Prompts » 08 Mar 2024
[EMNLP2023] Ditto: A Simple and Efficient Approach to Improve Sentence Embeddings » 06 Mar 2024
[EMNLP2023] PK-ICR: Persona-Knowledge Interactive Multi-Context Retrieval for Grounded Dialogue » 04 Mar 2024
[ACL2023] A Synthetic Data Generation Framework for Grounded Dialogues » 28 Feb 2024
[EMNLP2023] CLAIR: Evaluating Image Captions with Large Language Models » 26 Feb 2024
[EMNLP2023] TaskDiff: A Similarity Metric for Task-Oriented Conversations » 23 Feb 2024
[EMNLP 2023] Copyright Violations and Large Language Models » 21 Feb 2024
[ICLR2024] SELF-RAG: LEARNING TO RETRIEVE, GENERATE, AND CRITIQUE THROUGH SELF-REFLECTION » 19 Feb 2024
[EMNLP 2023] Poisoning Retrieval Corpora in Injecting Adversarial Passages » 16 Feb 2024
[Arxiv 2307] Evaluating the Ripple Effects of Knowledge Editing in Language Models » 14 Feb 2024
[ICML2022] HyperPrompt: Prompt-based Task-Conditioning of Transformers » 05 Feb 2024
[Arxiv 2312] NoMIRACL: Knowing When You Don’t Know for Robust Multilingual Retrieval-Augmented Generation » 02 Feb 2024
[EMNLP2023] EXPLORE-INSTRUCT: Enhancing Domain-Specific Instruction Coverage through Active Exploration » 31 Jan 2024
[EMNLP2023] Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs » 29 Jan 2024
[ACL2023] SELF-INSTRUCT: Aligning Lnaugage Models with Self-Generated Insructions » 26 Jan 2024
[EMNLP2023] Retrieval-Generation Alignment for End-to-End Task-Oriented Diaogue System » 24 Jan 2024
[EMNLP2023] Active Retrieval Augmented Generation » 12 Jan 2024
[NeurIPS2023] Meta-in-context learning in large language models » 10 Jan 2024
[EMNLP2023] HaluEval: A Large-Scale Hallucination Evaluation Benchmark for Large Language Models » 08 Jan 2024
[ICML2023] QASA: Advanced Question Answering on Scientific Articles » 02 Jan 2024
A Survey of Large Language Models (4) » 24 Dec 2023
A Survey of Large Language Models (3) » 24 Dec 2023
A Survey of Large Language Models (2) » 24 Dec 2023
A Survey of Large Language Models (1) » 24 Dec 2023
[ICML2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning » 17 Dec 2023
[ICML2023] Large Language Models Struggle to Learn Long-Tail Knowledge » 17 Dec 2023
[ICML2023] A Watermark for Large Language Models » 17 Dec 2023
[ACL2023] Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions » 11 Sep 2023
[ACL2023] FutureTOD: Teaching Future Knowledge to Pre-trained Language Model for Task-Oriented Dialogue » 19 Aug 2023
[ACL2022] An Interpretable Neuro-Symbolic Reasoning Framework for Task-Oriented Dialogue Generation » 28 Feb 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen image Encoders and Large Language Models » 27 Feb 2023
[NAACL2022] Database Search Results Disambiguation for Task-Oriented Dialog Systems » 21 Dec 2022
[ICML2022] Data Determinces Distributional Robustness in Contrastive Language-Image Pre-training (CLIP) » 14 Nov 2022
[ICML2022] NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework » 13 Nov 2022
[ICML2022] Describing Differences between Text Distributions with Natural Language » 12 Nov 2022
[ICML2022] What Language Model Architecture and Pretraining Objective Work Best for Zero-Shot Generalization? » 11 Nov 2022
[CVPR 2022 Tutorial] Denoising Diffusion-based Generative Modeling: Foundations and Applications(1) » 05 Nov 2022
[ICML2022] Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual Concepts » 05 Nov 2022
[ICML2022] Dialog Inpainting: Turning Documents into Dialogs » 01 Nov 2022
[ICML2022] VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix » 31 Oct 2022
[BEIT-3] Image as a Foreign Language: BEIT Pretraining for All Vision and Vision-Language Tasks » 06 Sep 2022

Yongil Kim

Paper Reviews