Yongil Kim
LG AI Research
Contents
About Me
My Projects
Paper Reviews
Paper Reviews
[EMNLP2023] Enhancing Chat Language Models by Scaling High-quality Instructional Conversations
»
17 Apr 2024
[ICLR2024] #INSTAG: INSTRUCTION TAGGING FOR ANALYZING SUPERVISED FINE-TUNING OF LARGE LANGUAGE MODELS
»
15 Apr 2024
[Arxiv 2404]HyperCLOVA X Technical Report
»
05 Apr 2024
[ICLR2024] DP-OPT: MAKE LARGE LANGUAGE MODEL YOUR PRIVACY-PRESERVING PROMPT ENGINEER
»
03 Apr 2024
[ICLR2024] LOFTQ: LORA-FINE-TUNING-AWARE QUANTIZATION FOR LARGE LANGUAGE MODELS
»
01 Apr 2024
[Arxiv 2402] REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering
»
29 Mar 2024
[EMNLP2023] EtiCor: Corpus for Analyzing LLMs for Etiquettes
»
27 Mar 2024
[EMNLP2023] Uncertainty Guided Global Memory Improves Multi-Hop Question
»
25 Mar 2024
[EMNLP2021 best paper] Visually Grounded Reasoning across Languages and Cultures
»
22 Mar 2024
[TACL2021] Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies
»
20 Mar 2024
[ACL2023] ReAugKD: Retrieval-Augmented Knowledge Distillation For Pre-trained Language Models
»
18 Mar 2024
[Arxiv 2305] Trusting Your Evidence: Hallucinate Less with Context-aware Decoding
»
15 Mar 2024
[Arxiv 2401] Blinded by Generated Contexts: How Language Models Merge Generated and Retrieved Contexts for Open-Domain QA?
»
13 Mar 2024
[EMNLP2023] IfQA: A Dataset for Open-domain Question Answeringunder Counterfactual Presuppositions
»
11 Mar 2024
[EMNLP2023] SMoP: Towards Efficient and Effective Prompt Tuning with Sparse Mixture-of-Prompts
»
08 Mar 2024
[EMNLP2023] Ditto: A Simple and Efficient Approach to Improve Sentence Embeddings
»
06 Mar 2024
[EMNLP2023] PK-ICR: Persona-Knowledge Interactive Multi-Context Retrieval for Grounded Dialogue
»
04 Mar 2024
[ACL2023] A Synthetic Data Generation Framework for Grounded Dialogues
»
28 Feb 2024
[EMNLP2023] CLAIR: Evaluating Image Captions with Large Language Models
»
26 Feb 2024
[EMNLP2023] TaskDiff: A Similarity Metric for Task-Oriented Conversations
»
23 Feb 2024
[EMNLP 2023] Copyright Violations and Large Language Models
»
21 Feb 2024
[ICLR2024] SELF-RAG: LEARNING TO RETRIEVE, GENERATE, AND CRITIQUE THROUGH SELF-REFLECTION
»
19 Feb 2024
[EMNLP 2023] Poisoning Retrieval Corpora in Injecting Adversarial Passages
»
16 Feb 2024
[Arxiv 2307] Evaluating the Ripple Effects of Knowledge Editing in Language Models
»
14 Feb 2024
[ICML2022] HyperPrompt: Prompt-based Task-Conditioning of Transformers
»
05 Feb 2024
[Arxiv 2312] NoMIRACL: Knowing When You Don’t Know for Robust Multilingual Retrieval-Augmented Generation
»
02 Feb 2024
[EMNLP2023] EXPLORE-INSTRUCT: Enhancing Domain-Specific Instruction Coverage through Active Exploration
»
31 Jan 2024
[EMNLP2023] Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs
»
29 Jan 2024
[ACL2023] SELF-INSTRUCT: Aligning Lnaugage Models with Self-Generated Insructions
»
26 Jan 2024
[EMNLP2023] Retrieval-Generation Alignment for End-to-End Task-Oriented Diaogue System
»
24 Jan 2024
[EMNLP2023] Active Retrieval Augmented Generation
»
12 Jan 2024
[NeurIPS2023] Meta-in-context learning in large language models
»
10 Jan 2024
[EMNLP2023] HaluEval: A Large-Scale Hallucination Evaluation Benchmark for Large Language Models
»
08 Jan 2024
[ICML2023] QASA: Advanced Question Answering on Scientific Articles
»
02 Jan 2024
A Survey of Large Language Models (4)
»
24 Dec 2023
A Survey of Large Language Models (3)
»
24 Dec 2023
A Survey of Large Language Models (2)
»
24 Dec 2023
A Survey of Large Language Models (1)
»
24 Dec 2023
[ICML2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning
»
17 Dec 2023
[ICML2023] Large Language Models Struggle to Learn Long-Tail Knowledge
»
17 Dec 2023
[ICML2023] A Watermark for Large Language Models
»
17 Dec 2023
[ACL2023] Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions
»
11 Sep 2023
[ACL2023] FutureTOD: Teaching Future Knowledge to Pre-trained Language Model for Task-Oriented Dialogue
»
19 Aug 2023
[ACL2022] An Interpretable Neuro-Symbolic Reasoning Framework for Task-Oriented Dialogue Generation
»
28 Feb 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen image Encoders and Large Language Models
»
27 Feb 2023
[NAACL2022] Database Search Results Disambiguation for Task-Oriented Dialog Systems
»
21 Dec 2022
[ICML2022] Data Determinces Distributional Robustness in Contrastive Language-Image Pre-training (CLIP)
»
14 Nov 2022
[ICML2022] NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework
»
13 Nov 2022
[ICML2022] Describing Differences between Text Distributions with Natural Language
»
12 Nov 2022
[ICML2022] What Language Model Architecture and Pretraining Objective Work Best for Zero-Shot Generalization?
»
11 Nov 2022
[CVPR 2022 Tutorial] Denoising Diffusion-based Generative Modeling: Foundations and Applications(1)
»
05 Nov 2022
[ICML2022] Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual Concepts
»
05 Nov 2022
[ICML2022] Dialog Inpainting: Turning Documents into Dialogs
»
01 Nov 2022
[ICML2022] VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix
»
31 Oct 2022
[BEIT-3] Image as a Foreign Language: BEIT Pretraining for All Vision and Vision-Language Tasks
»
06 Sep 2022