Yongil Kim

LG AI Research

Articles by category: LLM

Vision-and-Language Transformer Dialogue Diffusion PLM Hallucination Retrieval

2024

[EMNLP2023] Enhancing Chat Language Models by Scaling High-quality Instructional Conversations » 17 Apr 2024

[ICLR2024] #INSTAG: INSTRUCTION TAGGING FOR ANALYZING SUPERVISED FINE-TUNING OF LARGE LANGUAGE MODELS » 15 Apr 2024

[Arxiv 2404]HyperCLOVA X Technical Report » 05 Apr 2024

[ICLR2024] DP-OPT: MAKE LARGE LANGUAGE MODEL YOUR PRIVACY-PRESERVING PROMPT ENGINEER » 03 Apr 2024

[ICLR2024] LOFTQ: LORA-FINE-TUNING-AWARE QUANTIZATION FOR LARGE LANGUAGE MODELS » 01 Apr 2024

[Arxiv 2402] REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering » 29 Mar 2024

[EMNLP2023] EtiCor: Corpus for Analyzing LLMs for Etiquettes » 27 Mar 2024

[EMNLP2023] Uncertainty Guided Global Memory Improves Multi-Hop Question » 25 Mar 2024

[TACL2021] Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies » 20 Mar 2024

[ACL2023] ReAugKD: Retrieval-Augmented Knowledge Distillation For Pre-trained Language Models » 18 Mar 2024

[Arxiv 2305] Trusting Your Evidence: Hallucinate Less with Context-aware Decoding » 15 Mar 2024

[Arxiv 2401] Blinded by Generated Contexts: How Language Models Merge Generated and Retrieved Contexts for Open-Domain QA? » 13 Mar 2024

[EMNLP2023] IfQA: A Dataset for Open-domain Question Answeringunder Counterfactual Presuppositions » 11 Mar 2024

[EMNLP2023] SMoP: Towards Efficient and Effective Prompt Tuning with Sparse Mixture-of-Prompts » 08 Mar 2024

[EMNLP2023] CLAIR: Evaluating Image Captions with Large Language Models » 26 Feb 2024

[EMNLP 2023] Copyright Violations and Large Language Models » 21 Feb 2024

[ICLR2024] SELF-RAG: LEARNING TO RETRIEVE, GENERATE, AND CRITIQUE THROUGH SELF-REFLECTION » 19 Feb 2024

[EMNLP 2023] Poisoning Retrieval Corpora in Injecting Adversarial Passages » 16 Feb 2024

[Arxiv 2307] Evaluating the Ripple Effects of Knowledge Editing in Language Models » 14 Feb 2024

[ICML2022] HyperPrompt: Prompt-based Task-Conditioning of Transformers » 05 Feb 2024

[Arxiv 2312] NoMIRACL: Knowing When You Don’t Know for Robust Multilingual Retrieval-Augmented Generation » 02 Feb 2024

[EMNLP2023] EXPLORE-INSTRUCT: Enhancing Domain-Specific Instruction Coverage through Active Exploration » 31 Jan 2024

[EMNLP2023] Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs » 29 Jan 2024

[ACL2023] SELF-INSTRUCT: Aligning Lnaugage Models with Self-Generated Insructions » 26 Jan 2024

[EMNLP2023] Active Retrieval Augmented Generation » 12 Jan 2024

[NeurIPS2023] Meta-in-context learning in large language models » 10 Jan 2024

[EMNLP2023] HaluEval: A Large-Scale Hallucination Evaluation Benchmark for Large Language Models » 08 Jan 2024

[ICML2023] QASA: Advanced Question Answering on Scientific Articles » 02 Jan 2024

2023

A Survey of Large Language Models (4) » 24 Dec 2023

A Survey of Large Language Models (3) » 24 Dec 2023

A Survey of Large Language Models (2) » 24 Dec 2023

A Survey of Large Language Models (1) » 24 Dec 2023

[ICML2023] Large Language Models Struggle to Learn Long-Tail Knowledge » 17 Dec 2023

[ICML2023] A Watermark for Large Language Models » 17 Dec 2023

[ACL2023] Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions » 11 Sep 2023

2022

[ICML2022] Describing Differences between Text Distributions with Natural Language » 12 Nov 2022

[ICML2022] What Language Model Architecture and Pretraining Objective Work Best for Zero-Shot Generalization? » 11 Nov 2022