LIVIVO - Search results -

Search results

Result 1 - 5 of total 5

Search options

Book ; Online: Natural Language to Code Translation with Execution

Shi, Freda / Fried, Daniel / Ghazvininejad, Marjan / Zettlemoyer, Luke / Wang, Sida I.

2022

Abstract: Generative models of code, pretrained on large corpora of programs, have shown great success in translating natural language to code (Chen et al., 2021; Austin et al., 2021; Li et al., 2022, inter alia). While these models do not explicitly incorporate ... ...

Abstract	Generative models of code, pretrained on large corpora of programs, have shown great success in translating natural language to code (Chen et al., 2021; Austin et al., 2021; Li et al., 2022, inter alia). While these models do not explicitly incorporate program semantics (i.e., execution results) during training, they are able to generate correct solutions for many problems. However, choosing a single correct program from a generated set for each problem remains challenging. In this work, we introduce execution result--based minimum Bayes risk decoding (MBR-EXEC) for program selection and show that it improves the few-shot performance of pretrained code models on natural-language-to-code tasks. We select output programs from a generated candidate set by marginalizing over program implementations that share the same semantics. Because exact equivalence is intractable, we execute each program on a small number of test inputs to approximate semantic equivalence. Across datasets, execution or simulated execution significantly outperforms the methods that do not involve program semantics. We find that MBR-EXEC consistently improves over all execution-unaware selection methods, suggesting it as an effective approach for natural language to code translation. We open-source our code at github.com/facebookresearch/mbr-exec and data at dl.fbaipublicfiles.com/mbr-exec/mbr-exec-release.zip Comment: EMNLP 2022
Keywords	Computer Science - Computation and Language ; Computer Science - Software Engineering
Subject code	005 ; 004
Publishing date	2022-04-25
Publishing country	us
Document type	Book ; Online
Database	BASE - Bielefeld Academic Search Engine (life sciences selection)

Full text online

Full text

Inter-library loan at ZB MED

Your chosen title can be delivered directly to ZB MED Cologne location if you are registered as a user at ZB MED Cologne.

Book ; Online: Large Language Models Can Be Easily Distracted by Irrelevant Context

Shi, Freda / Chen, Xinyun / Misra, Kanishka / Scales, Nathan / Dohan, David / Chi, Ed / Schärli, Nathanael / Zhou, Denny

2023

Abstract: Large language models have achieved impressive performance on various natural language processing tasks. However, so far they have been evaluated primarily on benchmarks where all information in the input context is relevant for solving the task. In this ...

Abstract	Large language models have achieved impressive performance on various natural language processing tasks. However, so far they have been evaluated primarily on benchmarks where all information in the input context is relevant for solving the task. In this work, we investigate the distractibility of large language models, i.e., how the model problem-solving accuracy can be influenced by irrelevant context. In particular, we introduce Grade-School Math with Irrelevant Context (GSM-IC), an arithmetic reasoning dataset with irrelevant information in the problem description. We use this benchmark to measure the distractibility of cutting-edge prompting techniques for large language models, and find that the model performance is dramatically decreased when irrelevant information is included. We also identify several approaches for mitigating this deficiency, such as decoding with self-consistency and adding to the prompt an instruction that tells the language model to ignore the irrelevant information. Comment: Published in ICML 2023
Keywords	Computer Science - Computation and Language ; Computer Science - Artificial Intelligence
Subject code	004
Publishing date	2023-01-31
Publishing country	us
Document type	Book ; Online
Database	BASE - Bielefeld Academic Search Engine (life sciences selection)

Full text online

Full text

Inter-library loan at ZB MED

Your chosen title can be delivered directly to ZB MED Cologne location if you are registered as a user at ZB MED Cologne.

Book ; Online: Audio-Visual Neural Syntax Acquisition

Lai, Cheng-I Jeff / Shi, Freda / Peng, Puyuan / Kim, Yoon / Gimpel, Kevin / Chang, Shiyu / Chuang, Yung-Sung / Bhati, Saurabhchand / Cox, David / Harwath, David / Zhang, Yang / Livescu, Karen / Glass, James

2023

Abstract: We study phrase structure induction from visually-grounded speech. The core idea is to first segment the speech waveform into sequences of word segments, and subsequently induce phrase structure using the inferred segment-level continuous representations. ...

Abstract	We study phrase structure induction from visually-grounded speech. The core idea is to first segment the speech waveform into sequences of word segments, and subsequently induce phrase structure using the inferred segment-level continuous representations. We present the Audio-Visual Neural Syntax Learner (AV-NSL) that learns phrase structure by listening to audio and looking at images, without ever being exposed to text. By training on paired images and spoken captions, AV-NSL exhibits the capability to infer meaningful phrase structures that are comparable to those derived by naturally-supervised text parsers, for both English and German. Our findings extend prior work in unsupervised language acquisition from speech and grounded grammar induction, and present one approach to bridge the gap between the two topics.
Keywords	Computer Science - Computation and Language ; Computer Science - Machine Learning ; Computer Science - Sound ; Electrical Engineering and Systems Science - Audio and Speech Processing
Subject code	430
Publishing date	2023-10-11
Publishing country	us
Document type	Book ; Online
Database	BASE - Bielefeld Academic Search Engine (life sciences selection)

Full text online

Full text

Inter-library loan at ZB MED

Your chosen title can be delivered directly to ZB MED Cologne location if you are registered as a user at ZB MED Cologne.

Book ; Online: Siren's Song in the AI Ocean

Zhang, Yue / Li, Yafu / Cui, Leyang / Cai, Deng / Liu, Lemao / Fu, Tingchen / Huang, Xinting / Zhao, Enbo / Zhang, Yu / Chen, Yulong / Wang, Longyue / Luu, Anh Tuan / Bi, Wei / Shi, Freda / Shi, Shuming

A Survey on Hallucination in Large Language Models

2023

Abstract: While large language models (LLMs) have demonstrated remarkable capabilities across a range of downstream tasks, a significant concern revolves around their propensity to exhibit hallucinations: LLMs occasionally generate content that diverges from the ... ...

Abstract	While large language models (LLMs) have demonstrated remarkable capabilities across a range of downstream tasks, a significant concern revolves around their propensity to exhibit hallucinations: LLMs occasionally generate content that diverges from the user input, contradicts previously generated context, or misaligns with established world knowledge. This phenomenon poses a substantial challenge to the reliability of LLMs in real-world scenarios. In this paper, we survey recent efforts on the detection, explanation, and mitigation of hallucination, with an emphasis on the unique challenges posed by LLMs. We present taxonomies of the LLM hallucination phenomena and evaluation benchmarks, analyze existing approaches aiming at mitigating LLM hallucination, and discuss potential directions for future research. Comment: work in progress; 32 pages
Keywords	Computer Science - Computation and Language ; Computer Science - Artificial Intelligence ; Computer Science - Computers and Society ; Computer Science - Machine Learning
Subject code	501
Publishing date	2023-09-03
Publishing country	us
Document type	Book ; Online
Database	BASE - Bielefeld Academic Search Engine (life sciences selection)

Full text online

Full text

Inter-library loan at ZB MED

Your chosen title can be delivered directly to ZB MED Cologne location if you are registered as a user at ZB MED Cologne.

Book ; Online: InCoder

Fried, Daniel / Aghajanyan, Armen / Lin, Jessy / Wang, Sida / Wallace, Eric / Shi, Freda / Zhong, Ruiqi / Yih, Wen-tau / Zettlemoyer, Luke / Lewis, Mike

A Generative Model for Code Infilling and Synthesis

2022

Abstract: Code is seldom written in a single left-to-right pass and is instead repeatedly edited and refined. We introduce InCoder, a unified generative model that can perform program synthesis (via left-to-right generation) as well as editing (via infilling). ... ...

Abstract	Code is seldom written in a single left-to-right pass and is instead repeatedly edited and refined. We introduce InCoder, a unified generative model that can perform program synthesis (via left-to-right generation) as well as editing (via infilling). InCoder is trained to generate code files from a large corpus of permissively licensed code, where regions of code have been randomly masked and moved to the end of each file, allowing code infilling with bidirectional context. Our model is the first generative model that is able to directly perform zero-shot code infilling, which we evaluate on challenging tasks such as type inference, comment generation, and variable re-naming. We find that the ability to condition on bidirectional context substantially improves performance on these tasks, while still performing comparably on standard program synthesis benchmarks in comparison to left-to-right only models pretrained at similar scale. The InCoder models and code are publicly released. https://sites.google.com/view/incoder-code-models Comment: ICLR 2023. v3: camera-ready that includes PLBART and OpenAI baselines
Keywords	Computer Science - Software Engineering ; Computer Science - Computation and Language ; Computer Science - Machine Learning
Subject code	005
Publishing date	2022-04-12
Publishing country	us
Document type	Book ; Online
Database	BASE - Bielefeld Academic Search Engine (life sciences selection)

Full text online

Full text

Inter-library loan at ZB MED

Your chosen title can be delivered directly to ZB MED Cologne location if you are registered as a user at ZB MED Cologne.

To top

Search results

Search options

Book ; Online: Natural Language to Code Translation with Execution

Full text online

More links

Kategorien

Inter-library loan at ZB MED

Book ; Online: Large Language Models Can Be Easily Distracted by Irrelevant Context

Full text online

More links

Kategorien

Inter-library loan at ZB MED

Book ; Online: Audio-Visual Neural Syntax Acquisition

Full text online

More links

Kategorien

Inter-library loan at ZB MED

Book ; Online: Siren's Song in the AI Ocean

Full text online

More links

Kategorien

Inter-library loan at ZB MED

Book ; Online: InCoder

Full text online

More links

Kategorien

Inter-library loan at ZB MED