cover

Deductive Verification with Natural Programs: Case Studies

8 Sept 2024

Explore detailed examples of deductive verification with a Natural Program-based approach, highlighting successful error detection and areas of improvement.

cover

Essential Prompts for Reasoning Chain Verification and Natural Program Generation

8 Sept 2024

Explore a comprehensive list of prompts for verifying and generating reasoning chains

cover

Deductive Verification of Chain-of-Thought Reasoning: More Details on Answer Extraction

8 Sept 2024

Discover a detailed process for extracting final answers from language models, including pattern recognition and regular expression techniques.

cover

Understanding the Impact of Deductive Verification on Final Answer Accuracy

8 Sept 2024

Understand why improvements in deductive verification accuracy don't always lead to better final answer correctness, with a focus on the GSM8K dataset.

cover

How Fine-Tuning Impacts Deductive Verification in Vicuna Models

8 Sept 2024

Discover how fine-tuning Vicuna models boosts their deductive verification accuracy, and see why they still trail behind GPT-3.5 in performance.

cover

A New Framework for Trustworthy AI Deductive Reasoning

8 Sept 2024

Discover how the Natural Program framework revolutionizes AI reasoning by enhancing accuracy with innovative verification and voting strategies.

cover

When Deductive Reasoning Fails: Contextual Ambiguities in AI Models

8 Sept 2024

The limitations of the Natural Program deductive reasoning verification highlight AI’s struggles with contextual ambiguities.

cover

How Natural Program Improves Deductive Reasoning Across Diverse Datasets

8 Sept 2024

This paper evaluates the effectiveness of the Natural Program-based deductive reasoning process, showcasing improvements in reasoning rigor and reliability.

cover

Deductively Verifiable Chain-of-Thought Reasoning

8 Sept 2024

Discover how Natural Program and deductive verification enhance AI reasoning accuracy and trust by validating every step with unanimity-plurality voting.