Verifiable Geometry Problem Solving: Solver-Driven Autoformalization and Theorem Proposing

wpnews.pro

cd /news/artificial-intelligence/verifiable-geometry-problem-solving-… · home › topics › artificial-intelligence › article

[ARTICLE · art-42943] src=arxiv.org ↗ pub=2026-06-29T04:00Z topic=artificial-intelligence verified=true sentiment=↑ positive

Verifiable Geometry Problem Solving: Solver-Driven Autoformalization and Theorem Proposing

Researchers propose SD-GPS, a solver-driven framework for geometry problem solving that integrates autoformalization and theorem proposing. The framework uses a symbolic solver as an execution oracle to improve formalization and overcome deductive impasses, outperforming existing methods on Geometry3K and PGPS9K benchmarks.

read1 min views1 publishedJun 29, 2026

arXiv:2606.27926v1 Announce Type: new Abstract: Geometry Problem Solving have increasingly adopt the neuro-symbolic paradigm, combining neural intuition with symbolic rigor. However, current frameworks suffer from severe bottlenecks in two core stages: autoformalization, which treats multimodal translation as a static task decoupled from downstream solver compatibility, and theorem prediction, where solvers frequently hit a deductive impasse due to fixed rule libraries. To address these, we propose SD-GPS, a solver-driven framework that treats the symbolic solver as an execution oracle throughout both formalization and deduction. First, Solver-Driven Autoformalization unifies supervised formal-language adaptation and solvability-guided reinforcement learning into a single module built on QwenVL3-2B, making executability the central training signal. Second, Verified Theorem Proposing introduces an impasse-aware agent that proposes local auxiliary lemmas from current proof states, ensuring soundness by filtering all proposals through symbolic verification. Empirical evaluations on Geometry3K and PGPS9K demonstrate that SD-GPS consistently outperforms existing MLLM, neural, and neuro-symbolic methods across standard completion, multiple-choice, and cross-modal reference regimes, proving that closing the loop between multimodal perception and symbolic execution significantly improves geometric reasoning, offering profound insights into how neural agents can be grounded by formal systems to achieve verifiable problem-solving capabilities.

source & further reading

arxiv.org — original article

~/api · this article 200

$curl api.wpnews.pro/v1/news/verifiable-geometry-prob…

Read original on arxiv.org → arxiv.org/abs/2606.27926

mentioned entities

SD-GPS

QwenVL3-2B

Geometry3K

PGPS9K

metadata

slugverifiable-geometry-problem-solving-solver-driven-autoformalization-and-theorem

topic#artificial-intelligence

secondary3 topics

sentimentpositive

canonicalarxiv.org

navigation

← prevv0.5.6

next →Media Buying Briefing: The holdc…

── more in #artificial-intelligence 4 stories · sorted by recency

arxiv.org · 29 Jun · #artificial-intelligence

EntMTP: Accelerating LLM Inference with Entropy Guided Multi Token Prediction

arxiv.org · 29 Jun · #artificial-intelligence

Ko-WideSearch: A Korean Breadth-Search Benchmark for Exhaustive Set Enumeration by Web Agents

arxiv.org · 29 Jun · #artificial-intelligence

DMV-Bench: Diagnosing Long-Horizon Multimodal Agents' Visual Memory with Incidental Cue Injection

arxiv.org · 29 Jun · #artificial-intelligence

Large Language Model Teaches Visual Students: Cross-Modality Transfer of Fine-Grained Conceptual Knowledge

── more on @sd-gps 3 stories trending now

wpnews · 28 May · #ai-startups

[AINews] Cognition raises $1B in $26B Series D

wpnews · 5 Jun · #ai-agents

Miasma Worm Targets AI Coding Agents via GitHub Repos

wpnews · 28 Jun · #ai-agents

OpenCode v1.17: Session Snapshots Undo Your AI Agent

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required