Results 331 to 340 of about 2,849,722 (390)
Some of the next articles are maybe not open access.
2023
Abstract Suppose Bob says, “I want to have reasons for what I do.” The skeptic replies, “But why do you want to have reasons?” Bob responds, “It’s just a brute fact about me. I want to have reasons for what I do.” The skeptic pounces: “But then you’re unreasonable.
openaire +1 more source
Abstract Suppose Bob says, “I want to have reasons for what I do.” The skeptic replies, “But why do you want to have reasons?” Bob responds, “It’s just a brute fact about me. I want to have reasons for what I do.” The skeptic pounces: “But then you’re unreasonable.
openaire +1 more source
arXiv.org
Recent advancements in long chain-of-thought (CoT) reasoning, particularly through the Group Relative Policy Optimization algorithm used by DeepSeek-R1, have led to significant interest in the potential of Reinforcement Learning with Verifiable Rewards ...
Xumeng Wen +11 more
semanticscholar +1 more source
Recent advancements in long chain-of-thought (CoT) reasoning, particularly through the Group Relative Policy Optimization algorithm used by DeepSeek-R1, have led to significant interest in the potential of Reinforcement Learning with Verifiable Rewards ...
Xumeng Wen +11 more
semanticscholar +1 more source
SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models
Trans. Mach. Learn. Res.This work revisits the dominant supervised fine-tuning (SFT) then reinforcement learning (RL) paradigm for training Large Vision-Language Models (LVLMs), and reveals a key finding: SFT can significantly undermine subsequent RL by inducing ``pseudo ...
Guiming Hardy Chen +7 more
semanticscholar +1 more source
Towards Reasoning Era: A Survey of Long Chain-of-Thought for Reasoning Large Language Models
arXiv.orgRecent advancements in reasoning with large language models (RLLMs), such as OpenAI-O1 and DeepSeek-R1, have demonstrated their impressive capabilities in complex domains like mathematics and coding.
Qiguang Chen +9 more
semanticscholar +1 more source
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL
arXiv.orgEnhancing reasoning in Large Multimodal Models (LMMs) faces unique challenges from the complex interplay between visual perception and logical reasoning, particularly in compact 3B-parameter architectures where architectural constraints limit reasoning ...
Yi Peng +9 more
semanticscholar +1 more source
2001
Abstract Grice begins this chapter by discussing to what extent the notion of variable rationality can be derived from that of flat rationality, and thus from the concept of a rational being alone. He then draws a distinction between ‘explanatory’ (motivating) and ‘justificatory’ (normative) reasons, as well as ‘personal’ reasons that ...
openaire +1 more source
Abstract Grice begins this chapter by discussing to what extent the notion of variable rationality can be derived from that of flat rationality, and thus from the concept of a rational being alone. He then draws a distinction between ‘explanatory’ (motivating) and ‘justificatory’ (normative) reasons, as well as ‘personal’ reasons that ...
openaire +1 more source
Giving reasons and given reasons
2021Abstract Derek Parfit, as a leader of the ‘reasons-first’ movement, says that the concept of a reason is fundamental and indefinable. But his concept of a reason differs from most philosophers’. Most philosophers take a reason to be a fact, whereas Parfit says that reasons are given by facts, not that they are facts.
openaire +1 more source
2001
Abstract Grice deals with the nature of practical and non‐practical (‘alethic’) reasoning, particularly what may be called imperfect reasoning. It consists of ‘misreasoning’ (the misapplication of good principles of reasoning), ‘incomplete reasoning’ (formally invalid but nonetheless correct inferences due to a missing premise that is ...
openaire +1 more source
Abstract Grice deals with the nature of practical and non‐practical (‘alethic’) reasoning, particularly what may be called imperfect reasoning. It consists of ‘misreasoning’ (the misapplication of good principles of reasoning), ‘incomplete reasoning’ (formally invalid but nonetheless correct inferences due to a missing premise that is ...
openaire +1 more source
Measuring Multimodal Mathematical Reasoning with MATH-Vision Dataset
arXiv.orgRecent advancements in Large Multimodal Models (LMMs) have shown promising results in mathematical reasoning within visual contexts, with models approaching human-level performance on existing benchmarks such as MathVista. However, we observe significant
Ke Wang +5 more
semanticscholar +1 more source
Symposium, 1999
Katherine Mo"ison charges that in my book, Back to Reality, ffailed to make my case for the adoption of a modest realism in postmodem (na"ative) therapy, because f failed to establish the motive behind that movement's adoption of antirealism. Infact, in Back to Reality, I put forth several reasons for therapists of all stripes to favor a modest realism
openaire +1 more source
Katherine Mo"ison charges that in my book, Back to Reality, ffailed to make my case for the adoption of a modest realism in postmodem (na"ative) therapy, because f failed to establish the motive behind that movement's adoption of antirealism. Infact, in Back to Reality, I put forth several reasons for therapists of all stripes to favor a modest realism
openaire +1 more source

