Measuring and mitigating debugging effectiveness decay in code language models [PDF]
The effectiveness of AI debugging follows a predictable exponential decay pattern; most models lose 60-80% of their debugging capability within just 2-3 attempts, despite iterative debugging being a critical capability for practical code generation ...
Muntasir Adnan, Carlos C. N. Kuhn
doaj +2 more sources
Symbolic debugging with Gillian [PDF]
Software debugging for concrete execution enjoys a mature suite of tools, but debugging symbolic execution is still in its infancy. It carries unique challenges, as a single state can lead to multiple branches representing different sets of conditions ...
Gardner, Philippa +2 more
core +1 more source
Monitoring Data in R with the lumberjack Package
Monitoring data while it is processed and transformed can yield detailed insight into the dynamics of a (running) production system. The lumberjack package is a lightweight package allowing users to follow how an R object is transformed as it is ...
Mark P. J. van der Loo
doaj +1 more source
Using Modified Diffusion Models for Reliability Estimation of Open Source Software
Software development is a highly unpredictable process, and ensuring software quality and reliability before releasing it to the market is crucial. One of the common practices during software development is the reuse of code.
Kuan-Ju Chen, Chin-Yu Huang
doaj +1 more source
Comprehending Finite Maps for Algorithmic Debugging of Higher-Order Functional Programs [PDF]
Algorithmic debuggers for higher-order functional languages have to display functional values. Originally functional values had been represented as partial applications of function and constructor symbols, but a recent approach represents functional ...
Olaf Chitil +3 more
core +1 more source
Performance Analysis of Maximal Risk Evaluation Formulas for Spectrum-Based Fault Localization
The effectiveness analysis of risk evaluation formulas has become a significant research area in spectrum-based fault localization (SBFL). The risk evaluation formula is designed and widely used to evaluate the likelihood of a program spectrum to be ...
Tingting Wu +5 more
doaj +1 more source
A Systematic Mapping of the Proposition of Benchmarks in the Software Testing and Debugging Domain
Software testing and debugging are standard practices of software quality assurance since they enable the identification and correction of failures. Benchmarks have been used in that context as a group of programs to support the comparison of different ...
Deuslirio da Silva-Junior +4 more
doaj +1 more source
The MADAG Strategy for Fault Location Techniques
Spectrum-based fault localization (SBFL), which utilizes spectrum information of test cases to calculate the suspiciousness of each statement in a program, can reduce developers’ effort.
Shih-DA Wu, Jung-Hua Lo
doaj +1 more source
Virtual Machine Based Debugging for occam-pi [PDF]
While we strive to create robust language constructs and design patterns which prevent the introduction of faults during software development, an inevitable element of human error still remains.
Ritson, Carl G. +3 more
core +1 more source
Algorithmic Debugging of Real-World Haskell Programs: Deriving Dependencies from the Cost Centre Stack [PDF]
Existing algorithmic debuggers for Haskell require a transformation of all modules in a program, even libraries that the user does not want to debug and which may use language features not supported by the debugger. This is a pity, because a promising ap-
Faddegon, Maarten +3 more
core +1 more source

