ScienceOpen: research and publishing network

For Researchers

Search
Advanced search

19

views

    

0

recommends

0

shares

Record: found
Abstract: found
Article: found

Is Open Access

Fault Localization Using Textual Similarities

Preprint

Author(s): Zachary P. Fry , Westley Weimer

Publication date Created: 12 November 2012

Read this article at

ScienceOpen ArXiv

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Maintenance is a dominant component of software cost, and localizing reported defects is a significant component of maintenance. We propose a scalable approach that leverages the natural language present in both defect reports and source code to identify files that are potentially related to the defect in question. Our technique is language-independent and does not require test cases. The approach represents reports and code as separate structured documents and ranks source files based on a document similarity metric that leverages inter-document relationships. We evaluate the fault-localization accuracy of our method against both lightweight baseline techniques and also reported results from state-of-the-art tools. In an empirical evaluation of 5345 historical defects from programs totaling 6.5 million lines of code, our approach reduced the number of files inspected per defect by over 91%. Additionally, we qualitatively and quantitatively examine the utility of the textual and surface features used by our approach.

Related collections

Most cited references 21

Record: found
Abstract: not found
Article: not found

A STATISTICAL INTERPRETATION OF TERM SPECIFICITY AND ITS APPLICATION IN RETRIEVAL

KAREN SPARCK JONES (1972)

0 comments Cited 357 times – based on 0 reviews      Review now

Record: found
Abstract: not found
Conference Proceedings: not found

Empirical evaluation of the tarantula automatic fault-localization technique

James A Jones, Mary Jean Harrold (2005)

0 comments Cited 47 times – based on 0 reviews

Record: found
Abstract: not found
Conference Proceedings: not found

Who should fix this bug?

John Anvik, Lyndon Hiew, Gail C. Murphy (2006)

0 comments Cited 42 times – based on 0 reviews

Author and article information

Journal

Publication date Created: 12 November 2012

Article

ArXiV ID: 1211.2858

SO-VID: 561ca899-7f36-4026-ba45-581a5513026b

License:

http://arxiv.org/licenses/nonexclusive-distrib/1.0/

History

Custom metadata

Categories cs.SE

Data availability:

Comments

Comment on this article