Empirical Validation of Object-Oriented Metrics on Open Source Software for Fault Prediction


Open source software systems are becoming increasingly important these days. Many companies are investing in open source projects and lots of them are also using such software in their own work. But, because open source software is often developed with a different management style than the industrial ones, the quality and reliability of the code needs to be studied. Hence, the characteristics of the source code of these projects need to be measured to obtain more information about it. This paper describes how we calculated the object-oriented metrics given by Chidamber and Kemerer to illustrate how fault-proneness detection of the source code of the open source Web and e-mail suite called Mozilla can be carried out. We checked the values obtained against the number of bugs found in its bug database - called Bugzilla - using regression and machine learning methods to validate the usefulness of these metrics for fault-proneness prediction. We also compared the metrics of several versions of Mozilla to see how the predicted fault-proneness of the software system changed during its development cycle.

IEEE Transactions on Software Engineering, 31(10):897–910


    author   = {Gyim{\'o}thy, Tibor and Ferenc, Rudolf and Siket, Istv{\'a}n},
    title    = {Empirical Validation of Object-Oriented Metrics on Open Source Software for Fault Prediction},
    journal  = {IEEE Transactions on Software Engineering},
    year     = {2005},
    volume   = {31},
    number   = {10},
    pages    = {897--910},
    month    = nov,
    issn     = {1939-3520},
    doi      = {10.1109/TSE.2005.112},
    keywords = {Columbus., Index Terms- Fact extraction, metrics validation, reverse engineering, open source software, fault-proneness detection, Mozilla, Bugzilla, C++, compiler wrapping},
    url      = {http://ieeexplore.ieee.org/document/1542070/},