scispace - formally typeset
Search or ask a question
Journal Article

A comparison of model building techniques to develop predictive equations for software metrics

About: This article is published in Information & Software Technology.The article was published on 1997-01-01 and is currently open access. It has received 24 citations till now. The article focuses on the topics: Software metric & Model building.
Citations
More filters
Journal ArticleDOI
TL;DR: A comparison of the prediction accuracy of three CBR techniques used to estimate the effort to develop Web hypermedia applications and to choose the one with the best estimates is presented.
Abstract: Software cost models and effort estimates help project managers allocate resources, control costs and schedule and improve current practices, leading to projects finished on time and within budget. In the context of Web development, these issues are also crucial, and very challenging given that Web projects have short schedules and very fluidic scope. In the context of Web engineering, few studies have compared the accuracy of different types of cost estimation techniques with emphasis placed on linear and stepwise regressions, and case-based reasoning (CBR). To date only one type of CBR technique has been employed in Web engineering. We believe results obtained from that study may have been biased, given that other CBR techniques can also be used for effort prediction. Consequently, the first objective of this study is to compare the prediction accuracy of three CBR techniques to estimate the effort to develop Web hypermedia applications and to choose the one with the best estimates. The second objective is to compare the prediction accuracy of the best CBR technique against two commonly used prediction models, namely stepwise regression and regression trees. One dataset was used in the estimation process and the results showed that the best predictions were obtained for stepwise regression.

223 citations


Cites background or methods from "A comparison of model building tech..."

  • ...A useful summary of these techniques is presented in Gray and MacDonell (1997b)....

    [...]

  • ...…have been made in software engineering between the three categories of prediction techniques aforementioned, based on their prediction power (Gray and MacDonell, 1997a, 1997b; Briand et al., 1999, 2000; Jeffery et al., 2000, 2001; Myrtveit and Stensrud, 1999; Shepperd et al., 1996; Shepperd…...

    [...]

  • ...An advantage of AM over ML and EJ is to allow users to see how a model derives its conclusions, an important factor for verification as well as theory building and understanding of the process being modeled (Gray and MacDonell, 1997b)....

    [...]

Journal ArticleDOI
TL;DR: The use of regression analysis to derive predictive equations for software metrics has recently been complemented by increasing numbers of studies using non-traditional methods, such as neural networks, fuzzy logic models, case-based reasoning systems, and regression trees.
Abstract: The use of regression analysis to derive predictive equations for software metrics has recently been complemented by increasing numbers of studies using non-traditional methods, such as neural networks, fuzzy logic models, case-based reasoning systems, and regression trees. There has also been an increasing level of sophistication in the regression-based techniques used, including robust regression methods, factor analysis, and more effective validation procedures. This paper examines the implications of using these methods and provides some recommendations as to when they may be appropriate. A comparison of the various techniques is also made in terms of their modelling capabilities with specific reference to software metrics.

201 citations

Journal ArticleDOI
TL;DR: Artificial neural network and stepwise regression based predictive models are investigated, aiming at offering alternative methods for those who do not believe in estimation models and indicate that these techniques are competitive with the APF, SLIM, and COCOMO methods.

123 citations


Cites background or methods from "A comparison of model building tech..."

  • ...Some research works have used artificial neural networks to produce more accurate resource estimates Gray and MacDonell, 1997; Witting and Finnie, 1997)....

    [...]

  • ...…models are considered synonyms), recent research works have attempted at determining the best approach based mostly on one or more accuracy measures (Gray and MacDonell, 1997; Briand et al., 2000; Jeffery et al., 2001; Shepeerd et al., 1996; Angelis and Stamelos, 2000; Finnie et al., 1997)....

    [...]

  • ..., 1994), regression trees (Selby and Porter, 1998), artificial neural networks (Srinivazan and Fisher, 1995), and case based reasoning (Shepeerd et al., 1996; Gray and MacDonell, 1997)....

    [...]

  • ...Given the availability of a variety of different predictive models (estimation models and predictive models are considered synonyms), recent research works have attempted at determining the best approach based mostly on one or more accuracy measures (Gray and MacDonell, 1997; Briand et al., 2000; Jeffery et al., 2001; Shepeerd et al., 1996; Angelis and Stamelos, 2000; Finnie et al., 1997)....

    [...]

  • ...According to Gray and MacDonell (1997), neural networks are the most common software estimation modelbuilding technique used as an alternative to mean least squares regression....

    [...]

Proceedings ArticleDOI
04 Jun 2002
TL;DR: This paper compares the prediction accuracy of three CBR techniques to estimate the effort to develop Web hypermedia applications against three commonly used prediction models, namely multiple linear regression, stepwise regression and regression trees.
Abstract: Several studies have compared the prediction accuracy of different types of techniques with emphasis placed on linear and stepwise regressions, and case-based reasoning (CBR). We believe the use of only one type of CBR technique may bias the results, as there are others that can also be used for effort prediction. This paper has two objectives. The first is to compare the prediction accuracy of three CBR techniques to estimate the effort to develop Web hypermedia applications. The second objective is to compare the prediction accuracy of the best CBR technique, according to our findings, against three commonly used prediction models, namely multiple linear regression, stepwise regression and regression trees. One dataset was used in the estimation process and the results showed that different measures of prediction accuracy gave different results. MMRE and MdMRE showed better prediction accuracy for multiple regression models whereas box plots showed better accuracy for CBR.

90 citations


Cites background or methods from "A comparison of model building tech..."

  • ...Several modelling techniques have been compared in the Software/Web engineering literature [10-27]....

    [...]

  • ...Several comparisons between the three categories of prediction techniques aforementioned have been made, based on their prediction power [10-27]....

    [...]

  • ...A useful summary of these techniques is presented in [10]....

    [...]

Proceedings ArticleDOI
21 Sep 1997
TL;DR: Software metrics are measurements of the software development process and product that can be used as variables (both dependent and independent) in models for project management and the use of alternative techniques, especially fuzzy logic, is investigated and some usage recommendations are made.
Abstract: Software metrics are measurements of the software development process and product that can be used as variables (both dependent and independent) in models for project management. The most common types of these models are those used for predicting the development effort for a software system based on size, complexity, developer characteristics, and other metrics. Despite the financial benefits from developing accurate and usable models, there are a number of problems that have not been overcome using the traditional techniques of formal and linear regression models. These include the nonlinearities and interactions inherent in complex real-world development processes, the lack of stationarity in such processes, over-commitment to precisely specified values, the small quantities of data often available, and the inability to use whatever knowledge is available where exact numerical values are unknown. The use of alternative techniques, especially fuzzy logic, is investigated and some usage recommendations are made.

87 citations