Investigating the Use of Chronological Splitting to Compare Software Cross-company and Single-company Effort Predictions: A Replicated Study

CONTEXT: Three previous studies have investigated the use of chronological split to compare cross- to single-company effort predictions, where all used the ISBSG dataset release 10. Therefore there is a need for these studies to be replicated using different datasets such that the patterns previously observed can be compared and contrasted, and a better understanding with regard to the use of chronological splitting can be reached. OBJECTIVE: The aim of this study is to replicate [17] using the same chronological splitting; however a different database - the Finnish dataset. METHOD: Chronological splitting was compared with two forms of cross-validation. The chronological splitting used was the project-by-project chronological split, in which a validation set contains a single project, and a regression model is built from scratch using as training set the set of projects completed before the validation project’s start date. We used 201 single-company projects and 593 cross-company projects from the Finnish dataset. RESULTS: Single-company models presented significantly better prediction than cross-company models. Chronological splitting provided significantly worse accuracy than leave-one and leave-two out cross-validations when based on single-company data; and provided similar accuracy when based on cross-company data. CONCLUSIONS: Results did not seem promising when using project-by-project splitting; however in a real scenario companies that use their own data can only apply some sort of chronological splitting when obtaining effort estimates for their new projects. Therefore we urge the use of chronological splitting in effort estimation studies such that more realistic results can be provided to inform industry.

Content

Author and article information

Contributors

Emilia Mendes

Chris Lokan

Conference

Publication date: April 2009

Publication date (Print): April 2009

Pages: 1-10

Affiliations

[1 ]Computer Science Department, The University of Auckland, Private Bag 92019, Auckland, New Zealand

[2 ]School of IT&EE, UNSW@ADFA, Canberra ACT 2600, Australia

Article

DOI: 10.14236/ewic/EASE2009.2

SO-VID: 54bf0942-5c90-492b-a830-6d42c5b14ab5

License:

This work is licensed under a Creative Commons Attribution 4.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Conference name: 13th International Conference on Evaluation and Assessment in Software Engineering (EASE)

Conference acronym: EASE

Conference number: 13

Conference location: Durham University, UK

Conference date: 20 - 21 April 2009

Conference sponsor: Electronic Workshops in Computing (eWiC)

Conference theme: Evaluation and Assessment in Software Engineering (EASE)

History

Product

1477-9358 BCS Learning & Development

Self URI (article page): https://www.scienceopen.com/hosted-document?doi=10.14236/ewic/EASE2009.2

Self URI (journal page): https://ewic.bcs.org/

Celebrating 65 years of The Computer Journal - free-to-read perspectives - bcs.org/tcj65

Investigating the Use of Chronological Splitting to Compare Software Cross-company and Single-company Effort Predictions: A Replicated Study

Abstract