Assessing and generating test sets in terms of behavioural adequacy | Zendy

Fraser Gordon | Zendy; Walkinshaw Neil | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Assessing and generating test sets in terms of behavioural adequacy

Author(s) -

Fraser Gordon,

Walkinshaw Neil

Publication year - 2015

Publication title -

software testing, verification and reliability

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.216

H-Index - 49

eISSN - 1099-1689

pISSN - 0960-0833

DOI - 10.1002/stvr.1575

Subject(s) - computer science , set (abstract data type) , test (biology) , test suite , machine learning , reliability (semiconductor) , test set , artificial intelligence , task (project management) , test case , interim , data mining , programming language , history , paleontology , power (physics) , physics , regression analysis , management , archaeology , quantum mechanics , economics , biology

Summary Identifying a finite test set that adequately captures the essential behaviour of a program such that all faults are identified is a well‐established problem. This is traditionally addressed with syntactic adequacy metrics (e.g. branch coverage), but these can be impractical and may be misleading even if they are satisfied. One intuitive notion of adequacy, which has been discussed in theoretical terms over the past three decades, is the idea of behavioural coverage : If it is possible to infer an accurate model of a system from its test executions, then the test set can be deemed to be adequate. Despite its intuitive basis, it has remained almost entirely in the theoretical domain because inferred models have been expected to be exact (generally an infeasible task) and have not allowed for any pragmatic interim measures of adequacy to guide test set generation. This paper presents a practical approach to incorporate behavioural coverage. Our BESTEST approach (1) enables the use of machine learning algorithms to augment standard syntactic testing approaches and (2) shows how search‐based testing techniques can be applied to generate test sets with respect to this criterion. An empirical study on a selection of Java units demonstrates that test sets with higher behavioural coverage significantly outperform current baseline test criteria in terms of detected faults. © 2015 The Authors. Software Testing, Verification and Reliability published by John Wiley & Sons, Ltd.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Empowering knowledge with every search

About

About Careers Publisher Partners Contact Us

Learn

FAQs Blog Terms of Use Privacy Policy

About

Learn

Discover

Explore