0204: Coverage measurement

cs128

14 Sept 202106:01

Summary

TLDRThis lesson delves into test-driven development's coverage measurement, emphasizing the importance of thorough testing beyond just general cases to include edge cases. It introduces different levels of code coverage, from simple method execution (S0) to comprehensive pathway testing (C2), suggesting that while more tests are better, the real question is how thorough the testing is. The instructor highlights the balance between testing enough and over-testing, and the value of various coverage metrics in ensuring software correctness.

Takeaways

🔍 Coverage measurement is a key aspect of test-driven development to ensure the thoroughness of tests.
🛠️ Different levels of coverage provide varying degrees of confidence in the correctness of the code.
🤔 The question 'how much testing is enough?' is addressed by considering the balance between testing and shipping software.
📈 A quantitative evaluation can be made by comparing lines of test code to lines of code in the application.
📊 Having more tests than lines of code is not unreasonable, especially in production-level systems.
🧐 The focus should be on the thoroughness of testing rather than just the quantity of test code.
📋 Basic coverage measurements include S0 (every method is called) and S1 (every method is called from every call site).
🔄 C1 coverage involves executing every branch in both directions to test all possible pathways.
🌐 C2 coverage is about testing every possible combination of pathways through a method, which is challenging and controversial.
🚀 The lecture will continue with more on test-driven development in the next video.

Q & A

What is the primary focus of the lesson in this script?
-The lesson focuses on coverage measurement within the context of test-driven development, exploring different levels of coverage to evaluate how thoroughly code is being tested.
What is the basic idea behind code coverage in testing?
-Code coverage refers to the extent to which tests cover the code base. Higher coverage indicates that more parts of the code, including edge cases and different pathways, have been tested, which increases confidence in the correctness of the code.
Why is it not ideal to keep testing until the software is ready for release?
-Continuously testing until release may indicate a lack of methodical planning in the testing process, leading to ineffective or redundant testing. It's better to structure tests in a methodical manner for optimal coverage.
Why might measuring the number of test cases against lines of code be insufficient?
-Simply having more tests than lines of code does not necessarily indicate thoroughness, as the tests could be poorly written or fail to cover critical parts of the code. It's more important to consider the quality and scope of the tests.
What is S0 coverage in testing?
-S0 coverage indicates that every method in the code has been called at least once, though it doesn't guarantee that all possible code paths have been exercised or that edge cases are covered.
How does S1 coverage improve upon S0 coverage?
-S1 coverage ensures that every method is called from every possible call site, allowing for a more comprehensive evaluation of the function's behavior in different contexts, though it may still neglect fringe or edge cases.
What does C1 coverage ensure in unit testing?
-C1 coverage ensures that every statement in a method is executed at least once, including both branches of conditional statements. This provides a more thorough test than just calling the method or function.
What does C2 coverage aim to achieve in testing?
-C2 coverage aims to test every possible combination of paths through the code, requiring a large number of test inputs to ensure that every potential pathway is covered. However, it is complex and sometimes debated in terms of its value.
Why is C2 coverage considered difficult to achieve?
-C2 coverage is difficult because it requires testing every possible pathway through a function, which often involves a huge number of test cases. This makes it complex and resource-intensive, and there is debate about how valuable it is in practice.
What is the overall purpose of measuring coverage in test-driven development?
-The purpose of measuring coverage is to ensure that tests thoroughly exercise the code, increasing confidence in its correctness and stability. Higher coverage, especially when it includes edge cases and different pathways, leads to better software reliability.