CS 50 | Software Design and Implementation

Today we look at developing software specifications. A good specification leads to a clear understanding between the customer and the developer regarding what the software to be developed shall do. Getting this right is tricky in practice as both groups often have different assumptions about what various terms and procedures mean. Documenting these assumptions carefully and creating clear specifications can help ensure the developer creates something that is useful for the customer.

Also, take a look at this lecture extra that describes how to use the webpage module for Lab 4, 5, and 6.

Goals

To understand software specifications
To develop an implementation plan for the TSE Crawler

Activity

In the activity today, we aim to familiarize the usage of the webpage module. We will use it to fetch the web page of a URL and save it to an external file.

Methodology

In this unit, we introduce a simple software design methodology. It’s by no means the only methodology - but it’s straightforward and useful for CS50.

There are many techniques for the design and development of good code, including top-down or bottom-up design, divide and conquer (breaking the system down into smaller more understandable components), structured design (data-flow approach), and object-oriented design (modularity, abstraction, and information-hiding). For a quick survey of these and other techniques, see A Survey of Major Software Design Methodologies (author unknown).

Many of these techniques use similar approaches, and embrace fundamental concepts like abstraction, data representation, data flow, data structures, and top-down decomposition from requirements to structure.

It is unlikely that someone could give you 10 steps to follow and be assured of great system software. Every non-trivial project has its special cases, unique environments, or unexpected uses. It’s often best to begin development of a module, or a system, with small experiments - building a prototype and throwing it away - because you can learn (and make mistakes) building small prototype systems.

Pragmatic Programmer Tip:

Prototype to Learn. Prototyping is a learning experience. Its value lies not in the code you produce, but in the lessons you learn.

Clarity comes from the experience of working from requirements, through system design, implementation and testing, to integration and customer feedback on the requirements.

The following figure shows the software design methodology that we use in the design of the TinySearchEngine and the project.

Software system design methodology

Let’s step through the phases of software design, as shown in the figure above.

Procurement phase

The procurement phase of a project occurs when the project is in its early stages. It represents deep discussion between a customer and provider of software systems. As a software developer, you have to clearly understand and capture the customers’ needs. Often some of the requirements are specified separately by regulatory agencies, national or international standards bodies, or industry standards groups. These provide additional requirements that the customer expects to be included.

In our case, you are the provider and we (CS50 staff) are your customer.

Specification phase

After initial discussions with the customer, we next turn to developing specifications (specs) to describe the project to be delivered. This phase typically begins with a Requirements spec, moves on to a Design spec, and finishes with an Implementation spec. We examine each of these.

Requirements spec

Pragmatic Programmer Tip:

Don’t Gather Requirements – Dig for Them. Requirements rarely lie on the surface. They’re buried deep beneath layers of assumptions, misconceptions, and politics.

The system Requirements Spec captures all the requirements of the system that the customer wants built. Typically the provider and customer get into deep discussion of requirements and their cost. The requirements must be written down, and reviewed by both customer and provider, to be sure all are in agreement. Sometimes these documents are written in contractual (legal) language. If the customer gets a system that does not meet the spec, or the two parties disagree about whether the finished product meets the spec, lawyers may get involved. If a system is late, financial penalties may arise.

“The hardest part of design… is keeping features out.” – Anonymous

The system requirement spec may have a variety of requirements typically considered the shalls - such as, “the crawler shall only crawl webpages within the cs50 website”. These requirements include functional requirements, performance requirements, security requirements, and cost requirements.

A common challenge during this phase is that the customer either doesn’t know what he/she really wants or expresses it poorly (in some extreme cases the customer may not be able to provide you with the ultimate intended use of your system due to proprietary or security concerns). You must realize that the customer may have these difficulties and iterate with the customer until you both are in full agreement. One useful technique is to provide the customer with the system requirements specification (and sometimes later specs too) and then have the customer explain the spec to you. It is amazing how many misunderstandings and false assumptions come to light when the customer is doing the explaining.

The Requirements Spec may address many or all of the following issues:

functionality - what should the system do?
performance - goals for speed, size, energy efficiency, etc.
compliance - with federal/state law or institutional policy
compatibility - with standards or with existing systems
security - against a specific threat model under certain trust assumptions
cost - goals for cost, if system operation incurs costs
timeline - when will various part of the system be completed? what are the deadlines?
hardware/software - what hardware or software must be purchased or provisioned?
personnel - who will work on this project?

A new concern of system development is the issue of the services-oriented model referred to as the “cloud” (Software As A Service, Infrastructure As A Service, Platform As A Service, etc.). The decision of whether to develop a specific system running in a traditional manner or to build a cloud-based solution should be made early, as it will affect many of the later stages of the development process. Some would argue about where it needs to fit in the methodology, but we feel that the sooner you (and the customer) know where this system is headed, the better.

Pragmatic Programmer Tip:

Make quality a requirements issue. Involve your users in determining the project’s real quality requirements.

Although the customer may make some assumptions on this point, it’s in your best interests to make it a priority. Remember the “broken window theory”:

Pragmatic Programmer Tip:

Don’t Live with Broken Windows. Fix bad designs, wrong decisions, and poor code when you see them.

Design spec

In this phase, you translate the requirements into a full system-design specification. The Design Spec is the result of studying the system requirements and applying the art of design (the magic) with a design team. This design specification shows how the complete system is broken up into specific subsystems, and all of the requirements are mapped to those subsystems. The Design spec for a system, subsystem, or module includes:

User interface
Inputs and outputs
Functional decomposition into modules
Dataflow through modules
Pseudo code (plain English-like language) for logic/algorithmic flow
Major data structures
Testing plan

To this last point:

Pragmatic Programmer Tip:

Design to Test. Start thinking about testing before you write a line of code.

The Design Specification is independent of your choice of language, operating system, and hardware. In principle, it could be implemented in any language from Java to micro-code and run on anything from a Cray supercomputer to a toaster.

Implementation spec

The Implementation Spec represents a further refinement and decomposition of the system. It is language, operating system, and hardware dependent (sometimes, the language abstracts the OS or HW out of the equation). The implementation spec includes many or all of these topics:

Detailed pseudo code for each of the objects/components/functions,
Definition of detailed APIs, interfaces, function prototypes and their parameters,
Data structures (e.g., struct names and members),
Security and privacy properties,
Error handling and recovery,
Resource management,
Persistent storage (files, database, etc).
Testing plan

Implementation phase

In this phase, we turn the Design Spec into an Implementation Spec, then code up the modules, unit-test each module, integrate the modules and test them as an integrated sub-system and then system.

Coding

Coding is often the fun part of the software development cycle - but not usually the largest amount of time. As a software developer in industry, you might spend only about 20% of your time coding (perhaps a lot more if you’re in a startup). The rest of the time will be dealing with the other phases of the methodology, particularly, the last few: testing, integration, fixing problems with the product and meetings with your team and with your customers.

Goals during coding:

Correctness: The program is correct (i.e., does it work) and error free; that is, does its behavior match the functional requirements in the specifications.
Clarity: The code is easy to read, well commented, and uses good variable and function names. In essence, is it easy to use, understand, and maintain

Clarity makes sure that the code is easy to understand by people with a range of skills, and across a variety of machine architectures and operating systems. [Kernighan & Pike]
Simplicity: The code is as simple as possible, but no simpler.

Simplicity keeps the program short and manageable. [Kernighan & Pike]
Generality: The program can easily adapt to change.

Generality means the code can work well in a broad range of situations and is tolerant of new environments (or can be easily made to do so). [Kernighan & Pike]

Testing phase

Pragmatic Programmer Tip:

Test your software, or your users will.

Testing is a critical part of the whole process of any development effort, whether you’re building bridges or software. Unit testing of modules in isolation, and integration testing as modules are assembled into sub-systems and, ultimately, the whole system, result in better, safer, more reliable code.

The ultimate goal of testing is to exercise all paths through the code. Of course, with most applications this may prove to be a daunting task. Most of the time the code will execute a small set of the branches in the module. So when special conditions occur and newly executed code paths fail, it can be really hard to find those problems in large, complex pieces of code.

The better organized and modularized your code is, the easier it will be to understand, test, and maintain - even by you!

Testing normally involves the following tests:

unit: does a new unit behave as expected when run alone?
integration: do the pieces of the system work together as specified?
regression: did a change break something elsewhere in the system?
fuzz: does the system handle unexpected input properly?
acceptance: does the customer agree the system works as designed?

Next class we will take a closer look at each of these.

Feedback phase

The design team sits down with its customer and demonstrates its implementation as development progresses. The customer and the team review the original requirement spec and check each requirement for completion. The customer provides feedback on what the development team has built.

Beware scope creep! — Pierson

In the TSE and project we emphasize understanding the requirements of the system we want to build, writing good design and implementation specs before coding. In CS50 we put special weight on the coding principles of simplicity, clarity, and generality.