SDD16 — Evaluation and Code Quality

Learning intentions

Understand why evaluation is the final stage of the iterative development process
Describe, identify, and exemplify all four SQA evaluation criteria for a software solution
Apply evaluation criteria to code examples and explain specific improvements

Success criteria

I can define fitness for purpose and explain how to check a program meets its requirements
I can define robustness and identify whether a program handles invalid input correctly
I can define efficient use of coding constructs and give a specific code example
I can define readability and identify all four sub-criteria: commentary, meaningful identifiers, indentation, and white space
I can evaluate a given program against all four criteria and justify my judgement with evidence from the code

Key vocabulary

evaluation

Judging how well a completed software solution meets the requirements set out during analysis. The SQA specifies four criteria: fitness for purpose, robustness, efficient use of coding constructs, and readability.

fitness for purpose

The program does everything specified in the functional requirements. Every input, process, and output listed during analysis must work correctly.

robustness

The program handles invalid, unexpected, or out-of-range input without crashing. Typically achieved through input validation.

efficient use of coding constructs

The program uses the most appropriate structure for each task. For example: ELSE IF instead of repeated IFs; a loop instead of repeated lines; a predefined function instead of writing the same code multiple times.

readability

Another programmer can understand the code without needing to ask questions. Achieved through internal comments, meaningful identifiers, consistent indentation, and white space.

meaningful identifier

A variable, array, or function name that describes its purpose. totalScore is meaningful; x is not.

The four evaluation criteria

The SQA specification requires you to evaluate a software solution against exactly four criteria. For each one, you must be able to describe what it means, identify whether a program has it, and exemplify it with specific evidence from code.

1. Fitness for purpose

Does the program do what it was designed to do? To evaluate this, go back to the functional requirements from the analysis stage and check each one:

Does every input listed work correctly?
Does every process produce the right result?
Does every output appear as specified?

A program is fit for purpose if it meets all requirements. If it meets most but not all, it is partially fit for purpose — and you should identify which requirement is not met.

2. Robustness

Does the program handle invalid input without crashing? A robust program:

Validates input before processing it
Displays a meaningful error message when input is invalid
Gives the user the chance to try again
Does not crash or produce garbage output when unexpected data is entered

At N5 level, robustness is primarily achieved through the input validation standard algorithm (SDD14b). A program that uses int(input(...)) without any validation is not robust — it will crash if the user types a letter.

3. Efficient use of coding constructs

Does the program use the right tool for each job? The SQA is specifically looking for situations where a less efficient approach was replaced with a more efficient one. The most common examples at N5 are:

Situation	Inefficient	Efficient
Multiple conditions	Three separate IF statements (all three are always evaluated)	IF / ELSE IF / ELSE (stops at the first match)
Repeated code	Same block of code copied 10 times	A loop that runs 10 times
Common calculation	Writing the same formula in multiple places	A predefined function called wherever needed
Array traversal	Accessing array[0], array[1], array[2]… individually	A loop with an index variable

Key point for the exam: ELSE IF is more efficient than multiple IF statements because when one condition is true, the rest are skipped. With separate IFs, the computer evaluates every single one.

4. Readability

The SQA specifies four sub-criteria for readability. All four can be examined:

Sub-criterion	What it means	Example
Internal commentary	Comments in the code that explain what sections do. In Python, lines starting with `#`.	`# Calculate the average score`
Meaningful identifiers	Variable, array, and function names that describe their purpose.	`totalScore` not `x`
Indentation	Code inside loops, IF statements, and functions is consistently indented (4 spaces in Python). Makes the structure visible.	Lines inside `for` indented by 4 spaces
White space	Blank lines between sections of code. Makes the code easier to scan.	Blank line between variable declarations and the main loop

Before and after — applying the criteria

Readability — poor vs improved

✕ Poor readability

x=0
for i in range(5):
  n=int(input("?"))
  x=x+n
print(x/5)

✓ Good readability

# Collect 5 scores and calculate the average
total = 0

for question in range(1, 6):
    score = int(input("Enter score for Q" + str(question) + ": "))
    total = total + score

average = total / 5
print("Average score:", average)

Improvements: comment added; meaningful identifiers (total, score, average, question); consistent 4-space indentation; blank line before the loop and before the output.

Efficient use — separate IFs vs ELSE IF

✕ Inefficient — 3 IFs always evaluated

if score >= 70:
    grade = "A"
if score >= 50 and score < 70:
    grade = "B"
if score < 50:
    grade = "Fail"

✓ Efficient — stops at first match

if score >= 70:
    grade = "A"
elif score >= 50:
    grade = "B"
else:
    grade = "Fail"

If score is 80, the first condition is True. With separate IFs, conditions 2 and 3 are still evaluated. With ELSE IF (elif), Python jumps straight to END IF — more efficient.

Robustness — no validation vs validated

✕ Not robust — crashes on invalid input

age = int(input("Enter your age: "))
# If user types "hello", this crashes
# with ValueError: invalid literal

✓ Robust — validates range

age = int(input("Enter your age (0-120): "))
while age < 0 or age > 120:
    print("Invalid. Enter age 0-120.")
    age = int(input("Try again: "))

Note: the improved version validates the range — it still assumes the user enters a number. Full robustness (handling non-numeric input) requires exception handling, which is beyond N5 scope.

How to write an exam evaluation

In the exam, you may be shown a program and asked to "evaluate this solution" or asked about a specific criterion. Always:

Name the criterion — use the SQA term exactly.
State your judgement — is it good, poor, or could be improved?
Give specific evidence from the code — quote a line number, variable name, or specific construct.

Example answer — fitness for purpose

"The program is fit for purpose. The functional requirements stated that the program should accept a score, calculate the grade, and display it. All three requirements are met: line 3 receives the score, lines 5–10 calculate the grade using ELSE IF, and line 12 displays the result."

Example answer — readability

"The readability could be improved. The variable name x on line 1 is not meaningful — it should be renamed totalScore to make its purpose clear. There are also no internal comments to explain what each section of the program does."

Example answer — efficient use

"The program makes efficient use of coding constructs. Lines 6–10 use an ELSE IF chain rather than three separate IF statements. This is more efficient because once the first matching condition is found, the remaining conditions are not evaluated."

Now you try

Evaluate the program below against all four criteria. Give specific evidence from the code for each one.

t = 0
for i in range(10):
    n = int(input("number: "))
    t = t + n
print(t)

Fitness for purpose: Partially fit for purpose. The program collects 10 numbers and outputs their total. If the requirement was to display the total and average, it would not be fit for purpose as it only shows the total.

Robustness: Not robust. There is no input validation — if the user enters a non-numeric value, the program will crash with a ValueError. Adding a WHILE loop to check the input is within range would improve robustness.

Efficient use of coding constructs: Efficient. A FOR loop is used to collect 10 numbers rather than repeating the input line 10 times. The accumulator pattern (t = t + n) is the correct construct for a running total.

Readability: Poor. Variable names t, i, and n are not meaningful — they should be total, counter, and number. There are no internal comments. No white space separates the loop from the output statement.

Common mistakes

Giving a vague evaluation without evidence. Saying "the program is readable" is worth 0 marks. You must say why — e.g. "the variable totalScore is a meaningful identifier" or "there are comments on lines 2 and 7 explaining each section."

Confusing robustness with fitness for purpose. Fitness for purpose is about meeting requirements. Robustness is specifically about handling invalid input. A program can be fit for purpose but not robust.

Missing sub-criteria for readability. The SQA recognises four: internal commentary, meaningful identifiers, indentation, and white space. An answer that only mentions comments misses three marks.

Saying ELSE IF is "shorter" rather than "more efficient". The SQA mark scheme looks for the word efficient and the explanation that remaining conditions are not evaluated once a match is found.

Exam tip

Evaluation questions are among the highest-value questions in the SDD section. They are often worth 3–4 marks and require you to name, judge, and justify for each criterion. If the question says "evaluate the solution", cover all four criteria. If it says "evaluate the readability", cover all four readability sub-criteria.

The phrase the SQA uses for efficient use is: "efficient use of coding constructs". Use these exact words in your answer, not "good code" or "no repetition."

Teacher notes — Shift+T to hide

Suggested timing: 55 minutes. Warm up 7 min; vocab + notes 15 min; before/after examples 10 min; now you try 5 min; task set Q1–Q8 13 min; PyCharm Q9–Q10 15 min (or set as homework).

Common pupil confusion: evaluation vs testing. Testing checks whether the program produces correct outputs for given inputs. Evaluation makes a broader judgement about the quality of the solution. They are related but distinct — a program can pass all tests and still have poor readability or inefficient code.

Efficient use — the key teaching point. Write both versions of the grade checker on the board side by side. Ask pupils: "If score is 85, how many IF comparisons does version A do? How many does version B (ELSE IF) do?" Version A: 4. Version B: 1. This makes the efficiency difference concrete.

Readability task: Give pupils a printout of a real piece of pupil code (anonymised) with poor readability. Ask them to annotate it — identify which sub-criteria are missing and rewrite it with improvements. This is more engaging than working from a textbook example.

SQA mark scheme language. Pupils must use "efficient use of coding constructs" — not "efficient code" or "no repeated lines". Similarly for robustness — must reference "invalid input" not just "errors".