Speaking Strategy
CELPIP Speaking Task 3 — Sample Answers
Last updated: June 2026
Task 3 (Picture Description) gives you 30 seconds to prepare and 60 seconds to speak. You describe a scene and offer an interpretation. Below are three sample responses at different CLB levels, with a scoring breakdown for each.
What the grader is looking for in Task 3
- →Description: identify what is happening (who, what, where)
- →Spatial organisation: use foreground / middle ground / background or left / right / centre
- →Specific detail: mention colours, numbers, approximate ages, weather, clothing
- →Interpretation: end with what the scene suggests (e.g., 'this appears to be a weekend')
- →Cohesion: use transitions — 'moving to', 'in the background', 'overall'
Sample answers by CLB level
“In this picture, I can see a park. There are some people sitting on benches and some children are playing. The weather looks nice. There is green grass and some trees. Some people are walking their dogs. The park looks busy because it is a weekend or holiday. I think people are enjoying their day off.”
Analysis: Passes the task requirement — describes a scene and offers an interpretation. Missing: spatial organisation, specific details, complex vocabulary. Grammar is simple but mostly correct.
“The image shows a busy urban park on what appears to be a sunny afternoon. In the foreground, a group of children — probably between five and ten years old — are playing near a fountain, and several of them appear to be splashing water. Moving to the middle ground, I can see two adults sitting on a wooden bench, and one of them is reading a book while the other appears to be on a phone call. In the background, there are several high-rise buildings visible beyond a row of mature trees, which suggests this park is located in a city centre. The overall atmosphere is relaxed and lively, and the scene gives the impression of a community space that residents use regularly during their leisure time.”
Analysis: Strong Content and Organisation. Uses spatial markers (foreground, middle ground, background), specific vocabulary (fountain, mature trees, high-rise), and closes with an interpretation. Grammar: mostly complex, controlled sentences.
“The photograph captures a vibrant urban park scene bathed in warm afternoon light. Dominating the foreground is a shallow circular fountain surrounded by young children who are gleefully splashing water — their colourful summer clothing suggesting this is a weekend outing. To the left of the fountain, a couple sits on a slatted wooden bench beneath the canopy of an old oak tree; one person is absorbed in a paperback while the other leans forward, apparently engaged in conversation on their phone. The background reveals a juxtaposition that is quintessentially modern: a dense row of mature trees providing a natural screen, beyond which three or four glass-and-steel high-rises punctuate the skyline. The composition as a whole conveys a sense of civic harmony — people of different ages and backgrounds sharing a single well-maintained public space, likely on a Saturday or Sunday given the relaxed, unhurried energy of the scene.”
Analysis: Near-perfect scoring. Rich, precise vocabulary (gleefully, canopy, juxtaposition, quintessentially, civic harmony). Complex grammar throughout. Spatial organisation clear. Interpretation integrated naturally, not bolted on.
How to use the 30-second preparation time
Spatial vocabulary to use
Location
- — in the foreground
- — in the background
- — in the middle ground
- — to the left / right
- — in the centre
- — at the top / bottom
Movement through scene
- — moving to
- — shifting our attention to
- — further back
- — closer to
- — beyond
- — adjacent to
Interpretation
- — this appears to be
- — the scene suggests
- — I believe this was taken
- — the overall atmosphere
- — the impression is
- — this gives the sense of
Frequently asked questions
How long is CELPIP Speaking Task 3?
Task 3 (Picture Description) gives you 30 seconds of preparation time and 60 seconds to speak. The 30-second prep window is when you should scan the image, decide your spatial structure, and pick your interpretation phrase.
What does the grader look for in CELPIP Speaking Task 3?
Graders evaluate four criteria: Content (what you describe), Coherence and Cohesion (how well your description flows), Vocabulary (specificity and range), and Delivery (pronunciation and fluency). Content and Vocabulary carry the most weight for Task 3.
Do I need to use foreground/background language in CELPIP Task 3?
Spatial language is not mandatory, but it signals CLB 9+ organisation to graders. Phrases like 'in the foreground', 'moving to the middle ground', and 'in the background' show organised description — a key Coherence marker. Without spatial structure, a description typically scores CLB 7 or below.
What is the best way to end a CELPIP Speaking Task 3 answer?
End with an interpretation — a sentence that says what the scene suggests. Examples: 'The overall atmosphere suggests this is a busy weekend afternoon' or 'This appears to be a community event given the variety of people present.' This moves you from description to inference, which is required for CLB 9.
Can I make up details in CELPIP Speaking Task 3?
Yes — graders do not penalise reasonable inference about an image. If you cannot read a sign clearly, you can say 'what appears to be a coffee shop'. Making reasonable, stated inferences actually demonstrates higher-level language ability than only describing what is directly visible.