If you have ever watched a film or TV programme, or gone to an opera or theatre production with either captions (subtitles in the UK) for improved accessibility or subtitles (also called subtitles in the UK) for language interpretation, there is a visual language that should be followed that helps users get more from the content.

These are things that professional subtitlers and captioners do when they follow best practice as they can help with reading comprehension. These have been researched and tested, and put into practice by broadcasters in particular for over 40 years.

This guide should be applicable to TV or Film open captioning, Translation Subtitles, YouTube Subtitles, Burned-in Captions on social media videos and Video Game Captions.

Some of these are fairly obvious but there are others that can conveyed subtleties that can unlock meaning.

One thing that needs to be said is that the presentation of captions and translation subtitles should always follow the same rules. They have different purposes but the function is the same, reading the spoken word.
They are both relaying narrative and dialogue, and the end-users included people with or without access to sound, and might also have vision or cognitive impairments. There are people who will say differently, but they only consider translation subtitles being accessed by a standard issue audience, especially hearing audiences. As soon as you consider that disabled people also watch foreign-language films and TV, then you realise that the editorial accessibility conventions are just as relevant.

The following is a user-centric guide to the editorial conventions of an accessible caption or subtitle experience.

If you are a content producer then the main 16 considerations that can be universally applied on any VOD social media platform are on this downloadable cheat sheet. There are more links to resources at the end of the article.

1. Different Colours.
When text appears in a different colour, then this denotes a different speaker. Colours for individual speakers should persist. If possible a colour associated with a character could be used, like red for Iron Man, blue for Captain America or Green for The Hulk, as this helps cognitively.

2. A hyphen at the Start of a Line.
Where the platform doesn’t support different colours hyphens are used instead to tell us the speaker has changed.

3. Combined Colours and a Hyphens.
This is when there is either redundancy built into the design so it can be used on multiple platforms, or the content producer is ensuring this also works for colour blind users.

4. Single Quotation Marks.
This means that the person speaking is either a voiceover artist, a narrator or on the phone.

5. Named Speakers.
This can appear at seemingly odd times, but it does get used by producers when there are scenes where there are too many speakers for colours or hyphens alone.
This can also be used the first time someone speaks to introduce their caption colour, especially if the first time we encounter them is a particularly busy scene with lots of characters in the dialogue.

6. Double Quotation Marks.
When there are double quotation marks this tells us that the voice is coming from a radio or loud speaker. This could also be a synthetic voice.

7. Double Quotation Marks after No Quotation Marks.
This simply means that one speaker is quoting another.

8. Arrows.
When there is an arrow at the start or end of a line, it tells us that the speaker is off the screen in the direction the arrow is pointing

9. Speech Delivery in Brackets.
When there is a description of the delivery of speech in brackets at the start of the line, this can mean that their speech has changes and/or how they are speaking is important to the narrative. For instance this could mean someone is injured, intoxicated or under stress.

10. Hyphens In-between Letters.
When there are repeated letters with hyphens this tells us that the speaker is stammering.

11. Declared Accent.
When a person or character’s accent is relevant to the story being told or gives valuable context to a character, it is pointed out the first time they speak.

12. Apostrophes Instead of Silent Letters.
If a character’s accent means they either soften or miss out letters, like accents with a glottal stop, then these letters are replaced with an apostrophe. This helps convey more richness in the characterisation.

13. Bracketed Words
If a character whispers some lines, then this is conveyed in the caption using brackets, although it is also OK to precede the whispered words with an explanation, although that is less efficient in terms of efficiency and caption real estate.

14. Question and Exclamation Marks in Brackets
When you see either (!) or (?) at the end of the line, this tells you that the speaker is being sarcastic. The exclamation mark tells you that it is a sarcastic statement whilst the question mark is a sarcastic question.
Like whispering, there can be an indicator at the beginning of the line that says SARCASTIC: but this uses up more real estate and also does not differentiate between a question and a statement.

15. Question Mark followed by Exclamation Mark.
The addition of ?! at the end of a line indicates that the words have been delivered with an incredulous tone, because the speaker is unable or unwilling to believe in something. Like sarcasm, knowing this can completely change the meaning.

simply tells us that the speaker is shouting or screaming. Both can be indicated using the words SHOUTS: or SCREAMS: but these take up a lot of space and add an addition word to read, so full caps and an exclamation mark saves space.

If words are displayed in full caps it cal also mean they are describing a noise and are and not speech.
The addition or a chevron or arrow can also tell us what direction the sound came from if that is important information.

18. Rrrarrrgghh!
Sometimes sound effect words can be substituted for or added to descriptions of sounds. These words are no onomatopoeias but more like the effect words used in comic books. This is the content producer telling us important contextual information and trying to bring the sound to life.
LIONS ROARING is more factual and would be used if identifying the source of the noise is obvious to someone who can hear it, whereas <Rrrarrrgghh! tries to imitate the sound, bringing the roar to life, but does not reveal what is making the sound.

19. A Line Starting With Two Dots
If a line begins with two dots this is because it is in response to speech that is unheard by viewers who have access to the sound. This could be a situation like a character listening responding to someone on the phone that we can’t hear.

20. Some ALL CAPS words in the middle of a sentence.
If a single word or words in a sentence are displayed as all caps this is because the speaker is stressing those words in their delivery.

The only exception is if the word ‘I’ is stressed, it is then often displayed in a different colour.

21. Three Dots in the Middle of a Word or Sentence
The use of three dots in the middle of a sentence shows that the speaker has paused, which could be important because it could show use they are considering something, realising something or changing their mind.

22. Three Dots at the End of an Unfinished Sentence
This tells us that speech has trailed off. If the trailed-off speech was a question, an exclamation or delivered with disbelief, then the three dots are followed immediately with ?, ! or ?!

23. If the speaker comes back to the sentence after a pause, then the rest of the sentence is preceded by two dots.

24. Music Information
There are lots of reasons why music is used. Sometimes it helps create atmosphere, others times a particular piece or its lyrics can help with storytelling, or sometimes it’s just there as audio decoration.

There are different ways a content producer can give context to the use of music.

If knowing what a piece of music is is , there can be an informational caption.

If it is important to know the style and delivery of the piece then this can be in an ALL CAPS description.

If these are combined, then descriptors and information are combined also.

If the music is incidental but the atmosphere it creates is important, then this is in an ALL CAPS description.

When characters, artists or crowds sing and hearing the words is important then that can be presented in two ways, top and tailing the lyrics with hashes or musical notes.

If the song is interrupted, the heard lyrics end with three dots, and if the lyrics start with three dots that tells us that this is not the beginning of the song.

Compound Use.
The reason why lots of things like expression and context are not described in words, but instead using punctuation and symbols, means that more information can be delivered more efficiently.
Both the amount of words displayed and how long they are displayed for have their limitations, so understanding the subtitle and caption visual language gives all users the opportunity to access richer information.

If you would like to know more about subtitling and captioning, please check out these resources:

BBC Subtitling (Closed Caption) Guidelines
BBC R&D 360 Video and VR Captions Display Research
BBC R&D Subtitles and Closed Caption Quality Research
How Big Should Closed Captions and Subtitles Be?
How TV Subtitles and Closed Captions are Produced
How to Create Subtitles and Closed Captions
The History of Access Services at the BBC
W3C Closed Caption Guidelines
YouTube Guide to Captioning

#closedcaptions #subtitles #captions #a11y #accessibility

