Style Guide for Transcribing and Editing

Overview

This style guide will teach you the basics of how to transcribe for CastingWords. You should also see the grading guidelines and example transcript to get a better feel for what we want. However you should never transcribe without reading through this style guide and following what it says.

Lastly, although we try to document and explain what to do in any situation we can't cover everything, just use your head and do what makes sense.

Goal

To produce an accurate and readable transcript.

This means do not transcribe verbatim (unless specifically requested) but be accurate. Leave out false starts but transcribe exactly what the speaker says. Do not paraphrase.

NEVER include editorial comments, id numbers, titles, or anything else in the transcript that is not in the audio.

Related: NEVER post completed work, audio, or any sort of identifiable information about our clients anywhere else on the web. Discretion and confidentiality are part of doing this work. Most of this falls under copyright violations and the like but it bears repeating.

Other Helpful Pages

What If Style Guide - This style guide goes into deeper detail on issues in transcribing. Following it will get you high grades (and high bonuses).

Required Guidelines

Your transcription HIT will be rejected if you fail to follow these guidelines.

Speaker Labeling

Speaker name and colon start each new speaker. If there is a timestamp it goes before the speech, after the colon.

Label speakers with their first and last name the first time they appear. Label them with just the first name after that.

If you do not know a speaker's name use a descriptive name such as "Man:", "Woman 3:", "Host:", "Interviewer:", etc. The more descriptive the better. Do not use "Male:", "Female:" or non-descriptive labels such as "Speaker 1:".

Only relabel the speaker when the speaker changes or after a block non-verbal (A non-verbal that has a blank line above it and below it).

Example (with timestamps)

Jerry Coleman: [00:00] This is the beginning of the transcription so I am labeled with my full name.

John Gale: [00:31] Right, but after the first time you will be labeled with only your first name right?

Jerry: [00:57] Correct. As you can see I am labeled with just my first name now and will be labeled the same throughout the transcript.

Man 1: [01:14] Hi, I'm a new speaker and no one ever mentions my name so the transcriber should simply give me a descriptive name. In this case the only information that can be gathered on me is that I'm male so I'm "Man:" if there were 2 unidentified men I'd be "Man 2:"

John: [02:01] Ah, cool.

More details and examples

Formatting

The rules are simple but important. Thanks for your attention.

More detailed with examples

Special Tags

[xx] - [unintelligible], [inaudible]

Transcribers should mark unintelligible and inaudible words and phrases with "[xx]". Marking parts with "[xx]" will only hurt your grade if the grader can understand what was said where the "[xx]" is. But we prefer you use [xx] when in doubt and prefer it to guessing at what was said.

Note: Editors must remove all "[xx]" tags from the transcript. If they cannot figure out what is said they must mark it as [indecipherable 0:00] or [inaudible 0:00] and label it with a timestamp. Transcribers should not timestamp [xx] because we split up the transcript into smaller parts so the timestamps won't be accurate.

Example (Initial Transcript)

John: I was going out to the [xx] and a pack of buffalo were on the opposite hillside.

Example 2 (Final Edit) John: I was going out to the [inaudible 01:10] and a pack of buffalo were on the opposite hillside.

[sp]

Use "[sp]" if you are unsure of the spelling of a term or name. You should not mark any words that can easily be looked up with [sp] and remember to always use spellcheck on your transcript before you submit it.

Note: Editors must remove all "[sp]" tags after double-checking and researching (if necessary) the term/name.

Example

John: I went out to Wichita [sp] to see an old friend.

[?]

If you are pretty sure of what was said but not completely sure you should mark it with "[?]" this will tell an editor that they should closely review the section of text that it comes after.

Note: Editors must remove all "[?]" tags after they have reviewed and correct the text that the tag marked.

Example

White: I went out to a baseball game [?] at midnight.

Non-verbals

Anything that occurs on the recording but isn't represented by what is said should be marked in brackets.

Everything in brackets should be lowercase unless it is a proper noun.

Examples

Preferred Guidelines (Follow these to get a high grade)

Consistency

Spell things the same way each time, even if it's incorrect. It's easier for an editor to correct one mistake repeated nine times rather than nine unique mistakes.

Clean up the transcript (But don't paraphrase!)

Don't Transcribe Filler Words

This means that you should “translate” speech into writing by leaving out filler words like “um”, “uh”, “like”, and “kind of”, while still keeping the personality and meaning of the original utterance. When they begin a sentence, words like “And”, “So” and “But” are usually considered filler words.

If the filler words contribute to the meaning and tone of the speaker, leave them there. However, in almost all cases, they are a distraction in written language and should be left out.

We suggest you look over our grading examples to understand the balance between cleaning up what is said while still being accurate.

Clean up Verbal Stumbling but Retain the Speaker's Tone

It's OK to occasionally clean up grammar but the transcript must be an accurate representation of what was said.

Start Each Line with a Capital Letter

The exception being when there is an ellipsis before the first word where a speaker starts mid-sentence (explained below, under "Specific Punctuation Do's and Don’ts").

Spell Out One through Nine but use Numerals over Nine

So use nine not 9. But if the number is over nine use numerals such as 10. See the What If Style Guide for more details.

Punctuation

Use proper punctuation. This means periods, commas, ellipses, capitalization of proper nouns, everything just like your English teacher taught you.

Specific Punctuation Do's and Don’ts

Semicolons ";" are too formal for the conversational style of transcriptions and should be used only sparingly.

Colons are only to be used for speaker labels, e.g., “Man 1:”, for times of day, e.g., “2:45 PM”, timestamps, e.g., “[00:57]” or “[indecipherable 03:41]”, or Bible (or similar) verses, e.g., “Ecclesiastes 3:1-8”. There is a time to use a colons, and a time to not.

Like semicolons, colons are not conversational, but the main reason to not use them (except for the above examples) is that any colon that has a space after it shows up as a speaker label and will have to be fixed before we deliver the transcription.

Hyphens, "-", em dashes, " -- " (or ""), ellipses, "..." and commas all have specific uses, please use them correctly. A hyphen is only for joining words, numbers and other symbols. It is not an all-purpose substitute for em dashes, ellipses or commas. The proper use of hyphens is explained further down the page.

Use an em dash to place -- special emphasis -- on a word or phrase. It’s like increasing the volume of a guitar during a solo to make it stand out.

Use an ellipsis... to mark a break, such as a speaker breaking off mid-sentence..., ...starting mid-sentence. If the audio cuts off mark it as [cuts off] (see non-verbals).

Important! Style Change for Ellipsis Usage

We have decided we prefer no spaces between dots because sometimes the line breaks and they get split up. For a similar reason there shouldn’t be a space between an ellipsis and the word before or after it. Also we prefer not to use the "Smart" ellipses that Word (as well as some other applications) provides.

Don't Transcribe Radio Commercial Breaks

There is no need to transcribe radio commercial breaks unless specifically requested; simply mark the break as [radio break].

Use written form for URLs and spelled-out names

Even if the speaker spells out the name for clarity use the written version the first time in a paragraph. This is for visual clarity - people should see the written name. However, if it becomes a point of conversation it should be clear what the point of the conversation is. Most of the time this rule meets these goals.

Examples

"His name is John Smythe, again S-M-Y-T-H-E" not "His name is J-O-H-N S-M-Y-T-H-E, again S-M-Y-T-H-E."
"Go to mturk.com" not "Go to M-T-U-R-K dot C-O-M."

Use Word Wrap

Do not break lines at 80 columns; just keep typing and allow the software to do the line wraps. Separate paragraphs with a blank line, not with a tab. If you are not using word wrap the transcript will become broken and awkward when you shrink the window instead of nicely adjusting to the new size.

Keep Paragraphs Short

Keeping paragraphs down to 500 characters or 4 or 5 lines is ideal. Sometimes this can't be done, but monster paragraphs will affect scores, as they affect readability.

Submit just the transcript.

We don't need comments, titles, or things other than the transcription in the assignment box. This is what the comments box is for, putting them in the transcript box just adds work for the editors and thus will negatively affect your grade.

Spelling

Editors: check the spelling of proper names against resources such as the podcast's website or using Google.

Preferred Spellings

Incorrect Correct
Okay OK
mhm/mm-hm mm-hmm
alright all right
Email/e-mail email
Web site website
internet Internet
& and
% percent
colour color

More Guidelines

If you're reading this far, we love you!

Abbreviations

Dnt use abbrvs b/c thr diff 2 read & ur grd wl b lo. Spell out the words.

Hyphenation

Use standard rules (dictionary or style manual) for hyphenation.
The trend is toward fewer hyphens, so omit them when the meaning remains clear. Be consistent.

Some common hyphenation guidelines:

"its" versus "it's"

"It’s" always means "it is" or "it has." If you can change the "its" in your sentence to "his", leave out the apostrophe.

"your" versus "you're"

"You're" is always a contraction of "you are." If you can substitute "you are" in the sentence, then keep the apostrophe. If it doesn't make sense use "your" instead.

“were” versus “we’re”

“We’re” is always a contraction of “we are”. If you can substitute "we are" in the sentence, then keep the apostrophe. If it doesn't make sense, use "were" instead.

“lets” versus “let’s”

“Let’s” is always a contraction of “let us”. If you can substitute "let us" in the sentence, then keep the apostrophe. If it doesn't make sense, use "lets" instead.