This style guide will teach you the basics of how to transcribe for CastingWords. You should also see the grading guidelines and example transcript to get a better feel for what we want. However you should never transcribe without reading through this style guide and following what it says.
Lastly, although we try to document and explain what to do in any situation we can't cover everything, just use your head and do what makes sense.
To produce an accurate and readable transcript.
This means do not transcribe verbatim (unless specifically requested) but be accurate. Leave out false starts but transcribe exactly what the speaker says. Do not paraphrase.
NEVER include editorial comments, id numbers, titles, or anything else in the transcript that is not in the audio.
Related: NEVER post completed work, audio, or any sort of identifiable information about our clients anywhere else on the web. Discretion and confidentiality are part of doing this work. Most of this falls under copyright violations and the like but it bears repeating.
What If Style Guide - This style guide goes into deeper detail on issues in transcribing. Following it will get you high grades (and high bonuses).
Speaker name and colon start each new speaker. If there is a timestamp it goes before the speech, after the colon.
Label speakers with their first and last name the first time they appear. Label them with just the first name after that.
If you do not know a speaker's name use a descriptive name such as "Man:", "Woman 3:", "Host:", "Interviewer:", etc. The more descriptive the better. Do not use "Male:", "Female:" or non-descriptive labels such as "Speaker 1:".
Only relabel the speaker when the speaker changes or after a block non-verbal (A non-verbal that has a blank line above it and below it).
Jerry Coleman: [00:00] This is the beginning of the transcription so I am labeled with my full name.
John Gale: [00:31] Right, but after the first time you will be labeled with only your first name right?
Jerry: [00:57] Correct. As you can see I am labeled with just my first name now and will be labeled the same throughout the transcript.
Man 1: [01:14] Hi, I'm a new speaker and no one ever mentions my name so the transcriber should simply give me a descriptive name. In this case the only information that can be gathered on me is that I'm male so I'm "Man:" if there were 2 unidentified men I'd be "Man 2:"
John: [02:01] Ah, cool.
The rules are simple but important. Thanks for your attention.
Transcribers should mark unintelligible and inaudible words and phrases with "[xx]". Marking parts with "[xx]" will only hurt your grade if the grader can understand what was said where the "[xx]" is. But we prefer you use [xx] when in doubt and prefer it to guessing at what was said.
Note: Editors must remove all "[xx]" tags from the transcript. If they cannot figure out what is said they must mark it as [indecipherable 0:00] or [inaudible 0:00] and label it with a timestamp. Transcribers should not timestamp [xx] because we split up the transcript into smaller parts so the timestamps won't be accurate.
John: I was going out to the [xx] and a pack of buffalo were on the opposite hillside.
Example 2 (Final Edit) John: I was going out to the [inaudible 01:10] and a pack of buffalo were on the opposite hillside.
Use "[sp]" if you are unsure of the spelling of a term or name. You should not mark any words that can easily be looked up with [sp] and remember to always use spellcheck on your transcript before you submit it.
Note: Editors must remove all "[sp]" tags after double-checking and researching (if necessary) the term/name.
John: I went out to Wichita [sp] to see an old friend.
If you are pretty sure of what was said but not completely sure you should mark it with "[?]" this will tell an editor that they should closely review the section of text that it comes after.
Note: Editors must remove all "[?]" tags after they have reviewed and correct the text that the tag marked.
White: I went out to a baseball game [?] at midnight.
Anything that occurs on the recording but isn't represented by what is said should be marked in brackets.
Everything in brackets should be lowercase unless it is a proper noun.
Spell things the same way each time, even if it's incorrect. It's easier for an editor to correct one mistake repeated nine times rather than nine unique mistakes.
This means that you should “translate” speech into writing by leaving out filler words like “um”, “uh”, “like”, and “kind of”, while still keeping the personality and meaning of the original utterance. When they begin a sentence, words like “And”, “So” and “But” are usually considered filler words.
If the filler words contribute to the meaning and tone of the speaker, leave them there. However, in almost all cases, they are a distraction in written language and should be left out.
We suggest you look over our grading examples to understand the balance between cleaning up what is said while still being accurate.
It's OK to occasionally clean up grammar but the transcript must be an accurate representation of what was said.
So use nine not 9. But if the number is over nine use numerals such as 10. See the What If Style Guide for more details.
Use proper punctuation. This means periods, commas, ellipses, capitalization of proper nouns, everything just like your English teacher taught you.
We have decided we prefer no spaces between dots because sometimes the line breaks and they get split up. For a similar reason there shouldn’t be a space between an ellipsis and the word before or after it. Also we prefer not to use the "Smart" ellipses that Word (as well as some other applications) provides.
There is no need to transcribe radio commercial breaks unless specifically requested; simply mark the break as [radio break].
Even if the speaker spells out the name for clarity use the written version the first time in a paragraph. This is for visual clarity - people should see the written name. However, if it becomes a point of conversation it should be clear what the point of the conversation is. Most of the time this rule meets these goals.
"His name is John Smythe, again S-M-Y-T-H-E" not "His name is J-O-H-N S-M-Y-T-H-E, again S-M-Y-T-H-E."
"Go to mturk.com" not "Go to M-T-U-R-K dot C-O-M."
Do not break lines at 80 columns; just keep typing and allow the software to do the line wraps. Separate paragraphs with a blank line, not with a tab. If you are not using word wrap the transcript will become broken and awkward when you shrink the window instead of nicely adjusting to the new size.
Keeping paragraphs down to 500 characters or 4 or 5 lines is ideal. Sometimes this can't be done, but monster paragraphs will affect scores, as they affect readability.
We don't need comments, titles, or things other than the transcription in the assignment box. This is what the comments box is for, putting them in the transcript box just adds work for the editors and thus will negatively affect your grade.
| Incorrect | Correct |
| Okay | OK |
| mhm/mm-hm | mm-hmm |
| alright | all right |
| Email/e-mail | |
| Web site | website |
| internet | Internet |
| & | and |
| % | percent |
| colour | color |
Dnt use abbrvs b/c thr diff 2 read & ur grd wl b lo. Spell out the words.
Use standard rules (dictionary or style manual) for hyphenation.
The trend is toward fewer hyphens, so omit them when the meaning remains clear. Be consistent.
Some common hyphenation guidelines:
"It’s" always means "it is" or "it has." If you can change the "its" in your sentence to "his", leave out the apostrophe.
"You're" is always a contraction of "you are." If you can substitute "you are" in the sentence, then keep the apostrophe. If it doesn't make sense use "your" instead.