Spoken Cues To Deception
Spoken Cues To Deception
Spoken Cues To Deception
CS 4706
What is Deception?
Defining Deception
• Duration features
– Phone / Vowel / Syllable Durations
– Normalized by Phone/Vowel Means, Speaker
• Speaking rate features (vowels/time)
• Pause features (cf Benus et al ‘06)
– Speech to pause ratio, number of long pauses
– Maximum pause length
• Energy features (RMS energy)
• Pitch features
– Pitch stylization (Sonmez et al. ‘98)
– Model of F0 to estimate speaker range
– Pitch ranges, slopes, locations of interest
• Spectral tilt features
Lexical Features
• Presence and # of filled pauses • Presence of hedges
• Is this a question? A question • Complexity: syls/words
following a question
• Presence of pronouns (by • Number of repeated words
person, case and number) • Punctuation type
• A specific denial? • Length of unit (in sec and
• Presence and # of cue phrases words)
• Presence of self repairs
• # words/unit length
• Presence of contractions
• Presence of positive/negative • # of laughs
emotion words • # of audible breaths
• Verb tense
• # of other speaker noise
• Presence of ‘yes’, ‘no’, ‘not’,
negative contractions • # of mispronounced words
• Presence of ‘absolutely’, ‘really’ • # of unintelligible words
Subject-Dependent Features: Calibrating
Truthful Behavior
• % units with cue phrases
• % units with filled pauses
• % units with laughter
• Ratio lies with filled pauses/truths with filled
pauses
• Ratio lies with cue phrases/truths with filled
pauses
• Ratio lies with laughter / truths with laughter
• Gender
Columbia University– SRI/ICSI – University of Colorado
Deception Corpus: An Example Segment
SEGMENT TYPE
Breath Group
LABEL
Obtained LIE
from subject
pedal presses.
ACOUSTIC FEATURES
max_corrected_pitch 5.7 pitch_change_last_word -11.5
mean_corrected_pitch 5.3 normalized_mean_energy 0.2
Produced
pitch_change_1st_word -6.7 unintelligible_words 0.0
Produced using automaticall
ASR output y
and other using lexical
LEXICAL FEATURES
acoustic transcription
analyses has_filled_pause YES negative_emotion_word NO .
positive_emotion_word YES contains_pronoun_i YES
uses_past_tense NO verbs_in_gerund YES
PREDICTION
LIE
CSC Corpus: Results
• Classification via Ripper rule induction, randomized 5-fold
xval)
– Slash Units / Local Lies — Baseline 60.2%
• Lexical & acoustic: 62.8 %; + subject dependent:
66.4%
– Phrases / Local Lies — Baseline 59.9%
• Lexical & acoustic 61.1%; + subject dependent:
67.1%
• Other findings
– Positive emotion words deception (LIWC)
– Pleasantness deception (DAL)
– Filled pauses truth
– Some pitch correlations — varies with subject
But…How Well Do Humans Do?
By Interviewee
Personality Measure: NEO-FFI