Skip to content

Latest commit

 

History

History

stat_analysis

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 

Statistical analysis of the sample data

Analysis based on Lead feature

Do men or women dominate lead roles in Hollywood movies?

numbers

normal

Has gender balance in lead roles changed over time?

fraction

amount

Grossing by lead gender

grossing

STATISTICAL ANALYSIS OF THE TRAINING DATA


female leads: .......................................... 254

male leads: ............................................ 785

total samples: ........................................ 1039


APPROXIMATION TO NORMAL DISTRIBUTION


mean [μ]: ............................................ 0.244

standard deviation [σ]: ............................... 0.43

P(gender ∈ [0.5,1]): ................................. 0.276


GROSSING BY GENDER


female mean [μ]: ..................................... 98.74

female strd deviation [σ]: .......................... 138.13

male mean [μ]: ...................................... 115.16

male strd deviation [σ]: ............................ 155.79


Analysis based on lead words

Do men or women dominate speaking roles in Hollywood movies?

bin-roles

Has gender balance in speaking roles changed over time?

roles-years

Grossing over gender and words


female roles: ......................................... 3644

male roles: ........................................... 8070

total roles: ......................................... 11714


APPROXIMATION TO NORMAL DISTRIBUTION


mean [μ]: ........................................... -0.378

standard deviation [σ]: .............................. 0.926

P(gender ∈ [0,1]): ................................... 0.342


Sum of words:

words-gross-sing

words-gross-male

Fraction of total words:

words-gross-frac

words-gross-frac-male