Here are some preliminary keyword searches of the HackOH5 dataset.  I ran these on the dataset before our hackathon to give the team ideas and seed our brainstorming session.  Time will be at a premium during the hackathon so we need to come prepared as possible.
As you can see, I picked several general ideas along with specific terms we could search the corpus for related to each idea.
JUSTICE
justice – 14,422
protest* – 22,340 (includes protestant and protest
protest – 10,947
riot – 13,910
justice – 14,422
SOCIAL
race – 56,232
black – 79,591 (not necessarily race)
african-american – 0
” negro” – 10,293
“latin*” – 29,766
“Hispan*” – 1,076 (incl hispanic, hispano)
chican – 231 (incl chicano, chicana)
asian – 8,142
chinam – 257 (incl chinamen, chinaman)
oriental – 2,832
chinam – 257 (incl chinaman, chinamen)
jap – 504 (excl japanese, japan)
jap* – 20,108 (incl japanese, japan)
” Nip ” – 0 (excl nippon, nip* but could be verb to nip)
” Hun ” – 933
GENDER
“feminis*” – 4,771 (incl feminist, feminism)
suffrage* – 1,315 (incl suffrage, suffragette)
” gender*” – 4,206 (excl engender, etc)
SEX/DRUG/ROCK ROLL
drug – 29,987
sex –
rock –
RELIGION
protestant – 6
catholic – 5,366
bible – 11,629
holy – 6,764
divin* – 13,471 (incl divine, divinity)
sacred – 4,739
Jesus – 7,746
God – 40,416
jew* – 35,873 (includes jew,jews,jewish)
muslim – 2,089
islam* – 1,860 (includes islam, islamic, etc)

ECONOMY/JOBS

econom* – 47,324
job – 53,563
career – 26,045
interview – 22,440
market -17,377
money – 60,287
dollar – 27,183
POLITICS
vote – 40,250
election – 50,565

war – 607,649

INTERNATIONAL
europe – 27,540
asia – 16,441
latin america – 3,294
africa – 30,044
vietnam – 10,811
president – 218,626
LOCAL/CAMPUS
professor –
class –
campus – 193,087
restaurant – 11,038
police – 21,098
community – 77,232
local – 8
mayor –
.
Advertisements