A Brief Introduction to Data Mining Projects in the Humanities

Document Type

Article

Publication Date

4-2012

Publication Source

Bulletin of the American Society for Information Science & Technology

Volume Number

38

Issue Number

4

First Page

20

Last Page

23

Publisher

Wiley

ISSN

1931-6550

Abstract

Data mining offers the capability to view data in a new light, discovering associations and patterns not appreciated before. For the humanities domain, it exemplifies the interdisciplinary efforts of digital humanities. The technique provides answers and prompts further questions from new discoveries. Part of knowledge discovery in databases, data mining involves identifying relevant n-grams, classifying and reclassifying results, modeling the interdependence of variables and clustering results into meaningful subgroups. From designing research questions to determining how best to display and communicate results, the process requires collaboration between information professionals and humanities scholars. A selection of data mining projects illustrates how the technique is being applied for humanities research. Tools for data mining are readily available online, through simple web interfaces or for download and customization for optimal results.

Share

COinS