Andrew S. Gordon

The Commonsense Activity Resource Page

Cataloging the activities that everyone knows about

1. Overview

As a graduate student at Northwestern University, I decided to catalog the full breadth of activities that everyone knows about. These commonsense activities included all of the contexts of human behavior that we all have expectations about, such as eating in a restaurant, flying on a passenger airplane, and casting a vote at a polling station. Theoretically, these activities have been described using the term "scripts", but early research in this area was never able to identify more than a handful of examples. My aim was to identify activities on an extremely large scale, and the 768 activity representations in the resulting catalog remains the largest collection in existence today.

Rather than attempting to construct traditional formal representations of activities, I authored these representations using simple frame structure. Each activity frame consisted of a short English discriptor, and contained references to the places where the activity took place, the people that participated in the activity, the physical things that were involved, the events or actions that comprised the activity, and the miscellaneous abstract concepts that were related to the activity in some way. Each reference to place, people, thing, event, or idea in these representations was a term from a controlled vocabulary, the Library of Congress Thesaurus for Graphic Materials. The breadth of the LCTGM is extremely broad, having been developed over the course of decades of indexing millions of photographs. The success of this knowledge representation effort is largely due to the methodology that was employed. In short, I analyzed each term in the LCTGM to identify the activities in which the term should be thought of as a component. For each candidate activity that was postulated, the LCTGM was consulted to determine whether enough of the remaining concepts of the activity were captured by other controlled terms. In cases where the LCTGM was rich in component concepts, these candidate activities were added to the collection, and otherwise they were discarded. The effect of this approach was to capitalize on the breadth of the LCTGM to make the knowledge representation effort tractiable while still achieving a very broad scale.

A comparison of this collection of activities to other large-scale knowledge representation efforts (CYC, WordNet, ThoughtTreasure) was done by Erik Meuller, and is described in the following report:

Mueller, Erik T. (1999) A database and lexicon for ThoughtTreasure. Available at cogprints.org.

2. Publications

Several publications are available that describe the collection of activities and the applications in which it was used. The JASIST article in 2001 is the definitive reference to the work, whereas the full list of activities can be found in my 1999 Ph.D. dissertation.

3. Downloads

Provided here are several resources that may be useful to researchers interested in large-scale representations of commonsense activities. The activity representations that are made available here remain (C) copyright 1999 by Andrew S. Gordon, as they were first published in their entirety in my doctoral dissertation. All rights are reserved. If you have an interest in incorporating these representations into your work, please contact me directly.

4. Applications

The potential applications of this collection of commonsense activities are numerous. To date, two types of applications have been explored.