-
Notifications
You must be signed in to change notification settings - Fork 7
Data
DiMe is centered around loggers uploading events, represented by the
Event
class and its subclasses. For example an event could be:
- the user has looked at a PDF document
- the user's pulse is so-and-so
Some events contain linked objects, such as the document that was
looked at. These objects, which can appear in several events, e.g.
when the user closes the same document are represented by the
InformationElement
class and its subclasses.
The image below illustrates several events being uploaded over time. Some refer to information elements, many events may naturally refer to the same information element, e.g. opening and closing the same document.
An event is uploaded to DiMe encoded as a JSON object using the uploading API.
The DiMe data model is implemented as a set of Java classes in the
source directory: src/main/java/fi/hiit/dime/data
. You
can also view the class hierarchy from the JavaDoc page
(requires a HIIT username that is member of the reknow group).
The actual event uploaded to DiMe needs to be one of the concrete
subclasses of the Event
class, which represent different types of
events. To see the full list of currently implemented events check the
JavaDoc page. New classes can be added as needed.
All event subclasses are ultimately documented in the JavaDoc page, but below some of the most important ones are described together with the most important fields needed. Almost all fields can be left empty if they do not make sense for the particular application.
All events should preferrably have these.
-
actor
: the program that produced the event, e.g. "Firefox". -
origin
: typically the host name of the computer where the event was generated. -
type
: detailed type of event, using the Semantic Desktop ontology, see: http://www.semanticdesktop.org/ontologies/2010/01/25/nuao -
start
: time stamp when the event was started. -
end
: time stamp when the event ended - DiMe can fill this if duration was supplied. -
duration
: duration of event in seconds - DiMe can fill this if end time was supplied.
A search event, i.e. the user doing a text query. In addition to the common fields, a search event has:
-
query
: the text of the query.
A desktop event, such as opening a document in the computer graphical environment.
-
targettedResource
: the InformationElement object that is targetted by this event.
An event representing an explicit feedback by the user, e.g. ranking a document as relevant.
-
targettedResource
: the InformationElement object that is targetted by this event. -
relatedEvent
: a related event, e.g the SearchEvent that introduced the document that we are giving feedback to. -
value
: the feedback value, e.g. the relevance of the document.
Many events refer to information elements, such as the document that
was opened. The actual information elements uploaded to DiMe
(typically as part of the event upload) are represented by the
subclasses of the InformationElement
class.
All information element subclasses are ultimately documented in the JavaDoc page, but below some of the most important ones are described together with the most important fields needed. Almost all fields can be left empty if they do not make sense for the particular application.
-
uri
: URI of the information element, e.g. path on computer or web URL. -
plainTextContent
: plain text content of the information element. This is indexed for text search. -
isStoredAs
: form of storage according to the Semantic Desktop ontology: http://www.semanticdesktop.org/ontologies/2007/03/22/nfo -
type
: detailed data type according to the Semantic Desktop ontology: http://www.semanticdesktop.org/ontologies/2007/03/22/nfo
-
mimeType
: mime type of the document, see: https://en.wikipedia.org/wiki/MIME#Content-Type -
title
: the title of the document.
Below is an example of a DesktopEvent
with its corresponding
Document
. This example represents a user having accessed a web page,
i.e. the document is the web page in this case, and the event
specifies when the web page was accessed.
The example is given below in JSON, which is the data format used for uploading to and downloading from DiMe. The fields and their meaning are explained below in the comments.
{
// this corresponds to the Java event class
"@type": "DesktopEvent",
// the program that produced the event, here the web browser
"actor": "Firefox",
// a unique id, if left empty DiMe will generate a random id
"id": "f9654c54d7f38acfe179b04de8c0554ea1d6481b",
// typically the host name of the computer where the event was generated
"origin": "hp8x-15.cs.helsinki.fi",
// time stamp when the event was started
"start": "2015-08-11T12:56:53Z",
// type using the Semantic Desktop ontology
"type": "http://www.semanticdesktop.org/ontologies/2010/01/25/nuao#UsageEvent"
// the contained InformationElement
"targettedResource": {
// the Java class
"@type": "Document",
// a unique id, if left empty DiMe will generate a random id
"id": "d74fa5afb9c04e148fc75a640348f8648c17812b",
//
"isStoredAs": "http://www.semanticdesktop.org/ontologies/2007/03/22/nfo#RemoteDataObject",
// mime type, optional
"mimeType": "text/html",
// the plain text, used for search
"plainTextContent": "The revolution has begun...",
// title of the document
"title": "Revolution of Knowledge Work | Revolution of Knowledge Work",
// type using the Semantic Desktop ontology
"type": "http://www.semanticdesktop.org/ontologies/2007/03/22/nfo#Document",
// the URI of the web page or document
"uri": "http://www.reknow.fi/"
},
}
If you need a new data class, it should be implemented as subclass of the existing ones - please discuss it first e.g. in
- the #dime channel on Slack, or
- open an issue in the issue tracker, or
- send an email to [email protected].