What exactly are classifiers in XES?

hpl002 · December 2020

Currently reading up on the published XES standard, but having a hard time grasping what trace and event classifiers are. The document is not very detailed when it comes to exactly how they are implements and i suspect this is by design.

The exact wording in the specification being "The identity of the event shall be derived from the actual values of the attributes with these keys".

Derived how?

Would i be correct in assuming that these classifiers are typically just concatenated values, given that a event E1 and E2 pertaining to trace T1 should both share at least some part of the identity produced by the classifier?

Here is a very reduced example from http://www.processmining.org/event_logs_and_models_used_in_book with the classifiers in bold.

<?xml version="1.0" encoding="utf-8"?>

</global>

</global>

<classifier name="Activity" keys="Activity" />

<classifier name="activity classifier" keys="Activity" />

<trace>

<event>

</event>

</trace>

</log>

This log originates from the process mining book, but does not conform to the current XES spec. I have not checked compatibility with any prior version of the spec and assuming that they have not introduced any breaking changes..

Image: https://www.win.tue.nl/promforum/uploads/editor/82/qq0qhxc0pnp3.png

I would greatly appreciate any and all input that could help me resolve this confusion. I it might become clearer if i get to have a look at a up to date XES log that also conforms to the current specification.

hverbeek · December 2020

Hi,

The logs in the book may not conform to the IEEE XES standard, as they predate the IEEE XES standard.

An event classifier is just a list of attribute keys, like "concept:name lifecycle:transition". Discovery algorithm are encouraged to use event classifiers instead of plain attributes. The log can then inform the discovery algorithm which combinations of attributes make sense as the activity name. If a discovery algorithm would use the example event classifier, typical values for the activity names could be "A+complete" and "B+start" (assuming "+" is used to glue the values of concept:name and lifecycle:transition together).

A trace classifier is also just a list of attribute keys, but its use is different. The IEEE XES standard allows for events that are children of the log. Usually, events are children of a trace. For such 'trace-less' events, the trace classifier can be used to group these vents into traces. If the value of the trace classifier for two events is the same, then they belong to the same trace.

Kind regards,

Eric.

What exactly are classifiers in XES?

Comments

Howdy, Stranger!

Categories

In this Discussion