To prevent spam users, you can only post on this forum after registration, which is by invitation. If you want to post on the forum, please send me a mail (h DOT m DOT w DOT verbeek AT tue DOT nl) and I'll send you an invitation in return for an account.
Markov trace clustering using event logs generated by myself
Hello everyone,
I built a petri net using pipe and generate some event logs,then input them in the prom,using markov trace clustering plug in,but the event log is too simple to get a useful clustering dimensition.cause it only have two attributes as follows.
If I want to clustering traces by trace attributes,for example,put case 1,3,5 in a cluster,case 2,4,6,7 in another cluster,how can I change my event log?add some attributes?
Thank you in advance.
I built a petri net using pipe and generate some event logs,then input them in the prom,using markov trace clustering plug in,but the event log is too simple to get a useful clustering dimensition.cause it only have two attributes as follows.
If I want to clustering traces by trace attributes,for example,put case 1,3,5 in a cluster,case 2,4,6,7 in another cluster,how can I change my event log?add some attributes?
Thank you in advance.
Comments
-
I have asked my colleague to anwser your question. Please wait for a while.Guangming Li
PhD student in Process Mining at Eindhoven University of Technology -
Thank you very much! I will be waiting for the response.
-
Dear Yahui,
I understand you want to cluster the traces of an event log you created yourself. Without any data attributes, the only dimension that can be used to cluster upon is of course the control-flow dimension. In this tool, currently, the data attributes are primarily used, and no control-flow ordering is taken into account, only frequency or occurrence of activity names. If you want, you can create your own dimensions by extending the tool. If you don't want to extend the tool or to cluster on control-flow you have to add attributes at the case or event level.
Note that the one involving clustering over time is not the one you should use for the purpose of your question. Also, you cannot set the number of clusters that you want to have output in the Markov clustering algorithm. The whole point of this algorithm is namely that it tells you how many clusters there are.
Note also that I've already answered several of your and other people's threads on how to use the plug-in before, for example in:- https://www.win.tue.nl/promforum/discussion/908/how-to-use-traceclustering-plugin-export-sublogs
- https://www.win.tue.nl/promforum/discussion/701/how-can-i-incorporate-context-info-to-the-profile-vector-in-trace-clustering-using-mcl-clustering
- https://www.win.tue.nl/promforum/discussion/comment/2563
- https://www.win.tue.nl/promforum/discussion/comment/2150
Kind regards,
Bart
Bart Hompes - Eindhoven University of Technology -
Dear Bart, Thank you again for you response. I will add a attribute in my csv event log,for some cases set this attribute to "a",some set to "b",others set to"c" or something.And try your plug in again ,use this attribute as cluster dimension,if it works,I will tell you the result. I also know that for output a better cluster result,the number of cluster can not be set at first,but generate by the algorithm itself. Kind regards, Tangyahui
-
Dear Bart,
Though add an attribute in my csv file,and set different value on different traces,I got the clusters I want.
Thank you again for your response,I know you are very busy.And your responses are very helpful for me.
kind regards,
TangYahui
Howdy, Stranger!
Categories
- 1.6K All Categories
- 45 Announcements / News
- 225 Process Mining
- 6 - BPI Challenge 2020
- 9 - BPI Challenge 2019
- 24 - BPI Challenge 2018
- 27 - BPI Challenge 2017
- 8 - BPI Challenge 2016
- 68 Research
- 1K ProM 6
- 393 - Usage
- 287 - Development
- 9 RapidProM
- 1 - Usage
- 7 - Development
- 54 ProM5
- 19 - Usage
- 187 Event Logs
- 32 - ProMimport
- 75 - XESame