To prevent spam users, you can only post on this forum after registration, which is by invitation. If you want to post on the forum, please send me a mail (h DOT m DOT w DOT verbeek AT tue DOT nl) and I'll send you an invitation in return for an account.
Interpreting the Heuristic Miner Fitness Value
I am running PRoM 6.6 using the package "HeuristicsMiner". When I run my model it displays a fitness value in the lower lefthand corner of the visualization. However, I don't know how to interpret these values, and the various academic publications I've found don't seem to shed any light on this.
Hence, my question is: How do I interpret the fitness value that comes out of the HeuristicsMiner?
Hence, my question is: How do I interpret the fitness value that comes out of the HeuristicsMiner?
Comments
-
@JBuijs: I've been watching your MOOC at FutureLearn, which covers the heuristics miner. However, it doesn't seem to cover this particular issue. Do you know how to interpret the fitness statistic?
-
Dear Aron, I looked through the Heuristic Miner code and found that apparently the following class is responsible for calculating the displayed fitness value:
org.processmining.plugins.heuristicsnet.miner.heuristics.miner.fitness.ImprovedContinuousSemantics
The JavaDoc says:/**
* Calculates the fitness of <code>HeuristicsNet</code> objects
* in a population based on the ratio of activities
* in a log that could be parsed (or replayed) without
* problems. The ratio contains a punishment component based
* on the amount of problems encountered during the log replay
* <i>and</i> the amount of traces with parsing problems. In a
* nutshell, this punishment factor benefits the
* <code>HeuristicsNet</code> objects that have fewer problems
* scattered in fewer traces.
* <p> Note that the parsing semantics of this fitness measure
* is a continuous one (i.e., the log replay does not stop
* when problems are encountered).
*
* @author Ana Karla Alves de Medeiros
*
*/
I guess the academic publication corresponding to this is this one: http://is.ieis.tue.nl/staff/aweijters/WP166.pdf
The CPM measure seems to be matching with the description in the JavaDocs.
I hope this helps.
-
@fmannhardt -- thanks--this is somewhat helpful. It seems that I am encountering some negative values here. I don't quite see how this can be possible--do you see that this could be possible?
-
Sorry, I did not have the time to look into this. Maybe the PhD thesis of Seppe vanden Broucke may help you:
http://seppe.net/seppe/Seppe vanden Broucke PhD.pdf
He wrote a whole section about shortcomings of the heuristic miner.
Btw:
Soon, there might be a new heuristic miner with some new features. Stay tuned
-
@fmannhardt: thanks!
Howdy, Stranger!
Categories
- 1.6K All Categories
- 45 Announcements / News
- 225 Process Mining
- 6 - BPI Challenge 2020
- 9 - BPI Challenge 2019
- 24 - BPI Challenge 2018
- 27 - BPI Challenge 2017
- 8 - BPI Challenge 2016
- 68 Research
- 1K ProM 6
- 393 - Usage
- 287 - Development
- 9 RapidProM
- 1 - Usage
- 7 - Development
- 54 ProM5
- 19 - Usage
- 187 Event Logs
- 32 - ProMimport
- 75 - XESame