PhD position on Stream Mining for Real Time Compliance Checking

In the context of the EU H2020 project BPR4GDPR (Business Process Re-engineering and functional toolkit for GDPR compliance), a PhD position is open at the Process Analytycs (PA) group in TU/e’s Department for Mathematics and Computer Science in the domain of Stream Process Mining.

Position PhD-student
Department(s) Department of Mathematics & Computer Science
Institutes Data Science Center Eindhoven
FTE 1,0
Date off 31/07/2018
Reference number V32.3350

The broader scope of the BPR4GDPR project

In the last two decades the focus on process-orientation (e.g., process-aware information systems or BPM systems) has increased, while, with the incredible growth of event data (cf. “Big Data”), it has become possible to use process mining, i.e., a posteriori analysis technique exploiting the information recorded in event logs, to discover models and check the conformance of existing ones. Indeed, most organisations have very limited knowledge about the reality happening throughout their day-to-day operation; process mining focuses on this kind of problem, with a view to assessing the organisational reality and reduce the gap between what is supposed to happen and what actually happens. The key facets of process mining are discovery, monitoring and improvement of real processes by extracting knowledge from the organisation’s available data. Previous research has pointed large discrepancies between the idealized model and the process in reality. Moreover, process mining has shown that different models are possible for different and particular views on the process at hand.

The goal of BPR4GDPR is to support the implementation of a Privacy-Aware Process Mining Framework, seeking to meet requirements related to: transparency, being able to discover and integrate interpretable business procedures into a process model, i.e., to generate process models reflecting, as precisely as possible, an organisation’s current modus operandi; compliance, automatically identifying “business rules” for different perspectives; and accountability, spotting non-conformant executions. While checking the conformance between a process model and events in reality, two main concepts should be considered: real-time data and concept drift.

Additionally, advanced stream mining techniques will be implemented within the framework of this project to support real-time, single-passage mining of sensitive data such that businesses can fulfill the new requirements of GDPR (General Data Protection Regulation).

Privacy-Aware Stream Mining

For the position, the PhD candidate is expected to work on stream process and data mining and will be supervised by dr.ing. Marwan Hassani and dr.ir. Boudewijn van Dongen. In the domains of today’s evolving, IoT-based organisations, business rules (forming the de jure model) can easily get less fulfilled over the time. This is observed through deviations of the real process (reflecting the de facto model) as collected from the event data from the de jure model. Such deviations can be either intended or accidental. In both cases, one is interested in a real-time detection of these concept drifts and in an online reporting of their severity. In a similar setting, new requirements may arise, such that the process model discovered from the underlying event data will be outdated and in need to be continuously updated/adapted.

Furthermore, to respect the requirements of EU’s GDPR (put into effect from the end of May 2018), businesses are not allowed to store user sensitive data unless clearly authorized by end users. Even in cases when an authorization is obtained, users will always keep the right of their profile data “to be forgotten”.

Position

In the light of the above description, both (i) stream process mining and (ii) stream data mining solutions should be provided during the period of this position. The former should define concept drift detection and stream process mining novel techniques that update the statistics about the data-related behaviours. They should establish patterns to efficiently integrate recent information to improve the model with continuous update of discovered models and to report concept drifts from a given reference model or from the previously discovered models. The latter should advance efficient, single-passage privacy-aware stream data mining approaches that are able to extract useful insights of the data on the fly without compromising any privacy regulations.

The Analytics for Information Systems (AIS) group provides its long running expertise and experience across all challenges of BPM, stream data mining and process mining.

Requirements

We are looking for a candidate that meets the following requirements:

  • a solid background in Computer Science, Data Science, Machine Learning, Mathematics or Information Systems (demonstrated by a relevant Master);
  • has a strong background in at least one of the following domains: data mining, machine learning, process mining, data privacy, and/or databases (in particular data modeling and query construction);
  • has a strong interest in streaming data mining and data science research;
  • has the ability to realize research ideas in terms of prototype software, so software development skills are needed.
  • is highly motivated, rigorous, and disciplined when developing algorithms and software according to high quality standards;
  • good communication skills in English, both in speaking and in writing (candidates from non-Dutch or non-English speaking countries should be prepared to prove their English language skills);
  • possesses good communication capabilities and be an efficient team worker.

A PhD candidate is expected to:

  • perform scientific research in the domain described
  • collaborate with other researchers in this project and be ready to perform project-related business travels
  • present results at (international) conferences
  • publish results in scientific journals
  • participate in activities of the group and department
  • assist in teaching undergraduate/graduate courses
  • participate in doctoral training on relevant topics

Conditions of employment

We offer:

  • A full-time temporary appointment for a period of 4 years, with an intermediate evaluation after 9 months;
  • A gross salary of € (2.222) per month in the first year increasing up to € (2.840) in the fourth year;
  • Strong collaboration ties with several research groups in Europe and world-wide
  • Healthy travel funding for presenting your work at the leading conferences, for project meetings with multiple European partners and for visiting research.
  • Support for your personal development and career planning including courses, summer schools, conference visits etc.;
  • A broad package of fringe benefits (e.g. excellent technical infrastructure, child daycare, excellent sports facilities, extra holiday allowance [8%, May], and end-of-year bonus [8.3%, December]).

More information

Application

The application should consist of the following parts:

  • Cover letter explaining your motivation and qualifications for the position (the letter should also show an understanding of process mining and the work done within AIS, see websites such as www.processmining.org and the book “Process Mining: Discovery, Conformance and Enhancement of Business Processes”);
  • Detailed Curriculum Vitae;
  • A copy or a link to your MSc thesis. If you have not completed it yet, please explain your current situation;
  • List of courses taken at the Bachelor and Master level including marks;
  • List of publications and software artefacts developed (if applicable);
  • Names of at least three referees.

The selection process with start in June 2018 and will continue until the position gets filled. The position is fully funded and immediately available. The successful candidates are expected to start ASAP.

Leave a Reply