Literature DB >> 32501316

Employees recruitment: A prescriptive analytics approach via machine learning and mathematical programming.

Dana Pessach¹, Gonen Singer², Dan Avrahami¹, Hila Chalutz Ben-Gal³, Erez Shmueli¹, Irad Ben-Gal¹.

Abstract

In this paper, we propose a comprehensive analytics framework that can serve as a decision support tool for HR recruiters in real-world settings in order to improve hiring and placement decisions. The proposed framework follows two main phases: a local prediction scheme for recruitments' success at the level of a single job placement, and a mathematical model that provides a global recruitment optimization scheme for the organization, taking into account multilevel considerations. In the first phase, a key property of the proposed prediction approach is the interpretability of the machine learning (ML) model, which in this case is obtained by applying the Variable-Order Bayesian Network (VOBN) model to the recruitment data. Specifically, we used a uniquely large dataset that contains recruitment records of hundreds of thousands of employees over a decade and represents a wide range of heterogeneous populations. Our analysis shows that the VOBN model can provide both high accuracy and interpretability insights to HR professionals. Moreover, we show that using the interpretable VOBN can lead to unexpected and sometimes counter-intuitive insights that might otherwise be overlooked by recruiters who rely on conventional methods. We demonstrate that it is feasible to predict the successful placement of a candidate in a specific position at a pre-hire stage and utilize predictions to devise a global optimization model. Our results show that in comparison to actual recruitment decisions, the devised framework is capable of providing a balanced recruitment plan while improving both diversity and recruitment success rates, despite the inherent trade-off between the two.

Entities: Chemical Disease Gene Species

Keywords: Explainable artificial intelligence; Human resource analytics; Interpretable AI; Machine learning; Mathematical programming; Recruitment

Year: 2020 PMID： 32501316 PMCID： PMC7252110 DOI： 10.1016/j.dss.2020.113290

Source DB: PubMed Journal: Decis Support Syst ISSN： 0167-9236 Impact factor: 5.795

Introduction

One of the most challenging and strategic organizational processes is to efficiently hire suitable workforce. A comprehensive study by the Boston Consulting Group has shown that the recruitment function has the most significant impact on companies' revenue growth and profit margins compared to any other function in the field of human resources (HR) [1]. Indeed, poor recruitment decisions may lead not only to low-performing employees but also to increased turnover. Turnover may have a direct impact stemming from employee replacement costs (e.g., interviews and rehiring costs, training and productivity loss, overtime of other employees), as well as indirect effects, such as poor service to clients or a decline in employee morale [2]. Thus, improving organizational recruitment processes by hiring the most suitable candidates has a significant impact on organizational performance [3,4]. In this study, we propose a data analytics approach, which can be used as a decision support tool for recruiters in real-world settings to improve hiring decisions of candidates to specific positions or jobs. The proposed approach comprises two components: a local prediction model for recruitment success per candidate and job type, and a global optimization model of the recruitment process. The first part of this study is based on interpretability ML modeling, which provides meaningful insights into the potential recruitments related to the candidate's background features as well as the planned job placement. The output of these models is the probabilities of successful recruitment per employee and job. The second part in this research is based on a mathematical modeling formulation at an organizational level that takes into account multi-objective considerations and optimize the recruitment process over many candidates and jobs by using the success probability outputs of the ML models. Previous efforts have been invested in trying to predict recruiters' decisions (e.g., [5,6]). Such prediction models, if accurate enough, may eventually replace the human recruiter and save a considerable amount of resources. Note, however, that recruiters' decisions are inherently subjective, and human intuition plays an important part in recruitments and placements. Hence, using interpretability modeling tools that can enrich and guide recruiters' decisions by insight seems to be a relevant approach, which recently gained popularity and is also known as explainable artificial intelligence (XAI) (see, for example, [7]). Another line of work has focused on the post-hire prediction of turnover or performance (e.g., [8]). While such measures are somewhat more objective, post-hire prediction efforts might be too late in certain cases to act upon. Therefore, in this paper, we focus on the pre-hire prediction of performance and turnover as a combined objective measure. A key property of our approach is the interpretability of predictions, providing a useful explanation of how they are obtained. Apart from the accuracy of the prediction model, users' trust in the model is often directly impacted by how much they can understand and anticipate its behavior [9]. Understanding why the model behaves the way it does may increase users' trust and their potential to act upon its recommendations. This is especially true in decisions that involve human beings' intuition, such as in the case of employees' recruitment and job placement. To address the prediction task described above, we propose applying the interpretable Variable-Order Bayesian Network (VOBN) model [10,11]. In contrast to other interpretable models such as decision trees, which often suffer from high variance and overfit to the training set, the VOBN model provides an inherent modeling flexibility that reduces such effects. Therefore, it often results in an improved generalization and predictive ability over various test sets. Finally, we show that the VOBN model is also flexible enough for mining significant patterns and insights in HR data. Nevertheless, recruitment requires not only hiring the highest-potential workforce, but also meeting other organizational objectives. For example, there is a necessity to meet the demand for employees in different departments, the facilitation of diversity in teams and the allocation of the workforce among different departments in a balanced manner. Each of these dimensions may also include numerous points of view: the local point of view of each separate candidate-position pair, the positional point of view and the organizational or regulatory point of view. Given that there are requirements of various stakeholders in the organization, there is a need to balance the trade-offs in this multi-objective scenario. Hence, in the second part of this research, we address the recruitment problem with a global perspective by accounting for the various dimensions and points of view. We evaluate the proposed method using a unique dataset obtained from a large nonprofit service organization that is highly diversified over roles, accountabilities and job descriptions, with heterogeneous population of employees with diverse backgrounds, geographic locations and levels of socioeconomic status. The dataset includes a rich feature set of hundreds of thousands of employment cases collected over a decade and represents a wide range of heterogeneous populations. These characteristics enable us to test potentially biased recruitment policies and placement decisions that traditionally may not be tested due to the absence of sufficient data on such large groups in the population. The results of our evaluation reveal that the proposed prediction approach can perform well in terms of both accuracy and interpretability, despite the inherent trade-off that often exists between the two [9,12]. In addition, we demonstrate how our interpretable approach can be used to extract meaningful insights that may support and benefit the recruiters' decision process. These extracted insights are sometimes counter-intuitive and shed light on the limitations of existing approaches and on the recruiters' intuition, which is limited and biased at times. Moreover, we demonstrate that it is feasible to predict a successful placement of a candidate to a specific position at a pre-hire stage with a relatively high prediction performance (AUC = 0.73) and then utilize these predictions to devise a global optimization model. Our results show that using the proposed mathematical programming model, we are able to increase diversity (by 40%) while maintaining a high level of recruitment success (decreased by only 1%). Moreover, the results show an improvement of both diversity and recruitment success rates compared to recruiters' actual selections, although these objectives are generally found to be in conflict. The proposed approach can provide recruiters and organizations alike, with an applicable decision support tool for hiring successful candidates while improving organizational recruitment and placement processes and procedures. This paper is structured as follows. Section 2 reviews the relevant literature. Section 3 describes the proposed analytics framework and the experimental settings. Section 4 describes the results, and finally, Section 5 summarizes and provides some concluding remarks.

Background and literature review

We organize the relevant literature review as follows. We first survey the related studies that address predictive analytics in HR and classify them along three core dimensions: functional, data and method. We then review the related topics from the HR literature.

Functional dimension

In recent years, several preliminary studies have focused on predicting recruiters' decisions [5,6,[13], [14], [15]]. However, imitating the recruiter's decision may not necessarily be the best approach, since they are often affected by highly subjective and potentially inaccurate judgments that preserve, rather than improve, hiring biases. Consequently, there is a need for an objective measure of the actual success of employee recruitment and performance, as well as providing meaningful insights to the recruiters themselves. Other recent studies have focused on objective measures of successful recruitment based on employee past performance. Some of these studies examined the post-hire prediction of turnover or performance with predictors collected over the employment period [8,[16], [17], [18], [19], [20], [21], [22], [23]]. Note that the prediction of turnover or performance using post-hire data (such as absenteeism, punctuality and performance reviews) may be useful as part of some retention activities but may lead to a late discovery of recruitment errors and may often be too late to act upon [8,24]. In contrast, the potential benefit of the early pre-hire foresight of longer-term employee success may be much higher, saving more financial and social costs. Few studies have addressed the pre-hire prediction of recruitment success using performance assessments [[25], [26], [27]] separately from turnover assessments [25,28]. Measuring performance may incorporate one aspect of the success of an employee; however, high-performers will not necessarily remain in the organization. Moreover, turnover alone may only partially indicate recruitment success — as often happens in practice, low-performers may not leave the organization due to organizational policies to minimize layoffs and promote high internal mobility. No previous study has referenced the combination of turnover and performance into one measure that represents an objective measure of recruitment success (see Fig. 1 for a taxonomy of the functional dimension). Thus, in this study, we focus on the case of pre-hire predictions of recruitment success using a combined measure. In the rest of this review, we focus mostly on the case of pre-hire predictions of recruitment success. Note that our methodology approaches hiring from the point of view of recruiters, as opposed to other methodologies that examine the perspective of candidates (for example, how they browse or select relevant job positions [[29], [30], [31]]).

Fig. 1

Literature review based on the functional dimension.

Data dimension

One of the challenges of using machine learning (ML) techniques in HR is the deficiency of empirical data. A noticeable number of studies have examined rather small datasets, in terms of both the number of candidates, as well as the number of features (e.g. [8,15,23,26,32]). Within the line of studies that have addressed pre-hire prediction, studies traditionally included a rather narrow set of samples (such as [[25], [26], [27]]). However, in most cases, a small dataset fails to adequately portray the characteristics of the population, yielding the challenge to adequately train a reliable model based on such a small dataset. Narrow datasets often result in low support values of subpopulations, meaning that very few samples are associated with each predicted (or rule-based) subpopulation, resulting in low statistical significance. This challenge is even more noticeable with the growth in the number of features. Some studies have also involved a limited set of features. For example, Li et al. [26] and Bach et al. [27] use only psychological assessments of personality and cognitive abilities, whereas Mehta et al. [28] use resume data only. Chien and Chen [25] use only a few features, such as age, gender, marital status, educational background, work experience, and recruitment channels. Mehta et al. [28] conclude that features that capture candidate attributes, such as leadership, may contribute significantly to the analysis and that different models should be evaluated for different jobs. They indicate that a broader set of features and samples may enhance both prediction results and root cause analysis. Lack of sufficient empirical data is reflected not only in the absolute amount of data (features, candidates) but also in the available data on populations that are usually not recruited and often are not even interviewed. It is evident that to extract significant insights using the potential of machine learning techniques on HR data, data should include a range of differing applicants [33]. Hence, data collected from a large organization that promotes a wide social diversity policy and hires a wide range of heterogeneous populations would be beneficial in showing new understandings and counter-intuitive results. In contrast to many of the abovementioned papers, in our study, we use a large dataset with hundreds of thousands of employees from a wide range of heterogeneous populations, containing >100 features. This unique dataset allows us to extract relatively deeper rules and insights based on a wider feature set and with high significance predictions of successful or unsuccessful recruitments.

Method dimension

Preliminary studies in HR analytics often used conventional statistical tools such as descriptive statistics, hypothesis testing, analysis of variance, regression and correlation analysis [27,[34], [35], [36], [37]]. Bollinger et al. [37] used a t-test to determine the factors that affect recruiters' decisions and integrated them into their aggregated score. Then, this single-score measure was used as a correlated measure to recruiters' surveyed opinions. Samuel and Chipunza [35] used the Chi-square test to identify which post-hire employment factors impact organizational turnover. Bach et al. [27] used multiple regression analysis to test which personality traits and cognitive ability features have an impact on employee performance. However, their regression models obtain a low fit (R 2 = 0.054, R 2 = 0.088). More recent studies have started to use machine learning techniques for HR analytics. Some of them have implemented models that provide interpretable insights (e.g., [19,21,25,38]) and others have implemented non-interpretable models that provide solely the predictions or their ranked scores (e.g., [8,16,28,39]; further literature is detailed in recent surveys, e.g., [18,33]). In the rest of this section, we mainly focus on papers that addressed the pre-hire prediction of recruitment success using ML tools. Chien and Chen [25] used the CHAID decision tree to extract rules for three different problems with separate classification targets: employee performance levels, turnover in the first three months of employment, and turnover in the first year of employment. They extracted several rules based on the demographic data of a rather moderately sized dataset of 3825 applicants, using all data as the training set (without using validation or test set, which can lead to overfitting). They suggested implementing some strategies based on the one-time findings from the obtained decision trees, such as recruiting from first-tier universities. However, they indicate that the HR staff found the extracted rules to be difficult to implement. The researches suggest performing an in-depth analysis to further clarify the root causes of turnover and implementing processes to effectively improve orgranizational retention rate. The small dataset used in their research could be the reason for the limitations of the extracted rules. Li et al. [26] used a support vector machine (SVM) model to predict the performance of seven test candidates using a training set of 32 employees and focused on their personality test features. Mehta et al. [28] showed the results of a random forest classifier on a dataset containing resumes of candidates. However, they did not use an interpretable model to provide recruitment insights for the organization. It should be noted that the suggested modeling approach in this study is intended to be used by HR professionals in order to facilitate improved interaction with candidates. Thus, there is significant importance to the provision of an interpretable model that can be well comprehended by HR professionals. The model evaluation should consider the interpretability as well as the accuracy of the model [9,12]. Another challenge that the proposed approach must take into account is complexity. In the recruitment-success classification problem under consideration, the complexity arises from a large set of features in the HR dataset (with >150 features). Each feature has several or more possible values, resulting in a large combinatorial space of potential feature interactions. Specifically, the dataset includes many categorical features, such as education certificates, test results, background details and potential assigned positions. In fact, extracting rules (i.e., patterns of feature values), even with a small number of features, may result in an extremely large space of potential combinations [10]. This study investigates several interpretable machine learning algorithms for predicting recruitment and placement success. The proposed method, which has not been used before for this objective, performs well in terms of both interpretability and accuracy, despite the inherent trade-off between the two [9,12]. The results of this research are expected to provide recruiters and organizations alike, with a useful modeling approach that generates insights for supporting recruitment and placement plans. Moreover, the above reviewed studies provide local prediction scores, rankings or rules but do not provide a global prescriptive method that takes into account the position or the organizational point of view as a whole. To conclude, a prescriptive solution, rather than only a predictive methodology, is required for implementation in an actual organizational environment.

HR practices and HR analytics

Employees are considered one of the most important assets for modern organizations; hence, many efforts are invested in improving their success in the workplace. This has led to the rise of fields such as human resources (HR) analytics (which includes other related topics, such as “workforce analytics”, “people analytics”, and “human capital analytics” [40]). A recent review [40] maps the different tasks of HR practices to HR analytics tools and discusses how these tools can influence the organizational return on investment (ROI). The review shows that HR predictive analytics in workforce planning and recruitment have the highest effect on organizational ROI (similar conclusions are shown in a report by the Boston Consulting Group in [1]). Interestingly, as opposed to recruitment and workforce planning, other HR tasks, such as “industry analysis”, “job analysis” and “performance management”, have low expected ROI. Tasks such as “training”, “compensation” and “retention” have medium expected ROI [40]. These findings correspond with our approach of a pre-hire in-advance design of the recruitment plan, which is expected to have more impact than a post-hoc approach. Post-hire information includes information such as: employee engagement, organizational commitment, organizational support and HR practices applied for retention [[41], [42], [43], [44]]. This information surely affects employees' success and could improve the prediction accuracy if included in the model, but it may be too late to act upon this information while inducing much higher expenses. Nevertheless, there is already much hinted evidence in pre-recruitment information that can help predict success, even before it is known how the recruited individual engages with the organization. Hence, it is highly beneficial to focus on early pre-hire predictions that have the highest effect on organizational ROI. An additional important organizational aspect to examine is diversity. A report by McKinsey & Company shows that diversity leads to better profits and that diverse companies may outperform others [45,46]. Therefore, there are economic incentives for enhancing diversity, not solely social or legal incentives. Literature reveals that there is some criticism with regards to the use of HR analytics for business and commercial use [[47], [48], [49]]. Gelbard et al. (2017) [41] state that one of the main reasons for the rather scarce adoption of HR analytics approaches among organizations is the use of “black-box” methods and a lack of actionable items. As shown in [40], indeed, the focus of most human resources studies is mostly descriptive or predictive, and fewer are focused on prescriptive methodologies; however, a prescriptive solution can benefit organizations greatly [18]. For further information about the literature in the field of HR analytics, we refer the reader to recent reviews in [[40], [41], [42], [43], [44]]. In this paper, we aim to provide a prescriptive methodology that includes interpretable insights and an optimization tool for recruitment planning and execution. This tool can be used as a decision support tool for HR professionals, since it not only provides actionable items but also allows for the incorporation of their valuable knowledge and experience into the model.

Methods and data

The goal of this study is to develop an analytic framework that can be implemented as a decision support tool for HR recruiters in real-world settings to efficiently hire suitable candidates and place them in the organization. The proposed methodology comprises two main components: i) a local prediction scheme for the recruitments' success with a technique for extracting meaningful insights based on the trained ML model and ii) a robust mathematical model that provides a global optimization of the recruitment process, taking into account multilevel considerations.

Local recruitment perspective

The first phase of this study is essentially aimed at predicting the fit of an employee to a specific position he or she is hired for. In this part of the study, we focus on using machine learning models for the pre-hire prediction of recruitment success and for the extraction of interpretable insights. The recruitment success measure is based on a combination of turnover and an objective performance indicator. This approach has several advantages in comparison to traditional methods: i) the target measure is objective; ii) it takes into account both turnover and performance; and iii) it focuses on the pre-hire prediction of recruitment success. The use of an objective target measure, as opposed to other evaluations, allows for the examination of existing recruitment policies as well as the extraction of actionable and sometimes intriguing and unexpected insights. Objective performance is affected by the circumstances leading to a position change within the organization. For classification and prediction of successful and unsuccessful recruitments and placements, as well as for mining significant patterns, we use a Variable-Order Bayesian Network model (VOBN) proposed by Ben-Gal et al. [10] and Singer and Ben-Gal [11]. Further details on the model used and its implementation in the recruitment process can be found in Appendix A. We evaluate the model against other interpretable and non-interpretable machine learning algorithms applied to the real-world recruitment dataset. We show that although the VOBN model has not been used before for the task of predicting recruitment success, it performs very well in terms of both interpretability and accuracy. We use the trained VOBN model to identify context-based patterns that can support the organization in the recruitment process. As opposed to some black box models, the VOBN model can be used to extract rules and actions for the recruiters without any machine learning background, providing both scores and specific insights on factors and root causes that affect the success of recruitments. In this phase, we focus on insights and interpretability (that are further discussed in Section 4 and Section 5), while in the second phase, we use the predicted probabilities for successful recruitments as inputs into a global recruitment optimization scheme that addresses more global parameters and objectives of the recruitment decisions at an organizational level.

Global recruitment optimization perspective

Recruitment success at an organizational level requires not only hiring the highest-potential workforce in a greedy manner but also optimizing the process to meet more general objectives. For example, a greedy allocation of candidates to jobs, such that the first candidates are allocated to the most promising job in terms of allocation success, can result in a sub-optimal situation in which certain jobs in the organization will be poorly allocated. Other high-level goals that could be considered are meeting the need for employees at a certain proportion, facilitating the diversity of teams, or properly balancing the workforce among different departments. Each of these dimensions may also include numerous viewpoints, e.g., successful recruitment from the candidate viewpoint, successful allocation from the job viewpoint, and an overall organizational regulatory viewpoint. In this section, we mainly focus on a global optimization perspective that takes into consideration multiple goals of various organizational stakeholders. In the first phase, we pursued interpretability via extracted patterns, through which HR professionals can locally act. In this phase, however, we aim at higher prediction accuracy rather than interpretability for the purpose of designing a more global optimization strategy. To this end, the model with the best prediction results (even if non-interpretable) can be used to predict the probability of success of each candidate for each of the intended positions. These predictions can then be used to address a more global recruitment plan that controls more parameters of the recruitment decisions.

Global optimization implementation

The considered problem spans multiple dimensions, satisfying different requirements as follows: i) demand – minimizing the difference between the required workforce demand and the actual number of recruited employees; ii) accuracy – maximizing the sum of the probabilities of the successful recruitment of employees in the organization; iii) diversity – balancing diverse groups of employees to maintain a heterogeneous work environment. Note that when facing a recruitment challenge at an organizational level, it is important to ensure that each of the above dimensions is balanced across the various business units and positions in the organization. For example, when aiming to minimize the total number of non-filled open positions in the organization, the solution has to also account for fulfilling the demand over all the open positions in a balanced manner.

Mathematical programming formulation

We consider the global recruitment task as an optimization problem and propose a mathematical programming formulation to solve it. The proposed formulation incorporates the objectives that were described above. We use the following parameters as input for the problem: the set of candidates E; the set of positions J; the binary qualification of candidate i to position j, represented by q (it equals 1 if candidate i is qualified for position j and 0 otherwise); the predicted probability of candidate i to succeed in position j, denoted by P which is the output of the learning model such as VOBN or GBM; and the number of open jobs in position j, denoted by N . Since different positions may have different values associated with successful recruitment, our formulation introduces V as an input parameter that represents the value of successful recruitment to position j, it equals 1 if all the jobs are considered evenly, or can be set propositionally to the compensation value of that position relatively to other positions. To support diversity, this formulation includes, in addition, the following input parameters: T denotes different types or classes of candidates (T may represent, for example, the association with diverse groups of the population); the association of candidate i to a class of type t, denoted by b (it equals 1 if candidate i belongs to class t and 0 otherwise); and the minimal proportion of candidates of type t for position j, denoted by PR . A summary of the notations, including the input parameters, the indices, and the model's decision variables, is presented in Table 1 . Additionally, we use a more simplified and less constrained formulation for benchmark purposes.

Table 1

Formulation notations.

Input parameters
E	The set of candidates, i ∈ 1, …\|E\|
J	The set of open positions (jobs), j ∈ 1, …\|J\|
q_ij	Equals 1 if candidate i is qualified for position j, 0 otherwise
P_ij	Probability for candidate i to succeed in position j
N_j	Number of open jobs in position j
V_j	Value of a successful recruitment to position j
T	Set of candidate class types, t ∈ 1, …\|T\|
b_it	Equals 1 if candidate i belongs to class t, 0 otherwise
PR_jt	Required proportion of workers of class t in position j
B	A parameter that balances accuracy and demand objectives

Formulation notations. The first formulation (Formulation 1) is used for benchmark purposes and is a rather simple adjustment to the assignment problem [50], in which the objective function (1.1) maximizes the sum of the predicted probabilities of assignments. Constraint set (1.2) ensures that no candidate is recruited to more than one position. Constraint set (1.3) requires that the number of recruitments for position will not exceed . Constraint set (1.4) ensures that only qualified candidates are recruited for positions. The next set of constraints (1.5) limits the set of possible values for (whether to assign candidate to position ) to 0 or 1. A simple linear programming based on the classic assignment problem solution. max(∑[P ·X ]) ∑ X  ≤ 1 , ∀ i ∈ E ∑ X  ≤ N , ∀ j ∈ J X  ≤ q , ∀ i ∈ E, j ∈ J X  ∈ {0, 1} , ∀ i ∈ E, j ∈ J Formulation 1 raises several challenges that we wish to address. For example, positions that have a very low probability of succeeding might not receive any recruitments (hence, not considering the positional point of view of our demand requirement). Another challenge is that employees might not be evenly distributed among positions. In Formulation 2 below, we propose one way to address these requirements by adding a cost to the deviation from the recruitment demand (can be proportional to the loss due to this position staying unfulfilled). Formulation 2 introduces the decision variable Y , which represents the difference between the required and recruited employees to the position while Y , is set the maximal allowed position shortage (constraints sets (2.5) and (2.6)). Accordingly, we then modify the objective function (2.1) to penalize the maximum deviation from the number of open positions (B Y ), where B is a parameter that balances accuracy and demand objectives. Hence, this penalty approach leads to a better distribution of the employee shortage among positions. Note that we choose to use the demand as a “soft” constraint and penalize shortages in the objective function, rather than forcing a specific level of demand satisfaction. This enables a larger feasible solution space and allows for achieving higher demand satisfaction by minimizing shortages in the objective function. Formulation 2 also introduces diversity constraints into the model. Constraints (2.7) require that Z will determine the number of candidates of type t that are assigned to position j. Constraints (2.8) require that the proportion of candidates of type t assigned to position j will be at least PR . Table 2 presents the requirements that both Formulation 1, Formulation 2 address in terms of the dimensions and viewpoints presented above.

Table 2

Dimensions addressed by Formulation 1, Formulation 2.

Dimension view	Demand	Accuracy	Diversity
Position	√(Formulations 2)		√(Formulation 2)
Organization - total value	√(Formulation 1, Formulation 2)	√(Formulation 1, Formulation 2)	√(Formulation 2)
Organization - balance across business units	√(Formulation 2)		√(Formulation 2)

Proposed linear programming with diversity and penalty on maximal position shortage. max(∑[V ·P ·X ] − B·Y ) ∑ X  ≤ 1 , ∀ i ∈ E ∑ X  ≤ N , ∀ j ∈ J X  ≤ q , ∀ i ∈ E, ∀ j ∈ J Y  = N  − ∑ X , ∀ j ∈ J Y  ≥ Y , ∀ j ∈ J Z  = ∑ X ·b , ∀ j ∈ J, t ∈ T Z  ≥ PR ·∑ X , ∀ j ∈ J, t ∈ T X  ∈ {0, 1}, Z  ∈ {0, 1} , ∀ i ∈ E, j ∈ J, t ∈ T Y  ∈ Integer , ∀ j ∈ J Dimensions addressed by Formulation 1, Formulation 2.

Motivating and illustrative example

We first solve the model over a sample of the real-world dataset as a motivating and illustrative example. We then continue to implement the solution using a larger real-world dataset and perform an analysis of the trade-off between the objectives, as shown in Section 4.4. As a first step, we use small sample data from the real-world dataset to illustrate the properties of the formulations. The data for the example are shown in Fig. 2 . It includes four positions (columns), sixteen candidates (rows), two types of candidates that need to be balanced (e.g., based on their background), and the predicted success probability for each pair of candidate and position (shades of green represent high probability and shades of red represent low probabilities). In addition, we assume a demand of 6 employees for each position.

Fig. 2

Predicted probabilities of success of assigning sixteen candidates of two types of populations to four positions. The entries are color-coded by the success probability values, green - high probability, red - low probability. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.) For example, it is clear from the table that if the only objective is to maximize the sum of success probabilities of candidates for each position separately, position 379 will be filled by candidates from the group of type 2 only. Fig. 3 illustrates four different solutions to the problem: (1) solution to Formulation 1, (2) solution to Formulation 2 with PR = 0, (3) solution to Formulation 2 with PR = 0.1, and (4) solution to Formulation 2 with PR = 0.3. The rows represent the different candidates, and the columns represent the positions. Within each position, an assignment of a candidate to that position is marked with color. Table 3 shows several aggregated properties of the different solutions.

Fig. 3

Table 3

Illustrative example results. Entropy is used as a suitable measure for diversity in the case of more than two candidate type.

Solution #	Description	Demand		Diversity		Average success probability
Solution #	Description	# of completely unassigned positions	Y_max (maximal position shortage)	Minimum proportion of type 1 population	Average position entropy	Average success probability
1	Formulation 1	1	6	0	0.413	0.756
2	Formulation 2 with PR = 0	0	2	0	0.703	0.711
3	Formulation 2 with PR = 0.1	0	2	0.25	0.858	0.701
4	Formulation 2 with PR = 0.3	0	3	0.33	0.939	0.667

Assignment of candidates to positions by four different solutions. For example, solution 1 (marked in red) suggests the following: i) recruiting 4 candidates to position 1409; ii) recruiting 6 candidates to position 1509; iii) recruiting 6 candidates to position 379 (note that none of them are of type 1); and iv) not recruiting any of the candidates to position 40 (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.). Illustrative example results. Entropy is used as a suitable measure for diversity in the case of more than two candidate type. We observe the following: i) Accuracy: the predicted success probability decreases with more constrained models (i.e., models with more constraints) as a result of the shift from global to local objectives; ii) Demand: (1) formulations that penalize deviation from the required demand (Solutions 2–4) avoid cases of positions in shortage of assignments, and (2) solutions that incorporate the penalty on the maximum shortage (Solutions 2–4) manage to better balance the demand satisfaction among positions; and iii) Diversity: (1) adding diversification constraints to the formulations (Solutions 3 and 4) results in higher diversity without significantly compromising accuracy, and (2) solutions that impose a high diversity requirement (Solution 4) may result in higher demand shortage. Similar results are expected over larger recruitment experiment.

Dataset description and target definition

The input dataset for this research includes hundreds of thousands of employment cases (approximately 700,000 cases) of employees who were recruited to the organization over the span of a decade (hired between the years 2000–2010). The pre-hire features in the dataset include age, gender, family and marital status, residence, nationality details, background record, education and grades, interviews and test scores (including leadership scores and language scores), professional preferences questionnaires, family details (when available), “lifestyle” data (when available), and details about the positions. Table 4 presents the main categories of the 164 features in the dataset.

Table 4

Feature summary (after data preparation procedures).

Feature cluster	Lifestyle	Family	Interview and test scores	Special interview scores	Education	Position	Nationality	Language	Residence	Culture	Background record	Gender	Age
# of Features	62	30	29	14	7	5	5	5	2	2	1	1	1
Avg. rank by GBM importance	12	11	2	9	3	1	7	8	5	6	4	10	13

Feature summary (after data preparation procedures). In the preprocessing phase, 21 data tables were consolidated to mask sensitive private data and personal identification; on this dataset we also performed feature enrichment processes and addressed missing data and outliers. Specifically, in the feature enrichment process, we identified several interesting hierarchies of position groups and background data. In addition, we used residence-related data to deduce the socioeconomic levels of the candidates, using statistical data from the Central Bureau of Statistics. Missing values were tagged in the dataset by zeros, since these values mainly represented a lack of a specific test result or interview attribute. The reason to avoid a certain test or question for a specific candidate was not random nor uniform but rather based on the candidate's profile. For example, candidates who seemed to be less relevant to a specific job type were not asked to complete a related questionnaire or did not go through a specific interview segment. As such, these zeros indicate a specific categorical decision, which could be overlooked had we used the mean values (e.g., the mean of the results of certain tests, to impute them). The data records of candidates with many missing values were removed entirely; however, only <1% of the records were removed in total. Additional dimensionality reduction procedures were performed in accordance with each of the applied machine learning algorithms (see details in Section 4). The class feature definition for successful and unsuccessful recruitments was conducted by utilizing the following process: based on HR department records, the reasons for employee turnover were analyzed and accordingly divided into two groups: successful recruitments (e.g., the employee left for “natural” reasons, such as leaving the job after a sufficient time period) and unsuccessful recruitments (e.g., job termination after a short amount of time or due to poor performance). Position and placement changes were classified as negative (e.g., “misfit”) or positive (e.g. “promotion” or “job enrichment processes”). To conclude, the combination of turnover and position changes was used as a combined measure for labeling successful vs. unsuccessful recruitments, as seen in Table 5 . To clarify, the fifth row in the table represents instances that were excluded from the analysis as their period of employment was not long enough to determine if they were successful or not. To maintain consistency, the a-priori distributions of the target class in both the training and testing datasets include 30% of the unsuccessful recruits and 70% of the successful recruitments.

Table 5

Target definitions by HR department.

Employment status	Completed expected time in position (Position dependent)	Reason for leaving	HR term	Target feature label
Left the organization	Yes	“Natural” reasons	Turnover	Successful recruitment
Left the organization	Yes	Negative reasons	Turnover	Unsuccessful recruitment
Left the organization	No	Negative reasons	Turnover	Unsuccessful recruitment
Employed in the organization	Yes	–	Retention	Successful recruitment
Employed in the organization	No	–	–	–
Employed in the organization	No	Promotion or job enrichment	Position change - promotion	Successful recruitment
Employed in the organization	No	Negative reasons	Position change demotion	Unsuccessful recruitment

Target definitions by HR department. Recall that the dataset was acquired from a large nonprofit service organization that is highly diversified over roles, accountabilities and job descriptions with a heterogeneous population. These characteristics allow for testing potentially biased recruitment policies and decisions that traditionally may not be tested due to the absence of sufficient data on certain groups or lack of information on different personal properties. Specifically, it enables us to focus on various groups in the population and to show some counter-intuitive understandings based on data, which is not commonly available. With respect to data selection, we aimed to focus on early pre-hire predictions; thus, the features that were integrated as predictors in the model included only the available pre-hire data, i.e., data from before the recruitment day. The motivation for such data selection was based on several reasons. First, the recruitment day is an important decision point in which it is easier for the organization to take action —for example, the early identification of a possible misfit may save a great deal of financial and social costs. Second, such data selection enables the identification of actionable recommendations for preventive actions. For example, there is little interest in the revelation of turnover among employees who were absent for a long period of time immediately before they resigned (these causes are obvious and self-evident and also occur too late to be acted upon). Note that although post-hire data was available (i.e., data about each employee through his employment period), we utilize only the pre-recruitment data. This approach allows us to achieve the goal of improving the recruitment process and providing insights that may be integrated within recruitment decision processes.

Prediction model and evaluation measure

The classification algorithms were trained on 70% of the candidates (first 8 years in the dataset). In the test stage, we used the trained classification models to predict the recruitment success of the remaining 30% of the candidates and validate our predictions with the ground truth. Note that we used time-dependent partitioning for training and testing to reassure the applicability of the model in the real world and show that the model can still be valid even when the organization changes. In this process, we examined five interpretable machine learning algorithms and four non-interpretable algorithms. We evaluated the results of the prediction models by relying on the AUC (area under ROC curve) measure. According to the literature, e.g., Chawla [51], when the dataset is imbalanced (e.g., when the target variable includes large differences between the frequencies of different class values), an appropriate performance measure is the ROC curve and the AUC measure.

Results

Model evaluation

The study results are presented in Table 6 below. Comparing various interpretable and non-interpretable models, the best AUC score obtained by an interpretable model was obtained using the VOBN algorithm [10,11], with an AUC = 0.705 on the test set. The best results by a non-interpretable model, were obtained by the gradient boosting machine (GBM) algorithm with an AUC = 0.73. Thus, for interpretability purposes, we suggest selecting the VOBN model, whereas for solely aiming at prediction, we suggest using the GBM model.

Table 6

Evaluation of models.a

	Explainability/interpretabilityb	AUC results over all test samples	AUC results by each positionc
	Explainability/interpretabilityb	AUC results over all test samples	Number of positions with AUC > 0.7d	Average rank over all positions (1 - highest)e
GBM (gradient boosting)	No	0.730	174	3.07
RF (random forest)	No	0.719	164	3.39
VOBN (variable-order Bayesian networks)	Yes	0.705	145	3.78
LR (logistic regression)	Partial	0.700	129	4.30
SVM (support vector machine)	No	0.697	100	5.16
C45 (J48)	Yes	0.682	103	5.37
CHAID	Yes	0.681	105	5.12
Naive Bayes	Yes	0.677	80	5.81
CART	Yes	0.644	7	7.92

Note that both the RF and GBM models and their implementations are generally robust to noisy and high dimensionality datasets, since they base their decisions on multiple permutations of the dataset (see [56,[66], [67], [68], [69]]). For the logistic regression and decision tree models, we implemented a feature selection preprocess by using information gain analysis (see [70]). For the SVM model, we used the built-in model as implemented in [71], that can deal with high dimensionality by testing different subsets of the data. In the VOBN model, there is a built-in preprocess procedure that uses mutual information to identify the high-impact features (see the Appendix A for further details).

We consider interpretable and non-interpretable models based on the classification presented in [72].

These results show the AUC for each position in the organization. The AUC scores were calculated over all the candidates that were recruited and placed in specific positions.

Out of 456 positions.

For each position, the compared algorithms were ranked by the AUC score —the values in this column represent the average rank for each algorithm over all positions. A lower rank implies a better average AUC score.

Evaluation of models.a Note that both the RF and GBM models and their implementations are generally robust to noisy and high dimensionality datasets, since they base their decisions on multiple permutations of the dataset (see [56,[66], [67], [68], [69]]). For the logistic regression and decision tree models, we implemented a feature selection preprocess by using information gain analysis (see [70]). For the SVM model, we used the built-in model as implemented in [71], that can deal with high dimensionality by testing different subsets of the data. In the VOBN model, there is a built-in preprocess procedure that uses mutual information to identify the high-impact features (see the Appendix A for further details). We consider interpretable and non-interpretable models based on the classification presented in [72]. These results show the AUC for each position in the organization. The AUC scores were calculated over all the candidates that were recruited and placed in specific positions. Out of 456 positions. For each position, the compared algorithms were ranked by the AUC score —the values in this column represent the average rank for each algorithm over all positions. A lower rank implies a better average AUC score. A conventional approach to handle multiple (conflicting) objectives is to use a Pareto-optimality approach [52]. The model's AUC and its interpretability can be considered as two conflicting objectives that should be addressed by a Pareto-optimality approach. In this sense, the VOBN and the GBM algorithms are both “Pareto optimal”. Specifically, the GBM should be selected if the objective is mainly prediction (although non-interpretable), while the VOBN model should be prioritized if the model interpretability is important, despite a relatively small decrease in the AUC score. Such interpretability not only enhances the understanding of key features in the prediction model but also provides root cause analysis and insights into the recruitment process. Following the evaluation of the different models, the VOBN and GBM models were used for further experimentation and analysis — the former for identifying interpretable patterns and the latter for a global optimization approach.

Identified patterns

Patterns in this use case can be thought of as regularities in the dataset that characterize subpopulations of candidates with common characteristics. A pattern is often described by a set of rules that can be used to cluster subpopulations into different categories. The VOBN, as an interpretable descriptive model, enables the extraction of patterns that can be mapped into insights for the recruitment process, as seen in the next example. The VOBN model has generated more than a thousand patterns that went through a filtering process based on the following: their statistical validity (i.e., statistical significance and support set that indicates how many cases they refer to) and the change they imply on the recruitment's success probability with respect to other subpopulations. The final set of implemented patterns contained few dozens of patterns (a number that also depends on the ability of the recruiters to implement it in their routine procedures), including the ones used by the HR department and the ones presented in the next examples. These patterns were selected by a prioritization process that included the following steps: i) selecting patterns that contain at least one variable that can be controlled by the HR department, such as a threshold on a test result (otherwise the pattern is non-actionable); ii) selecting patterns in which the controlled variables separates well the population into subgroups resulting in different success probability outcomes; iii) prioritizing patterns that represent “counter-intuitive” phenomena that were not known to the recruiters; and iv) prioritizing patterns with larger number of instances in the leaves and with a larger turnover percentage. The following are several examples of patterns, some of which are counter-intuitive and were extracted from the data by the VOBN algorithm. Correlation of a high analytical score in a pre-placement test with the dropout rate in a specific administrator position over different subpopulations. As shown in Fig. 4 , an interesting pattern is found related to the correlation of a high analytical score in a pre-placement test on the position dropout rate of certain administrator positions. As seen in the left figure, the position dropout rate falls only slightly (from 42.5% to 39.3%) when the candidate obtains a higher analytical score. However, as seen from the pattern in the right figure, for men with low leadership skills scores and low language scores, the dropout rate increases significantly (from 58.1% to 68.3%, with p-value<0.001) if the candidate has a high analytical score. A possible explanation can be related to the fact that a high analytical ability has an over-qualifying effect on these specific candidates.

Fig. 4

High analytical score effect on administrator position dropout rate over various subpopulations.

High analytical score effect on administrator position dropout rate over various subpopulations. Skowronski [53] reviews the connections between over-qualification and turnover as well as performance. The paper proposes several practices for the pre-hire and post-hire management of overqualified employees and suggests considering perceived over-qualification rather than merely objective over-qualification. In the case of the considered pattern, it is likely for an employee to feel overqualified and less motivated if he or she is highly skilled but not able to demonstrate his or her competence due to language and communication gaps. To overcome the above difficulties, recruiters should investigate which jobs' properties might decrease the probability of successful recruitment and adjust the specific job requirements to accommodate for wider populations of employees. They may also devise unique programs for different populations that includes for example language, communication and technical training. The effect of competencies on the position dropout rate for a specific field-support position. In general, the data show that candidates with high competencies are less likely to leave their position than are candidates with low competencies (15% vs. 30% position dropout rate, respectively, with p-value<0.001). However, for specific field-support positions, this effect is reversed. Fig. 5 illustrates how candidates for specific support positions who have high competencies follow a significantly higher position dropout rate than do candidates with low competencies (43% vs. 21% position dropout rate, respectively, with p-value<0.001). Here, the recruiters should again be aware of the reversed relation in the case of this field-support position.

Fig. 5

The effect of competencies on the position dropout rate for all positions and for a specific field support position.

The effect of competencies on the position dropout rate for all positions and for a specific field support position. This considered pattern also shows that the dropout rate for low-competency employees has decreased when they are assigned to a specific support position. This is somewhat unexpected since it implies that an organization should strive for the heterogeneity and diversity of its employees rather than recruiting only the most highly scored individuals. This notion is also supported in a report by McKinsey & Company that interestingly showed that diversity leads to better profits among organizations [45,46]. Let us note again that this output is due to the analysis of a unique dataset of a large nonprofit service organization that hires diverse populations with different backgrounds and skills. Correlation between low personal interview scores and low management skill levels with position dropout rates in specific business units for male candidates. The pattern under consideration shows that the effect on the position dropout rate for male candidates with low scores in a specific section in the personal interview, combined with low management-skills score, is business-unit dependent, as shown in Fig. 6 . For males, these low scores are associated with a position dropout rate of 37%, compared to an average dropout rate of 29% for all male candidates. However, this observation changes significantly among different business units, as seen in the figure.

Fig. 6

The relationship between poor-skill levels and position dropout rates for male candidates in different business units.

The relationship between poor-skill levels and position dropout rates for male candidates in different business units. In business unit A, the position dropout rate for all males is 39% (1928 out of 4916), while candidates with low scores have a considerably higher position dropout rate of 60% (394 out of 662). In business unit B, the opposite effect is observed: the dropout rate for male candidates with low scores is 23% (4113 out of 17,600), which is slightly lower than the rate for males, with an average score of 26% (7648 out of 29,049). All these differences have p-values lower than 0.001. As mentioned above, these findings support previous observations in the literature that call for the diversification and heterogeneity of workers [45,46]. Moreover, these findings emphasize the advantage of using data-driven methods to allocate people with diversified backgrounds and skills to specific positions (involving complex hidden patterns, related in this example to gender, business units, managerial and personal skills as well as specific test scores), in which they have a higher potential for success and good performance. Using the proposed approach, organizations should detect the characteristics of specific positions that are found to be statistically related to the allocation success of candidates from various backgrounds. These recruitment and allocation insights should be implemented accordingly, as long as they follow the required regulations for transparency, fairness and explainability (e.g., see GDPR: The EU's General Data Protection Regulation). Cultural background effect on position dropout for a specific office administrative position. The model identified a unique pattern that is related to a specific administrative office position. It turns out that for this office position, allocating a subpopulation of people with a specific common background results in a significantly lower dropout rate (23% instead of 44%, with p-value<0.001). Note that without a granular pattern-detection model, such as the one proposed here, it would be extremely difficult to identify such significant correlations between this office position and the specific cultural background. As seen in Fig. 7 , the average effect of the cultural background over all the positions is minor (indicating a 5% difference only). However, for this considered administrative office position, the effect on the dropout rate is marginal, i.e., more than four times greater (a 21% difference).

Fig. 7

The potential effect of the candidates' cultural background (A or B) on the dropout rate for all positions and for a specific administrative office position.

The potential effect of the candidates' cultural background (A or B) on the dropout rate for all positions and for a specific administrative office position. Organizations and researchers should investigate why some subpopulations of candidates who share common characteristics outperform or underperform in specific jobs or scenarios. Accordingly, they should find more opportunities to include (rather than exclude) specific populations as well as to adjust other organizational practices to support successful recruitment, considering the data-driven patterns discovered. In this context, it worth noting that the literature already recognized, for example, that some subpopulations of immigrants who share common cultural assets and social norms are sometimes better equipped than others to succeed in specific scenarios and vice versa (see [53,54]). The effect of oral language score on turnover differs by specific subpopulation. The effect of an oral language score on turnover is heavily dependent on the chosen subpopulation. Fig. 8 shows that when analyzing this factor over all the employees in the organization, the turnover rate associated with a low oral language score results in a significantly higher position dropout rate (31% vs. 9%, with p-value<0.001) and thus a lift of 3.2. However, when considering a subpopulation of women in administrative positions from a specific cultural background and a certain educational path, the turnover lift grows approximately to 15.5 (77% vs. 5%, with p-value<0.001). This pattern addresses a rather privileged group of women according to their cultural and educational background, and although expected to succeed in their placement (with a 6% turnover only), there is a noticeable language deficiency that affects their ability to succeed in specific jobs.

Fig. 8

The effect of the oral language score on turnover changes for specific subpopulations of candidates.

The effect of the oral language score on turnover changes for specific subpopulations of candidates. It is interesting to compare the relative contribution of features when considering a large population to that of a specific subpopulation. Note that the feature importance of language according to Table 4 is relatively low; however, for a specific subpopulation, there is a greater impact of language skills. This notion is also closely related to Simpson's Paradox [55], which shows that an observed trend in subgroups may behave quite differently (even reversely) when these subgroups are aggregated and analyzed together.

Results application

When recruiters are looking for candidates to be placed in certain positions, they can take advantage of many patterns, such as those shown above. First, they can check that all the relevant data being used by the algorithm are collected and analyzed for all candidates. In addition, they can decide to send some of the candidates to undergo additional testing shown to be informatively correlated with the dropout rates. Then, they can apply the obtained patterns that were discovered by the algorithm to improve the recruitment and placement processes. Finally, the obtained patterns can be used to reveal insights about factors that contribute to the recruitment success of specific positions. This in turn can provide feedback to the organization and can be used to adjust the position definition, such that it increases employment satisfaction and the overall recruitment success probability. It is important to note that these insights must be considered in the proper ethical and legal contexts following specific regulations (e.g., GDPR). Organizations should investigate the reasons as to why some candidates underperform under certain scenarios and find opportunities to include, rather than exclude, diverse populations while adjusting organizational practices when the discovered patterns are considered. In the next section, we discuss a proposed global optimization formulation that can be used to enhance diversity in the organization while maintaining high success placement rates.

Data-driven global optimization results

In this section, we show an analysis of the proposed global optimization model. The analysis incorporates the data of real candidates, positions and demand and includes the predicted success probabilities derived from the prediction for a yearly planning program of our organization. We then perform a sensitivity analysis of the results and compare them to the recruiters' actual decisions. The best prediction was obtained using the GBM algorithm [56], with AUC = 0.73 (see Table 6). We analyzed the robustness of the model by using different time-based partitions for training and testing and noted that the AUC value remained stable. Let us note that the results may be improved using post-hire data; however, as mentioned above, in this study, we focus on recruitment, as performing a prediction in a later post-hire phase may be too late to act upon, leading to much higher expenses. The problem includes 30 position-types and all the candidates recruited to these position-types during a period of one year (10,329 candidates). As in the previous section, we compared different formulations of the problem to the actual assignment of the recruiters in the organization (see Table 7 and Fig. 9 ) and analyzed the trade-offs between different objectives. We expect to have similar results and to be able to show an improvement (in terms of both accuracy and diversity) compared to the actual allocation that was performed by the recruiters.

Table 7

Results for a yearly plan.

#	Method		Accuracy		Diversity
#	Solution	Diversity requirement (PR)	Average of Predicted success probability	Standard deviation over the mean probabilities of all positions	Minimum proportion of type 1 population	Average position entropy	Mean difference in accuracy between candidate croups
1.1	Actual selection	–	0.7087	0.1602	0	0.718	9.67%
2.1	Formulation 2	0	0.7654	0.1434	0.0057	0.619	7.43%
2.2	Formulation 2	0.1	0.7653	0.1431	0.1003	0.653	7.33%
2.3	Formulation 2	0.2	0.7648	0.1427	0.2	0.700	6.07%
2.4	Formulation 2	0.3	0.7638	0.1422	0.3005	0.780	4.86%
2.5	Formulation 2	0.4	0.7623	0.1414	0.4	0.810	3.69%
2.6	Formulation 2	0.5	0.7607	0.1407	0.5	0.864	2.33%

Fig. 9

Pareto efficiency for a yearly plan of the real-world scenario.

Results for a yearly plan. Pareto efficiency for a yearly plan of the real-world scenario. The results above indicate the following1 : We expect that more complicated diversity requirements will lead to a reduced predicted probability of success. However, it can be observed that in the suggested formulations (Solutions 2.4–2.6 in Table 7), there was a significant improvement in both objectives in comparison to the actual assignment. Interestingly, requiring diversity at the position level also contributes to the organizational level: Average entropy - measuring individual positional diversity (the higher it is, the better). Mean difference - measuring balanced scoring between candidate groups (the lower it is, the better). Standard deviation of average probabilities for positions - measuring balanced average scoring between positions (the lower the standard deviation is, the higher the balance between positions). To combine the local and global procedures, it is possible to integrate additional limitations or preferences in the model that were induced from interpretable insights or from other sources, such as legal or regulatory requirements. One approach for doing so is to incorporate additional constraints to the model. Another possible approach is to indicate entries in the probability matrix, such as in the Big M Method [57]. For a future study, we suggest devising a model that will explicitly require the reduction of the imbalance between positions.

Arc deletion heuristic

We note that a more demanding and complex diversity requirement entails longer runtime, which ranges from a few minutes for PR = 0 to 51 min with PR = 0.1 up to several hours with a higher diversity requirement. However, the bottom line is that the size of a relevant assignment problem, even for a large organization with thousands of workers, is fully feasible with the proposed approach (specifically with the suggested heuristic described below). In order to reduce the computational complexity of the model, we suggest a simple heuristic for the deletion of arcs. The heuristic deletes arcs that have a predicted success probability under a certain threshold (in our experimentation we use a threshold of Pij < 0.45). We found that such a deletion allows one to reduce computational complexity while having only a minor effect on the obtained accuracy. In particular, after introducing arc deletion, the observed gaps to the best solution (without an arc deletion) were at most 1% but resulted in runtimes shorter by half or less with respect to the original runtimes. Note that deleting arcs, although improving runtimes, may compromise demand satisfaction in cases in which there are many candidates with an allocation probability which is lower than the threshold, for a certain position.

Conclusions

The objective of this study is to develop a hybrid decision support tool for HR professionals in the operations of recruitment and placement. The proposed methodology consists of two main components. The first is the definition of the problem as a machine learning problem with objective recruitment success as the target variable for specific candidate-position recruitments. The second is the development of a method based on mathematical modeling, which provides a global prescriptive hiring policy at an organizational level rather than a local one. In the first phase, the machine learning model predicts the probabilities of successful recruitments and placements by taking into account various turnover scenarios and pre-recruitment data. The proposed approach is objective, based on an integrated performance indicator as opposed to some other evaluation schemes from the HR literature. It allows for an examination of current recruitment policies and the extraction of interpretable and actionable pattern-based insights. In the second phase, the methodology considers the multi-stakeholder environment of the recruitment problem, including multisided balance and diversity in the process. We show that using the proposed mathematical programming model, even with the requirements of balanced demand and diversity, one is able to maintain a high level of accuracy and while improving the multiple objectives, compared to the actual selection of the recruiters. Implementing the presented approach as a decision support tool can increase the impact of recruiters and maximize organizational return on investment. The utilized dataset in this study is unique and includes the data of hundreds of thousands of employees over a decade. The data represent a wide range of heterogeneous populations represented in a big-data repository. These characteristics allow us to analyze various recruitment policies and decisions that traditionally could not be tested due to the absence of proper data for such research studies. The proposed methodology can be acted upon directly by HR professionals, without a need for deeper technical or machine learning knowledge, and can be implemented as a support software tool for recruiters and HR managers. A detailed study on the contribution of the proposed approach with respect to existing HR theories is beyond the scope of this paper and can be found in [40,58]. We recognize that a prediction model that stands alone may be inherently biased; hence, in this work, we approach this potential bias through several measures: i) an objective target measure; ii) a large dataset incorporating a large range of differing applicants; iii) a mathematical programming model that enhances diversity and balance; and iv) a proposition to use a combined decision of both the recruiter and the used algorithm. For future research, we suggest examining various directions of post-hire feature analysis and studying how these factors affect recruitment performance in comparison to the baseline literature as well as to a pre-hire analysis only. In light of the explainable patterns discovered in relevant candidate profiles, organizations may also devise and adjust personalized practices, such as specific training programs, awareness workshops, compensation and benefit plans, definitions of job duties, work-life balance policies, management and communication campaigns, and the overall organizational culture [44].

CRediT authorship contribution statement

All authors conceived of the presented ideas, developed the theory, performed the computations, discussed the results and took part in writing the paper. All authors read and approved the final manuscript. Dana Pessach:Conceptualization, Methodology, Formal analysis, Validation, Writing - original draft, Writing - review & editing.Gonen Singer:Conceptualization, Methodology, Formal analysis, Validation, Writing - original draft, Writing - review & editing.Dan Avrahami:Conceptualization, Methodology, Formal analysis, Validation, Writing - original draft, Writing - review & editing.Hila Chalutz Ben-Gal:Conceptualization, Methodology, Formal analysis, Validation, Writing - original draft, Writing - review & editing.Erez Shmueli:Conceptualization, Methodology, Formal analysis, Validation, Writing - original draft, Writing - review & editing.Irad Ben-Gal:Conceptualization, Methodology, Formal analysis, Validation, Writing - original draft, Writing - review & editing.

Declaration of competing interest

None.

(1.1)

max(∑_{i, j}[P _ij·X _ij])

Subject to the constraints

(1.2)

∑_j X _ij ≤ 1 , ∀ i ∈ E

(1.3)

∑_i X _ij ≤ N _j , ∀ j ∈ J

(1.4)

X _ij ≤ q _ij , ∀ i ∈ E, j ∈ J

(1.5)

X _ij ∈ {0, 1} , ∀ i ∈ E, j ∈ J

(2.1)

max(∑_{i, j}[V _j·P _ij·X _ij] − B·Y _max)Subject to the constraints

(2.2)

∑_j X _ij ≤ 1 , ∀ i ∈ E

(2.3)

∑_i X _ij ≤ N _j , ∀ j ∈ J

(2.4)

X _ij ≤ q _ij , ∀ i ∈ E, ∀ j ∈ J

(2.5)

Y _j = N _j − ∑_i X _ij , ∀ j ∈ J

(2.6)

Y _max ≥ Y _j , ∀ j ∈ J

(2.7)

Z _jt = ∑_i X _ij·b _it , ∀ j ∈ J, t ∈ T

(2.8)

Z _jt ≥ PR _jt·∑_i X _ij , ∀ j ∈ J, t ∈ T _protected

(2.9)

X _ij ∈ {0, 1}, Z _jt ∈ {0, 1} , ∀ i ∈ E, j ∈ J, t ∈ T

(2.10)

Y _j ∈ Integer , ∀ j ∈ J

Table 8

Notations.

Notation
L	Depth of the complete and balanced tree
R	Minimal frequency of samples per leaf for statistically significance.
s	Pattern, define by series of variable of the parent node
sb	Pattern define the descendent leaf, by the series of the variables of the parent s, and addition split variable b.
x	The value of the target variable. In our case, x ∈ X {turnoverFalse, turnoverTrue)
X	Finite set for the target variable X {turnoverFalse, turnoverTrue)
n(x\| sb)	Number of samples with the value x in the descendant node sb
ΔN(sb)	The (ideal) code length difference between the descendent node sb and the parent node
P^xsb	The estimated conditional probability for getting the value x in the descendant node
P^xs	The estimated conditional probability for getting the symbol x (in the parent)
d	The size of the finite set X.In our case d = 2 since X = {turnoverFalse, turnoverTrue)
C	The pruning constant tuned to process requirements (with default C = 2)
t	The pattern size of an examined node (depth of leaf).

6 in total

1. Competing on talent analytics.

Authors: Thomas H Davenport; Jeanne Harris; Jeremy Shapiro
Journal: Harv Bus Rev Date: 2010-10

2. Identification of transcription factor binding sites with variable-order Bayesian networks.

Authors: I Ben-Gal; A Shani; A Gohr; J Grau; S Arviv; A Shmilovici; S Posch; I Grosse
Journal: Bioinformatics Date: 2005-03-29 Impact factor: 6.937

Review 3. Impact of job satisfaction components on intent to leave and turnover for hospital-based nurses: a review of the research literature.

Authors: Billie Coomber; K Louise Barriball
Journal: Int J Nurs Stud Date: 2006-04-24 Impact factor: 5.837

3 in total

1. An Objective-Based Entropy Approach for Interpretable Decision Tree Models in Support of Human Resource Management: The Case of Absenteeism at Work.

Authors: Gonen Singer; Izack Cohen
Journal: Entropy (Basel) Date: 2020-07-27 Impact factor: 2.524

2. The Challenges of Machine Learning and Their Economic Implications.

Authors: Pol Borrellas; Irene Unceta
Journal: Entropy (Basel) Date: 2021-02-25 Impact factor: 2.524

3. Optimal manpower recruitment and promotion policies for the finitely graded systems using dynamic programming.

Authors: Peter Nga Assi; Effanga Okon Effanga
Journal: Heliyon Date: 2021-07-01