Gladden, Matthew E. “Factory Workers’ Daily Performance & Attrition: Data with rich causal relationships, for testing machine-learning approaches.” Dataset published on Kaggle.com, July 23, 2022.
This dataset can be downloaded from Kaggle.com.
Summary. This synthetic dataset was produced with version 0.3.15 of Synaptans WorkforceSim for distribution on Kaggle. It contains 18 months’ worth of daily performance and attrition data (411,948 observations) for a factory whose organizational structure comprises 508 workers. Due to employee turnover, a total of 687 persons appear in the dataset. The dataset’s observations cover both regular daily events (like workers’ attendance and daily level of Efficacy) and special one-time events (like accidents, an employee’s termination, or the onboarding of a new employee). A unique feature of the dataset is diverse causal relationships “hidden” within the data that are waiting to be uncovered through machine learning.