Text this: A two-step learning artificial neural network for solving an imbalanced dataset problem in semiconductor manufacturing