The bacterial count of
Pseudomonas spp. in the unit of log CFU was defined as the main objective function considering the entire dataset categorised into numerical and categorical values for each record ID. The parameters “time”, “temperature”, “NaCl concentration”, “water activity”, and “CO
2 concentration” are numerical data. The microbial counts (log CFU/g) at 0 h were determined as the initial count of
Pseudomonas spp. for each record ID. To separate initial counts from others, data belonging to a time of 0 (h) were coded as 0, and other data were coded as 1. Through this process, the information on the initial count of
Pseudomonas spp. was also converted to numerical data. The parameters, vacuum condition (yes/no), food category (beef, pork, poultry, and culture medium), and food name (minced beef, pork, raw meat lombo, turkey, brain heart infusion broth “BHIB”, and several kinds of tryptic soy broth “TSB”), are categorical data and were kept as is. These variables were not transformed into numerical values, and they were directly used for predictions to avoid the possibility that the machine learning algorithms can create bias in the encoded variables by assuming that higher numbers are more important. The pre-processing steps were performed using
Matlab 8.3.0.532 (R2014a) software (MathWorks Inc., Natick, MA, USA).
Tarlak F, & Yücel Ö. (2023). Prediction of Pseudomonas spp. Population in Food Products and Culture Media Using Machine Learning-Based Regression Methods. Life, 13(7), 1430.