- Look into validation set and epochs - Look into other ways of data splitting other than random split - Check inbuilt sklearn datasplit function as well