The entire pipeline is now complete. The program is now able to collect data for all routes and train against all stops. Currently it only considers lat and lng which is not enough to make a good prediction. Will probably continue on improving the prediction focusing on route 21.
Currently route 21 and stop 14907 is trained - http://hestia.nighsoft.net/septa_next_bus/predict.php?lat=39.955372&lng=-75.2000890&routeid=21&stopid=14907