Exemplary embodiments of the present disclosure are directed towards a method for parallel activity scheduling for large data sets. The method comprising receiving the input variables from a respected project by a project scheduling problem library (PSPLIB) data module, finding the relevant, relational project data for processing efficiently by a multi project relationship module to minimize the search space due to the correlation between the project activities by a probability.