The fundamental notions behind diabetes mellitus and its types, causes, diagnosis, and significance in the prediction of DM. This technical work gives a short introduction to Data Mining and Machine Learning techniques for the prediction of diabetes mellitus. The different methodologies, techniques, and methods involved in predicting diabetes disease. Additionally, current advances in machine learning were highlighted, which have had a substantial influence on the identification and treatment of diabetes. The entire automatic disease prediction system utilizes data collection, noise removal, feature extraction, selection, and classification techniques. The described steps are examined in different authors perceptively to get the better knowledge about particular steps. The class imbalance problem on medical data and applied sampling technique is evaluated with different iteration which denotes the number of sampling process. Adaptive sampling technique is adopted for balancing the class in medical dataset. This improves the performance of the gradient boosting classifier that provides 97.78% accuracy.