Design and Performance Evaluation of Efficient Clustering Algorithms for Big Data Applications
, and
Feb 05, 2025
About this article
Published Online: Feb 05, 2025
Received: Sep 22, 2024
Accepted: Dec 31, 2024
DOI: https://doi.org/10.2478/amns-2025-0056
Keywords
© 2025 Ping Dai et al., published by Sciendo
This work is licensed under the Creative Commons Attribution 4.0 International License.
Figure 1.

Figure 2.

Figure 3.

Figure 4.

Figure 5.

Figure 6.

Figure 7.

Execution time comparison on seven data sets
Data sets | Size(MB) | Execution time(s) | |||
---|---|---|---|---|---|
K-means | ParCLARA | Par2PK-Means | Ours | ||
Taxi trajectory | 80 | 78 | 658 | 432 | 310 |
150 | 100 | 755 | 480 | 375 | |
310 | 190 | 892 | 578 | 400 | |
650 | 1120 | 1175 | 830 | 615 | |
1280 | 2365 | 1665 | 1350 | 1025 | |
2560 | - | 2735 | 2412 | 1875 | |
Iris | 80 | 30 | 610 | 300 | 210 |
150 | 55 | 690 | 375 | 275 | |
310 | 120 | 815 | 450 | 300 | |
650 | 680 | 1035 | 640 | 405 | |
1280 | 1750 | 1200 | 1001 | 765 | |
2560 | - | 2455 | 2015 | 1420 | |
Haberman’s survival | 80 | 50 | 576 | 312 | 295 |
150 | 65 | 677 | 400 | 310 | |
310 | 125 | 834 | 480 | 375 | |
650 | 715 | 981 | 675 | 425 | |
1280 | 2020 | 1325 | 1250 | 975 | |
2560 | - | 2445 | 2200 | 1725 | |
Ecoli | 80 | 35 | 620 | 350 | 281 |
150 | 60 | 715 | 400 | 326 | |
310 | 125 | 842 | 455 | 370 | |
650 | 755 | 1123 | 670 | 485 | |
1280 | 1925 | 1635 | 1230 | 765 | |
2560 | - | 2578 | 2210 | 1420 | |
Hayes-roth | 80 | 40 | 567 | 312 | 275 |
150 | 55 | 640 | 390 | 340 | |
310 | 120 | 725 | 470 | 385 | |
650 | 725 | 962 | 650 | 435 | |
1280 | 1985 | 1475 | 1200 | 735 | |
2560 | - | 2430 | 2100 | 1577 | |
Lenses | 80 | 30 | 620 | 300 | 295 |
150 | 62 | 705 | 385 | 325 | |
310 | 135 | 925 | 450 | 375 | |
650 | 720 | 1052 | 540 | 430 | |
1280 | 1890 | 1345 | 1080 | 815 | |
2560 | - | 2375 | 2085 | 1475 | |
Wine | 80 | 55 | 625 | 360 | 310 |
150 | 76 | 700 | 410 | 355 | |
310 | 125 | 834 | 460 | 400 | |
650 | 732 | 1000 | 600 | 425 | |
1280 | 2200 | 1345 | 1240 | 845 | |
2560 | - | 2564 | 2250 | 1625 |
Data structure
Name | Type | Remark |
---|---|---|
Id_Num | String | Id Number |
Create_Time | Date | Create (Upload) Time |
Lng | Double | Longitude |
Lat | Double | Latitude |