Design and Performance Evaluation of Efficient Clustering Algorithms for Big Data Applications
, et
05 févr. 2025
À propos de cet article
Publié en ligne: 05 févr. 2025
Reçu: 22 sept. 2024
Accepté: 31 déc. 2024
DOI: https://doi.org/10.2478/amns-2025-0056
Mots clés
© 2025 Ping Dai et al., published by Sciendo
This work is licensed under the Creative Commons Attribution 4.0 International License.
Figure 1.

Figure 2.

Figure 3.

Figure 4.

Figure 5.

Figure 6.

Figure 7.

Execution time comparison on seven data sets
Data sets | Size(MB) | Execution time(s) | |||
---|---|---|---|---|---|
K-means | ParCLARA | Par2PK-Means | Ours | ||
Taxi trajectory | 80 | 78 | 658 | 432 | 310 |
150 | 100 | 755 | 480 | 375 | |
310 | 190 | 892 | 578 | 400 | |
650 | 1120 | 1175 | 830 | 615 | |
1280 | 2365 | 1665 | 1350 | 1025 | |
2560 | - | 2735 | 2412 | 1875 | |
Iris | 80 | 30 | 610 | 300 | 210 |
150 | 55 | 690 | 375 | 275 | |
310 | 120 | 815 | 450 | 300 | |
650 | 680 | 1035 | 640 | 405 | |
1280 | 1750 | 1200 | 1001 | 765 | |
2560 | - | 2455 | 2015 | 1420 | |
Haberman’s survival | 80 | 50 | 576 | 312 | 295 |
150 | 65 | 677 | 400 | 310 | |
310 | 125 | 834 | 480 | 375 | |
650 | 715 | 981 | 675 | 425 | |
1280 | 2020 | 1325 | 1250 | 975 | |
2560 | - | 2445 | 2200 | 1725 | |
Ecoli | 80 | 35 | 620 | 350 | 281 |
150 | 60 | 715 | 400 | 326 | |
310 | 125 | 842 | 455 | 370 | |
650 | 755 | 1123 | 670 | 485 | |
1280 | 1925 | 1635 | 1230 | 765 | |
2560 | - | 2578 | 2210 | 1420 | |
Hayes-roth | 80 | 40 | 567 | 312 | 275 |
150 | 55 | 640 | 390 | 340 | |
310 | 120 | 725 | 470 | 385 | |
650 | 725 | 962 | 650 | 435 | |
1280 | 1985 | 1475 | 1200 | 735 | |
2560 | - | 2430 | 2100 | 1577 | |
Lenses | 80 | 30 | 620 | 300 | 295 |
150 | 62 | 705 | 385 | 325 | |
310 | 135 | 925 | 450 | 375 | |
650 | 720 | 1052 | 540 | 430 | |
1280 | 1890 | 1345 | 1080 | 815 | |
2560 | - | 2375 | 2085 | 1475 | |
Wine | 80 | 55 | 625 | 360 | 310 |
150 | 76 | 700 | 410 | 355 | |
310 | 125 | 834 | 460 | 400 | |
650 | 732 | 1000 | 600 | 425 | |
1280 | 2200 | 1345 | 1240 | 845 | |
2560 | - | 2564 | 2250 | 1625 |
Data structure
Name | Type | Remark |
---|---|---|
Id_Num | String | Id Number |
Create_Time | Date | Create (Upload) Time |
Lng | Double | Longitude |
Lat | Double | Latitude |