Design and Performance Evaluation of Efficient Clustering Algorithms for Big Data Applications
, y
05 feb 2025
Acerca de este artículo
Publicado en línea: 05 feb 2025
Recibido: 22 sept 2024
Aceptado: 31 dic 2024
DOI: https://doi.org/10.2478/amns-2025-0056
Palabras clave
© 2025 Ping Dai et al., published by Sciendo
This work is licensed under the Creative Commons Attribution 4.0 International License.
Figure 1.

Figure 2.

Figure 3.

Figure 4.

Figure 5.

Figure 6.

Figure 7.

Execution time comparison on seven data sets
Data sets | Size(MB) | Execution time(s) | |||
---|---|---|---|---|---|
K-means | ParCLARA | Par2PK-Means | Ours | ||
Taxi trajectory | 80 | 78 | 658 | 432 | 310 |
150 | 100 | 755 | 480 | 375 | |
310 | 190 | 892 | 578 | 400 | |
650 | 1120 | 1175 | 830 | 615 | |
1280 | 2365 | 1665 | 1350 | 1025 | |
2560 | - | 2735 | 2412 | 1875 | |
Iris | 80 | 30 | 610 | 300 | 210 |
150 | 55 | 690 | 375 | 275 | |
310 | 120 | 815 | 450 | 300 | |
650 | 680 | 1035 | 640 | 405 | |
1280 | 1750 | 1200 | 1001 | 765 | |
2560 | - | 2455 | 2015 | 1420 | |
Haberman’s survival | 80 | 50 | 576 | 312 | 295 |
150 | 65 | 677 | 400 | 310 | |
310 | 125 | 834 | 480 | 375 | |
650 | 715 | 981 | 675 | 425 | |
1280 | 2020 | 1325 | 1250 | 975 | |
2560 | - | 2445 | 2200 | 1725 | |
Ecoli | 80 | 35 | 620 | 350 | 281 |
150 | 60 | 715 | 400 | 326 | |
310 | 125 | 842 | 455 | 370 | |
650 | 755 | 1123 | 670 | 485 | |
1280 | 1925 | 1635 | 1230 | 765 | |
2560 | - | 2578 | 2210 | 1420 | |
Hayes-roth | 80 | 40 | 567 | 312 | 275 |
150 | 55 | 640 | 390 | 340 | |
310 | 120 | 725 | 470 | 385 | |
650 | 725 | 962 | 650 | 435 | |
1280 | 1985 | 1475 | 1200 | 735 | |
2560 | - | 2430 | 2100 | 1577 | |
Lenses | 80 | 30 | 620 | 300 | 295 |
150 | 62 | 705 | 385 | 325 | |
310 | 135 | 925 | 450 | 375 | |
650 | 720 | 1052 | 540 | 430 | |
1280 | 1890 | 1345 | 1080 | 815 | |
2560 | - | 2375 | 2085 | 1475 | |
Wine | 80 | 55 | 625 | 360 | 310 |
150 | 76 | 700 | 410 | 355 | |
310 | 125 | 834 | 460 | 400 | |
650 | 732 | 1000 | 600 | 425 | |
1280 | 2200 | 1345 | 1240 | 845 | |
2560 | - | 2564 | 2250 | 1625 |
Data structure
Name | Type | Remark |
---|---|---|
Id_Num | String | Id Number |
Create_Time | Date | Create (Upload) Time |
Lng | Double | Longitude |
Lat | Double | Latitude |