Crash data reporting systems in fourteen Arab countries: challenges and improvement


  • Zahira Abounoas University-Saint Joseph (USJ), Faculty of engineering (ESIB), Beirut Author
  • Wassim Raphael University-Saint Joseph (USJ), Faculty of engineering (ESIB), Beirut Author
  • Yarob Badr University-Saint Joseph (USJ), Faculty of engineering (ESIB), Beirut, Lebanon United Nations Economic & Social Commission for West Asia (ESCWA), Beirut, Lebanon Author
  • Rafic Faddoul University-Saint Joseph (USJ), Faculty of engineering (ESIB), Beirut Author
  • Anne Guillaume Renault, LAB Laboratory for Accident Analysis, Paris Author



road accidents, road safety, information system, reporting system, variables selection, classification model


Traffic crash fatalities and serious injuries still represent a big burden for most Arab countries because the actual policies, strategies, and interventions are based on poorly collected data. Through this paper, we assessed the crash data reporting systems in Fourteen Arab countries via a survey conducted to identify the fundamental dysfunctions at the management and data collection levels. Then, to address some of the dataset problems, we had applied data mining technics to select a minimum of variables (crash, vehicle, and road user) that should be collected for a better understanding of crash circumstances. For this raison, three methods of selection (correlation, information gain, and gain ratio) and seven classifiers (naive Bayes, nearest neighbour, random forest, random tree, J48, reduced error pruning tree, and bagging) were tested and compared to identify the variables that affect significantly the crashes severity. Decision trees family of classifiers showed the best performance based on the analysis of the area under the curve. The explanatory variables obtained from the data mining process were combined with other descriptive variables to maintain traceability. As a result, we produced hybrid lists of variables for the crash, vehicle, and road user, each contains 25 variables. Finally, in order to propose a cost-effective solution to switch from manual to electronic data collection, we got inspired by a tool used to track animals to create and customize a unified e-form for handheld devices, in order to ensure easy entering of the harmonized data for the entire region based on our selected lists of variables. The tool verified the countries requirements especially by enabling data collection and transfer with and without the internet, and by allowing data analysis thought its built-in Geographic Information System (GIS) capabilities.


Abellán, J., & Castellano, J. G. (2017). A comparative study on base classifiers in ensemble methods for credit scoring. Expert Systems with Applications, 73, 1-10.

Almatawah, D. J. (2014). Towards Improving Crash Data Management System in Gulf Countries. 4(9), 6.

Atnafu, B., & Kaur, G. (2017). Analysis and Predict the Nature of Road Traffic Accident Using Data Mining Techniques in Maharashtra, India. 4(10), 10.

Beasley, R. E. (2020). Essential ASP.NET Web Forms Development: Full Stack Programming with C#, SQL, Ajax, and JavaScript. Apress.

Belgiu, M., & Drăguţ, L. (2016). Random forest in remote sensing: A review of applications and future directions. ISPRS Journal of Photogrammetry and Remote Sensing, 114, 24-31.

Bellis, E. (2015). FARS Analytical User’s Manual. 592.

Bhondave, R., Kalbhor, M., Shinde, S., & Rajeswari, K. (2014). Improvement of Expectation Maximization Clustering using Select Attribute. 1.3, 500-503.

Bliss, T., & Breen, J. (2009). Country Guidelines for the Conduct of Road Safety Management Capacity Reviews and the Specification of Lead Agency Reforms, Investment Strategies and Safe System Projects. 329.

Castro, Y., & Kim, Y. J. (2016). Data mining on road safety: Factor assessment on vehicle accidents using classification models. International Journal of Crashworthiness, 21(2), 104-111.

Cuartas, K., Anzola, J., & Tarazona, G. (2015). Classification Methodology of Research Topics Based In Decision Trees: J48 And Randomtree. 8, 19413-19424.

CyberTracker. (2020). CyberTracker GPS Field Data Collection System-Home.

Dababneh, A., Fouad, R. H., & Majeed, A. J. H. (2018). Assessment of Occupational Safety and Health Performance Indicators for Jordan. 8.

Dadashova, B., Arenas-Ramírez, B., Mira-McWilliams, J., & Aparicio-Izquierdo, F. (2016). Methodological development for selection of significant predictors explaining fatal road accidents. Accident Analysis & Prevention, 90, 82-94.

Howard, C., & Linder, A. (2014). Review of Swedish experiences concerning analysis of people injured in traffic accidents.

Hussain, S., Abdulaziz Dahan, N., Ba-Alwi, F. M., & Ribata, N. (2018). Educational Data Mining and Analysis of Students’ Academic Performance Using WEKA. Indonesian Journal of Electrical Engineering and Computer Science, 9(2), 447.

Jha, A. N., Tiwari, G., & Chatterjee, N. (2020). Road Accidents in EU, USA and India: A critical analysis of Data Collection Framework. In P. K. Kapur, O. Singh, S. K. Khatri, & A. K. Verma (Eds.), Strategic System Assurance and Business Analytics (pp. 419-443). Springer Singapore.

Kalmegh, S. (2015). Analysis of WEKA Data Mining Algorithm REPTree, Simple Cart and RandomTree for Classification of Indian News. 2(2), 9.

Kang, K., & Michalak, J. (2018). Enhanced version of AdaBoostM1 with J48 Tree learning method. 4.

Khallaf, R. I., & Yasseen, A. Y. (2016). Improvement of Road Safety Within the Oil and Gas Industry and its Effect on the Community-Case Study. SPE International Conference and Exhibition on Health, Safety, Security, Environment, and Social Responsibility. SPE International Conference and Exhibition on Health, Safety, Security, Environment, and Social Responsibility, Stavanger, Norway.

Kumar, K., & Singh, J. (2016). Network Intrusion Detection with Feature Selection Techniques using Machine-Learning Algorithms. International Journal of Computer Applications, 150(12), 1-13.

Laaraj, N., Boutahari, S., & Jawab, F. (2018). The Road Safety Information Systems Appropriate to the Systemic Approach: The Case of Morocco. 2018 IEEE 5th International Congress on Information Science and Technology (CiSt), 74-79.

Liebenberg, L. (1999). Rhino Tracking with the CyberTracker Field Computer. 27, 59-61.

Mehmood, A., Taber, N., Bachani, A. M., Gupta, S., Paichadze, N., & Hyder, A. A. (2019). Paper Versus Digital Data Collection for Road Safety Risk Factors: Reliability Comparative Analysis From Three Cities in Low- and Middle-Income Countries. Journal of Medical Internet Research, 21(5), e13222.

Montella, A., Chiaradonna, S., Criscuolo, G., & De Martino, S. (2017). Perspectives of a web-based software to improve crash data quality and reliability in Italy. 2017 5th IEEE International Conference on Models and Technologies for Intelligent Transportation Systems (MT-ITS), 451-456.

O’Donnell, C., O’Malley, M., Lynch, D., & Mullins, E. (2019). WESPAS Cruise Report.

Patil, M. D., & Sane, D. S. S. (2014). Effective Classification after Dimension Reduction: A Comparative Study. 4(7), 4.

Ramya, S., Reshma, Sk., Manogna, V. D., Saroja, Y. S., & Gandhi, G. S. (2019). Accident Severity Prediction Using Data Mining Methods. International Journal of Scientific Research in Computer Science, Engineering and Information Technology, 528-536.

Sarraj, Y. (2016). Developing Road Accidents Recording System in Palestine. 30, 188-204.

Singh, A., N., M., & Lakshmiganthan, R. (2017). Impact of Different Data Types on Classifier Performance of Random Forest, Naïve Bayes, and K-Nearest Neighbors Algorithms. International Journal of Advanced Computer Science and Applications, 8(12).

Spanu, V., & McCall, M. K. (2013). Eliciting Local Spatial Knowledge for Community-Based Disaster Risk Management: Working with Cybertracker in Georgian Caucasus. International Journal of E-Planning Research (IJEPR), 2(2), 45-59.

SWOV. (2016). Data_sources.pdf.

Tharwat, A. (2018). Classification assessment methods. Applied Computing and Informatics.

Theofilatos, A., Chen, C., & Antoniou, C. (2019). Comparing Machine Learning and Deep Learning Methods for Real-Time Crash Prediction. Transportation Research Record: Journal of the Transportation Research Board, 2673(8), 169–178.

Traffic Accident Statistics as of 1438 H - Traffic Accident Statistics as of 1438 H.xls-Saudi Open Data. (2020).

Trafikverket Swedish Traffic administration. (2019). Data Collection [Text]. Trafikverket.

UK department of transport. (2013). Reported Road Casualties in Great Britain: Guide to the statistics and data sources.

Vinutha, H. P., & Poornima, B. (2018). An Ensemble Classifier Approach on Different Feature Selection Methods for Intrusion Detection. In V. Bhateja, B. L. Nguyen, N. G. Nguyen, S. C. Satapathy, & D.-N. Le (Eds.), Information Systems Design and Intelligent Applications (Vol. 672, pp. 442–451). Springer Singapore.

Wang, R., & Tang, K. (2009). Feature Selection for Maximizing the Area Under the ROC Curve. 2009 IEEE International Conference on Data Mining Workshops, 400-405.

Wang, S., Li, D., Petrick, N., Sahiner, B., Linguraru, M. G., & Summers, R. M. (2015). Optimizing area under the ROC curve using semi-supervised learning. Pattern Recognition, 48(1), 276–287.

Wilson, J. (2018). Universal screening with automated essay scoring: Evaluating classification accuracy in grades 3 and 4. Journal of School Psychology, 68, 19-37.

World Health Organization. (2013). Global status report on road safety 2013 supporting a decade of action. World Health Organization.

World Health Organization, issuing body, & ProQuest (Firm). (2018). Global status report on road safety 2018.

Zhang, H., Rangrej, J., Rais, S., Hillmer, M., Rudzicz, F., & Malikov, K. (2019). Categorizing Emails Using Machine Learning with Textual Features. In M.-J. Meurs & F. Rudzicz (Eds.), Advances in Artificial Intelligence (Vol. 11489, pp. 3-15). Springer International Publishing.

Żukowska, J. (2015). Regional implementation of a road safety observatory in Poland. Archives of Transport, 36(4), 77-85,






Original articles

How to Cite

Abounoas, Z., Raphael, W., Badr, Y., Faddoul, R., & Guillaume, A. (2020). Crash data reporting systems in fourteen Arab countries: challenges and improvement. Archives of Transport, 56(4), 73-88.


Most read articles by the same author(s)

1 2 3 4 5 6 7 8 9 10 > >> 

Similar Articles

1-10 of 418

You may also start an advanced similarity search for this article.