11
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Iktishaf+: A Big Data Tool with Automatic Labeling for Road Traffic Social Sensing and Event Detection Using Distributed Machine Learning

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Digital societies could be characterized by their increasing desire to express themselves and interact with others. This is being realized through digital platforms such as social media that have increasingly become convenient and inexpensive sensors compared to physical sensors in many sectors of smart societies. One such major sector is road transportation, which is the backbone of modern economies and costs globally 1.25 million deaths and 50 million human injuries annually. The cutting-edge on big data-enabled social media analytics for transportation-related studies is limited. This paper brings a range of technologies together to detect road traffic-related events using big data and distributed machine learning. The most specific contribution of this research is an automatic labelling method for machine learning-based traffic-related event detection from Twitter data in the Arabic language. The proposed method has been implemented in a software tool called Iktishaf+ (an Arabic word meaning discovery) that is able to detect traffic events automatically from tweets in the Arabic language using distributed machine learning over Apache Spark. The tool is built using nine components and a range of technologies including Apache Spark, Parquet, and MongoDB. Iktishaf+ uses a light stemmer for the Arabic language developed by us. We also use in this work a location extractor developed by us that allows us to extract and visualize spatio-temporal information about the detected events. The specific data used in this work comprises 33.5 million tweets collected from Saudi Arabia using the Twitter API. Using support vector machines, naïve Bayes, and logistic regression-based classifiers, we are able to detect and validate several real events in Saudi Arabia without prior knowledge, including a fire in Jeddah, rains in Makkah, and an accident in Riyadh. The findings show the effectiveness of Twitter media in detecting important events with no prior knowledge about them.

          Related collections

          Most cited references89

          • Record: found
          • Abstract: not found
          • Article: not found

          Sentiment Analysis and Opinion Mining

          Bing Liu (2012)
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            The role of big data in smart city

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              A Survey of Techniques for Event Detection in Twitter

                Bookmark

                Author and article information

                Contributors
                Role: Academic Editor
                Journal
                Sensors (Basel)
                Sensors (Basel)
                sensors
                Sensors (Basel, Switzerland)
                MDPI
                1424-8220
                24 April 2021
                May 2021
                : 21
                : 9
                : 2993
                Affiliations
                [1 ]Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah 21589, Saudi Arabia; EAlomari0011@ 123456stu.kau.edu.sa (E.A.); IAKatib@ 123456kau.edu.sa (I.K.); aaalbeshri@ 123456kau.edu.sa (A.A.)
                [2 ]School of Architecture and Built Environment, Queensland University of Technology, 2 George Street, Brisbane 4000, QLD, Australia; tan.yigitcanlar@ 123456qut.edu.au
                [3 ]School of Technology, Federal University of Santa Catarina, Campus Universitario, Trindade, Florianópolis 88040-900, SC, Brazil
                [4 ]High Performance Computing Center, King Abdulaziz University, Jeddah 21589, Saudi Arabia
                Author notes
                [* ]Correspondence: RMehmood@ 123456kau.edu.sa
                Author information
                https://orcid.org/0000-0003-3796-0294
                https://orcid.org/0000-0001-7262-7118
                https://orcid.org/0000-0002-4997-5322
                Article
                sensors-21-02993
                10.3390/s21092993
                8123223
                33923247
                1e524609-3ce8-4413-9246-c5a0489fd794
                © 2021 by the authors.

                Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license ( https://creativecommons.org/licenses/by/4.0/).

                History
                : 19 March 2021
                : 21 April 2021
                Categories
                Article

                Biomedical engineering
                smart cities,big data,event detection,road traffic,distributed machine learning,automatic labeling,social media,data analytics,social media analytics,arabic tweets

                Comments

                Comment on this article