شناسایی و حذف داده‌های پرت و تکراری درشبکه‌های حسگربی‌سیم با‌استفاده از تکنیک‌های طبقه‌بندی (مطالعه موردی: داده‌های هواشناسی کاشان)

نوع مقاله : مقاله پژوهشی

نویسندگان

1 استادیار، گروه کامپیوتر و فناوری اطلاعات، واحد کاشان ، دانشگاه آزاد اسلامی، کاشان، ایران

2 کارشناسی ارشد، گروه کامپیوتر و فناوری اطلاعات، واحد کاشان ، دانشگاه آزاد اسلامی، کاشان، ایران

چکیده

شبکه­های حسگربی­سیم نسل جدیدی از شبکه­ها هستند که درواقع از تعداد زیادی گره­های حسگر پراکنده در محیط تشکیل شده­اند و هرکدام به­طورخودمختار و با همکاری سایرگره­ها هدف خاصی را دنبال می­کنند. ازآن­جایی­که انتقال داده به­عنوان یکی ازمهم­ترین عملیات مصرف انرژی درشبکه­های حسگر بی­سیم می­باشد، بنابراین کاهش میزان انتقال داده در این شبکه­ها منجربه کاهش مصرف انرژی و درنتیجه طول­عمر بیش­تر این شبکه­ها می­گردد. روش­هایی که برای کاهش میزان انتقال داده در شبکه­حسگربی­سیم باهدف ذخیره انرژی، اغلب استفاده می­شود شامل تجمع داده­ها ، پالایش و پیش­بینی داده­ها می­باشد. ازسوی دیگر، استفاده از پروتکل­های­مسیریابی مناسب می­تواننددرحفظانرژیمصرفی گره­هانقشمؤثریداشته­باشند. دراین پژوهش سعی شده از طریق شناسایی داده­های پرت توسط برخی از تکنیک­های طبقه­بندی و پروتکل­های مسیریابی سلسله­مراتبی ، انرژی مصرفی درکل شبکه کاهش یابد. این آزمایشات روی داده­های حسگرهای ایستگاه خودکار هواشناسی کاشان انجام شد. نتایج نشان­می­دهد که با به­کارگرفتن تکنیک ماشین بردارپشتیبان با کرنل چندجمله­ای از درجه­ی 3 و با ضریب تنظیم­گر 10 درشبکه­ی حسگر پیشنهادی می­توان حدود 92%  داده­ها را درمدت 2 ثانیه به­درستی شناسایی نموده و بنابراین با فیلتر کردن داده های پرت و داده های تکراری و ارسال داده جدید و صحیح ، به صرفه­جویی انرژی و درنتیجه افزایش طول­عمر شبکه کمک نمود.

کلیدواژه‌ها


عنوان مقاله [English]

Detection and removal of outliers and redundant data in wireless sensor networks by classification technigues (Case Study: Weather data of Kashan)

نویسندگان [English]

  • Mehdi Esmaeli 1
  • Azimeh Sharif 2
چکیده [English]

Abstract
Wireless sensor networks are new generation of networks which consists of a large number of sensor nodes distributed in environment and each follows special porpuse independently with cooperation of other nodes. where as data transmission is one of the most important operation in wireless sensor networks, so reduction of data in these networks causes reduction in energy consumption and accordingly increase networks’ lifetime. The frequently used methods to reduce the amount of data transmissions in WSNs for power saving include the data aggregation , and the data prediction and filtering. On the other hand, usage of suitable routing protocols has important role in protection of nods, energy consumption.
In this research it was tried to reduce energy consumption in whole of the network by some of classification techniques and hierarchical routing protocols. these experiences was done on sensor data of automatic weather station of kashan . The results show by appling support vector machine technique with polynomial kernel of 3 degree, and with adjustment coefficient of 10 in the proposed sensor network, accurately about %92 of data is detected in 2 second, and then by filtering outliers and redundant data as well as sending new and correct data, helped to save energy and increase of network lifetime.

Introduction
Wireless sensor networks consist of a large number of sensor nodes distributed in environment randomly.so location of these nodes isn,t determined or clear that cause to leave them in dangerous or inaccessible places. on the other hand , every sensor nodes operate independently and also are able of cooperating with other sensor nodes to achieve the purpose. in wireless sensor networks , the energy issue and optimum usage is one of the most important challenges encounter with these networks , because wireless sensor nodes are usually feed with battery and network,s lifetime depends on battery lifetime. differend methods have been proposed for wireless sensor networks that the most importants are: decrease of data transmission,usage of routing protocols, clustering in wireless sensor networks .a suitable method and solution for encountering with energy depends on variety of network and usage.
Data transmission in wireless sensor networks has been identified as one of the most energy consumption operations [1],as well battery lifetime decrease considerably by data transmission in sensor node, one way to reduce data transmission ,is outlier,s detection, in order to prevent their transmission in network.
In wireless sensor networks,data that digressed from normal patterns,have been detected as outlier and they are important because they show significant abnormalcy. after detecting outliers depending on system decision, outliers can be removed or saved in sensor memory to evaluate more and totally prevent their transmission in network which decrease energy consumption in the whole network. on the other hand , if distance of sensor nodes is small when an event occurs in a region , it will be sense by several nodes and they send the same massage. also in many monitoring applications, wireless sensor network like aerology that sensors sense data from environment sequentially, since environmental data like temperature and other parameters change slowly it may be similar sequential senses of one node and cause sending redundant data , that in addition to aggregation in network, causes great decrease of energy. so by preventing creation of redundant data of filtering can decrease energy consumption in the whole network .
Purpose of this research , is usage of common classification techniques and acquiring optimal values for its parameters to form a model to detect outlier with high speed and accuracy. in this research was tried to reduce energy consumption in the whole network by detecting sensed outliers and redundant data by sensors and preventing their transmission in wireless sensor network that use backboneformation for multi-hop transmission.
In second part history of research will be reviewed. in third part program and necessary algorithms will be checked. in forth part proposed structure of wireless sensor network will be introduced. in fifth and sixth parts evaluation and results will be mentioned as well as next activities in order .

Review of research history
Recently , most of wireless sensor networks in real world were used to collecting valuable raw data. in analizing these data the main step is recognization and remove of abnormal data ( outlier , noise , …..) in [2] outline of ways of outliers, detection .
SVM is one of the machine learning techniques. Separating normal data from abnormal data has been introduced in [3] by SVM technique .
Outlier detection techniques based on one-class support vector machine use spatial and temporal correlations between sensor data for identifying outliers , but this technique for large scale training samples , take more spatial and temporal overhead to process and optimize .
In [4] a technique was defined based on bayesian for detection of local outliers in data flow sensor. according to what was mentioned in [5] , we can use combination of some techniques for outlier,s detection. for example, combination of k-nearest neighbor and SVM methods proposed for outliers detection that use k-nearest method for reducing scale of training sample. this method can shorten training time and optimize time however for data in large scale, k-nearest neighbor has considerable temporal and spatial consumption.
or in [6] combination of SVM and fisher discriminant ratio (FDR) proposed for redundant data detection that at first distributed clustering is applied on sensor data, then by FDR (fisher discriminant ratio) there is boundaries for discriminant between clustering data and scattered data in clusters. the clustered data are regarded as redundant data and will be removed. thus the number of data samples for training SVM is greatly reduced however classification accuracies in this method is a little lower than the traditional incremental SVM training methods.
in [7] proposed combinational technique of k-nearest neighbor and clustering to calculate interesting events for outliers detection. in [8] three techinques for outlier detection including a machine learning technique,a principal component analysis - based methodology and an univariate statistics-based approach were studied in wireless sensor networks.

کلیدواژه‌ها [English]

  • outlier detection
  • Wireless Sensor Network
  • Classification Techniques
  • Data mining
  • reduction of energy consumption
  1.  

    1. Asgarnezhad, R., Akbari Torkestani, j.)2011(. A survey on backbone formation algorithms for wireless sensor networks(a new classification).  Telecommunication Networks and Applications Conference (ATNAC), 2011 Australasian, 9-11 Nov. 2011.  (In Farsi )
    2. Akyildiz, I.F., Su, W., Sankarasubramaniam, Y., Cyirci, E .)2002(. Wireless sensor networks: a    survey. Computer Networks,2002- Elsevier, Vol 38,  15 March. 2002, pp. 393–422.
    3. Akkaya, K., Younis, M.)2005(. A survey on routing protocols for wireless sensor networks. Ad Hoc Networks- Elsevier, Vol 3,  May 2005, pp. 325–349.
    4. Bastam , M . )2008(. Data collection on wireless sensor networks in order to maximize your lifetime. Master's thesis, Department of Computer Engineering and Information Technology, Amirkabir University of Technology. (In Farsi )
    5. Bandyopadhyay, S., Coyle, E.J.)2003(. An energy efficient hierarchical clustering algorithm for wireless sensor networks. INFOCOM 2003. Twenty-Second Annual Joint Conference of the IEEE Computer and Communications. IEEE Societies, 30 March-3 April. 2003.
    6. Bamnett, V., Lewis, T.)1994(. Outliers in statistical data. Journal of the Royal Statistical Society. Series A (General),Vol. 141, No. 4 , pp. 1-2. 
    7. Corke , P., Wark, T., Jurdak, R., Hu, W., Valencia, P., Moore, D.)2010(. Environmental wireless sensor networks. Proceedings of the IEEE,Vol98, Nov. 2010 ,pp. 1903 – 1917.  
    8. Devika,  R., Santhi,  B., Sivasubramanian, T. )2013(. Survey on routing protocol in wireless sensor network. International Journal of Engineering and Technology, Vol 5, No 1, Feb-Mar 2013. Pp.350-356. 
    9. Esmaeili ,M .)2016(. Data warehouses and data mining. Tehran: Cultural and Art Institute of Dybagaran , pp. 416.  (In Farsi )

    10. Esmaeili ,M .)2013(. Training of step by step data mining with Rapied Miner. Tehran : Atynegar publishing, pp. 134. (In Farsi )

    11. Fawzy, A., Mokhtar, H.M.O., Hegazy, O.)2013(. Outlier detection and classification in wireless sensor networks. Egyptian Informatics Journal- Elsevier, Vol 14, July 2013, pp. 157–164.

    12. Gil,  P., Martins, H., Januário, F.)2016(. Detection and accommodation of outliers in wireless sensor networks within a multi-agent framework. Applied Soft Computing- Elsevier, Vol 42, May 2016, pp.204–214.

    13. Gupta, G., Younis, M.)2003(. Fault-tolerant clustering of wireless sensor networks. Wireless Communications and Networking, 2003. WCNC 2003. 2003 IEEE, 16-20 March .2003.  

    14. Heinzelman,W.R.,Chandrakasan, A., Balakrishnan, H.)2000(. Energy-efficien communication protocol for wireless microsensor networks. System Sciences, 2000. Proceedings of the 33rd Annual Hawaii International Conference on, 7-7 Jan. 2000.

    15. Hansen, E., Neander, J., Nolin, M., Bjorkman, M.)2006(. Efficient cluster formation for sensor networks. The Online Publication …, 2006 .

    16. Janakiram ,D., Mallikarjuna, A., Reddy, V., Kumar, P.)2006(. Outlier detection in wireless sensor networks using bayesian belief networks. Communication System Software and Middleware, 2006. Comsware 2006. First International Conference on, 8-12 Jan. 2006.

    17. Kiss, F.)2003(. Credit scoring processes from a knowledge management perspective. Periodica Polytechnica. Social and Management Sciences 11.1 , 2003,pp. 95-110.  

    18. Li, M .)2014(. Data filtering and distortion-prevention for power saving in wireless sensor networks.Communication­Problem-Solving (ICCP), 2014 IEEE International Conference on Beijing, 5-7 Dec. 2014 , pp. 103-106.

    19. Mhatre, V., Rosenberg, C.)2004(. Design guidelines for wireless sensor networks: communication, clustering and aggregation. Ad Hoc Networks- Elsevier, Vol 2, January 2004, pp. 45–63.  

    20. Pantazis, N.A., Nikolidakis, S.A., Vergados, D.D.)2013(. Energy-efficient routing protocols in wireless sensornetworks: a survey.IEEE Communications Surveys & Tutorials,Vol15,  Second Quarter 2013 ,pp. 551 – 591. 

    21. Raghuwanshi, S., Mishra, A.)2003(. A self-adaptive clustering based algorithm for increased  energy-efficiency and scalability in wireless sensor networks. Vehicular Technology Conference, 2003. VTC 2003-Fall. 2003 IEEE 58th, 6-9 Oct. 2003. 

    22. Rezaei, Z., Akbari Torkestani, J.)2012(. An Energy-efficient mcds-based routing algorithm for wireless sensor networks: learning automata approach. Przegląd Elektrotechniczny (Electrical Review),2012.  (In Farsi )

    23. Shukla, D.S, Pandey, A.C , Kulhari, A. )2014( . Outlier detection: a survey on techniques of wsns   involving event and error based outliers . International Conference on Innovative Applications of Computational Intelligence on Power, Energy and Controls with their Impact on Humanity (CIPECH14). 28 & 29 November 2014,pp. 113 – 116.

    24. Shahid, N., Naqvi, I.H. , Qaisar, S.B.)2013(. One-class support vector machines: analysis of outlier detection for wireless sensor networks in harsh environments. Artificial Intelligence Review , vol 43, no. 4: 515-563.

    25. Shen ,X., Li, Z., Jiang, Z., Zhan, Y. )2013(. Distributed svm classification with redundant data  removing . 2013 IEEE International Conference on Green Computing and Communications and IEEE Internet of Things and IEEE Cyber,Physical and Social Computing , Beijing, 20-23 Aug , pp. 866-870.

    26. Sepasi zahmati , A. )2006(. Design, simulation and evaluation of a new protocol to improve network routing of  wireless sensor networks. Master's thesis, Department of Electrical Engineering, University of Science and Technology. (In Farsi )

    27. Sudevalayam, S., Kulkarni, P.)2011(. Energy harvesting sensor nodes: survey and implications. IEEE Communications Surveys & Tutorials , Vol13, Third Quarter 2011,pp. 443 – 461.  

    28. Tsai, C.W., Lai, C.F., Vasilakos,  A.V.)2014(. Future internet of things: open issues and challenges. Wireless Networks-Springer,Vol 20, November 2014,  pp. 2201–2217.

    29. Vy ,V.D., An, T.H., Tu ,B.D., Nghe, V.D., Du, H.Q.)2012(. Data reduction algorithms for wireless sensor networks in environment monitoring and warning applications. Computational Intelligence, Modelling and Simulation (CIMSiM), 2012 Fourth International Conference on Kuantan, 25-27 Sept , pp. 416-421. 

    30. Wang, L.C., Wang, C.W.)2004(. A cross-layer design of clustering architecture for wireless sensor networks. Networking, Sensing and Control, 2004 IEEE International Conference on, 21-23 March. 2004.

    31. Xu, S. , Hu, C. , Wang, L. , Zhang, G .)2012(. Support vector machines based on k nearest neighbor algorithm for outlier detection in wsns . Wireless Communications, Networking and Mobile Computing (WiCOM), 2012 8th International Conference on Shanghai , 21-23 Sept , pp.1-4.

    32. Younis, O., Fahmy, S.)2004(. Distributed clustering in ad-hoc sensor networks: a hybrid, energy-efficient approach. INFOCOM 2004. Twenty-third AnnualJoint Conference of the IEEE Computer and Communications Societies, 7-11 March .2004.  

    33. Zöller, S., Vollmer, C., Wachtel ,M., Steinmetz, R. , Reinhardt, A.)2013(. Data filtering for wireless sensor networks using forecasting and value of information. Local Computer Networks (LCN), 2013 IEEE 38th Conference on, 21-24 Oct. 2013.  

    34. Zhang Y, Meratnia N, Havinga P.)2010(. Outlier detection techniques for wireless sensor networks: a survey.  IEEE Communications Surveys & Tutorials , Vol12,  Second Quarter 2010 ,pp. 159 – 170