Вы находитесь на странице: 1из 41

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/299534723

Garbage Bins Tracking and Geotagging using Predictive Analysis on a web and
mobile platform

Thesis · February 2017


DOI: 10.13140/RG.2.2.17273.47203

CITATIONS READS
0 285

2 authors:

Jonathan Chavez Javellana Guilmarie Joyce Doce Adriatico


Ateneo de Davao University Ateneo de Davao University
3 PUBLICATIONS   0 CITATIONS    1 PUBLICATION   0 CITATIONS   

SEE PROFILE SEE PROFILE

Some of the authors of this publication are also working on these related projects:

Geotagging View project

All content following this page was uploaded by Jonathan Chavez Javellana on 09 March 2018.

The user has requested enhancement of the downloaded file.


2
Garbage Bin Tracking and Geotagging using
Predictive Analytics in Bins Allocation on a Web
and Mobile Platform
ADRIATICO, GUILMARIE JOYCE D., ATENEO de Davao University
JAVELLANA, JONATHAN C., Ateneo de Davao University

The problem of identifying the waste collection is the lacking data structure. Some residents
in the community may throw in nearby corners, any place that fits. In this study, an application for
garbage tracking and collection is created to track and collect data on various garbage bins in a
barangay. Predicting the number of bins as the measure for volume of waste in a location point is
important in providing an efficient and improved solid waste collection service. This insight is all
about how many bins will be allocated in each pick up point. The researchers propose predictive
analytics approach to extract useful knowledge, and establish trends. The approach used is Simple
Regression technique. Number of filled bins is the target variable, and date, location, type of waste
are used for prediction. Geotagging helps identify the exact coordinates for the pickup point. This
establish a trend for the bins per type in each pick up point and identifying the point that needs to
be collected first. The result of the linear regression equation can be used directly by the Barangay
Committee on Waste and to predict the bins quickly and to make data-driven decisions.
General Terms: Geotagging, Waste Collection, Predictive Analytics
Additional Key Words and Phrases: Waste Management, Data Mining, Linear Regression

1. INTRODUCTION

1.1 Background of the Study

Waste Management is more than just putting waste to its proper place. Waste is a discarded
resource, a rejected object that is found both on our bins and in trucks.
Proper waste segregation and collection is essential for public awareness. According to a
research done by ADB, (2004) in Metro Manila alone, their waste generates about 6,700 tons per
day and around P 3.54 billion is spent annually for its collection and disposal. There is so much data
about waste management ‘hidden’ in the web: reports and scientific publications, and through
government and local units (Waste Atlas, 2013).
Waste Collection is the collection and transport of waste to the place of treatment or
discharge by municipal services or similar institutions or by public or private corporations,
Web and Mobile Garbage Tracking and Geotagging app for Predictive Analytics • 2:2

specialized enterprises or general government. Collection of municipal waste may be selective, that
is to say, carried out for a specific type of product, or undifferentiated, in other words, covering all
kinds of waste at the same time (OECD, 2016). There are different types of waste being collected.
Effective waste management is a primary concern and would yield proper approaches and
improvements with regards to the collection. Another concern that the waste collection currently
face is the lack of garbage data documentation and identification. This plays an important aspect of
an efficient and smart waste management system.

In developing countries such as the United Kingdom, These types are color-coded that its
citizens would be able to know, they call it green, blue and black services. Food and garden wastes
are labeled green. Mixed dry recycling are labeled blue. Lastly, the non-recyclables are labeled as
black.

In the past three decades, Singapore, a well-developed country, incorporates recycling, collection
and disposal. Their government created a program in boosting recycling rates. Waste sorting and
recycling was introduced for residences. A collection system was introduced, and schools, offices,
shopping malls, and industry, were brought under the program. The NEA appoints public waste
collectors through open tenders by geographical sectors, locations. By the end of 2005, fifty-six
percent (56%) of all Singapore households had taken part (Yep, 2016). The amount of waste rose
from 1260 tonnes per day in 1970 to 8402 tonnes per day in 2015 which is based on the initiatives
and regulation of solid waste collection in Singapore (National Environment Agency of Singapore,
2016).

Municipal Solid Waste is collected in several ways: House-to-House, Community Bins,


Curbside Pick-up, Self-delivered and Contracted or Delegated Service. Most common practice is
the House-to-House and Community Bins. Waste collectors visit each individual house to collect
the garbage. The community bins are placed at fixed points in a neighborhood or locality. Residents
bring their waste to those bins. ‘Separation’ can be a misnomer as waste is not actually separated
but rather is placed out for collection in separate containers without first being mixed together. In
the Philippines, there are steps and measures including in the Republic Act 9003 or the Ecological
2:3 • Adriatico, Javellana

Solid Waste Management Act which shows the proper implementations and guidelines in keeping
the country’s solid waste organized and properly disposed.

1.2 Problem Statement


Waste Collection is still in a relative state of infancy here in the Philippines, with little
research having conducted in many of its areas to date. In particular, the tasks, identifying necessary
information such as the amount of waste collected and the successful tagging with the location
points are yet to give attention.
(1) How can this study be applicable in garbage/solid waste collection?
(2) How can the project application be efficient in bin data tracking?
(3) How can the project application be useful in capturing location points?
(4) What is the purpose of predictive analytics applying it from the garbage data collected?

1.3 Objectives
This project aims to create and provide a waste collection tracking analysis and geotagging in a
local barangay within Davao City. These are the following objectives of the study:
(1) To be able to determine the applicability of the study.
(2) To be able to determine the efficiency in tracking bin data.
(3) To be able to determine the usefulness of capturing location data point.
(4) To be able to determine the purpose of applying and approach of predictive analytics.

1.4 Significance of the Study


Having someone to monitor the collection is an important aspect of waste management. This
study will aid in the efficient tracking of bins within the various pick up points. It sheds insights on
the personnel to be able to identify the trend or pattern within a month or a week that will be utilized
for smart decision-making that benefits the community to have efficient trash management.

Adriatico, Javellana
Web and Mobile Garbage Tracking and Geotagging app for Predictive Analytics • 2:4

1.5 Scope and Limitations


The application aims to determine the total amount of biodegradable, non-biodegradable and
recyclable trash in a particular barangay monthly. It focuses on the number of garbage bins collected
in the particular pick-up points for a certain period of time (Mon-Sat). And further provide
information to the barangay for future improvement in the garbage collection.

2. REVIEW OF RELATED LITERATURE

2.1 Related Works

2.1.1 LeanPath Food Waste Tracking System


It is an integrated tracking system that aids in identifying, tracking, recording
specified food waste item found in establishments. This is an optimal initiative to help
reduce food waste in the service sector. Its significance has become more important over
time. While there remain different levels of sophistication concerning the steps individual
restaurants can implement to control waste (such as menu modification, cooking to order,
or portion control), technology exists to help simplify the task of tracking food waste.

Lean Path provides individuals within the larger commercial food industry with the
most advanced tools to conduct waste audits, which enable restaurants to actually see the
amount of food waste that amasses during the course of the day. Lean Path focuses on waste
control through an industrial software system which tracks all pre-consumer waste such as
spoilage, expiration and overproduction.

2.1.2 Waste Atlas (2012)


Waste Atlas is a web and mobile open-access map that visualizes solid waste management
data from all over the world. Atlas aims to provide free and easy access to waste management
data and relevant documents for comparison and benchmarking purposes between countries,
cities and waste management systems. In that way, Atlas outlines a global view of waste
2:5 • Adriatico, Javellana

management and serves every citizen’s right to knowledge and reliable information regarding
the way waste is managed. With the combination of scientific knowledge, special data mining
software and multiple data resources, the data published in Atlas becomes more and more
reliable. And like the Wikipedia project, the more the organizations and scientists involved,
the better data quality will be achieved.

2.1.3 Rotterdam’s Waste Collection Efficiency Case Study


The municipality of Rotterdam set Enevo, a leading innovator in smart waste management
and its Dutch partner TWS (Total Waste Systems) in Netherlands a target to increase the
efficiency of collections by at least 20 percent. Enevo’s wireless sensors was installed in 150
paper and cardboard collection containers in South Rotterdam.

The project extension will double the number of monitored waste containers and increase
the use of Enevo Smart Plan software for optimized collection route planning of underground
paper and cardboard waste containers in the city.

The sensors continuously monitor the fill level within the containers and were linked to the
waste department’s project office via cloud servers. Technology to monitor fill levels in waste
containers is well understood, however key to our requirements was a dynamic route planning
system. Enevo offers one of the only systems that reliably offers both capabilities.
To be able to make the most of the recycling and reuse opportunity, we need to make sure
that waste collection is convenient for citizens. That means optimizing the waste value chain,
using Enevo’s analytical model, from the number and placement of receptacles, to scheduling
collections ensuring there are spaces in the containers.
Enevo is a forward-thinking company with a vision to transform the financial, environmental
and social impact of waste.

Adriatico, Javellana
Web and Mobile Garbage Tracking and Geotagging app for Predictive Analytics • 2:6

The Internet of Things, big data and analytics offer the promise of better business decision
making. Utilizing this technology from Enevo, waste management and recycling by the
municipality of Rotterdam is achieving tangible results through more efficient collections and
route planning (Enevo, 2016).
2.1.4 Use of Data Mining Techniques to Determine and Predict Length of Stay of
Cardiac Patients (Hachesu, et al: 2013)
Predicting the length of stay (LOS) of patients in a hospital is important in providing them

with better services and higher satisfaction, as well as helping the hospital management plan

and managing hospital resources as meticulously as possible. We propose applying data mining

techniques to extract useful knowledge and draw an accurate model to predict the LOS of heart

patients.

Data were collected from patients with coronary artery disease (CAD). The patient records

of 4,948 patients who had suffered CAD were included in the analysis. The techniques used

are classification with three algorithms, namely, decision tree, support vector machines (SVM),

and artificial neural network (ANN). LOS is the target variable, and 36 input variables are used

for prediction. A confusion matrix was obtained to calculate sensitivity, specificity, and

accuracy.

Methods

1. Patient Population

The cohort consisted of hospitalized patients during the study period, which started on July

18, 2006 and ended on December 30, 2011. We identified 4,948 patients who were admitted

to the Academic and Educational Hospital of Rajaei Cardiovascular Medical & Research
2:7 • Adriatico, Javellana

Center in Tehran, Iran with heart disease-related diagnoses. Only CAD data were included in

the study (n = 3,512). Significant CAD was defined as at least one point of 50% or greater

diameter stenosis in at least one coronary artery vessel.

2. Data Set

The data sets were stored in a database management system of Microsoft structured query

language (MS-SQL) database. We extracted and constructed a new data set for LOS of CAD.

However, 246 patients were removed from the analyses, because patient records data, such as

identification, were unavailable from the data set. Thus, 3,266 patients were included in the

final data set for further analysis. Table 1 shows features with acceptable class and values. The

data set contained 36 attributes. Finally, we organized the data set into two groups, including

categorical and numerical features. We categorized data values and derived new fields from

existing data in the following features: ejection fraction, diastolic blood pressure, systolic blood

pressure, smoking, triglyceride, low-density lipoprotein, high-density lipoprotein, hemoglobin,

serum cholesterol, and fasting blood sugar. These features were changed to categorical

attributes for better analysis and to obtain good results.

3. Data Pre-processing

Data cleansing and preprocessing are essential to have optimal results [27]. Therefore, we

performed the following cleansing and preprocessing: repeated records, fields with spelling
Adriatico, Javellana
Web and Mobile Garbage Tracking and Geotagging app for Predictive Analytics • 2:8

errors, additional tokens, other irregularities, and irrelevancies were deleted. The next step of

pre-processing was the handling of patient records with missing and outlier data.

4. Dealing with Missing Values

The hospital data set had many features with missing values. Several replacement strategies

were adopted to fill the missing values. First, if a feature was encountered in more than 50%

of records with missing values, that characteristic was determined not to be an effective feature

in the analysis. As a result, such a feature, such as weight or job, was removed. Second, if a

feature was encountered in less than 12% of records with missing values, the mean values of

records were replaced instead of missing values in the numeric features. For example,

creatinine showed 11.2% missing data.

5. Attribute Coding

Data was coded by some valid resources, such as heart disease associations and the

Wikipedia Website.

6. Training and Test Data Sets

After cleaning and preprocessing, 2,064 completed records were extracted and obtained for

data mining tasks. Separating the data into training and testing sets is an important part of

evaluating data mining models. We partitioned the data set into a training set and a testing set;

80% of the data (1,643 records) was used for training, and 20% of the data (421 records) was
2:9 • Adriatico, Javellana

used for testing. The training set was used to adjust the parameters of the models, and the

testing set was used to evaluate its predictive ability.

2.1.5 Singapore LPWAN Smart Waste Management System

This system includes battery operated sensors to detect if a bin is filled, and communicate
this information to an Application server on the Internet through a wireless infrastructure base
on LPWAN technology which supports wide area wireless coverage in the ISM frequency band
and at very low power.
The use of LPWAN (Low Powered Wide Area Network) technology enables the design of
sensor nodes that can be battery operated for a long duration of time. Compared to current
solutions that use public WAN data services such as 3G/4G, which are paid services, LPWAN
technologies allow system operators to implement low-cost wireless infrastructure to support
free wireless data usage since it is operating in free ISM bands.
The use of sensors that monitor the status of waste bins and transmitting this data to an
application server will allow efficient route planning in the collection of waste.
This technology benefits to the waste management industry in terms of its waste bin remote
sensors for an in-depth load and depth sensing. In the productivity improvement by cutting
down unnecessary trips made to empty collection points; in the context of a city waste
management system, this will also enable change in collection route made by rubbish trucks
and hence carbon emission. The server consists of 2 parts, web portal, and web service. The
web portal provides a manage tools for administrators to configure the application. The web
service provides APIs for communication with receiver gateways. The server is designed to be
generic so that it can be easily customized for other IoT (Internet of Things) related applications.

Adriatico, Javellana
Web and Mobile Garbage Tracking and Geotagging app for Predictive Analytics • 2:10

3 METHODOLOGY

3.1 Operational Framework

Figure 1 - Concept

3.2 Planning and Analysis

An android application starts for the day’s collection of bins depending on the pickup location
and the type of bin to be targeted. The data shall be sent over through a 4G or WAN connection
proceeding to the server (javthesis.info). The intention of this study is to predict the amount of
bins within a month. However to create the regression model, the data gathered is limited within
the fifteen day period that was utilized in order to present a general trending of the amount of
waste covering the month of January. In that way, it would present a sample of the report
whether or not the waste level bins has changed.

3.3 System Design

The following presents the overall flow of the web and mobile platforms. This design is

similar to a concept of a need for portability and flexibility and in Data Input using an

interactive User Interface through a Mobile Application. This gives an inspiration to provide

a seamless data gathering experience. No more lacking data structure. Through this system, it

is building one.
2:11 • Adriatico, Javellana

Figure 2 - Web structure


flow

Figure 3 -Mobile Structure


flow

Adriatico, Javellana
Web and Mobile Garbage Tracking and Geotagging app for Predictive Analytics • 2:12

User Interface Design

In this part, these are the following user interface design interface of the system.

Figure 4 – Web UI

Figure 5 – Mobile UI

3.4 System Implementation

The system will be implemented on the barangay community level where it is suitable and
fit for the basic context in Waste Collection Service. The end users of the system are the Volunteer
Waste Personnel whose role is to observe and log the daily collection in the pick-up point and the
Administrator-in charge of the Waste committee whose role is to monitor the personnel’s activity,
inspect and set the locations, and observe the bin trend. It is relevant to know that if the data gathered
can be transformed into a knowledge, and that would be useful and helpful in determining the
2:13 • Adriatico, Javellana

limitations and actions to prevent mistreatment of waste. In this case, the volume of waste is
measured through the number of bins.

4 THEORETICAL FRAMEWORK

4.1 Waste Management


Waste Management is the precise name for the collection, transportation, disposal or recycling
and monitoring of waste. This term is assigned to the material, waste material that is produced
through human being activity. This material is managed to avoid its adverse effect over human
health and environment. Waste Classification is its sub process wherein each waste is categorized
into types, and waste inventory refers to the overall waste data collected on a certain period. Solid
Waste are the result of consumption and the root and source of the waste management cycle.
Effective waste management includes five levels of priorities: prevention, reduction, recycling,
treatment and disposal. In the course of the evolution of solid waste management, six fundamental
elements have been identified: waste generation, storage, collection, transfer and transport,
processing, treatment and recovery, and disposal.

4.2 Barangay Del Monte Island Garden City of Samal Waste Management Ordinace
(2011-24)
One of the purposes of the ordinance is to guide, control and regulate the generation of solid
wastes within the Barangay and promote an orderly and sanitary system for the same. It is based on
the implementation of the provisions of Republic Act No. 9003 which mandates proper segregation
and collection of household wastes. This ordinance just like any other barangay ordinance has this
standard during collection and disposal.
In Section 10 it states, all purok organization must have a designated strategic
collection/drop-off points to facilitate the hauling of segregated solid waste materials of the

Adriatico, Javellana
Web and Mobile Garbage Tracking and Geotagging app for Predictive Analytics • 2:14

barangay. Segregated bins are brought to the points at the date and time of collection. A personnel
is assigned to every collection points to ensure delivery of solid household wastes.
The barangay must assign sanitary crews to check delivered solid waste materials and handle
its transfer.
At a point, there shall be a suitable container being brought up. There shall be a separate
container for each type of waste from all sources that in the case of bulky waste, it will suffice that
the same be collected and placed in a separate and designated area (Section 9)
The container/bin depending on its use shall be properly marked or identified for on-site
collection as follows:
a.) Biodegradables/Compostables or “Malata”
b.) Recyclables or “Di nabubulok na maari pang pakinabangan” and
c.) Residuals (Non-Biodegradable) & Special Wastes or “Di-nabubulok na ‘di na
mapapakinabangan”

4.3 Ecological Solid Waste Management of 2003 (Republic Act 9003)


It is an Ordinance from the Philippines which serves as a guide for solid waste management.
This ordinance also promotes national research and development programs for improved solid waste
management and resource conservation techniques, more effective institutional arrangement and
indigenous and improved methods of waste reduction, collection, separation and recovery.
From its definition, Waste Collection shall refer to the act of removing solid waste from the
source or from a communal storage point (Section 3).
Part of the components of the Ordinance Solid Waste Management Plan is collection and
transfer. To define the coverage of the solid waste collection area in every barangay. The barangay
shall be responsible for ensuring that a 100% collection efficiency from residential, commercial,
industrial and agricultural sources where necessary within its area of coverage, is achieved. The
following concerns are stated (Section 17):
(1) Availability and provision of properly designed containers or receptacles (bins) in
selected collection points for the temporary storage of solid waste while awaiting
collection and transfer to processing sites or to final disposal sites;
2:15 • Adriatico, Javellana

(2) Segregation of different types


(3) Hauling and transfer of solid waste from source or collection points to processing sites
(4) Issuance and enforcement of ordinances to effectively implement a collection system in
the barangay
(5) Provision of properly trained officers and workers to handle solid waste disposal

4.4 Predictive Analytics


Predictive analytics is a branch of advanced analytics and data mining which is used to make
predictions about unknown future events. It uses many techniques such as data mining, statistics,
modeling, machine learning, and artificial intelligence to analyze current data to make predictions.
It is used to bring together the management, information technology, and modeling business process.
The patterns found in historical and transactional data can be used to identify risks and opportunities
for future transactions. Predictive analytics models capture relationships among many factors to
assess risk with a particular set of conditions to assign a score or weightage.
Every business has a treasure trove of data, from customer and transaction information to
manufacturing and shipping statistics. The key is figuring out how to use past data to better the
business’ future (Predictive Analytics World, 2016)

Predictive analytics is the technology that learns from data to make predictions about what
each individual will do (Seigel, 2013).
The process involving it are the following:
1. Define Project
2. Data Collection
3. Data Analysis
4. Statistics
5. Modeling

Adriatico, Javellana
Web and Mobile Garbage Tracking and Geotagging app for Predictive Analytics • 2:16

6. Deployment
7. Model Monitoring
The study focuses on the Statistics and the Data Analysis component which it gives us the
key factor in identifying the key results on the frequency of food orders that is transacted.
The popularity of predictive analytics with businesses has led to other types of organizations
using the software. For example, healthcare firms are using predictive analytics to predict how
certain drugs and therapies will be received by patients, and help doctors better detect early warning
signs for life threatening diseases and illnesses.
Linear Regression Analysis
Linear regression is a statistical method that allows us to summarize and study relationships
between two continuous variables, part of regression modeling which the model specification is that
the dependent variable y is a linear combination of the parameters (but need not to be linear in the
independent variables). For example, in simple linear regression for modeling n data points there is
one independent variable x and two parameters b0 and b1 which forms a straight line:

Figure 5 – Simple Linear Regression Equation


In multiple linear regression, there are several independent variables or functions of
independent variables. Adding a term in xi2 to the preceding regression gives this which forms a
parabola:

Figure 6 – Multiple Linear Regression Equation


The steps to create a regression model into your dataset (Rosario, 2015)
Given a dataset,
Find mean and standard deviations of x and y values
∑ 𝑥𝑖 ∑ 𝑥𝑖 ∑(𝑥𝑖 −𝑥̅ )2 ∑(𝑦𝑖 −𝑦̅) 2
① 𝑥̅ = , 𝑦̅ = , 𝑠𝑥 = √ , 𝑠𝑦 = √ 𝑛−1
𝑛 𝑛 𝑛−1

Calculate the r, correlation coefficient


2:17 • Adriatico, Javellana

∑(𝑥𝑖− 𝑥̅ )(𝑦𝑖− 𝑦̅) ∑ 𝑧𝑥 𝑧𝑦


② 𝑟𝑥𝑦 = ∑(𝑥𝑖−𝑥̅ )2
= 𝑛−1

Calculate the regression coefficient and intercept


𝑠𝑦 ∑(𝑥𝑖− 𝑥̅ )(𝑦𝑖−𝑦̅)
③ Given r、b1 = 𝑟 𝑠𝑥 = ∑(𝑥𝑖−𝑥̅ )2
𝑡ℎ𝑒𝑛 𝑏0 = 𝑦̅ − 𝑏1 𝑥̅

Write out the linear model using these estimates


④ ŷ = 𝑏0 + 𝑏1 𝑥 which is a regression equation of the trend line
⑤ Compute R-squared (R2) which is just, well, r is raised to the power of 2!
⑥ Determine whether or not the model and/or regression coefficient is statistically
significant.
Using a confidence interval, 𝒃𝟏 ± 𝒕 ∗ 𝑺𝑬(𝜷𝟏 ) since H0: b1=0, we check to see if 0 is
in the confidence interval. If it is, fail to reject H0, x has no effect on y. Otherwise, reject
H0, x does have effect on y. Or using a hypothesis test.
⑦ If you are asked about prediction error, compute s or the standard error.

̂𝒊 )𝟐
∑(𝒚𝒊 −𝒚
𝐬= √ ̂𝒊 𝒊𝒔 𝒕𝒉𝒆 𝒓𝒆𝒔𝒊𝒅𝒖𝒂𝒍 or the error of a particular value of y
; 𝒘𝒉𝒆𝒓𝒆 𝒚𝒊 − 𝒚
𝒏−𝟏

from a value of x.

Regression-JS
Regression.js is a javascript library containing a collection of least squares fitting methods
for finding a trend in a set of data. It currently contains methods for linear, exponential, logarithmic,
power and polynomial trends.

Adriatico, Javellana
Web and Mobile Garbage Tracking and Geotagging app for Predictive Analytics • 2:18

Google Maps and Google Maps API

Originally a C++ program designed by two Danish brothers Lars and Jens Rasmussen at the
Sydney-based company Where 2 Technologies, Google Maps was acquired by Google, Inc. in 2004
and transformed into a web application (LeMay, 2005). The application is based on a close variant
of the Mercator projection but the coordinates of the features on Google Maps are GPS coordinates
based on the WGS 84 datum (Map Types, 2014).
Google launched Google Maps API in 2005 in order to allow web developers to integrate Google
Maps into their websites (Taylor, 2005). It has since become the most heavily used web application
development API on the Internet, used in the development of over 1,000,000 web sites
(ProgrammableWeb, 2013). At the time of writing it is a free service and doesn’t require
advertisements, but the terms of use state that Google reserves the right to display ads in the future.
It is also free for some commercial usage, provided that the site on which it is being used is publicly
accessible, does not charge for access, and is not generating more than 25,000 map requests a day
(Google Maps/Google Earth API’s Terms of Service, 2014).
The workflow involved in embedding Google Maps inside a web page involves the following six
steps (Getting Started with Javascript, 2014):
1. Declare the application as HTML5 using the <!DOCTYPE html> declaration.
2. Include the Maps API Javascript using a script tag.
3. Create a div element named “map-canvas” to hold the Map.
4. Create a Javascript object literal to hold a number of map properties.
2:19 • Adriatico, Javellana

5. Create a JavaScript “map” object, passing it the div element and the map options
6. Use an event listener to load the map after the page has loaded.

JavaScript and APIs

An interpreted computer programming language that come as a part of most web browsers,
the implementation which allows client-side scripts to interact with the user, control the browser,
communicate asynchronously, and alter the document content that is displayed (Javascript
Introduction, 2014). This language offers some flexibility in where the program code can reside,
and the simplest approach is to place the code directly on the webpage itself. This approach has the
additional benefit of allowing everyone with access to the webpage the ability to view the source
code through the use of the “View Source” function on the browser.
Web Services

One of the many ways to provide communications between data providers and data
consumers is through the use of web services. The World Wide Web Consortium (W3C) has this
definition: “a software system designed to support interoperable machine-to-machine interaction
over a network. It has an interface described in a machine-processable format (specifically the Web
Service Definition Language (WSDL) (Wen, 2014).
D3.js Data Visualization

Data-driven documents (D3) is a JS library for direct manipulation of a native representation


(Bostock et al., 2011). Main purpose of D3 is to create highly interactive and responsive data
visualizations by allowing animation. D3 Javascript library makes use of the SVG, HTML5 and
CSS standards to create visualizations for modern web browsers (Guldamlasioglu, 2015).
It was first developed by Stanford graduate student Mike Bostock in 2011, along with his
advisors Jeffery Heer and Vadim Ogievetsky. The library is a freely available extension of
Javascript and the basic idea behind D3 was to provide a way to join data with elements on a web
page and then manipulate the elements based on that data.
Adriatico, Javellana
Web and Mobile Garbage Tracking and Geotagging app for Predictive Analytics • 2:20

Also important is to note that D3.js is there to help tell a story. To design with data is to
design data in mind. D3.js offers a lot of different methods in working with data. There is a list
below taken from the libraries API reference pages on gitHub (Meskanen-Kundu, 2015).
 Behaviors – reusable interaction behavior
 Core – for example selections, transitions, data, localization, colors
 Geography – project spherical coordinates, latitude and longitude
 Geometry – utilities for 2D geometry, such as Voronoi diagrams and quadtrees
 Layouts – derives secondary data
 Scales — converts between data and visual encodings
 SVG — utilities for creating Scalable Vector Graphics
 Time — parse or format times that can compute calendar intervals

There is information also about namespaces, importing data in different formats (json and CSV),
math that can be done on data and even tutorials (Bostock, 2014).
Android

Android is a mobile operating system developed by Google. It is used by several


Smartphones such as the Motorola Droid, the Samsung Galaxy, and Google’s own Nexus One.
Android also is an operating system based on the open Linux kernel. Unlike the iPhone OS, Android
is open source, meaning developers can modify and customize the OS for each phone. Therefore,
different Android-based phones may have different graphical user interfaces even though they use
the same OS. The Android OS was originally created by Android, Inc. Which was bought by Google
in 2005. Google teamed up with other companies to form the Open Handset Alliance (OHA), which
has become responsible for the continued development of the Android OS (techopedia, 2017).
Android program are written in Java and run through Google’s “Dalvik” virtual machine,
which is optimized for mobile device (mayanneb, 2012).
o The Android OS features include:
o Integrated browser, based on the open source WebKit engine
o Optimized 2D and 3D graphics, multimedia and GSM connectivity
2:21 • Adriatico, Javellana

o Bluetooth, EDGE, 3G, Wi-Fi


o SQLite
o Camera
o GPS
o Compass
o Accelerometer

Software developers who want to create application for the Android OS can download the
Android Software Development Kit for a specific version. Android Studio, Eclipse are few of the
common IDEs used in developing Android applications.

Adriatico, Javellana
Web and Mobile Garbage Tracking and Geotagging app for Predictive Analytics • 2:22

Location Strategy: Tagging user created content with a location


Obtaining user location from a mobile device can be complicated. There are several reasons
why a location reading (regardless of the source) can contain errors and be inaccurate.
A model of this interaction might happen with respect to location services is visualized.

Timeline representing the window in which the user location is obtained and listening stops when
the user consumes the current location (From:Android Developers)

Flow for obtaining user location (Android Developers, 2017)


Here’s the typical flow of procedures for obtaining the user location:
1. Start application
2. Start listening for updates from desired location providers.
3. Maintain a “current best estimate” of location by filtering out new, but less accurate fixes
4. Stop listening for location updates
5. Take advantage of the last best location estimate
2:23 • Adriatico, Javellana

5 RESULTS AND DISCUSSION


Process involving web and mobile platform
The study presents the process lies within the context of the application in both web and
mobile platforms.
1. In the mobile application, a tagging module is present, which allows the personnel to tag
the pickup point where the garbage bins will be collected.
2. The personnel can add/track garbage bins by indicating the status of bins (number) which
is seen full or ready for collection in each waste type. Done according to the pickup point
selected.
3. Apply the regression model and estimate its parameters
4. Once submitted, the data will be placed onto the structure. This consists of 4 predictor
variables (location, bins, date, waste_type) and 1 flag variable (is_collected).
5. On the web, a status report page presents the graphical summary of the collection. Also,
trends will appear through the use of a linear regression model to determine the estimated
bins in the following week.

Process involving bins prediction


The study having included predictive analytics, is important to notify the user of the bin trends. It
also presents the use of web services including client-side scripting technologies like Javascript,
thus finding out that there are custom libraries and functions on regression. The process of a
leastSquares function implemented using Javascript is presented. (see Appendix C)
Having the parameters xSeries and ySeries which are both data map arrays,
① The data is being utilized to find the mean using the variables xBar and yBar. reduceSumFunc
is a function that adds previous and next values of the series.
var xBar = xSeries.reduce(reduceSumFunc) * 1.0 / xSeries.length;
var yBar = ySeries.reduce(reduceSumFunc) * 1.0 / ySeries.length;

Adriatico, Javellana
Web and Mobile Garbage Tracking and Geotagging app for Predictive Analytics • 2:24

② After that, the data will be able to find the sum of the squares in each series.
var ssXX = xSeries.map(function(d) { return Math.pow(d - xBar, 2); })
.reduce(reduceSumFunc);
var ssYY = ySeries.map(function(d) { return Math.pow(d - yBar, 2); })
.reduce(reduceSumFunc);
③ Find the sum of the product of series (XY)
var ssXY = xSeries.map(function(d, i) { return (d - xBar) * (ySeries[i] -
yBar); }).reduce(reduceSumFunc);
④ Find the slope
var slope = ssXY / ssXX;
⑤ Find the intercept
var intercept = yBar - (xBar * slope);
⑥ Find r2 – correlation
var rSquare = Math.pow(ssXY, 2) / (ssXX * ssYY);
The function returns slope, intercept and the r-squared.

Results
The proponents conducted an onsite usability test using the Android application of the
project on a mobile device. Different locations were tagged, and a form was presented to the
participant. The testing session captured each participant’s task activity and comment towards the
application. A total of eight participants are involved in this usability test to ensure stable results.
Each individual session lasted approximately 15 minutes.
The following presents the tasks:
1. Opening the application
2. Entering the location name
3. Detecting and tagging of Location Point
4. Select Location Point
5. Identify coordinates
6. Connect to the web server (javthesis.info)
2:25 • Adriatico, Javellana

7. Send bin information

Task Completion Rates

Participant Task 1 Task 2 Task 3 Task 4 Task 5 Task 6 Task 7

1 √ √ √ √ √ √ √

2 √ √ √ √ √ √ √

3 √ √ √ √ √ √ √

4 √ √ √ √ √ - -

5 √ √ √ √ √ - -

6 √ √ √ √ √ √ √

7 √ √ √ √ √ - -

8 √ √ √ √ √ √ √

Success 8 8 8 8 8 5 5

Completion 100% 100% 100% 100% 100% 62.5% 62.5%


Rates

*Completion Rate = [number of success/number of participants]

All participants successfully completed Tasks 1 (Opening the application), 2 (Entering the
location name), 3 (GPS Detection and Tagging of Location Point), 4 (Selection of Location
Point) and 5 (Identifying the coordinates). Around three out of eight participants were not able
to complete Tasks 6 (Connect to the web server) and 7 (Send bin information) having a
completion rate of 62.5%.

After the completion of each task, participants evaluated the system application through the
following factors:
1. Usability
2. Learnability (User interaction)

The 5-point rating scale ranged from 1 (Strongly disagree) to 5 (Strongly agree). The statements are
the following

Adriatico, Javellana
Web and Mobile Garbage Tracking and Geotagging app for Predictive Analytics • 2:26

1. I think that I would like to use this system frequently


2. I found the system unnecessarily complex
3. I thought the system was easy to use
4. I think that I would need the support of a technical person to
be able to use this system
5. I found the various functions in this system were well
integrated
6. I thought there was too much inconsistency in this system
7. I would imagine that most people would learn to use this
system very quickly
8. I found the system very cumbersome to use
9. I felt very confident using the system
10. I needed to learn a lot of things before I could get going
with this system

The following presents the score contributions of the whole usability test

Q1 Q2 Q3 Q4 Q5 Q6 Q7 Q8 Q9 Q10 SUS SCORE


p1 4 3 4 0 1 3 4 3 4 3 72.5
p2 3 3 4 2 3 4 2 4 4 2 77.5
p3 4 0 4 2 3 3 3 1 4 3 67.5
p4 3 4 4 3 4 3 4 1 3 1 75
p5 3 2 3 1 2 1 3 2 3 1 52.5
p6 3 0 4 1 4 2 4 3 4 3 70
p7 3 3 3 3 3 2 2 3 2 2 65
p8 3 2 4 2 2 3 3 2 4 1 65
AVERAGE
68.125

Usability
This part covers the Systems Usability Statements 1, 2, 3, 5, 6, 7, 8, and 9 which represents
the applicability of using the application.
Statement 1: Use the system frequently

There are two participants agreed.

Statement 2: Found the system unnecessarily complex


2:27 • Adriatico, Javellana

There are two participants agreed.

Statement 3: System easy to use

There are six participants agreed.

Statement 5: Various functions are well integrated

There are two participants agreed.

Statement 7: Most people would learn to use the system very quickly

There are three participants agreed

Statement 8: Found the system very cumbersome to use

No respondents agreed.

Statement 9: Confident in using the system

There are five respondents agreed.

Overall the usability of the application has a positive response meaning the personnel will
be able to confidently use the system that is easy to use, functions are well integrated, learn very
quickly and would like to use it on a regular operation.

Learnability
This part covers the Systems Usability Statements 4 and 10 which represents the learnability
of the application
Statement 4: I think that I would need the support of a technical person
Out of the eight participants, three agreed, two disagreed and others are either.
Statement 10: I needed to learn a lot of things before I could get going with this system
Out of the eight participants, three agreed, three disagreed and others are either.

Adriatico, Javellana
Web and Mobile Garbage Tracking and Geotagging app for Predictive Analytics • 2:28

Overall, the learnability of the application shows that a normal user could understand the system if
a manual or a support of a technical person is present.

The average scores of the test is 68.125 which falls under the marginal high acceptability range.
The following table shows the SUS Score of each participant and its corresponding acceptability
ranges, grade scale and adjective ratings.
Participant SUS Score Acceptability Grade Scale Adjective
Ranges Ratings
1 72.5 Acceptable C OK
2 77.5 Acceptable C Good
3 67.5 Marginal High D OK
4 75 Acceptable C Good
5 52.5 Marginal Low F OK
6 70 Acceptable C OK
7 65 Marginal High D OK
8 65 Marginal High D OK

Respondent Background
These are the following respondents that were able to participate in the usability test.
Participant Affiliation/Role Gender Age Educational Civil Status Length of the
Attainment job
(year/month)
1 CENRO Male 44 None Married 6 years
2 Barangay Male 30 College Level Single 4 years
Volunteer
Picker
3 Barangay Female 28 High School Single 1 year
Volunteer Level
Picker – 32A
4 Barangay Female 25 High School Single 1 year
Volunteer Level
Picker – 18B
5 Volunteer Male 34 College Level Single 2 years
2:29 • Adriatico, Javellana

6 Barangay Male 21 College Level Single 1 year


Member
7 Barangay Male 23 College Level Single 2 years
Volunteer
Picker – Malvar
8 Barangay Male 21 College Level Single 1 year
Volunteer
Picker – Mintal

REGRESSION TESTING
The predictive analytics component of the study is a linear regression analysis. Through this

model, it would generate a graph from the bins data. This includes the regression line (trendline)

and its equation and the correlation coefficient (r-squared) values. The extent of the data is about

nearly a month. That trend line will be the basis on the prediction of the number of waste bins on

each date for that pick up point and waste type.

In the succeeding months, the trend line will remain the same. Actual values may vary over

time. The linear regression line between number of bins (response variable) and date (predictive

variable) has produced a regression equation of Y = -0.21X + 8.51. The test presents a sample

dataset on BIR pick up point, under the Recyclables bin type.

Adriatico, Javellana
Web and Mobile Garbage Tracking and Geotagging app for Predictive Analytics • 2:30

Graph result from the application:

Graph result from Excel:

Y
12
10
8
6
4
2
0
0 5 10 15 20 25 30

Results

Application Coefficient Intercept Standard Error R-squared


Excel -0.15824 7.916484 2.304558311 0.082051282

GoTrash -0.21 8.51 0.000400892 0.16


2:31 • Adriatico, Javellana

X Y Predicted Difference
Table 6 – 1 10 8.3 1.7
GoTrash 2 8 8.0857 -0.0857
Difference 3 10 7.8714 2.1286
Prediction Test 4 6 7.6571 -1.6571
Using the 5 9 7.4428 1.5572
formula
6 5 7.2285 -2.2285
y = -0.21x +
7 6 7.0142 -1.0142
8.51
8 4 6.7999 -2.7999
9 9 6.5856 2.4144
10 4 6.3713 -2.3713
11 3 6.157 -3.157
12 10 5.9427 4.0573
13 6 5.7284 0.2716
14 5 5.5141 -0.5141
15 7 5.2998 1.7002

X Y Predicted Difference
Table 7 – Excel 1 10 7.758244 2.241756
Difference 2 8 7.600004 0.399996
Prediction Test 3 10 7.441764 2.558236
Using the 4 6 7.283524 -1.283524
formula 5 9 7.125284 1.874716
y = -0.15824x
6 5 6.967044 -1.967044
+ 7.916
7 6 6.808804 -0.808804
8 4 6.650564 -2.650564
9 9 6.492324 2.507676
10 4 6.334084 -2.334084
11 3 6.175844 -3.175844
12 10 6.017604 3.982396
13 6 5.859364 0.140636
14 5 5.701124 -0.701124
15 7 5.542884 1.457116
Adriatico, Javellana
Web and Mobile Garbage Tracking and Geotagging app for Predictive Analytics • 2:32

6 CONCLUSION AND RECOMMENDATION


This study can be applied in the context of solid waste management.

Garbage bins data tracking is made efficient by the utilization of geotagging and mobile data

connection to the server. Tagging a location point is useful based on the system usability test

that has proven that the system is in a marginal high acceptability range which includes that

component in terms of geolocation.

The approach used is Regression Technique in Predictive Analytics. The technique was able to

predict the number of bins with a positive variance. This further states that there’s a strong

correlation with the number of bins predicted compared with the actual value in the system.

Conclusion
This study has determined that it is possible to collect solid waste bins through this

application which serves as a helpful tool for an efficient and organized waste collection process.

With the possibility of adding tracking locations, noting the number of bins to be used for future

needs has been made seamless. The analytics function in the system is also possible in

determining the proper decision in bins allocation. In that way, there would be an established

data structure for the context in waste collection. The result of the linear regression equation can

be used directly by the Barangay Committee on Waste and to predict the bins quickly and to

make decisions for efficiency.


2:33 • Adriatico, Javellana

Recommendation
The next researchers would be able to explore the following recommendations of the

study:

- That predicted bins value can be associated as well with the number of people who would

dispose their garbage in a particular location point and identifying the kind of waste in

detail.

- That the application would be also tested in different barangays, considering the different

principles and guidelines that the system will be brought up and it keeps learning.

- That the web status report dashboard has filters and personnel can be able to set maximum

number of bins to be encoded.

Adriatico, Javellana
Web and Mobile Garbage Tracking and Geotagging app for Predictive Analytics • 2:34

7 APPENDICES
7.1 Appendix A – System Usability Test Form

USABILITY TEST FORM


Name:____________________________________
Age:____________ Gender___
Date:___________
Affiliation:________
Barangay:________________
Location Point:______________________________
How many years/months in the job?___________
Educational Background:__________
Civil Status:________
TASKS 3 2 1
working somewhat never
Opening the application
Enter location name
Tag a map location point
Select a location point in map
Identify current location coordinates
Connect to the internet
Send Bin Information

System Usability Scale 5 4 3 2 1


Strongly Agree Somewhat Disagree Strongly
Agree Disagree
I think that I would like to use this system
frequently
I found the system unnecessarily complex
I thought the system was easy to use
I think that I would need the support of a
technical person to be able to use this system
I found the various functions in this system
were well integrated.
I thought there was too much inconsistency in
this system.
I would imagine that most people would learn
to use this system very quickly.
I found the system very cumbersome to use.
7.2
I feltAppendix B – Screenshots
very confident using the system.
I needed to learn a lot of things before I could
Thegoing
get userwith
willthis
besystem.
redirected to the add location point.
2:35 • Adriatico, Javellana

7.2 Appendix B - Screenshots

Users will see this launcher User may assign a certain


activity. To proceed, click location name for the
the button below current tagged location
point detected by GPS.

User can click one of the


markers in the Map
Selection. Once selected,
then tap Use Location

Adriatico, Javellana
Web and Mobile Garbage Tracking and Geotagging app for Predictive Analytics • 2:36

Status Report Bin Trends Allocation (http://javthesis.info/thesisbasura/statusreport)


2:37 • Adriatico, Javellana

7.3 Appendix C – Source Code: Least Squares Regression Function


// returns slope, intercept and r-square of the line
function leastSquares(xSeries, ySeries) {
var reduceSumFunc = function(prev, cur) { return prev +
cur; };

var xBar = xSeries.reduce(reduceSumFunc) * 1.0 /


xSeries.length;
var yBar = ySeries.reduce(reduceSumFunc) * 1.0 /
ySeries.length;

var ssXX = xSeries.map(function(d) { return Math.pow(d -


xBar, 2); })
.reduce(reduceSumFunc);

var ssYY = ySeries.map(function(d) { return Math.pow(d -


yBar, 2); })
.reduce(reduceSumFunc);

var ssXY = xSeries.map(function(d, i) { return (d - xBar) *


(ySeries[i] - yBar); })
.reduce(reduceSumFunc);

var slope = ssXY / ssXX;


var intercept = yBar - (xBar * slope);
var rSquare = Math.pow(ssXY, 2) / (ssXX * ssYY);

return [slope, intercept, rSquare];


}

Adriatico, Javellana
Web and Mobile Garbage Tracking and Geotagging app for Predictive Analytics • 2:38

7.4 Appendix D - Usability Test Scaling

The System Usability Scale (SUS) is an inexpensive, yet effective tool for assessing the

usability of a product, including Web sites, cell phones, interactive voice response systems, TV

applications, and more. It provides an easy-to-understand score from 0 (negative) to 100 (positive).

As noted above, using a scoring system ranging from 0 to 100 often leads to researchers interpreting

SUS scores as percentages, which they are not.

The normative data collected by Bangor, Kortum, and Miller (2008) and Sauro (2011) provided

the basis for positioning SUS scores as percentiles, providing a more meaningful basis for

interpreting SUS scores. For example, Sauro commented: “A [SUS] score of a 70 is actually right

around the average score of 68, meaning it is at or around the 50th percentile” (Journal of Usability

Studies: 2013).
2:39 • Adriatico, Javellana

8 REFERENCES
Brooke, John. 2013. SUS: A Retrospective. Journal of Usability Studies. Vol 8, Issue 2. pp.29-40

Getting Started with JavaScript, 2014. [online] Google, Inc. https://google-


developers.appspot.com/maps/documentation/javascript/tutorial [Accessed 6 February 2014]

Google Maps/Google Earth APIs Terms of Service, 2014. [online]. Google, Inc. Available from:
https://developers.google.com/maps/terms?hl=en [Accessed 15 February 2014].

Guldamlasioglu, Selin (June 2015) Web-based Information Visualization using JavaScript, MSc.
Thesis. 57 pages. Retrieved from https://tampub.uta.fi/bitstream/handle/10024/97846/GRADU-
1438162264.pdf

Linear Regression [online] Wikipedia


https://en.wikipedia.org/wiki/Regression_analysis#Linear_regression [Accessed 27 January 2017]

Meskanen-Kundu, Laura (2015) Making Data Accessible: An Overview of Interactive Data


Visualization Using D3.js as Applied to a Scientific Dataset: Making a Static Visualization
Interactive. Retrieved from https://tampub.uta.fi/bitstream/handle/10024/97846/GRADU-
1438162264.pdf

Stadsbeheer Rotterdam Case Study (2016) Enevo offers a route to more efficient waste collection
in Rotterdam. Retrieved from https://cdn3.enevo.com/wp-
content/uploads/2016/03/29110608/Enevo_Case_Study_report_Rotterdam.pdf

UP Press: 2008 Solid Waste Management: Principles and Practices : an Introduction to the Basic
Functional Elements of Solid Waste Management, with Special Emphasis on the Needs of
Developing Countries
What is Predictive Analytics? - Predictive Analytics Times - predictive analytics & data science
news. (2013). Retrieved August 17, 2016, from
http://www.predictiveanalyticsworld.com/patimes/what-is-predictive-analytics/2217/

Waste Atlas 2013 Report. Collective work.


http://www.iswa.org/fileadmin/galleries/News/WASTE_ATLAS_2013_REPORT.pdf (Accessed
December 2016)

Adriatico, Javellana
Web and Mobile Garbage Tracking and Geotagging app for Predictive Analytics • 2:40

Wen, Michael. (July 2014) Exploring Spatial Analysis Capabilities in Google Maps Mashup using
Google Fusion Tables: A case study in Land Lease data retrieval, Master’s Thesis. Retrieved
from http://nwmissouri.edu/library/theses/2014/WenMichael.pdf

Location Strategies-Android Developers. Online Resource.


https://developer.android.com/guide/topics/location/strategies.html (Accessed 27 February 2017)

D3.js Web Articles. https://d3js.org/, https://www.dashingd3js.com/d3-examples/1-d3-and-


javascript-working-with-json, http://christopheviau.com/d3list/gallery.html

Tom-Alexander regression-js on Github. https://github.com/Tom-Alexander/regression-js

Zimring, Rathje. Encyclopedia of Consumption and Waste: The Social Science of Garbage. 2011.

View publication stats

Вам также может понравиться