Академический Документы
Профессиональный Документы
Культура Документы
Abstract
In this white paper, Impetus talks about the need for building Big Data technologies based social analytics platform for better business insight. The paper also focuses on why social media analytics is important in todays world and how 3-D data sourcesthat is, internal, external and social datacan be utilized to build a data warehouse based on Big Data technologies. Impetus also shares in this white paper, its recommended solution, and how Big Data technologies can be used to optimize costs and handle and exponential increases in data over time.
Table of Contents
Introduction .................................................................................................................................................. 2 The benefits of Social Analytics .................................................................................................................... 4 Data sources that facilitate Social Media Analytics ...................................................................................... 5 Technical tenets of Social Media Analytics ................................................................................................... 5 Using Big Data technologies to enable Social Media Analytics .................................................................... 7 Building a Big Data warehouse ..................................................................................................................... 8 A step-by-step approach to creating the Big Data EDW ............................................................................... 9 The Impetus solution .................................................................................................................................. 11 The iLaDaP high level architecture.......................................................................................................... 11 Summary ..................................................................................................................................................... 12
Introduction
Social Media Analytics is a discipline that helps organizations measure, assess and explain the performance of their social media initiatives. There are four stages of analyzing social media data, including the following: Step 1: collecting the data. This facilitates the compiling of reports and statistics that are to be shared with the management or the internal and external stakeholders. Step 2: measuring the data. This helps in Sentiment Analysis and gauging which products are well received in the marketplace. Step 3: analysis. Here, data is presented in a visual and interactive manner to the management, as well as the sales and marketing teams to provide better insights. Step 4: innovation. Based on the insights and analysis, there is a move towards innovation, where organizations determine the new products and ideas they are going to pursue, as a response to customer requirements. Innovation also helps unearth the cross sell or up sell opportunities that were not visible before. Social Analytics opens up a host of new opportunities and perspectives. Category-wise analysis of customer data for instance, enables their
2
demographic profiling and helps determine their usage patterns. Similarly, with Feature analysis, it is possible to figure out which forums, platforms or sources of data are more active as compared to others. Product Growth Analysis, which focuses on the data generated for a specific product, helps understand the response of users to that product. There is also a Recommendation Engine, which helps zero in on what is missing or lacking in a product range.
Finally, Social Analytics enables Third Party Analysis, which is purely focused on what the public social media platforms, such as Twitter, Facebook, MySpace, etc. have to say about the product.
Social Media Analytics effectively converges on-site, social media and third party data to extract useful information. Considering these factors, and the fact that it enables enterprises to leverage the colossal data that is continuously generated through social media interactions, Social Media Analytics should be made an integral part of the marketing and research strategies of enterprises.
Entity Extraction: Organizations can identify the concerns and issues that dissatisfied customers are struggling with through Entity Extraction. They can then take appropriate measures to ease the situation and retain customers on the verge of switching to other suppliers or vendors. Event Extraction enables companies to unearth the sequence of events leading up to customer defections, or why people moved on to other providers. Communications Graphs: Once organizations have all the data nicely sliced and diced, they can draw Communication Graphs. These graphs can help analyze and identify the top influencers, and active members in various groups. They can also help companies gain a better understanding of where the messages originate, and how they travel through the network. Knowing this, organizations can target the top influencers and most active members in the network, projecting a positive image of the brand or product in the community.
TweetUser FaceUser
USA, NY USA, GA
Twitter Facebook
For natural language processing, using a NLTK is a good Open Source option. Data preparation/Mashups can be accomplished by running Map-Reduce jobs over the collected data and massaging it. Apache Mahouts k-means algorithm can be used for clustering, while its Nave Bayesian algorithm can be used for classification/sentiment analysis using the comments and tweets from social media data sources and identifying patterns. The item-based similarity algorithm of Mahout can be used for collaborative filtering and recommendations. When the data is ready for analytical reporting and deep mining, Hive or Pig can be used.
Companies can come up with recommendations by running the data through Mahout. These recommendations can then be factored into future product design and rollouts. Step 3: This step is about using customer data to recommend new and related products. Once companies have data from their existing systems as well as social sources, they can prepare the mock customer data for Social ID mapping and run Item or User based recommendations on this data using Mahout. At this stage, it is possible to produce Analytical Reports on data generated by Mahout. This can be accomplished by generating reports using a traditional Reporting product or framework. The nicely sliced and diced reporting data can be dumped into a MySQL database or some other SQL database, with the help of Sqoop. This SQL database can be used to meet the regular downstream reporting requirements of organizations. This will enable them to use their existing investments in reporting tools as well as provide the drill down reports for use by the management and Sales and Marketing departments. Alongside social media, this Big Data Media Analytics platform can be used to address other large data analytics requirements. The platform can give companies a head start in putting together the pieces of their Big Data strategy and provide them with an asymmetric advantage over competition.
10
It is powered by Hadoop, and therefore, can linearly scale up to thousands of nodes using commodity hardware. This spells a significant cost advantage in the long run. iLaDaP also comes with a set of pre-canned and customized reports. Businesses that need to track down and take advantage of opportunities as they happen can use the Impetus platform to react to events. The iLaDaP is also capable of collecting data from a range of disparate sources. This unstructured data can be transformed and utilized for strategic business decisions. Furthermore, organizations can deploy the solution on-premise, as well as in a Cloud supported setup. iLaDaP can be seamlessly integrated with the current platforms of companies, without making any major changes.
11
Summary
Traditional Enterprise Data Warehouses do not have the ability to keep up with rapidly increasing social media data. The need of the hour is to effectively strategize and build a Big Data Analytics Platform to manage, store and derive insights from this digital data. Any single vendor technology may not be sufficient to undertake this task, and it is recommended that organizations go for Open Source options to build a Social Media Analytics Platform using Big Data technologies. The fact is that the success of a Big Data platform depends entire on the tools that are used. Organizations therefore, need to use discretion and select the most appropriate tools from the available options. Companies can also re-use existing EDW investments for their Big Data Analytics Platform.
About Impetus Impetus Technologies offers Product Engineering and Technology R&D services for software product development. With ongoing investments in research and application of emerging technology areas, innovative business models, and an agile approach, we partner with our client base comprising large scale ISVs and technology innovators to deliver cutting-edge software products. Our expertise spans the domains of Big Data, SaaS, Cloud Computing, Mobility Solutions, Test Engineering, Performance Engineering, and Social Media among others. Impetus Technologies, Inc. 5300 Stevens Creek Boulevard, Suite 450, San Jose, CA 95129, USA Tel: 408.252.7111 | Email: inquiry@impetus.com Regional Development Centers - INDIA: New Delhi Bangalore Indore Hyderabad To know more visit: www.impetus.com
Disclaimers
The information contained in this document is the proprietary and exclusive property of Impetus Technologies Inc. except as otherwise indicated. No part of this document, in whole or in part, may be reproduced, stored, transmitted, or used for design purposes without the prior written permission of Impetus Technologies Inc.
12