Вы находитесь на странице: 1из 17

Done by:

WHAT IS
YAHOO!
ANSWERS ?

A community-driven
question - and - answer
website that allows users
to both submit questions
to be answered and
answer questions asked
by other users.

2
FEATURES
 Allows any Yahoo! ID-registered user to
answer a pre-existing question or submit a
new question.
 All questions submitted to the site remain
actively open for four days, during which
any user may post an answer .
 Original poster (OP) can highlight a
specific response as the "Best Answer.“

3
 Users other than the OP are able to
select the "Best Answer" democratically
through an open voting process.

 Exceptionally helpful user contributions


are occasionally curated on the official
Yahoo! Answers Blog by the staff.

 Misuse of Yahoo! Answers is handled by


a user moderation system, where users
report posts that are in breach of
guidelines or the Terms of Service.
4
ARCHITECTURE OF
YAHOO!

5
DATA STORED:

 user profiles that record users


interests, education, hobbies
and etc.
 predefines interest categories
and subcategories.

6
BIG DATA
 "Big data" is a field that treats ways to analyze or
deal with data sets that are too large or complex
to be dealt with by traditional data-processing
application software.
 Data with many cases (rows) offer greater
statistical power, while data with higher
complexity (more attributes or columns) may lead
to a higher false discovery rate.
7
 Big data challenges :capturing
data, data storage, data
analysis, search, sharing,
transfer, visualization,
querying, updating, information
privacy and data source.

Three key concepts:


volume, variety, and velocity.

8
USER INTEREST ANALYZER

 utilizes user’s profile information and user


interactions to determine the interests of the
user in the predefined interest categories.
 User interest vector

9
QUESTION ANALYZER

▸ categorize a question into predefined


interest categories based on the topics of
the question.
▸ examine the tags and text of the question
and generates a token string.
▸ Compared to its Synset to determine the
categories where the question belongs.

10
QUESTION –USER MAPPER

▸ The question-user mapper algorithm is


called while asking or forwarding questions.

▸ Question-User Mapper identifies the


appropriate answerers for a given question.

▸ Involves the concept of data mining.

11
 Data mining, also called knowledge discovery in
databases, in computer science, the process of
discovering interesting and useful patterns and
relationships in large volumes of data.

 combines tools from statistics and artificial intelligence


(such as neural networks and machine learning) with
database management to analyze large digital
collections, known as data sets.
WHAT IS
DATA
MINING? 12
13
1.Classification:
 used to retrieve important and relevant information about data, and
metadata.
 helps to classify data in different classes.
2. Clustering:
 to identify data that are like each other.
 helps to understand the differences and similarities between the data.
3. Regression:
 identifying and analyzing the relationship between variables.
 used to identify the likelihood of a specific variable, given the
presence of other variables.
4. Association Rules:
 helps to find the association between two or more Items.
 discovers a hidden pattern in the data set.

14
5. Outer detection:
 observation of data items in the dataset which do not match an
expected pattern or expected behavior.
 used in a variety of domains, such as intrusion, detection, fraud or
fault detection, etc.
6. Sequential Patterns:
 discover or identify similar patterns or trends in transaction data for
certain period.
7. Prediction:
 analyzes past events or instances in a right sequence for predicting
a future event.

15
▸ Content can be inappropriate for children.
▸ Reporting feature is easily abused to
have quality answers removed.
▸ Moderation is inconsistent.

DISADVANTAG
ES
16
THANKS!
Any questions?

17

Вам также может понравиться