Вы находитесь на странице: 1из 7


Se min ar R e port On

Big Data: Issues and Challenges Moving


Submitted By

Miss. Sapana Pandit Ekade

Department of Computer Science &Engineering ,

Pratap Institute of Management &Technology, Washim.



Se min ar R e port On

Big Data: Issues and Challenges Moving


Submitted for partial fulfillment of requirement for the degree of

Computer Science & Engineering
Submitted By

Miss. Sapana Pandit Ekade

Under the Guidance of

Prof. Sachin Jadhav

Department of Computer Science & Engineering,

PratapInstitute of Management &Technology, Washim.


Pratap Institute of Management & Technology, Washim
Department of Computer Science &Engineering


The seminar titled Big Data: Issues a nd Challenges Moving

Forwa rdsubmitted by Miss. Sapana Pandit Ekade fulfillment of requirement for
the award of degree of Bachelor of Engineering in Computer Science And
Engineering by Sant Gadge Baba Amravati University , Amravati has been carried
out under my supervision at the computer science and engineering of Pratap Institute
of Management and Technology, Washim.

Prof. S. Jadhav
Prof. K. U. Chaware Dr.M.S.Patil
Head of Department Principal
PiMT, Washim PiMT,Washim


I have great pleasure and sense of satisfaction in presenting this report on Big
Data: Issues and Challenges Moving Forward for Seminar. This report
would not be possible without the help and support of gratitude to my seminar guide
Prof. S.Jadhav, for his instinct help and valuable guidance with a lot of
encouragement throughout this seminar work, right from selection of topic work up to
its completion. My sincere thanks to Head of the Department of Computer Science
Prof. K.Chaware, who continuously motivated and guided for completion of this
work. I am also thankful to our Prof. A.Raipurefor this report.

I also acknowledge the research work done by all researchers (world-wide)

whose efforts have directly or indirectly helped me accomplish this work.

Miss. Sapana Pandit Ekade

Exam Seat No:-


Big data refers to data volumes in the range of Exabytes (1018) and beyond.
Such volumes exceed the capacity of current on-line storage systems and processing
systems. Data, information, and knowledge are being created and collected at a rate
that is rapidly approaching the Exabyte/year range. But, its creation and aggregation
are accelerating and will approach the zettabyte/year range within a few years.
Volume is only one aspect of big data; other attributes are variety, velocity, value, and
complexity. Storage and data transport are technology issues, which seem to be
solvable in the near-term, but represent long-term challenges that require research and
new paradigms. We analyze the issues and challenges as we begin a collaborative
research program into methodologies for big data analysis and design.

Keywords :- Big data; Hadoop; Hadoop distributed file system ; Map reduce

Page No.
List of Figures
1 Introduction 1- 4
1.1 Importance of Big-data
1.2 Big-data Characteristics
1.3 Big-data- Where is it?
2 Literature Survey 5-6
3 Processing Big-data 7 - 13
3.1 Issues
3.1.1 Storage and Transport Issues
3.1.2 Management Issues
3.1.3 Processing Issues
3.2 Challenges
3.2.1 Data Input and Output Process
3.2.2 Quality versus Quantity
3.2.3 Data Growth versus Data Expansion
3.2.4 Speed versus Scale
3.2.5 Structured versus Unstructured Data
3.2.6 Data Ownership
3.2.7 Compliance and Security
4. Solution on Issues and Challenges of Big Data 14 - 17
4.1 Apache Hadoop Technology
4.1.1 Hadoop Distributed File System
4.1.2Apache Map Reduce
5 Discussion 18 - 19
6. Conclusion and Future work 20
References 21


1.2 Big Data Characteristics

1.3 Sources whos Generated Massive Amount of Data
3.2.6 Some Big Data Ownership Challenges
4.1 Big data is the problem and Hadoopis the solution. (With two main techniques)
4.1.1 Apache Hadoop distributed file system
4.1.2 Apache Map Reduce Technique
5.0 Communication between user to user inside the bigdata