Вы находитесь на странице: 1из 19

INFORMATION SEARCH AND ANALYSIS SKILLS

(ISAS)

TERADATA DATABASE

Created by : 1. AmiraLuthfita
2. Nadia RizkiIkhsan
3. SyarifahPrameswara
Class : 1NAP1
Semester : 1
Quarter : 2
Faculty : MilyunNimaShoumi,S.Kom, M.Kom

Continuing Education Program Center for Computing and Information Technology


Faculty of Engineering, University of Indonesia
2015
PREFACE

First of all, we would like to express our great thanks to our God, Allah SWT who have
given us His mercy and grace in this opportunity so we can finish our first project on time. And
thanks a lot to Mrs. MilyunNimaShoumi, S.Kom, M.Kom, the teacher who guide and help us to
make this assignment.

This assignment was prepared to be submitted as a college assignment Information


Technology themed “Relational Database Managing System”. We sincerely hope that this paper
can be useful in order to increase our knowledge and insight. We are also fully aware that in this
paper there is a shortage and far from perfect. Therefore, we hope the suggestion and critics for
the improvement of paper that we have made in the future, remember nothing is perfect without
constructive suggestion.

Hopefullythis simplepaperis understandablefor anyonewhoread itandreportsthathave been


preparedcan be usefulforourownas well asthosewhoread it.In the end, weapologize ifthere are
errorsof wordsthat are lesspleasing. Thank you.

Depok, November 2015

Authors

i
TABLE OF CONTENT
PREFACE.............................................................................................................................i
TABLE OF CONTENT.......................................................................................................ii
TABLE OF FIGURES.........................................................................................................iii
TABLE OF TABLES...........................................................................................................iv
CHAPTER I.........................................................................................................................1
INTRODUCTION................................................................................................................1
1.1 Background......................................................................................................................1
1.2 Writing Objective.............................................................................................................2
1.3 Problem Domain..............................................................................................................2
1.4 Writing Methodology.......................................................................................................2
1.5 Writing Framework..........................................................................................................2
CHAPTER II........................................................................................................................4
BASIC THEORY.................................................................................................................4
2.1 Relational Database.........................................................................................................5
2.2What is RDBMS?.............................................................................................................5
CHAPTER III......................................................................................................................7
PROBLEM & SYSTEM ANALYST.................................................................................7
3.1 What is Teradata?............................................................................................................7
3.2Teradata System Requirements.........................................................................................8
3.3Advantages and Disadvantages of Teradata.....................................................................9
3.4 Architecture of Teradata..................................................................................................10
3.5 Comparison......................................................................................................................12
CHAPTER IV.......................................................................................................................14
SUMMARY..........................................................................................................................14
4.1 Conclusion.......................................................................................................................14
4.2 Suggestion........................................................................................................................14
BIBLIOGRAPHY................................................................................................................15

ii
TABLE OF FIGURES

Figure 2.1 Relational Database (Ref:www.teach-ict.com)....................................................4


Figure 2.2 Structure of RDBMS (Ref : www.reuters.com ) ………………………………5
Figure 3.1 Logo of Teradata(Ref:www.best-online-training.com) ……………………….7
Figure 3.2 Teradata Basic Architecture (Ref: www.blog.usefulfreetips.com) …………….12

iii
TABLE OF TABLES

3.1 Teradata System Requirements (Ref: https://developer.teradata.com/)............................8


3.2Teradata Architecture (Ref: http://www.info.teradata.com)..............................................10
3.3Teradata Virtual Processor (Ref: http://www.info.teradata.com)......................................11
3.4TeradataComparison(Ref: http://db-engines.com/en/system)......................................12

iv
CHAPTER I
INTRODUCTION

1.1 Background
Following the technology progress in the areas of processors, computer memory, computer
storage and computer networks, the sizes, capabilities, and performance of databases and their
respective DBMSs have grown in orders of magnitude. The development of database technology
can be divided into three eras based on data model or structure: navigational, SQL/relational, and
post-relational. The relational model, first proposed in 1970 by Edgar F. Codd, departed from
this tradition by insisting that applications should search for data by content, rather than by
following links. The relational model employs sets of ledger-style tables, each used for a
different type of entity. Only in the mid-1980s did computing hardware become powerful enough
to allow the wide deployment of relational systems (DBMSs plus applications).
One kind of DBMS is Teradata's product which can be used for business analysis. Data
warehouses can track company data, such as sales, customer preferences, product placement,
etc.Teradata has a supplier diversity program that designates a minimum of 3 to 5% of spending
on minority, women, veteran, or small business vendors. In 2013, the Ethisphere Institute named
Teradata as one of the "World's Most Ethical Companies".

1.2 Writing Objective


The purpose of writing this article is to:
 To understand more about Database and RDBMS
 Give more knowledge and information about Teradata
 Tell about what is advantages and disadvantages of Teradata
 Comparison between Teradata and other DBMS

v
1.3 Problem Domain
This paper will discuss about:
 What is Teradata?
 Advantages and Disadvantages of Teradata
 How Teradata works

1.4 Methodology
The discussion above researched by collecting information from internet and IT professionals
both in the form of articles, journals, and the opinions of users in this subject.As for the technical
writing of scientific works of this author uses and refers to his previous reports made by seniors
in CCIT-FTUI, Depok.

1.5 Framework
To facilitate the preparation of this report, the systematics is used as follows:

1. CHAPTER I INTRODUCTION
This Chapter will discuss about The Background why thisarticle was made, Problem domain
or what will be discussed in this article, writing objective, writing methodology, and writing
structure.

2. CHAPTER II BASIC THEORY


This Chapter will disscuss about History About technology development and what is
Teradata in general
3. CHAPTER III ANALYSIS
This Chapterwill discuss about What is Teradata and TeradataFeatures, Advantage and
Disadvantage from security, nodes and it’s compability.

4. CHAPTER IV SUMMARY
This Chapter Give a Conclusion and Suggestion for the readers who read this assignment.

vi
CHAPTER II
BASIC THEORY

A database is an organized collection of data. It is the collection of schemes, tables,


queries, reports, views and other objects. The data is typically organized to model aspects of
reality in a way that supports processes requiring information, such as modelling the availability
of rooms in hotels in a way that supports finding a hotel with vacancies.
A database management system (DBMS) is a computer software application that interacts
with the user, other applications, and the database itself to capture and analyze data. A general-
purpose DBMS is designed to allow the definition, creation, querying, update, and administration
of databases. Database management systems are often classified according to the database model
that they support; the most popular database systems since the 1980s have all supported the
relational model as represented by the SQL language. Sometimes a DBMS is loosely referred to
as a 'database'.

2.1 Relational Database

Figure 2.1Relational Database


(Ref: http://www.teach-ict.com/gcse_new/databases/flatfile_relational/miniweb/pg2.htm)

vii
Databases have been a staple of business computing from the very beginning of the
digital era. In fact, the relational database was born in 1970 when E.F. Codd, a researcher at
IBM, wrote a paper outlining the process. Since then, relational databases have grown in
popularity to become the standard. With a relational database, you can quickly compare
information because of the arrangement of data in columns. The relational database model takes
advantage of this uniformity to build completely new tables out of required information from
existing tables. In other words, it uses the relationship of similar data to increase the speed and
versatility of the database. Figure 2.1 is an example of a Relational Database consists of 3 tables
related to each other.
The "relational" part of the name comes into play because of mathematical relations. A
typical relational database has anywhere from 10 to more than 1,000 tables. Each table contains a
column or columns that other tables can key on to gather information from that table. By storing
this information in another table, the database can create a single small table with the locations
that can then be used for a variety of purposes by other tables in the database. A typical large
database, like the one a big Web site, such as Amazon would have, will contain hundreds or
thousands of tables like this all used together to quickly find the exact information needed at any
given time. Relational databases are created using a special computer language, structured query
language (SQL), which is the standard for database interoperability.

2.2 What is RDBMS?

RDBMS stands for Relational Database Management System. RDBMS data is structured in
database tables, fields and records. Each RDBMS table consists of database table rows. Each
database table row consists of one or more database tablefields. 
RDBMS store the data into collection of tables, which might be related by common fields
(database table columns). Illustration of RDBMS can be seen in Figure 2.2

viii
Figure 2.2 Structure of RDBMS

RDBMS also provide relational operators to manipulate the data stored into the database tables.
Most RDBMS use SQL as database query language. RDBMS are a common choice for the
storage of information in new databases used for financial records, manufacturing and logistical
information, personnel data, and other applications since the 1980s. Relational databases have
often replaced legacy hierarchical databases and network databases because they are easier to
understand and use. However, relational databases have received unsuccessful challenge
attempts by object databasemanagement systems in the 1980s and 1990s (which were introduced
trying to address the so-called object-relational impedance mismatch between relational
databases and object-oriented application programs) and also by XML database management
systems in the 1990s. Despite such attempts, RDBMSs keep most of the market share, which has
also grown over the years.

ix
CHAPTER III
PROBLEM ANALYSIS

Teradata is a massively parallel processing system running a shared-nothing architecture.


Its technology consists of hardware, software, database, and consulting. The system moves data
to a data warehouse where it can be recalled and analyzed. The systems can be used as back-up
for one another during downtime, and in normal operation balance the work load across
themselves.
In 2009, Forrester Research issued a report, "The Forrester Wave: Enterprise Data
Warehouse Platform," by James Kobielus, rating Teradata the industry's number one enterprise
data warehouse platform in the "Current Offering" category. Marketing research company
Gartner Group placed Teradata in the "leaders quadrant" in its 2009, 2010, and 2012 reports,
"Magic Quadrant for Data Warehouse Database Management Systems".

3.1 What is Teradata?

Figure 2.2 Logo of Teradata


(Ref: http://www.best-online-training.com)

Teradata may be a hot topic in the industry but it’s not a new concept. The Teradata
product is referred to as a "data warehouse system" and stores and manages data. The data
warehouses use a "shared nothing" architecture, which means that each server node has its own
memory and processing power. Adding more servers and nodes increases the amount of data that
can be stored. The database software sits on top of the servers and spreads the workload among
them. Teradata sells applications and software to process different types of data.
Teradata is the most popular data warehouse DBMS in the DB-Engines database ranking.
In 2010, Teradata was listed in Fortune’s annual list of Most Admired Companies. Teradata

x
Active Enterprise Data Warehouse is the platform that runs the Teradata Database, with added
data management tools and data mining software.
The data warehouse differentiates between “hot and cold” data – meaning that the
warehouse puts data that is not often used in a slower storage section. As of October 2010,
Teradata uses Xeon 5600 processors for the server nodes.Teradata Database 13.10 was
announced in 2010 as the company’s database software for storing and processing data.Teradata
Database 14 was sold as the upgrade to 13.10 in 2011 and runs multiple data warehouse
workloads at the same time. It includes column-store analyses. Teradata Integrated Analytics is a
set of tools for data analysis that resides inside the data warehouse.
Teradata RDBMS is also a complete relational database management system. The system
is based onoff-the-shelf Symmetric Multiprocessing (SMP) technology combined with a
communicationnetwork connecting the SMP systems to form a Massively Parallel Processing
(MMP) system. BYNET is a hardware inter-processor network to link SMP nodes. All
processors in a same SMPnode are connected by a virtual BYNET. We use the following figure
to explain how each component in this DBMS works together.

3.2 Teradata System Requirements

Table 3.1 Teradata System Requirements


(Ref: https://developer.teradata.com/)
Software or Hardware Description
Operating System:  Microsoft Windows Server 2012, 2008, 2003 R2
 Microsoft Windows 8.1, 8, 7 Enterprise x86 and
x64 Editions
 Microsoft Windows Vista Enterprise x86 and x64
Editions

Disk Storage:  29MB available disk space for .NET Data Provider for
Teradata files.
 100MB disk space on the TEMP drive for temporary
installation files.

.NET Framework:  Microsoft .NET Framework 4.5.1 x86 and x64


runtime (CLR 4.0).
 Microsoft .NET Framework 4.5 x86 and x64 runtime
(CLR 4.0).

xi
 Microsoft .NET Framework 4.0 x86 and x64 runtime
(CLR 4.0).
 Microsoft .NET Framework 3.5 SP1 x86 and x64
runtime (CLR 2.0).

Supported Teradata  Teradata Database 15.00


Versions:  Teradata Database 14.10
 Teradata Database 14.0
 Teradata Database 13.10
 Teradata Database 13.0

Microsoft Visual Studio:  Visual Studio 2013 Professional or higher


(Optional)  Visual Studio 2012 Professional or higher
 Visual Studio 2010 Professional or higher
 Visual Studio 2008 Professional or higher

3.3 Advantages and Disadvantages of Teradata


Advantages
1)Teradata database is linearly scalable.
- We can expand the database capacity by just adding more nodes to the existing database.If the
data volume grows we can add more hardware and expand the database capacity.
2) Extensive parallel processing.
- Teradata has an extensive parallel processing capacity, It can handle multiple adhoc requests
and many concurrent user
3)Shared nothing architecture
- Teradata database has shared nothing architecture, it has high fault tolerance and data
protection.
Disadvantages
- Teradata is not as open as Oracle in Tech. Many key tech is still under the control of Teradata
PS and Teradata is for enterprise application, not as widely used as Oracle or Sybase
- Does not support Non-ANSI SQL statements in stage generated SQL statements.
- Does not support byte data types.
- Does not support ‘like’ in the user defined SQL when used as lookup.
- Reject rows handling

xii
3.4Architecture of Teradata

SMP and MPP Platforms


The hardware that supports Teradata Database software is based on Symmetric
Multiprocessing (SMP) technology. The hardware can be combined with a communications
network that connects the SMP systems to form Massively Parallel Processing (MPP) systems.
The components of the SMP and MPP hardware platforms include the following.

Table 3.2 Teradata Architecture (Ref: http://www.info.teradata.com)


 
Component Description Function

A hardware assembly containing several,


tightly coupled, central processing units (CPUs)
in an SMP configuration. An SMP node is
connected to one or more disk arrays with the
following installed on the node:

• Teradata Database software


Processor Serves as the hardware platform
Node • Client interface software upon which the database
software operates.
• Operating system

• Multiple processors with shared-memory

• Failsafe power provisions

An MPP configuration is a configuration of two


or more loosely coupled SMP nodes.

Hardware interprocessor network to link nodes Implements broadcast,


on an MPP system. multicast, or point-to-point
BYNET communication between
Note: Single-node SMP systems use a software-
processors, depending on the
configured virtual BYNET driver to implement
situation.
BYNET services.

xiii
Virtual Processors
The versatility of Teradata Database is based on virtual processors (vprocs) that eliminate
dependency on specialized physical processors. Vprocs are a set of software processes that run
on a node under Teradata Parallel Database Extensions (PDE) within the multitasking
environment of the operating system.The following table contains information about the different
types of vprocs.

Table 3.3 Teradata Virtual Processor (Ref:http://www.info.teradata.com)

Vproc Type Description

AMP Access module processors perform database functions, such as executing database
queries. Each AMP owns a portion of the overall database storage.

GTW Gateway vprocs provide a socket interface to Teradata Database.

Node The node vproc handles PDE and operating system functions not directly related to
AMP and PE work. Node vprocs cannot be externally manipulated, and do not appear in
the output of the Vproc Manager utility.

PE Parsing engines perform session control, query parsing, security validation, query
optimization, and query dispatch.

RSG Relay Services Gateway provides a socket interface for the replication agent, and for
relaying dictionary changes to the Teradata Meta Data Services utility.

TVS Manages Teradata Database storage. AMPs acquire their portions of database storage
through the TVS vproc.

xiv
Figure 3.2 Teradata Architecture
(Ref:http://www.blog.usefulfreetips.com/2012/07/01/what-is-teradata-architecture/)

3.5 Comparison

Vertica Systems is an analyticdatabase management software company.Vertica was founded in


2005 by database researcher Michael Stonebraker, and Andrew Palmer. Former CEOs include
Ralph Breslauer and Christopher P. Lynch.

Vertica was acquired by Hewlett Packard on March 22, 2011. The acquisition expanded the HP
Softwaresoftware portfolio for enterprise companies and the public sector.

Table 3.4 Teradata Comparison (Ref: http://www.info.teradata.com)

Description Teradata Vertica


DB-Engines Ranking Score 77.08 Score 19.41
Rank #13 Database Rank #30

Server Operating Linux Linux


System
Supported C C++
Programming Cobol Java
Language Java (JDBC-ODBC) Perl
PL/1 Phyton
R
User Concepts Fine grained access rights according to Fine grained access rights
SQL-standard according to SQL-standard
APIs and other access .NET Client API ADO.NET
methods JDBC JDBC
JMS Adapter ODBC
ODBC
OLE DB

xv
From theTable 3.3 above we can conclude that Teradata is linearly scalable and more flexible to
use, and we can expand the database capacity by just adding more nodes to the existing database.
If the data volume grows we can add more hardware and expand the database capacity.

xvi
CHAPTER IV
SUMMARY

4.1. Conclusion
From this discussion we can conclude that Teradata is a part of the example of Relational
Database Managing System that appears in daily life. Teradata is a leading provider of data
analytics platform, applications, and services for IT and business leaders who are looking for a
results-oriented smarter, faster, and more precise answer to the most important question for them.
It focused on integrated solutions of data warehousing, big data analytics and business
applications. Teradata provides an integrated view of data. By collecting all your business
knowledge in one place, Teradata allows you to embrace your data driving really effective
decision-making.
Teradata has the most experienced professionals in the field, the best technology
performance standards, leading innovation in the field of industry, and the best network in the
world class for its clients and partners to delivers data integration and business support
organizations in taking the best decision for the benefit of more competitive.

4.2. Suggestion
Advice to Teradata corporation , for the next version of Teradata the authors hope that
Teradata can be installed and run in every operating system, allows row handling, support byte
data types and Non-ANSI SQL statements in stage generated SQL statements.

xvii
BIBLIOGRAPHY

1. http://www.reuters.com/finance/stocks/companyProfile?
symbol=TDC#cXXmjDL895mT3ku8.97 (18/11/15)
2. https://docs.oracle.com/javase/tutorial/jdbc/overview/database.html (18/11/15)
3. http://db-engines.com/en/system/Netezza%3BOracle%3BTeradata (19/11/15)
4.  http://www.teknologibank.com/berita-teradata.html#ixzz3rrJ7X6Pn (19/11/15)
5. http://www.databasedir.com/what-is-rdbms/ (21/11/15)
6. http://www.webopedia.com/TERM/R/RDBMS.html (21/11/15)
7. https://developer.teradata.com/doc/connectivity/tdnetdp/15.11/help/SystemRequirements
andSupportedTeradataVersions.html (23/11/15)

xviii

Вам также может понравиться