Академический Документы
Профессиональный Документы
Культура Документы
ELE320_ENGR. J. SADOL
Data VS Information
Data is a collection of raw facts Examples: customer name, graphics, images, address, sound, and video segments, telephone number Data is not information Data without context is useless
2
Data VS Information
The terms data and information are closely related, and in fact are often used interchangeably. Information is data that has been processed in such a way that it can increase the knowledge of the person who uses it. Information is data that is organized and meaningful.
Data VS Information
Data + Interpretation >Information/Knowledge Example:
Baker, Kenneth D. Doyle, Joan E. Finkle, Clive R. Lewis, Clark C.
Data VS Information
Converting data to information (by context)
Class List Course: Filedat Trimester: 1st 01-02 Section: S33 Name Student Number Baker, Kenneth D. 2001-12345 Doyle, Joan E. 1999-67890 Finkle, Clive R. 2000-00001 Lewis, Clark C. 1998-01023
Data VS Information
Converting data to information (summarized)
Percent of Enrollment by Major (200X) SSE 35% COE 30% CNE 35%
Retrieval
Distribution
Programs VS Systems
Program - a set of step-by-step instructions that directs a computer to perform specific tasks and produce certain results. Examples: Sorting, adding numbers, computation of taxes, computation of deductions, MS Excel, MS Word, etc.
Programs VS Systems
System - an organized set of related components established to perform a certain task. Examples: Payroll System, MS Office System, Inventory System, HRIS, DBMS etc.
What is a Database?
A collection of data, typically describing the activities of one or more related organizations It is a collection of data organized in a way that allows access retrieval use
9
ELE320_ENGR. J. SADOL
What is a Database?
Banking: all transactions Airline: reservation, schedules Universities: registration, grades, Sales: customers, products, purchases Manufacturing: production, inventory, orders, supply chain Human resources: employee records, salaries, tax deductions
10
ELE320_ENGR. J. SADOL
What is a Database?
For example, the list of names and address of friends The book catalog of a large library may contain half a million records A database of much greater size and complexity is maintained by the government to keep track of the tax information filed by taxpayers.
11
ELE320_ENGR. J. SADOL
What is a Database?
A schema is the definition of a database. It defines the meaning of data. An instance of a database is the collection of data in the database at a particular point of time (snap-shot). For example, in the above, the schema is Student Name, ID, Age, Gender, Entrance Year, Grade. The remaining rows in the table make up an instance of the database.
12
ELE320_ENGR. J. SADOL
What is a DBMS?
A DBMS is a collection of software programs to enable users to create, maintain and utilize a database. Database Management Systems (DBMS) Program or set of programs designed to control access to the database manage data resources
ELE320_ENGR. J. SADOL 13
What is a DBMS?
DBMS
ELE320_ENGR. J. SADOL
Commercial DBMS
Product
2.
Oracle 8i, 9i, 10g DB2, Universal Server Access, SQL Server Adaptive Server Dynamic Server
Worlds 2nd largest after Informix acquisition
ELE320_ENGR. J. SADOL
15
What is a DBMS?
Commercial DBMS
Product
Oracle 8i, 9i, 10g Access, SQL Server Adaptive Server Dynamic Server
To reduce application development time Suppose we are given a collection of raw files which occupy 500GB
ELE320_ENGR. J. SADOL
17
address of a customer in
ELE320_ENGR. J. SADOL
18
The balance cannot be below $1000 The day of a month cannot exceed 31
ELE320_ENGR. J. SADOL
20
Step 1: We deduct $100 from account A Step 2: Then, we increment $100 in account B
ELE320_ENGR. J. SADOL
21
Uncontrolled concurrent accesses can lead to inconsistencies E.g., consider a bank application There is an account shared by 2 customers A and B Customers A and B withdraw $1000 concurrently
A B
Read 5000 Read 5000 5000 - 1000 5000 - 1000 Write 4000
ELE320_ENGR. J.Write SADOL
DBMS makes sure that the concurrent access cannot lead to this problem
22
4000
We do not want system programmers to have permissions to read some data (e.g., Andy Laus saving account and Joey Yungs saving account)
ELE320_ENGR. J. SADOL
23
Advantages of DBMS
Data independence Efficient data access Data integrity and security Data administration Concurrent access and crash recovery
ELE320_ENGR. J. SADOL
Data Models
A data model is a collection of tools or concepts for describing data, the meaning of data, data relationships and data constraints Object-based logical models Entity-Relationship Model (ER Model)
id
Customer-name Customer-street
Customer-city
Loan-number
amount
Customer
Borrower
Loan
25
Data Models
Relational Model
Main concept: relation, basically a table with rows and columns. A column is also called a field or attribute Other models such as the Network Model, Hierarchical Model, object-oriented model, objected-relational model We will focus on the dominant Relational model.
A description of data in terms of a data model is called a schema.
ELE320_ENGR. J. SADOL 26
Data Abstraction
is the process by which data and programs are defined with a representation similar in form to its meaning (semantics), while hiding away the implementation details. Abstraction tries to reduce and factor out details so that the programmer can focus on a few concepts at a time. It hides certain details of how data is stored and maintained
27
ELE320_ENGR. J. SADOL
ELE320_ENGR. J. SADOL
28
Data Abstraction
Conceptual Level
A34455
C23444
29
Data Abstraction
Payroll section: view on employees, departments, salaries Sales department: view on products, prices, sales, customers Purchasing department: view on parts, with pricing
ELE320_ENGR. J. SADOL
30
Data Independence
Change of physical database schemas without change of conceptual database schemas or application programs Change of conceptual schemas without change of external views
ELE320_ENGR. J. SADOL
31
For retrieval, we query the database with the query language, which is part of the DML
ELE320_ENGR. J. SADOL
32
Database Administrator (DBA): Person(s) who has central control over the database and is responsible for the following tasks
Schema definition/modification Storage structure definition/modification Authorization of data access Integrity constraint specification Monitoring performance Responding to changes in requirements
33
ELE320_ENGR. J. SADOL
Application Programmers
Embed DML calls in program written in a host language (e.g., Cobol, C, Java). (DML stands for data manipulation language) e.g., programs that generates payroll checks, transfer funds between accounts
Form request in database query language Invokes one of the permanent application programs that have been written previously e.g. transfer transfer fund between accounts
Sophisticated users
Nave users
ELE320_ENGR. J. SADOL
34
Structure of a DBMS
Query Optimization and Execution Relational Operators Files and Access Methods Buffer Management Disk Space Management
ELE320_ENGR. J. SADOL
35
History
The first generall purpose DBMS was designed by Charles Bachman at GE in early 1960s In 1970 Codd at IBM proposed a new data representation framework called the relational data model. The SQL query for relational databases, developed as part of IBMs System R project, was standardized in the late 1980s. The current standard, SQL-92, was adopted by ANSI (American National Standards Institute) and ISO (International Standards Organization).
36
ELE320_ENGR. J. SADOL
History
In late 1980s and 1990s, several vendors (e.g., IBMs DB2, Oracle 8) have extended their systems with the ability to store new data types such as images and text. Specialized systems developed for data warehouses, consolidating data from several databases. Entering the Internat Age, a new markup language XML is proposed for data access through a Web browser. As more and more data are collected, companies are also interested to mine useful information from their data.
37
ELE320_ENGR. J. SADOL