Академический Документы
Профессиональный Документы
Культура Документы
Data Management
Companies store their data and information in databases. Information technologies and systems
support organizations in managing—that is, acquiring, organizing, storing, accessing, analyzing,
and interpreting—data. From confidential customer information, to financial transactions, to
social media posts, organizations possess massive amounts of data that are critical to
organizational success and that they need to manage.
A database is a collection of data organized in a manner so as to allow for easy access, retrieval
and use of that data. Computers process the data in a database into information. A database at a
video rental club, for example, contains data about its members – personal data, rental data etc.
When someone wishes to join the club, a video rental employee enters data about the new
member into the database. When the member borrows a video, the computer processes the data
and calculates the Due date for the video. When the member returns a video, the computer
processes the data and calculates an overdue fine, if applicable.
With database software, called a database management system (DBMS), users can create a
computerized database; insert, modify and delete data in the database; sort and retrieve data from
the database; and create reports from the data in the database. Databases are stored on disk to
allow for quick, direct access to information.
The Hierarchy of Data
A computer system organizes data in a hierarchy that starts with bits and bytes and progresses to
fields, records, files, and databases (see Figure).
A bit represents the smallest unit of data a computer can handle. A group of bits, called a byte,
represents a single character, which can be a letter, a number, or another symbol. A grouping of
characters into a word, a group of words, or a number (such as a person’s name or age) is called
a field. A group of related fields, such as a student ID, the course taken, the date, and the grade,
comprises a record; a group of records of the same type is called a file. For example, the records
could constitute a student course file. A group of related files makes up a database. The student
course file illustrated in the figure could be grouped with files on students’ personal histories and
financial backgrounds to create a student database. Databases are at the heart of all information
systems because they keep track of the people, places, and things that a business must deal with
on a continuing basis.
A primary key is a field or a combination of multiple fields that uniquely identifies each record
in a file. The data in a primary key is unique to a specific record. For example, the Student ID
field uniquely identifies each student because no two students at a university can have the same
Student ID.
A record describes an entity. An entity is a person, place, thing, or event on which we store and
maintain information. To run a business, you most likely will be using data about entities such as
customers, suppliers, employees, orders, products, shippers, and parts. Each characteristic or
quality describing a particular entity is called an attribute. For example, in the example above,
COURSE would be an entity, and Student_ID, Course, Date, and Grade would be its attributes.
The specific values that these attributes can have are found in the fields of the record describing
the entity COURSE. If you were a business keeping track of parts you used and their suppliers,
the entity SUPPLIER would have attributes such as the supplier’s name and address, which
would most likely include the street, city, state, and zip code. The entity PART would typically
have attributes such as part description, price of each part (unit price), and the supplier who
produced the part.
File Maintenance
File maintenance refers to the procedures that keep data current. File maintenance procedures
include adding records to, modifying records in, and deleting records from a file.
Users add new records to a file when they obtain new data. When a new student is enrolled at
the university, this student’s record needs to be added to the Student file. If a student has a
change of address, this information must be modified in the student’s file. Similarly, if the
student’s GPA is being calculated and stored in the database, this information must be modified
every time new grades for the student are added to the database. Unwanted records can be
deleted from the file.
THE TRADITIONAL FILE ENVIRONMENT
In most organizations, systems tended to grow independently without a companywide plan.
Accounting, finance, manufacturing, human resources, and sales and marketing all developed
their own systems and data files. Each area, of course, required its own data files and its own
computer program to operate. For example, the human resources functional area might have a
personnel master file, a payroll file, a medical insurance file, a pension file, a mailing list file,
and so forth, until a number of data files and programs existed. In the company as a whole, this
process led to multiple master files created, maintained, and operated by separate divisions or
departments. As this process goes on for 5 or 10 years, the organization is saddled with hundreds
of data files and programs to manage.
The resulting problems are data redundancy and inconsistency, program-data dependence,
inflexibility, poor data security, and an inability to share data among applications that are very
difficult to maintain.
Poor Security
Because data is dispersed across the organization, there is little centralized control or
management of data. Access to and dissemination of information may be out of control.
Management may have no way of knowing who is accessing or even making changes to the
organization’s data.
Student Data
Database Admissions Reports
Courses and Management
Grades Data System
Bursary Data
Exams Reports
division
Bursary Reports
Relational DBMS
Contemporary DBMS use different database models to keep track of entities, attributes, and
relationships. The most popular type of DBMS today for PCs as well as for larger computers and
mainframes is the relational DBMS. Relational databases represent data as two-dimensional
tables (called relations). Tables may be referred to as files. Each table contains data on an entity
and its attributes. Microsoft Access is a relational DBMS for desktop systems, whereas Oracle
Database, and Microsoft SQL Server are relational DBMS for large mainframes and midrange
computers. MySQL is a popular open source DBMS.
Principal tools and technologies for accessing information from databases to improve
business performance and decision making
Businesses use their databases to keep track of basic transactions, such as paying suppliers,
processing orders, keeping track of customers, and paying employees. But they also need
databases to provide information that will help the company run the business more efficiently
and help managers and employees make better decisions. If a company wants to know which
product is the most popular or who is its most profitable customer, the answer lies in the data.
Companies often build enterprise-wide data warehouses, where a central data warehouse serves
the entire organization, or they create smaller, decentralized warehouses called data marts. A
data mart is a subset of a data warehouse in which a summarized or highly focused portion of
the organization’s data is placed in a separate database for a specific population of users. For
example, a company might develop marketing and sales data marts to deal with customer
information.
ANALYTICAL TOOLS
Once data have been captured and organized using business intelligence technologies, they are
available for further analysis using software for database querying and reporting,
multidimensional data analysis (OLAP), and data mining.
OLAP supports multidimensional data analysis, enabling users to view the same data in different
ways using multiple dimensions. Each aspect of information—product, pricing, cost, region, or
time period—represents a different dimension. So, a product manager could use OLAP to learn
how many washers were sold in the East in June, how that compares with the previous month
and the previous June, and how it compares with the sales forecast. OLAP enables users to
obtain online answers to ad hoc questions such as these in a fairly rapid amount of time, even
when the data are stored in very large databases, such as sales figures for multiple years.
DATA MINING
Traditional database queries answer such questions as “How many units of product number 403
were shipped in February 2019?” OLAP, or multidimensional analysis, supports much more
complex requests for information, such as “Compare actual sales of product #403 relative to
forecasts by quarter and sales region for the past two years.” With OLAP and query-oriented data
analysis, users need to have a good idea about the information for which they are looking.
Data mining is more discovery-driven. Data mining provides insights into data that cannot be
obtained with OLAP, by finding hidden patterns and relationships in large databases and
inferring rules from them to predict future behavior. The patterns and rules are used to guide
decision making. There are data mining applications for all the functional areas of business and
for government and scientific work. One popular use for data mining is to provide detailed
analyses of patterns in customer data for one-to-one marketing campaigns or for identifying
profitable customers.
These are all made possible through the use of web databases. The only way for organizations to
conduct their online businesses is to link their databases to the Internet, and allow the customers
access to these databases.