Академический Документы
Профессиональный Документы
Культура Документы
Data Control
1. To check if the data to be processed is correct
2. Need of data control: Error
Garbage-In-Garbage-Out (GIGO)
Source of Errors
Type Source Example
Data Source Error Incorrect data from source Intended fake data from interviewee
Transcription Error Misread or mistyped data “u” read as “v”
Class: 2 characters
Range Check Check if the numeric/character data lies within a range Class no: 1 – 40
M.C.: A – D
Format Check Check if the data fulfill a prescribed format Date: dd/mm/yyyy
Email: xxx@xxx.xxx
Check Digit Calculate a number by putting the data into a function. HKID Number
Data Verification
Check if the data input matches with the ones on the source document
Common data verification methods:
Method Description
Double entry Enter the same data set by two operators
Input data twice Enter the same data set twice
Data Organization
1. Structure
name age gender sid Field
Chan Tai Man 15 M S012345 Data
Wong Siu Ming 14 F S012378 Record
…… Table
Tam Ka Ming 15 M S012784
2. Key field: A field which can unique identify a particular record in a table
Example: HKID number can uniquely identify a person from all HK citizens
Key of a table can be single field, e.g. student_no, or composed of a number
of fields, e.g. class + class_no can unique identify a student in a school
In other words, a field cannot be a key if there is possibility of duplicated
occurrence of data ind that field. E.g. Name
Modes of Operation
1. Comparisons between batch processing and real-time processing
Batch Processing Real-time Processing
Description Data are collected in batched before Data are processed immediately after
processing collection
Feature Significant delay between data Very short delay from collection
collection and data processing to processing